JPRT system changes

Kelly O'Hair kelly.ohair at oracle.com
Fri Oct 26 10:52:03 PDT 2012


On Oct 26, 2012, at 10:31 AM, Tim Bell wrote:

> On 10/26/12 10:14, Vladimir Kozlov wrote:
>> Tim,
>> 
>> It is not just JBB. Here is current status:
> 
> Hmmm... not sure what it is, then.
> 
>>  [ 3 windows_x64_6.1 clients, 3 hosts ]
>>    sc11136526(P1)[DevOps VM Windows 7 X64]: running <2012-10-26-154155.jcoomes.gc-push> windows_x64-product-c2-GCBasher_ParOldGC(46m 40s)
> 
> No traces of that job on sc11136526:

I'm trying to kill off the JPRT client on these systems.

> 
> jprtadm at SC11136526:/cygdrive/c/MKS/mksnt> cd /cygdrive/c/MKS/mksnt && ./ps -ef | grep $USER
> jprtadm   1092    524  0   Oct 20 con  0:00 C:\cygwin\bin\cygrunsrv.exe
> jprtadm   1116    376  0   Oct 20 con  0:04 \??\C:\Windows\system32\conhost.exe "-1483141665-707719824-1770841038-590923786-3215500991937909109-844736201686794369
> jprtadm   1476   1140  0   Oct 20 con  0:01 C:\cygwin\usr\sbin\sshd.exe -D
> jprtadm   1196    524  0   Oct 24 con  0:00 "taskhost.exe"
> jprtadm   1652    372  0   Oct 24 con  0:00 rdpclip
> jprtadm   2052    864  0   Oct 24 con  0:00 "C:\Windows\system32\Dwm.exe"
> jprtadm   3572   3536  0   Oct 24 con  0:05 C:\Windows\Explorer.EXE
> jprtadm   1992   3084  0   Oct 24 con  0:00 "C:\Program Files (x86)\Common Files\Java\Java Update\jucheck.exe" -auto -scheduled -critical
> jprtadm   2580   1316  0   Oct 24 con  0:00 "C:\Program Files (x86)\McAfee\Common Framework\UdaterUI.exe" /EventName=UPDATER_UI_EVENT128252be
> jprtadm   1552   2580  0   Oct 24 con  0:00 /load
> jprtadm   4012   3188  0 10:17:05 con  0:00 C:\cygwin\usr\sbin\sshd.exe -D -R
> jprtadm   3136   4108  0 10:17:07 con  0:00 C:\cygwin\bin\bash.exe
> jprtadm   3044   3136  0 10:19:33 con  0:00 C:\cygwin\bin\bash.exe
> jprtadm   3796   3044  0 10:19:33 con  0:00 C:\MKS\mksnt\ps.exe -ef
> jprtadm   3372   3796  0 10:19:33 con  0:00 ntps.exe -ef
> jprtadm   2000   4824  0 10:19:33 con  0:00 C:\cygwin\bin\grep.exe jprtadm
> 
> 
>> sc11136598(P1)[DevOps VM Windows 7 X64]: running <2012-10-26-003719.vkozlov.7163534> windows_x64-fastdebug-c2-jbb_default_nontiered(56m 10s)
> 
> No sign of that job on sc11136598:
> 
> jprtadm at SC11136598:/cygdrive/c/MKS/mksnt> cd /cygdrive/c/MKS/mksnt && ./ps -ef | grep $USER
> jprtadm   3156    524  0   Oct 24 con  0:00 "taskhost.exe"
> jprtadm    600    372  0   Oct 24 con  0:00 rdpclip
> jprtadm   2472    860  0   Oct 24 con  0:00 "C:\Windows\system32\Dwm.exe"
> jprtadm   3196    588  0   Oct 24 con  0:04 C:\Windows\Explorer.EXE
> jprtadm   3940   3724  0   Oct 24 con  0:00 "C:\Program Files (x86)\Common Files\Java\Java Update\jusched.exe"
> jprtadm   1904   3724  0   Oct 24 con  0:01 "C:\Program Files (x86)\McAfee\Host Intrusion Prevention\FireTray.exe"
> jprtadm   3504   1392  0   Oct 24 con  0:00 "C:\Program Files (x86)\McAfee\Common Framework\UdaterUI.exe" /EventName=UPDATER_UI_EVENT12e6c98d
> jprtadm   1816   3504  0   Oct 24 con  0:00 /load
> jprtadm   2332   3940  0   Oct 24 con  0:00 "C:\Program Files (x86)\Common Files\Java\Java Update\jucheck.exe" -auto -scheduled -critical
> jprtadm   4796    524  0 09:37:10 con  0:00 C:\cygwin\bin\cygrunsrv.exe
> jprtadm   5880    376  0 09:37:10 con  0:00 \??\C:\Windows\system32\conhost.exe "-1794119873914084936-13665211491113271794989327951480364587-1969885781-473977465
> jprtadm   4512    824  0 09:37:10 con  0:00 C:\cygwin\usr\sbin\sshd.exe -D
> jprtadm   5500   5532  0 10:20:33 con  0:00 C:\cygwin\usr\sbin\sshd.exe -D -R
> jprtadm   5612   4044  0 10:20:35 con  0:00 C:\cygwin\bin\bash.exe
> jprtadm   1776   5612  0 10:20:56 con  0:00 C:\cygwin\bin\bash.exe
> jprtadm   5556   1776  0 10:20:56 con  0:00 C:\MKS\mksnt\ps.exe -ef
> jprtadm   2544   2268  0 10:20:56 con  0:00 C:\cygwin\bin\grep.exe jprtadm
> jprtadm   3864   5556  0 10:20:56 con  0:00 ntps.exe -ef
> 
>> sc11136603(P1)[DevOps VM Windows 7 X64]: running <2012-10-26-154155.jcoomes.gc-push> windows_x64-product-c2-runThese_Xcomp(36m 52s)
> 
> sc11136603 is not responding to ssh or to remote desktop.

Not sure what happened on that system.

You will need to reboot via the DevOps interface, I can't do that since the machine is assigned to you.


-kto

> 
> 
> Tim
> 
>> 
>> Vladimir
>> 
>> Tim Bell wrote:
>>> Adding hotspot-dev for the question about jbb_default below.
>>> 
>>> Kelly-
>>> 
>>>> I'll be taking all the Windows 7 systems out of the queues until we resolve this.
>>> 
>>> Rather than take them out, could you make thw W7 systems build-only?
>>> 
>>> The root of the problem is that jbb does not really run properly in headless mode.  Everything after that has been us standing on our heads trying to work around breakage in jbb or in AWT.
>>> 
>>> Is jbb_default worth keeping, or is it simply run out of habit? Just asking because it is causing a fair amount of trouble.
>>> 
>>> 
>>> Tim
>>> 
>>> On 10/26/12 09:56, Kelly O'Hair wrote:
>>>> It's the Windows 7 jbb issue.
>>>> 
>>>> We have seen this before in JPRT Stockholm, which was the first JPRT system that had Windows 7.
>>>> 
>>>> Current theory, not sure how much we understand this, is that the CYGWIN sshd service is somehow
>>>> involved, and that if we manually start sshd the hang goes away.
>>>> But so far we haven't had any solutions on how to get this manual sshd startup to work on reboots.
>>>> 
>>>> I'll be taking all the Windows 7 systems out of the queues until we resolve this.
>>>> 
>>>> -kto
>>>> 
>>>> On Oct 26, 2012, at 9:31 AM, Vladimir Kozlov wrote:
>>>> 
>>>>> It looks like some of new machines periodically timeout during jbb test run (>25min) so out jobs failed.
>>>>> 
>>>>>  windows_x64-fastdebug-c2-jbb_default FAILED(25m 40s)
>>>>>        USED:     hostname=sc11136526 platform=windows_x64_6.1 osname=windows osarch=x64 cpus=2 parallelcount=2 ram=7899MB instance=P1 compiler=VS2010 mks=true cygwin=true installshield=true dxsdk=true pstools=true
>>>>>        ATTRS:    cygwin=true
>>>>>        TIMING:   clean=1s init=13s work=25m12s fini=5s
>>>>>        VMFLAGS:  -server -Djava.awt.headless=true
>>>>>        NEEDS:    cygwin,gnumake381,jbb,jtreg
>>>>> 
>>>>> 
>>>>>  windows_x64-fastdebug-c2-jbb_default FAILED(25m 50s)
>>>>>        USED:     hostname=sc11136603 platform=windows_x64_6.1 osname=windows osarch=x64 cpus=2 parallelcount=2 ram=7899MB instance=P1 compiler=VS2010 mks=true cygwin=true installshield=true dxsdk=true pstools=true
>>>>>        ATTRS:    mks=true
>>>>>        TIMING:   clean=12s init=13s work=25m13s fini=4s
>>>>>        VMFLAGS:  -server -Djava.awt.headless=true
>>>>>        NEEDS:    gnumake381,jbb,jtreg,mks
>>>>> 
>>>>> Vladimir
>>>>> 
>>>>> Kelly O'Hair wrote:
>>>>>> I have updated all the JPRT systems to a new version.
>>>>>> JPRT Version: 3.0.47: (2012-10-23) Case of the Unwelcome Well
>>>>>> Changes:
>>>>>> * hotspotwest will get all the Windows X64 machines currently used by the sfbay queue (4 X64 systems, 1 is a VM)
>>>>>> * sfbay will get 6 new Windows X64 DevOps VM's
>>>>>> * Both sfbay and hotspotwest will each get 3 new Windows 7 X64 DevOp VMs
>>>>>> * some changes with regards to how JPRT kills off processes on clients, should help kill off failed or hung cygwin builds
>>>>>> * An attempt to allow ccache to work on Solaris with build-infra builds
>>>>>> There are some JPRT sfbay machines that are down, tickets have been filed. So sfbay is shy Solaris SPARC and one MacPro.
>>>>>> JPRT East is missing some Solaris SPARC machines.
>>>>>> -kto
>>> 
>>> 
> 
> 



More information about the hotspot-dev mailing list