JPRT system changes
Kelly O'Hair
kelly.ohair at oracle.com
Fri Oct 26 10:52:03 PDT 2012
On Oct 26, 2012, at 10:31 AM, Tim Bell wrote:
> On 10/26/12 10:14, Vladimir Kozlov wrote:
>> Tim,
>>
>> It is not just JBB. Here is current status:
>
> Hmmm... not sure what it is, then.
>
>> [ 3 windows_x64_6.1 clients, 3 hosts ]
>> sc11136526(P1)[DevOps VM Windows 7 X64]: running <2012-10-26-154155.jcoomes.gc-push> windows_x64-product-c2-GCBasher_ParOldGC(46m 40s)
>
> No traces of that job on sc11136526:
I'm trying to kill off the JPRT client on these systems.
>
> jprtadm at SC11136526:/cygdrive/c/MKS/mksnt> cd /cygdrive/c/MKS/mksnt && ./ps -ef | grep $USER
> jprtadm 1092 524 0 Oct 20 con 0:00 C:\cygwin\bin\cygrunsrv.exe
> jprtadm 1116 376 0 Oct 20 con 0:04 \??\C:\Windows\system32\conhost.exe "-1483141665-707719824-1770841038-590923786-3215500991937909109-844736201686794369
> jprtadm 1476 1140 0 Oct 20 con 0:01 C:\cygwin\usr\sbin\sshd.exe -D
> jprtadm 1196 524 0 Oct 24 con 0:00 "taskhost.exe"
> jprtadm 1652 372 0 Oct 24 con 0:00 rdpclip
> jprtadm 2052 864 0 Oct 24 con 0:00 "C:\Windows\system32\Dwm.exe"
> jprtadm 3572 3536 0 Oct 24 con 0:05 C:\Windows\Explorer.EXE
> jprtadm 1992 3084 0 Oct 24 con 0:00 "C:\Program Files (x86)\Common Files\Java\Java Update\jucheck.exe" -auto -scheduled -critical
> jprtadm 2580 1316 0 Oct 24 con 0:00 "C:\Program Files (x86)\McAfee\Common Framework\UdaterUI.exe" /EventName=UPDATER_UI_EVENT128252be
> jprtadm 1552 2580 0 Oct 24 con 0:00 /load
> jprtadm 4012 3188 0 10:17:05 con 0:00 C:\cygwin\usr\sbin\sshd.exe -D -R
> jprtadm 3136 4108 0 10:17:07 con 0:00 C:\cygwin\bin\bash.exe
> jprtadm 3044 3136 0 10:19:33 con 0:00 C:\cygwin\bin\bash.exe
> jprtadm 3796 3044 0 10:19:33 con 0:00 C:\MKS\mksnt\ps.exe -ef
> jprtadm 3372 3796 0 10:19:33 con 0:00 ntps.exe -ef
> jprtadm 2000 4824 0 10:19:33 con 0:00 C:\cygwin\bin\grep.exe jprtadm
>
>
>> sc11136598(P1)[DevOps VM Windows 7 X64]: running <2012-10-26-003719.vkozlov.7163534> windows_x64-fastdebug-c2-jbb_default_nontiered(56m 10s)
>
> No sign of that job on sc11136598:
>
> jprtadm at SC11136598:/cygdrive/c/MKS/mksnt> cd /cygdrive/c/MKS/mksnt && ./ps -ef | grep $USER
> jprtadm 3156 524 0 Oct 24 con 0:00 "taskhost.exe"
> jprtadm 600 372 0 Oct 24 con 0:00 rdpclip
> jprtadm 2472 860 0 Oct 24 con 0:00 "C:\Windows\system32\Dwm.exe"
> jprtadm 3196 588 0 Oct 24 con 0:04 C:\Windows\Explorer.EXE
> jprtadm 3940 3724 0 Oct 24 con 0:00 "C:\Program Files (x86)\Common Files\Java\Java Update\jusched.exe"
> jprtadm 1904 3724 0 Oct 24 con 0:01 "C:\Program Files (x86)\McAfee\Host Intrusion Prevention\FireTray.exe"
> jprtadm 3504 1392 0 Oct 24 con 0:00 "C:\Program Files (x86)\McAfee\Common Framework\UdaterUI.exe" /EventName=UPDATER_UI_EVENT12e6c98d
> jprtadm 1816 3504 0 Oct 24 con 0:00 /load
> jprtadm 2332 3940 0 Oct 24 con 0:00 "C:\Program Files (x86)\Common Files\Java\Java Update\jucheck.exe" -auto -scheduled -critical
> jprtadm 4796 524 0 09:37:10 con 0:00 C:\cygwin\bin\cygrunsrv.exe
> jprtadm 5880 376 0 09:37:10 con 0:00 \??\C:\Windows\system32\conhost.exe "-1794119873914084936-13665211491113271794989327951480364587-1969885781-473977465
> jprtadm 4512 824 0 09:37:10 con 0:00 C:\cygwin\usr\sbin\sshd.exe -D
> jprtadm 5500 5532 0 10:20:33 con 0:00 C:\cygwin\usr\sbin\sshd.exe -D -R
> jprtadm 5612 4044 0 10:20:35 con 0:00 C:\cygwin\bin\bash.exe
> jprtadm 1776 5612 0 10:20:56 con 0:00 C:\cygwin\bin\bash.exe
> jprtadm 5556 1776 0 10:20:56 con 0:00 C:\MKS\mksnt\ps.exe -ef
> jprtadm 2544 2268 0 10:20:56 con 0:00 C:\cygwin\bin\grep.exe jprtadm
> jprtadm 3864 5556 0 10:20:56 con 0:00 ntps.exe -ef
>
>> sc11136603(P1)[DevOps VM Windows 7 X64]: running <2012-10-26-154155.jcoomes.gc-push> windows_x64-product-c2-runThese_Xcomp(36m 52s)
>
> sc11136603 is not responding to ssh or to remote desktop.
Not sure what happened on that system.
You will need to reboot via the DevOps interface, I can't do that since the machine is assigned to you.
-kto
>
>
> Tim
>
>>
>> Vladimir
>>
>> Tim Bell wrote:
>>> Adding hotspot-dev for the question about jbb_default below.
>>>
>>> Kelly-
>>>
>>>> I'll be taking all the Windows 7 systems out of the queues until we resolve this.
>>>
>>> Rather than take them out, could you make thw W7 systems build-only?
>>>
>>> The root of the problem is that jbb does not really run properly in headless mode. Everything after that has been us standing on our heads trying to work around breakage in jbb or in AWT.
>>>
>>> Is jbb_default worth keeping, or is it simply run out of habit? Just asking because it is causing a fair amount of trouble.
>>>
>>>
>>> Tim
>>>
>>> On 10/26/12 09:56, Kelly O'Hair wrote:
>>>> It's the Windows 7 jbb issue.
>>>>
>>>> We have seen this before in JPRT Stockholm, which was the first JPRT system that had Windows 7.
>>>>
>>>> Current theory, not sure how much we understand this, is that the CYGWIN sshd service is somehow
>>>> involved, and that if we manually start sshd the hang goes away.
>>>> But so far we haven't had any solutions on how to get this manual sshd startup to work on reboots.
>>>>
>>>> I'll be taking all the Windows 7 systems out of the queues until we resolve this.
>>>>
>>>> -kto
>>>>
>>>> On Oct 26, 2012, at 9:31 AM, Vladimir Kozlov wrote:
>>>>
>>>>> It looks like some of new machines periodically timeout during jbb test run (>25min) so out jobs failed.
>>>>>
>>>>> windows_x64-fastdebug-c2-jbb_default FAILED(25m 40s)
>>>>> USED: hostname=sc11136526 platform=windows_x64_6.1 osname=windows osarch=x64 cpus=2 parallelcount=2 ram=7899MB instance=P1 compiler=VS2010 mks=true cygwin=true installshield=true dxsdk=true pstools=true
>>>>> ATTRS: cygwin=true
>>>>> TIMING: clean=1s init=13s work=25m12s fini=5s
>>>>> VMFLAGS: -server -Djava.awt.headless=true
>>>>> NEEDS: cygwin,gnumake381,jbb,jtreg
>>>>>
>>>>>
>>>>> windows_x64-fastdebug-c2-jbb_default FAILED(25m 50s)
>>>>> USED: hostname=sc11136603 platform=windows_x64_6.1 osname=windows osarch=x64 cpus=2 parallelcount=2 ram=7899MB instance=P1 compiler=VS2010 mks=true cygwin=true installshield=true dxsdk=true pstools=true
>>>>> ATTRS: mks=true
>>>>> TIMING: clean=12s init=13s work=25m13s fini=4s
>>>>> VMFLAGS: -server -Djava.awt.headless=true
>>>>> NEEDS: gnumake381,jbb,jtreg,mks
>>>>>
>>>>> Vladimir
>>>>>
>>>>> Kelly O'Hair wrote:
>>>>>> I have updated all the JPRT systems to a new version.
>>>>>> JPRT Version: 3.0.47: (2012-10-23) Case of the Unwelcome Well
>>>>>> Changes:
>>>>>> * hotspotwest will get all the Windows X64 machines currently used by the sfbay queue (4 X64 systems, 1 is a VM)
>>>>>> * sfbay will get 6 new Windows X64 DevOps VM's
>>>>>> * Both sfbay and hotspotwest will each get 3 new Windows 7 X64 DevOp VMs
>>>>>> * some changes with regards to how JPRT kills off processes on clients, should help kill off failed or hung cygwin builds
>>>>>> * An attempt to allow ccache to work on Solaris with build-infra builds
>>>>>> There are some JPRT sfbay machines that are down, tickets have been filed. So sfbay is shy Solaris SPARC and one MacPro.
>>>>>> JPRT East is missing some Solaris SPARC machines.
>>>>>> -kto
>>>
>>>
>
>
More information about the hotspot-dev
mailing list