Stack traces for a stuck test in mach5?
mikhailo.seledtsov at oracle.com
mikhailo.seledtsov at oracle.com
Thu Oct 29 20:25:03 UTC 2020
OK, thanks for clarifying. Then right, file a bug in infrabugs, add
subcomponent 'host', add me as a watcher. If you already did, please let
me know the bug id.
Thanks,
Misha
On 10/29/20 1:21 PM, Igor Ignatyev wrote:
> right, timeout handler uses tools from PATH, but as far as I can tell
> the problem here isn't w/ the used lldb, but w/ host security
> policies: if DevToolsSecurity isn't enabled, macOS asks you for
> login/password every time you try to attach w/ lldb (or any other tools)
>
> -- Igor
>
>> On Oct 29, 2020, at 1:13 PM, mikhailo.seledtsov at oracle.com
>> <mailto:mikhailo.seledtsov at oracle.com> wrote:
>>
>> Adding Leonid, since he worked on similar issue recently.
>>
>> It was something about timeout handler possibly referencing a wrong
>> lldb (platform natively installed vs installed by JIB), IIRC.
>>
>>
>> Misha
>>
>> On 10/27/20 1:07 PM, Igor Ignatyev wrote:
>>> Hi Evgeny,
>>>
>>> if you look at `DevToolsSecurity` results, you will see that
>>> "Developer mode is currently disabled.", meaning this host isn't
>>> properly configured and `lldb` can attach to the process. you need
>>> to open a bug in infra JIRA --
>>> https://java.se.oracle.com/infrabugs/ (you can always find link to
>>> it from 'services' menu on infra landing page
>>> https://java.se.oracle.com <https://java.se.oracle.com/> ). I
>>> *assume* the appropriate project is MACH5 w/ 'host' being the
>>> component, Misha (cc'ed) might know better.
>>>
>>> HTH,
>>> -- Igor
>>>
>>> PS even if the host was properly configured, we wouldn't get any
>>> meaningful data in this particular case, as by the time jtreg
>>> invoked failure-handler, the test process had already finished and
>>> exited. that's why there is no "common" section (which has `jstack`,
>>> `jcmd` and other java specific tools) in `processes.html`, and why
>>> `pgrep` (in "test_processes") exited w/ 1 and `kill` ("core"
>>> subsection) said "No such process". this is by no means to say that
>>> we shouldn't fix the hosts. I just don't want to get yours hopes too
>>> high: failure-handler is useful and all (mostly b/c it's me who
>>> implented it ;) ) but b/c it's run concurrently to a test process,
>>> there always will be cases w/ missed data, esp. when a test is
>>> having almost enough time to finish.
>>>
>>>
>>>> On Oct 27, 2020, at 12:51 PM, Evgeny Nikitin
>>>> <evgeny.nikitin at oracle.com <mailto:evgeny.nikitin at oracle.com>> wrote:
>>>>
>>>> Hi Igor,
>>>>
>>>> May I ask for your advice with one test stuck failure in mach5?
>>>>
>>>> Here's the job, it contains only one test failure:
>>>>
>>>> https://mach5.us.oracle.com/mdash/jobs/mach5-one-jdk-16+22-1219-tier3-20201022-1017-15212551/tasks/mach5-one-jdk-16+22-1219-tier3-20201022-1017-15212551-tier3-comp-open_test_hotspot_jtreg_hotspot_slow_compiler-macosx-x64-debug-40/results?search=status%3Afailed%20AND%20-state%3Ainvalid
>>>>
>>>> The test hung, jtreg tried to gather stack traces, but with no success.
>>>> Output for stack traces:
>>>>
>>>> ----------------------------------------
>>>> [2020-10-22 10:42:02] [/bin/bash, -c, DevToolsSecurity --status |
>>>> grep -q enabled && lldb -o 'attach 23355' -o 'thread backtrace all'
>>>> -o 'detach' -o 'quit'] timeout=20000
>>>> ----------------------------------------
>>>> ----------------------------------------
>>>> [2020-10-22 10:42:02] exit code: 1 time: 29 ms
>>>> ----------------------------------------
>>>>
>>>> Output for spindump:
>>>> ----------------------------------------
>>>> [2020-10-22 10:42:02] [/usr/sbin/spindump, 23355, -stdout]
>>>> timeout=20000
>>>> ----------------------------------------
>>>> spindump must be run as root
>>>> ----------------------------------------
>>>> [2020-10-22 10:42:02] exit code: 77 time: 98 ms
>>>> ----------------------------------------
>>>>
>>>> There's obviously some problem with stack traces gathering. Is that
>>>> expected? If not... how and where can I open a bug about that? I'm
>>>> guessing, it is for the Infra team, and not in the openjdk JIRA, right?
>>>>
>>>> Regards,
>>>> // Evgeny.
>>>>
>>>>
>>>
>
More information about the hotspot-runtime-dev
mailing list