RFR(XS): 8215042: Move serviceability/sa tests from tier1 to tier3.

Leonid Mesnik leonid.mesnik at oracle.com
Mon Dec 10 07:42:52 UTC 2018


Hi

Here is summary of  such "non test-specific" bugs:

Following bugs affect all (or mostly all) tests:
JDK-8202884 <https://bugs.openjdk.java.net/browse/JDK-8202884>SA: Attach/detach might fail on Linux if debugee application create/destroy threads during attaching
JDK-8204994 <https://bugs.openjdk.java.net/browse/JDK-8204994>SA might fail to attach to process with "Windbg Error: WaitForEvent failed"
JDK-8197591 <https://bugs.openjdk.java.net/browse/JDK-8197591>Tests failing with App waiting timeout
JDK-8203364 <https://bugs.openjdk.java.net/browse/JDK-8203364>Some serviceability/sa/ tests intermittently fail with java.io.IOException: LingeredApp terminated with non-zero exit code 3
and the similar bug for Solaris-SPARC
   JDK-8193639 <https://bugs.openjdk.java.net/browse/JDK-8193639> tests failing intermittently with Error attaching to process: Can't create thread_db agent!

SA might fail connecting to any process because of JDK-8202884 <https://bugs.openjdk.java.net/browse/JDK-8202884>, JDK-8204994 <https://bugs.openjdk.java.net/browse/JDK-8204994>. Also all tests relying on LingeredApp might affected by JDK-8197591 <https://bugs.openjdk.java.net/browse/JDK-8197591>, JDK-8203364 <https://bugs.openjdk.java.net/browse/JDK-8203364>. 

You have seen a lot of failures caused by https://bugs.openjdk.java.net/browse/JDK-8202884 <https://bugs.openjdk.java.net/browse/JDK-8202884> because it is a Linux-specific bug which is often reproduced on multi-core host.

Leonid

> On Dec 9, 2018, at 11:38 PM, Jini George <jini.george at oracle.com> wrote:
> 
> Hi Leonid,
> 
> Looks like all the SA failures here are all due to https://bugs.openjdk.java.net/browse/JDK-8202884. Do let me know if I am mistaken. We will work on fixing that issue faster.
> 
> Thanks,
> Jini.
> 
> On 12/10/2018 12:51 PM, Leonid Mesnik wrote:
>> David, Jini
>> I understand your concerns. But the original idea of tiered testing is that tier1 failures are treated as urgent issues and to resolve. [1]
>> Here is list of test failures for 1000 runs of tier1 tests in Mach5. (I am not able to provide a link here) Please note that all SA tests are excluded on Solaris and MacosX already.
>> 1 compiler/aot/calls/fromAot/AotInvokeSpecial2AotTest.java
>> 2 serviceability/sa/ClhsdbFindPC.java
>> 3 serviceability/sa/TestPrintMdo.java
>> 4 serviceability/sa/ClhsdbJstack.java
>> 5 serviceability/sa/ClhsdbJdis.java
>> 6 compiler/c2/Test8004741.java
>> 7 runtime/handshake/HandshakeWalkSuspendExitTest.java
>> 8 runtime/handshake/HandshakeWalkSuspendExitTest.java
>> 9 compiler/aot/calls/fromAot/AotInvokeVirtual2AotTest.java
>> 10 runtime/handshake/HandshakeWalkExitTest.java
>> 11 runtime/handshake/HandshakeWalkSuspendExitTest.java
>> 12 serviceability/sa/ClhsdbRegionDetailsScanOopsForG1.java
>> 13 serviceability/sa/ClhsdbRegionDetailsScanOopsForG1.java
>> 14 compiler/aot/calls/fromAot/AotInvokeVirtual2AotTest.java
>> The failures in of 'runtime/handshake/' are relatively caused by https://bugs.openjdk.java.net/browse/JDK-8214174 but should be also fixed/excluded. SA tests are also unstable and there are no plans to fix them soon.
>> So it means that we are going to have tier1 tests unstable for a long time.
>> The possible way to make tier1 more stable would be to run only some very basic sanity SA tests in tier1.  Might be to develop new sanity test which have some failover for existing SA bugs.
>> Leonid
>> [1] http://mail.openjdk.java.net/pipermail/jdk9-dev/2015-March/001991.html
>>> On Dec 9, 2018, at 7:19 PM, Jini George <jini.george at oracle.com <mailto:jini.george at oracle.com>> wrote:
>>> 
>>> Hi Leonid,
>>> 
>>> I agree with David. I am also concerned about us not detecting SA breakages (which could happen along with hotspot changes) soon enough. (Which was the primary reason to get these tests in).
>>> 
>>> Thank you,
>>> Jini.
>>> 
>>> On 12/8/2018 4:48 PM, David Holmes wrote:
>>>> Hi Leonid,
>>>> My concern here, if we care about keeping the SA operational, is that in tier3 these tests will not be covered by the jdk/submit testing process.
>>>> David
>>>> On 8/12/2018 3:53 pm, Leonid Mesnik wrote:
>>>>> Hi
>>>>> 
>>>>> Could you please review following fix which moves SA tests from tier1 to tier3. There are some bugs which cause intermittent failures of any test. SA tests fail intermittently are not stable enough for tier1.
>>>>> However failures are not very frequent. Also I don't think that putting all test in Problemlist.txt is very good idea because it left SA without any testing at all.
>>>>> So now all SA tests which are included in hotspot_tier3_runtime group.
>>>>> 
>>>>> webrev: http://cr.openjdk.java.net/~lmesnik/8215042/webrev.00/
>>>>> bug: https://bugs.openjdk.java.net/browse/JDK-8215042
>>>>> 
>>>>> Leonid

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.openjdk.java.net/pipermail/serviceability-dev/attachments/20181209/94ee21e9/attachment-0001.html>


More information about the serviceability-dev mailing list