RFR 8184445: JShell tests: fail intermittently if tests are run in high concurrent mode.
Joseph D. Darcy
joe.darcy at oracle.com
Wed Feb 28 02:14:13 UTC 2018
Hi Robert,
I'd prefer if only the launching tests and other known failures were
segregated into a non-concurrent area. That would still let ~3/4 of the
tests proceed normally.
As a follow-up, can an RFE be filed to harden the intermittently failing
tests against concurrent networking?
Thanks,
-Joe
On 2/27/2018 12:21 PM, Robert Field wrote:
> OK, I did a survey of all the JShell bugs. There are over a dozen
> intermittent test failures, almost all are probably network related.
> But if we limit to just intermittent failures to launch, then there
> are seven.
>
> There are 17 tests of launching configuration, and 75 'normal' tests.
> So, the launching configuration tests do fail disproportionately, 3
> mentioned failures vs 5 mentioned failing files.
>
> The bug that highlighted the concurrent testing -- "JShell tests: fail
> intermittently if tests are run in high concurrent mode":
> https://bugs.openjdk.java.net/browse/JDK-8184445
> mentioned 'several' issues, the two included JTR files are, tellingly,
> normal tests.
>
> The non-launching intermittent failures are all normal tests.
>
> So, where does that leave us? I could reduce the failures a bit at
> low time-cost by putting the launching configuration tests in the
> exclusiveAccess.dirs. Or, I could, at considerable testing cost,
> address the broad swath.
>
> -Robert
>
> On 02/26/18 17:28, joe darcy wrote:
>> Hi Robert,
>>
>> On 2/26/2018 10:57 AM, Robert Field wrote:
>>>
>>>
>>> On 02/26/18 10:23, joe darcy wrote:
>>>> Hi Robert,
>>>>
>>>> The fix looks acceptable in terms of addressing the problem, but is
>>>> there a sense of how this might impact running time of the test suite?
>>>>
>>>> Phrased differently, are there plans to make the tests more robust
>>>> to concurrent runs in the future?
>>>
>>> Hi Joe,
>>>
>>> There is a lot of network connection happening in these tests, most
>>> of which is in layers we don't control (JDI). We have been trying
>>> to lower the risk and we don't see failures running the tests
>>> ourselves, but intermittent failures scattered through the suite
>>> during testing (e.g. mach5) have been a constant problem.
>>>
>>> We will see the impact on test duration. Default connection has
>>> three-level fail-over, the tests of other connection modes see
>>> failure far more frequently, so, if necessary, we can look at tuning
>>> this.
>>>
>>
>> From some quick checking, there are about 80 tests in that directory.
>> From one sample point on my laptop, the tests took a good long while
>> to run. If some of the tests can be reliably run concurrently, I'd
>> much prefer to see a subset of tests moved to a sheltered directory.
>>
>> Thanks,
>>
>> -Joe
>
More information about the kulla-dev
mailing list