RFR: 8277617: Adjust AVX3Threshold for copy/fill stubs
Jie Fu
jiefu at openjdk.java.net
Wed Dec 1 07:11:27 UTC 2021
On Wed, 1 Dec 2021 03:38:00 GMT, Jie Fu <jiefu at openjdk.org> wrote:
>>> @dholmes-ora We see about 25% gain on a micro on our latest platform. There is no cpuid bit for this, so the closest was to check for the new serialize ISA supported on this platform.
>>
>> It would be better to add a jmh test for this opt.
>> Thanks.
>
>> @DamonFool There are jmh tests for Arraycopy in test/micro/org/openjdk/bench/java/lang/Arraycopy.java.
>
> So how about posting the detailed perf data before and after this patch?
> Thanks.
> > > @DamonFool There are jmh tests for Arraycopy in test/micro/org/openjdk/bench/java/lang/Arraycopy.java.
> >
> >
> > So how about posting the detailed perf data before and after this patch? Thanks.
>
> Before: Benchmark Mode Cnt Score Error Units ArrayCopy.arrayCopyObject avgt 5 19.538 ± 0.073 ns/op ArrayCopy.arrayCopyObjectNonConst avgt 5 20.513 ± 0.104 ns/op ArrayCopy.arrayCopyObjectSameArraysBackward avgt 5 15.919 ± 0.652 ns/op ArrayCopy.arrayCopyObjectSameArraysForward avgt 5 15.669 ± 0.359 ns/op
>
> After: Benchmark Mode Cnt Score Error Units ArrayCopy.arrayCopyObject avgt 5 16.957 ± 0.584 ns/op ArrayCopy.arrayCopyObjectNonConst avgt 5 17.221 ± 0.036 ns/op ArrayCopy.arrayCopyObjectSameArraysBackward avgt 5 12.952 ± 0.068 ns/op ArrayCopy.arrayCopyObjectSameArraysForward avgt 5 13.562 ± 0.124 ns/op
Thanks @sviswa7 for your sharing.
So the performance number looks good on Intel's latest AVX512 platform.
We don't use the 64-byte instructions as default on Intel's old AVX512 platforms, right?
If so, is it possible a performance regression for the old platforms after this patch?
Thanks.
-------------
PR: https://git.openjdk.java.net/jdk/pull/6512
More information about the hotspot-compiler-dev
mailing list