RFR: 8257436: [aarch64] Regressions in ArrayCopyUnalignedDst.testByte/testChar for 65-78 bytes when UseSIMDForMemoryOps is on [v4]

Volker Simonis simonis at openjdk.java.net
Thu Dec 3 14:10:15 UTC 2020


On Thu, 3 Dec 2020 14:05:35 GMT, Evgeny Astigeevich <github.com+42899633+eastig at openjdk.org> wrote:

>> Copying data in range 65 - 80 bytes with
>> ldpq
>> ldpq 
>> ldp
>> stpq
>> stpq
>> stp
>> is faster than with
>> ldpq
>> ldpq
>> ldpq
>> stpq
>> stpq
>> stpq
>> when pointers involved are not properly aligned. This mostly affects byte and char arraycopy. 
>> This patch fixes regressions of ArrayCopyUnalignedDst.testByte/testChar when UseSIMDForMemoryOps is on.
>> See the attached results for performance data
>> [test_byte_results.txt](https://github.com/openjdk/jdk/files/5629676/test_byte_results.txt)
>> [test_char_results.txt](https://github.com/openjdk/jdk/files/5629673/test_char_results.txt)
>
> Evgeny Astigeevich has refreshed the contents of this pull request, and previous commits have been removed. The incremental views will show differences compared to the previous content of the PR. The pull request contains one new commit since the last revision:
> 
>   8257436: [aarch64] Regressions in ArrayCopyUnalignedDst.testByte/testChar for 65-78 bytes when UseSIMDForMemoryOps is on
>   
>   On Graviton 2 ArrayCopyUnaligned of data range 65 - 78 bytes with
>   the sequence of 'ldpq,ldpq,ldp,stpq,stpq,stp' is faster than with
>   the sequence of 'ldpq,ldpq,ldpq,stpq,stpq,stpq'.

Looks good now. Thanks!

-------------

Marked as reviewed by simonis (Reviewer).

PR: https://git.openjdk.java.net/jdk/pull/1558


More information about the hotspot-compiler-dev mailing list