RFR: 8257436: [aarch64] Regressions in ArrayCopyUnalignedDst.testByte/testChar for 65-78 bytes when UseSIMDForMemoryOps is on [v2]
Evgeny Astigeevich
github.com+42899633+eastig at openjdk.java.net
Wed Dec 2 14:08:13 UTC 2020
> Copying data in range 65 - 80 bytes with
> ldpq
> ldpq
> ldp
> stpq
> stpq
> stp
> is faster than with
> ldpq
> ldpq
> ldpq
> stpq
> stpq
> stpq
> when pointers involved are not properly aligned. This mostly affects byte and char arraycopy.
> This patch fixes regressions of ArrayCopyUnalignedDst.testByte/testChar when UseSIMDForMemoryOps is on.
> See the attached results for performance data
> [test_byte_results.txt](https://github.com/openjdk/jdk/files/5629676/test_byte_results.txt)
> [test_char_results.txt](https://github.com/openjdk/jdk/files/5629673/test_char_results.txt)
Evgeny Astigeevich has refreshed the contents of this pull request, and previous commits have been removed. The incremental views will show differences compared to the previous content of the PR. The pull request contains one new commit since the last revision:
8257436: [aarch64] Regressions in ArrayCopyUnalignedDst.testByte/testChar for 65-78 bytes when UseSIMDForMemoryOps is on
On Graviton 2 arraycopy of data range 65 - 78 bytes with the sequence
of 'ldpq,ldpq,ldp,stpq,stpq,stp' is faster than with the sequence of
'ldpq,ldpq,ldpq,stpq,stpq,stpq' for ArrayCopyUnaligned.
-------------
Changes:
- all: https://git.openjdk.java.net/jdk/pull/1558/files
- new: https://git.openjdk.java.net/jdk/pull/1558/files/16b36d36..3d9d9195
Webrevs:
- full: https://webrevs.openjdk.java.net/?repo=jdk&pr=1558&range=01
- incr: https://webrevs.openjdk.java.net/?repo=jdk&pr=1558&range=00-01
Stats: 15 lines in 1 file changed: 3 ins; 1 del; 11 mod
Patch: https://git.openjdk.java.net/jdk/pull/1558.diff
Fetch: git fetch https://git.openjdk.java.net/jdk pull/1558/head:pull/1558
PR: https://git.openjdk.java.net/jdk/pull/1558
More information about the hotspot-compiler-dev
mailing list