RFR: 8357460: RISC-V: Optimize array fill stub for small size [v3]
Fei Yang
fyang at openjdk.org
Mon May 26 00:46:52 UTC 2025
On Fri, 23 May 2025 15:38:36 GMT, Feilong Jiang <fjiang at openjdk.org> wrote:
>> Please consider.
>> As discussed in https://github.com/openjdk/jdk/pull/23890#discussion_r2094920943, we can also further optimize the array fill stub by unrolling the storage of values when the size is less than 8.
>>
>> This PR also removes the **aligned tail part** with the consideration of code size and testing coverage. As the test reveals there are no significant regressions.
>>
>>
>> Before:
>> Benchmark (size) Mode Cnt Score Error Units
>> ArrayFill.fillByteArray 7 avgt 12 27.215 ± 0.073 ns/op
>> ArrayFill.fillByteArray 15 avgt 12 32.687 ± 0.904 ns/op
>> ArrayFill.fillIntArray 7 avgt 12 28.629 ± 0.006 ns/op
>> ArrayFill.fillIntArray 15 avgt 12 29.351 ± 0.009 ns/op
>> ArrayFill.fillShortArray 7 avgt 12 30.776 ± 0.006 ns/op
>> ArrayFill.fillShortArray 15 avgt 12 31.724 ± 0.447 ns/op
>> ArrayFill.zeroByteArray 7 avgt 12 27.199 ± 0.006 ns/op
>> ArrayFill.zeroByteArray 15 avgt 12 32.685 ± 0.900 ns/op
>> ArrayFill.zeroIntArray 7 avgt 12 28.630 ± 0.007 ns/op
>> ArrayFill.zeroIntArray 15 avgt 12 29.352 ± 0.011 ns/op
>> ArrayFill.zeroShortArray 7 avgt 12 30.776 ± 0.006 ns/op
>> ArrayFill.zeroShortArray 15 avgt 12 31.497 ± 0.012 ns/op
>>
>> After:
>> Benchmark (size) Mode Cnt Score Error Units
>> ArrayFill.fillByteArray 7 avgt 12 20.137 ± 0.042 ns/op
>> ArrayFill.fillByteArray 15 avgt 12 32.928 ± 0.004 ns/op
>> ArrayFill.fillIntArray 7 avgt 12 28.630 ± 0.004 ns/op
>> ArrayFill.fillIntArray 15 avgt 12 29.344 ± 0.005 ns/op
>> ArrayFill.fillShortArray 7 avgt 12 31.494 ± 0.004 ns/op
>> ArrayFill.fillShortArray 15 avgt 12 31.492 ± 0.008 ns/op
>> ArrayFill.zeroByteArray 7 avgt 12 19.980 ± 0.164 ns/op
>> ArrayFill.zeroByteArray 15 avgt 12 32.927 ± 0.004 ns/op
>> ArrayFill.zeroIntArray 7 avgt 12 28.629 ± 0.005 ns/op
>> ArrayFill.zeroIntArray 15 avgt 12 29.346 ± 0.006 ns/op
>> ArrayFill.zeroShortArray 7 avgt 12 32.193 ± 0.027 ns/op
>> ArrayFill.zeroShortArray 15 avgt 12 31.495 ± 0.010 ns/op
>>
>>
>> Testing:
>> - [x] tier1
>
> Feilong Jiang has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains four additional commits since the last revision:
>
> - Merge branch 'openjdk:master' into riscv-optimize-generate-fill
> - Merge branch 'openjdk:master' into riscv-optimize-generate-fill
> - Merge branch 'master' of https://github.com/openjdk/jdk into riscv-optimize-generate-fill
> - optimize array fill stub for small size
Looks good. Thanks.
-------------
Marked as reviewed by fyang (Reviewer).
PR Review: https://git.openjdk.org/jdk/pull/25350#pullrequestreview-2867010574
More information about the hotspot-compiler-dev
mailing list