RFR: 8338967: Improve performance for MemorySegment::fill [v5]

Paul Sandoz psandoz at openjdk.org
Tue Aug 27 20:28:04 UTC 2024


On Tue, 27 Aug 2024 10:38:46 GMT, Per Minborg <pminborg at openjdk.org> wrote:

>> The performance of the `MemorySegment::fil` can be improved by replacing the `checkAccess()` method call with calling `checkReadOnly()` instead (as the bounds of the segment itself do not need to be checked).
>> 
>> Also, smaller segments can be handled directly by Java code rather than transitioning to native code.
>> 
>> Here is how the `MemorySegment::fill` performance is improved by this PR:
>> 
>> ![image](https://github.com/user-attachments/assets/92a0bcf2-f5b0-4a91-9c02-39423f870209)
>> 
>> Operations involving 8 or more bytes are delegated to native code whereas smaller segments are handled via a switch rake.
>> 
>> It should be noted that `Arena::allocate` is using `MemorySegment::fil`. Hence, this PR will also have a positive effect on memory allocation performance.
>
> Per Minborg has updated the pull request with a new target base due to a merge or a rebase. The incremental webrev excludes the unrelated changes brought in by the merge/rebase. The pull request contains six additional commits since the last revision:
> 
>  - Merge branch 'master' into fill-performance
>  - Fix typo
>  - Add a comment about the old switch type
>  - Remove unused import
>  - Reduce kick-in size and add test
>  - Initial implementation

How fast do we need to be here given we are measuring in a few nanoseconds per operation? 

What if the goal is not to regress from say explicitly filling in a small sized segment or a comparable array (e.g., < 8 bytes) then maybe a loop suffices and the code is simple?

-------------

PR Comment: https://git.openjdk.org/jdk/pull/20712#issuecomment-2313446118


More information about the core-libs-dev mailing list