RFR: 8355094: Performance drop in auto-vectorized kernel due to split store [v2]
Emanuel Peter
epeter at openjdk.org
Fri May 16 07:29:53 UTC 2025
On Thu, 15 May 2025 21:25:08 GMT, Vladimir Ivanov <vlivanov at openjdk.org> wrote:
>> Emanuel Peter has updated the pull request incrementally with one additional commit since the last revision:
>>
>> Update src/hotspot/share/opto/superword.cpp
>>
>> Co-authored-by: Manuel Hässig <manuel at haessig.org>
>
> Impressive analysis, Emanuel! Very deep, thorough, and insightful.
>
> Looks good.
>
> Speaking of Vector API, we experimented with getting access alignment under control. Unfortunately, when it comes to on-heap accesses it boils down to hyper-aligned objects support which is not there yet.
>
> PS: yay, you found a way to turn PRs into blog posts! :-)
@iwanowww Thanks for your kind words 😊
Indeed: on-heap access would profit from hyper-aligned objects. Are there any ideas on how to do that? I wonder if it is worth it, or if it is good enough to just use off-heap (native) MemorySegments to guarantee alignment for very performance critical cases?
-------------
PR Comment: https://git.openjdk.org/jdk/pull/25065#issuecomment-2885883207
More information about the hotspot-compiler-dev
mailing list