RFR: 8288047: Accelerate Poly1305 on x86_64 using AVX512 instructions [v5]
Vladimir Ivanov
vlivanov at openjdk.org
Tue Nov 1 23:34:34 UTC 2022
On Fri, 28 Oct 2022 20:19:35 GMT, vpaprotsk <duke at openjdk.org> wrote:
> And just looking now on uops.info, they seem to have identical timings?
Actual instruction being used (aligned vs unaligned versions) doesn't matter much here, because it's a dynamic property of the address being accessed: misaligned accesses that cross cache line boundary incur a penalty. Since the cache line size is 64 byte in size, every misaligned 512-bit access is penalized.
-------------
PR: https://git.openjdk.org/jdk/pull/10582
More information about the hotspot-dev
mailing list