RFR: 8322768: Optimize non-subword vector compress and expand APIs for AVX2 target. [v5]

Emanuel Peter epeter at openjdk.org
Fri Jan 19 07:46:28 UTC 2024


On Thu, 18 Jan 2024 17:06:55 GMT, Jatin Bhateja <jbhateja at openjdk.org> wrote:

>> @jatin-bhateja so why do you shift by 5? I thought 4 longs are 32 bit?
>
> For long/double each permute row is 32 byte in size, so a shift by 5 to compute row address.

Ah right. Maybe we could say `32byte = 4 long = 4 * 64bit`.
Because "64bit row" sounds like the whole row is only 64 bit long. It is actually the cells that are 64bits, not the rows!

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/17261#discussion_r1458509886


More information about the hotspot-compiler-dev mailing list