RFR: 8322768: Optimize non-subword vector compress and expand APIs for AVX2 target. [v5]
Jatin Bhateja
jbhateja at openjdk.org
Tue Jan 16 06:16:22 UTC 2024
On Mon, 15 Jan 2024 09:10:38 GMT, Emanuel Peter <epeter at openjdk.org> wrote:
>> Jatin Bhateja has updated the pull request incrementally with one additional commit since the last revision:
>>
>> Using emulated variable blend E-Core optimized instruction.
>
> src/hotspot/cpu/x86/c2_MacroAssembler_x86.cpp line 5309:
>
>> 5307: assert(bt == T_LONG || bt == T_DOUBLE, "");
>> 5308: vmovmskpd(rtmp, mask, vec_enc);
>> 5309: shlq(rtmp, 5); // for 64 bit rows (4 longs)
>
> Suggestion:
>
> shlq(rtmp, 5); // for 32 bit rows (4 longs)
Each long/double permute lane holds 64 bit value.
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/17261#discussion_r1452967063
More information about the hotspot-compiler-dev
mailing list