RFR: 8282711: Accelerate Math.signum function for AVX and AVX512 target. [v2]
Quan Anh Mai
duke at openjdk.java.net
Tue Mar 8 14:13:01 UTC 2022
On Tue, 8 Mar 2022 13:11:24 GMT, Jatin Bhateja <jbhateja at openjdk.org> wrote:
>> src/hotspot/cpu/x86/c2_MacroAssembler_x86.cpp line 4334:
>>
>>> 4332: vsubpd(xtmp3, zero, one, vec_enc);
>>> 4333: vblendvpd(dst, xtmp3, one, xtmp2, vec_enc);
>>> 4334: vblendvpd(dst, dst, src, xtmp1, vec_enc);
>>
>> The same applies here, suggestion:
>>
>> vsubpd(dst, zero, one, vec_enc);
>> vblendvpd(dst, src, one, dst, vec_enc);
>> vcmppd(xtmp1, src, zero, Assembler::EQ_UQ, vec_enc);
>> vblendvpd(dst, xtmp1, dst, zero, vec_enc);
>>
>> Thanks.
>
> Same responses as above.
I don't fully understand what you mean by "same as above". The logic here is using `src` to blend `one` and `minus_one` so that every positive will get `one` and every negative will get `minus_one`, the others are `zero` and `Nan` will make the unordered comparison with zero return true, thus the final blend will result in the correct output.
Thanks.
-------------
PR: https://git.openjdk.java.net/jdk/pull/7717
More information about the hotspot-compiler-dev
mailing list