[vectorIntrinsics] RFR: Improve mask reduction operations on AVX [v3]

Sandhya Viswanathan sviswanathan at openjdk.java.net
Wed Nov 17 18:41:11 UTC 2021


On Tue, 9 Nov 2021 16:30:20 GMT, Mai Đặng Quân Anh <duke at openjdk.java.net> wrote:

>> Mai Đặng Quân Anh has updated the pull request incrementally with two additional commits since the last revision:
>> 
>>  - support for non-bmi, some refinement
>>  - restore VectorStoreMaskNode, move logic to backend
>
> The latest change reverts the change in the mid-end and lets the x86 back-end perform the elision when matching. Some minor changes in the C2_MacroAssembler were made to improve the code emitted by first true operations, as well as to reduce the width of instructions that operate on index calculations from 64 bits to 32 bits.
> Thank you very much.

@merykitty Please send this PR directly versus the openjdk/jdk mainline. This is a good optimization to have in JDK 18.

-------------

PR: https://git.openjdk.java.net/panama-vector/pull/158


More information about the panama-dev mailing list