[vectorIntrinsics] RFR: RFC: Vector API masking support proposal for Arm SVE [v3]

Xiaohong Gong xgong at openjdk.java.net
Fri Mar 19 03:49:52 UTC 2021


On Mon, 15 Mar 2021 03:55:34 GMT, Xiaohong Gong <xgong at openjdk.org> wrote:

>> src/hotspot/cpu/aarch64/aarch64_sve.ad line 1301:
>> 
>>> 1299:   ins_encode %{
>>> 1300:     // Load mask values which are boolean type, and extend them to the
>>> 1301:     // expected vector element type.
>> 
>> Isn't this same as existing VectorLoadMaskNode ?
>
> I'm not sure whether this can benefit AVX-512 masking load. It loads the values from boolean memory and extends the boolean values to relative other basic types (short,int,long). Different with AVX-512, SVE predicate register controls the vector activity depending on the element type, i.e. it uses 1-bit to control a byte lane, and 2-bits for a short lane, and 4-bits for an int lane, and so on. The type extending/narrowing between real basic type and boolean is always needed. As SVE provides a single instruction to implement the `load+extend` function, we don't need to load values and extend them with external instructions. So that's what the new added node used to do. Maybe a better name for it is `LoadVectorExtendNode`.

This new node is performance friendly to SVE. For SVE, we changed the original `VectorLoadMask (LoadVector mem)` to `VectorToMask (LoadVectorMask mem)`. And the same logic to mask store.

-------------

PR: https://git.openjdk.java.net/panama-vector/pull/40


More information about the panama-dev mailing list