RFR: 8283667: [vectorapi] Vectorization for masked load with IOOBE with predicate feature [v2]
Xiaohong Gong
xgong at openjdk.java.net
Thu May 5 02:17:20 UTC 2022
On Thu, 31 Mar 2022 02:15:26 GMT, Quan Anh Mai <duke at openjdk.java.net> wrote:
>> I'm afraid not. "Load + Blend" makes the elements of unmasked lanes to be `0`. Then a full store may change the values in the unmasked memory to be 0, which is different with the mask store API definition.
>
> The blend should be with the intended-to-store vector, so that masked lanes contain the need-to-store elements and unmasked lanes contain the loaded elements, which would be stored back, which results in unchanged values.
Hi @merykitty @jatin-bhateja , could you please help to take a review at the similar store masked PR https://github.com/openjdk/jdk/pull/8544 ? Any feedback is welcome! Thanks so much!
-------------
PR: https://git.openjdk.java.net/jdk/pull/8035
More information about the hotspot-compiler-dev
mailing list