RFR: 8318650: Optimized subword gather for x86 targets. [v3]
Jatin Bhateja
jbhateja at openjdk.org
Tue Nov 7 01:13:34 UTC 2023
On Mon, 6 Nov 2023 18:33:46 GMT, Sandhya Viswanathan <sviswanathan at openjdk.org> wrote:
> This is not a masked operation so every lane of dst will be written through pinsrw/pinsrb. An vpxor before is not required.
xor here clears the intermediate vector after each iteration, this is eventually ORs with destination.
Checkout line https://github.com/openjdk/jdk/pull/16354/files/86783403c453d329e33d94f787a5709ec35f7099#diff-318d0e76b9a97e8cf8936be1de34e52735c4d947a77cac38babbbf9a081d16fcR1644
https://github.com/openjdk/jdk/pull/16354/files/86783403c453d329e33d94f787a5709ec35f7099#diff-318d0e76b9a97e8cf8936be1de34e52735c4d947a77cac38babbbf9a081d16fcR1651
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/16354#discussion_r1384225640
More information about the hotspot-compiler-dev
mailing list