RFR: 8318650: Optimized subword gather for x86 targets. [v3]

Jatin Bhateja jbhateja at openjdk.org
Tue Nov 7 01:13:34 UTC 2023


On Mon, 6 Nov 2023 18:33:46 GMT, Sandhya Viswanathan <sviswanathan at openjdk.org> wrote:

> This is not a masked operation so every lane of dst will be written through pinsrw/pinsrb. An vpxor before is not required.

xor here clears the intermediate vector after each iteration,  this is eventually ORs with destination.

Checkout line https://github.com/openjdk/jdk/pull/16354/files/86783403c453d329e33d94f787a5709ec35f7099#diff-318d0e76b9a97e8cf8936be1de34e52735c4d947a77cac38babbbf9a081d16fcR1644

https://github.com/openjdk/jdk/pull/16354/files/86783403c453d329e33d94f787a5709ec35f7099#diff-318d0e76b9a97e8cf8936be1de34e52735c4d947a77cac38babbbf9a081d16fcR1651

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/16354#discussion_r1384225640


More information about the hotspot-compiler-dev mailing list