RFR: 8342095: Add autovectorizer support for subword vector casts [v3]
Emanuel Peter
epeter at openjdk.org
Mon Feb 17 12:06:17 UTC 2025
On Sun, 9 Feb 2025 05:59:37 GMT, Jasmine Karthikeyan <jkarthikeyan at openjdk.org> wrote:
>> Jasmine Karthikeyan has updated the pull request incrementally with one additional commit since the last revision:
>>
>> Fix some tests that now vectorize
>
> I also updated the benchmark, and got these results:
>
> Baseline Patch
> Benchmark (SIZE) Mode Cnt Score Error Units Score Error Units Improvement
> VectorSubword.byteToInt 1024 avgt 12 185.700 ± 0.798 ns/op 37.427 ± 0.276 ns/op (4.96x)
> VectorSubword.byteToShort 1024 avgt 12 240.737 ± 1.087 ns/op 23.094 ± 0.502 ns/op (10.42x)
> VectorSubword.intToByte 1024 avgt 12 181.680 ± 0.553 ns/op 49.873 ± 1.613 ns/op (3.64x)
> VectorSubword.intToShort 1024 avgt 12 176.256 ± 1.414 ns/op 43.933 ± 4.310 ns/op (4.01x)
> VectorSubword.shortToByte 1024 avgt 12 245.600 ± 6.217 ns/op 28.426 ± 0.649 ns/op (8.64x)
> VectorSubword.shortToInt 1024 avgt 12 178.364 ± 2.921 ns/op 34.140 ± 0.229 ns/op (5.22x)
@jaskarth just ping me whenever I should have a look again!
-------------
PR Comment: https://git.openjdk.org/jdk/pull/23413#issuecomment-2662917708
More information about the hotspot-compiler-dev
mailing list