Integrated: 8338126 : C2 SuperWord: VectorCastF2HF / vcvtps2ph produces wrong results for vector length 2

Mon Oct 21 15:01:27 UTC 2024

On Fri, 11 Oct 2024 23:27:35 GMT, Sandhya Viswanathan <sviswanathan at openjdk.org> wrote:

> When Float.floatToFloat16 is vectorized using a 2-element vector width due to dependencies, we incorrectly generate a 4-element vcvtps2ph with memory as destination storing 8 bytes instead of desired 4 bytes.  This issue is fixed in this PR by limiting the memory version of match rule to 4-element vector and above.
> Also a regression test case is added accordingly.
> 
> Best Regards,
> Sandhya

This pull request has now been integrated.

Changeset: 153ad911
Author:    Sandhya Viswanathan <sviswanathan at openjdk.org>
URL:       https://git.openjdk.org/jdk/commit/153ad911f9fa3389ab92a1acab44526e3f4be4a2
Stats:     31 lines in 3 files changed: 24 ins; 3 del; 4 mod

8338126: C2 SuperWord: VectorCastF2HF / vcvtps2ph produces wrong results for vector length 2

Reviewed-by: thartmann, jbhateja, epeter

-------------

PR: https://git.openjdk.org/jdk/pull/21480