[lworld+fp16] RFR: 8341414: Add support for FP16 conversion routines [v2]

Bhavana Kilambi bkilambi at openjdk.org
Thu Oct 31 13:50:40 UTC 2024


> This patch adds intrinsic support for FP16 conversion routines to int/long/double and also the aarch64 backend support. This patch implements both scalar and vector versions for these conversions.
> 
> Performance numbers on aarch64 machine with SVE support :
> 
> 
> Benchmark                         (vectorDim)   Gain
> Float16OpsBenchmark.fp16ToDouble  1024          18.23
> Float16OpsBenchmark.fp16ToInt     1024          1.93
> Float16OpsBenchmark.fp16ToLong    1024          3.95
> 
> 
> The Gain column is the ratio between thrpt of this patch and the thrpt with the intrinsics disabled (which generates FP32 arithmetic).

Bhavana Kilambi has updated the pull request incrementally with one additional commit since the last revision:

  Remove intrinsification of conversion methods in Float16

-------------

Changes:
  - all: https://git.openjdk.org/valhalla/pull/1283/files
  - new: https://git.openjdk.org/valhalla/pull/1283/files/fe9e31ce..0809cedc

Webrevs:
 - full: https://webrevs.openjdk.org/?repo=valhalla&pr=1283&range=01
 - incr: https://webrevs.openjdk.org/?repo=valhalla&pr=1283&range=00-01

  Stats: 307 lines in 10 files changed: 188 ins; 110 del; 9 mod
  Patch: https://git.openjdk.org/valhalla/pull/1283.diff
  Fetch: git fetch https://git.openjdk.org/valhalla.git pull/1283/head:pull/1283

PR: https://git.openjdk.org/valhalla/pull/1283


More information about the valhalla-dev mailing list