[lworld+fp16] RFR: 8341003: [lworld+fp16] Benchmarks for various Float16 operations

Jatin Bhateja jbhateja at openjdk.org
Thu Sep 26 08:30:33 UTC 2024


- Adding micro-benchmarks for various Float16 operations.
- Adding similarity search targeting micro-benchmarks.

Please find below the results of performance testing over recently released Intel Xeon6 Granite Rapids:-


Benchmark                                               (vectorDim)   Mode  Cnt      Score   Error   Units
Float16OpsBenchmark.absBenchmark                               1024  thrpt    2  25605.990          ops/ms
Float16OpsBenchmark.addBenchmark                               1024  thrpt    2  19222.468          ops/ms
Float16OpsBenchmark.cosineSimilarityDequantizedFP16            1024  thrpt    2    528.738          ops/ms
Float16OpsBenchmark.cosineSimilarityDoubleRoundingFP16         1024  thrpt    2    660.018          ops/ms
Float16OpsBenchmark.cosineSimilaritySingleRoundingFP16         1024  thrpt    2    659.799          ops/ms
Float16OpsBenchmark.divBenchmark                               1024  thrpt    2   1974.039          ops/ms
Float16OpsBenchmark.euclideanDistanceDequantizedFP16           1024  thrpt    2    743.071          ops/ms
Float16OpsBenchmark.euclideanDistanceFP16                      1024  thrpt    2    682.440          ops/ms
Float16OpsBenchmark.fmaBenchmark                               1024  thrpt    2  14052.422          ops/ms
Float16OpsBenchmark.isFiniteBenchmark                          1024  thrpt    2   3851.234          ops/ms
Float16OpsBenchmark.isInfiniteBenchmark                        1024  thrpt    2   1496.207          ops/ms
Float16OpsBenchmark.isNaNBenchmark                             1024  thrpt    2   2778.822          ops/ms
Float16OpsBenchmark.maxBenchmark                               1024  thrpt    2  19231.326          ops/ms
Float16OpsBenchmark.minBenchmark                               1024  thrpt    2  19257.589          ops/ms
Float16OpsBenchmark.mulBenchmark                               1024  thrpt    2  19236.498          ops/ms
Float16OpsBenchmark.negateBenchmark                            1024  thrpt    2  25938.789          ops/ms
Float16OpsBenchmark.sqrtBenchmark                              1024  thrpt    2   1759.051          ops/ms
Float16OpsBenchmark.subBenchmark                               1024  thrpt    2  19242.967          ops/ms


Best Regrads,
Jatin

-------------

Commit messages:
 - Update Float16OpsBenchmark.java
 - 8341003: [lworld+fp16] Benchmarks for various Float16 operations

Changes: https://git.openjdk.org/valhalla/pull/1254/files
  Webrev: https://webrevs.openjdk.org/?repo=valhalla&pr=1254&range=00
  Issue: https://bugs.openjdk.org/browse/JDK-8341003
  Stats: 235 lines in 1 file changed: 235 ins; 0 del; 0 mod
  Patch: https://git.openjdk.org/valhalla/pull/1254.diff
  Fetch: git fetch https://git.openjdk.org/valhalla.git pull/1254/head:pull/1254

PR: https://git.openjdk.org/valhalla/pull/1254


More information about the valhalla-dev mailing list