[code-reflection] RFR: [hat] Extensions of F16 (API and codegen) [v2]
Juan Fumero
jfumero at openjdk.org
Wed Nov 5 14:29:30 UTC 2025
> - New dialect nodes to do `OpConvert` from `float` to `half` and `half` to `float`: note that conversions differ between CUDA and OpenCL
> - Add Fluent API for F16 operations (similar to `Float4`)
> - Add mixed float-precision operations (`F16` -> `float`)
> - Refactor of `F16` interface to a new file
> - Fix <struct/union> parenthesis in the codegen
> - Initialization of `F16` values using the `float2F16` builtin and `Float16Tofloat` also allowed on the GPU
Juan Fumero has updated the pull request with a new target base due to a merge or a rebase. The pull request now contains 17 commits:
- Merge branch 'code-reflection' into hat/fp16/extension
- [hat] minor change
- [hat] F16 with mixed float types supported for CUDA backend
- [hat] F16 ops with mixed f32 operations
- [hat][f16] Concatenation of F16 operations supported
- Merge branch 'code-reflection' into hat/fp16/extension
- [hat][f16] WIP for local memory
- minor recformating
- patch for matmul in fp16
- [hat] refine F16Phase
- ... and 7 more: https://git.openjdk.org/babylon/compare/6be4588e...84e6de6e
-------------
Changes: https://git.openjdk.org/babylon/pull/663/files
Webrev: https://webrevs.openjdk.org/?repo=babylon&pr=663&range=01
Stats: 1155 lines in 21 files changed: 1004 ins; 72 del; 79 mod
Patch: https://git.openjdk.org/babylon/pull/663.diff
Fetch: git fetch https://git.openjdk.org/babylon.git pull/663/head:pull/663
PR: https://git.openjdk.org/babylon/pull/663
More information about the babylon-dev
mailing list