[code-reflection] RFR: Proposal for supporting Float4.of for CPU host code and GPU device code [v8]
Juan Fumero
jfumero at openjdk.org
Fri Oct 31 08:27:19 UTC 2025
> This PR adds support for creating float4 objects within the device code:
>
>
> Float4 f = Float4.of(x, y, z ,w)
>
>
> In addition, it provides a CPU implementation for this type a new method to operate in `lanewise`. This operation is CPU only at the moment:
>
>
> Float4 vA ...
> Float4 vB ...
> Float4 vC = vA.linewise(vB, Float::sum);
Juan Fumero has updated the pull request incrementally with one additional commit since the last revision:
[hat][cuda] buildType method for vload/vstore restored
-------------
Changes:
- all: https://git.openjdk.org/babylon/pull/642/files
- new: https://git.openjdk.org/babylon/pull/642/files/aec780a9..dfb349ce
Webrevs:
- full: https://webrevs.openjdk.org/?repo=babylon&pr=642&range=07
- incr: https://webrevs.openjdk.org/?repo=babylon&pr=642&range=06-07
Stats: 14 lines in 3 files changed: 14 ins; 0 del; 0 mod
Patch: https://git.openjdk.org/babylon/pull/642.diff
Fetch: git fetch https://git.openjdk.org/babylon.git pull/642/head:pull/642
PR: https://git.openjdk.org/babylon/pull/642
More information about the babylon-dev
mailing list