[code-reflection] RFR: Proposal for supporting Float4.of for CPU host code and GPU device code [v8]
Juan Fumero
jfumero at openjdk.org
Fri Oct 31 08:30:39 UTC 2025
On Fri, 31 Oct 2025 08:27:19 GMT, Juan Fumero <jfumero at openjdk.org> wrote:
>> This PR adds support for creating float4 objects within the device code:
>>
>>
>> Float4 f = Float4.of(x, y, z ,w)
>>
>>
>> In addition, it provides a CPU implementation for this type a new method to operate in `lanewise`. This operation is CPU only at the moment:
>>
>>
>> Float4 vA ...
>> Float4 vB ...
>> Float4 vC = vA.linewise(vB, Float::sum);
>
> Juan Fumero has updated the pull request incrementally with one additional commit since the last revision:
>
> [hat][cuda] buildType method for vload/vstore restored
PR ready and passing for both OpenCL and CUDA.
-------------
PR Comment: https://git.openjdk.org/babylon/pull/642#issuecomment-3471830825
More information about the babylon-dev
mailing list