[code-reflection] RFR: Proposal for supporting Float4.of for CPU host code and GPU device code [v8]

Juan Fumero jfumero at openjdk.org
Fri Oct 31 08:30:39 UTC 2025


On Fri, 31 Oct 2025 08:27:19 GMT, Juan Fumero <jfumero at openjdk.org> wrote:

>> This PR adds support for creating float4 objects within the device code:
>> 
>> 
>> Float4 f = Float4.of(x, y, z ,w)
>> 
>> 
>> In addition, it provides a CPU implementation for this type a new method to operate in `lanewise`. This operation is CPU only at the moment:
>> 
>> 
>> Float4 vA ... 
>> Float4 vB ... 
>> Float4 vC = vA.linewise(vB, Float::sum);
>
> Juan Fumero has updated the pull request incrementally with one additional commit since the last revision:
> 
>   [hat][cuda] buildType method for vload/vstore restored

PR ready and passing for both OpenCL and CUDA.

-------------

PR Comment: https://git.openjdk.org/babylon/pull/642#issuecomment-3471830825


More information about the babylon-dev mailing list