[code-reflection] RFR: [hat][cuda] 2D/3D NDrange supported for CUDA
Juan Fumero
duke at openjdk.org
Wed Jul 30 12:27:45 UTC 2025
This PR extends the 2D and 3D NDRange developed for the OpenCL backend/runtime for CUDA devices.
In addition, it improves the thread-block selection.
The heuristic for thread-block selection here is quite simple (static decision based on the number of dimensions). However, in the future we can improve this. My goal with this PR is to showcase 2D and 2D ranges for CUDA compatible devices.
How to test?
java @hat/run ffi-cuda matmul 1D
java @hat/run ffi-cuda matmul 2D
# Old examples should continue to work
java @hat/run ffi-cuda squares
-------------
Commit messages:
- [hat][cuda] 2D/3D NDrange supported for CUDA
Changes: https://git.openjdk.org/babylon/pull/504/files
Webrev: https://webrevs.openjdk.org/?repo=babylon&pr=504&range=00
Stats: 63 lines in 3 files changed: 31 ins; 17 del; 15 mod
Patch: https://git.openjdk.org/babylon/pull/504.diff
Fetch: git fetch https://git.openjdk.org/babylon.git pull/504/head:pull/504
PR: https://git.openjdk.org/babylon/pull/504
More information about the babylon-dev
mailing list