Files
composable_kernel/include/ck/tensor_operation/gpu/device/impl
Wojciech Laskowski 5e10274417 WMMA support for GEMM reduce (#2823)
Added gemm + reduce instance library for RDNA4. This includes:

- New device implementation running GEMM and reduction kernel
- instances for wmma (xdl parity)
- examples for wmma (xdl parity)
- tests for existing xdl and wmma

[ROCm/composable_kernel commit: b25d4d684a]
2025-09-12 21:36:43 +02:00
..
2024-05-10 09:41:39 -07:00
2023-06-19 09:44:22 -05:00