Files
composable_kernel/include/ck/tensor_operation/gpu/device/impl
Enrico Degregori 7414a0f4d4 Wmma support for gemm_reduce (#3145)
* Initial implementation GEMM+Reduce:

 - device struct
 - epilogue struct

* Fix tests, improve profiler and add initial instances

* Add instances

* Fix compilation error

* Address review comments

* Fix logging

---------

Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com>
2025-11-12 11:23:54 -08:00
..
2024-05-10 09:41:39 -07:00
2023-06-19 09:44:22 -05:00