Files
composable_kernel/include/ck/tensor_operation/gpu/device/impl
Haocong WANG 5b10dae6a4 Add gemm universal bf16 instances (#1484)
* revert ckprofiler change

* temp save

* Add test and test pass

* test pass

* Fix bug inside rotating buffer when tensor is not packed

* bug fix

* clang format

---------

Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com>
2024-09-04 20:58:54 -07:00
..
2024-05-10 09:41:39 -07:00
2023-06-19 09:44:22 -05:00