Files
composable_kernel/include/ck/tensor_operation/gpu/device/impl
Johannes Graner 323e014799 [CK Grouped Gemm] Fix workspace stride in two stage kernel (#3412)
* Use correct workspace stride

* Use correct stride in elementwise kernel

* Fix test by adding padder

* No UTF-8 in comments

* Remove unnecessary changes

* Remove more unnecessary changes

* Use non-padded strides for workspace

* Disable two stage kernel for RRR+MNKPadding+kbatch>2

Partially fixes AICK-441
2025-12-19 10:04:48 +01:00
..