mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-14 02:02:46 +00:00
* retune & add conflict-free bf16/fp16 c-shuffle gemm instances
amend wrong K1 value in some fp16/bf16 kernel instances
* make gemm cshuffle's timing behavior consistent with all other functions
* clang-format
* retune & add conflict-free fp32 c-shuffle gemm instances
* retune & add conflict-free int8 c-shuffle gemm instances
* update the underlying gridwise gemm of all c-shuffle gemm kernels
* typo
[ROCm/composable_kernel commit: 7db48f9008]