mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-06-28 18:56:59 +00:00
Two new template parameters are introduced: * CShuffleDataType allows to apply multiple Ds before downcasting to ODataType (prevents unexpected precision loss and/or overflow); * CComputeDataType to use scales with int32 AccDataType (int8 gemms);