Files
composable_kernel/include
Anton Gorenko 5ea73e4075 Impove precision of CShuffle with scales or multi D
Two new template parameters are introduced:
 * CShuffleDataType allows to apply multiple Ds before downcasting to
 ODataType (prevents unexpected precision loss and/or overflow);
 * CComputeDataType to use scales with int32 AccDataType (int8 gemms);
2026-06-19 15:34:02 +05:00
..