mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-14 10:09:41 +00:00
* quant
* fix bug
* simple smoothquant after softmax
* update kv-quant
* update stride
* fix fp8-pertoken-kvcache
* update int8/fp8 quant support
---------
Co-authored-by: so <a.com>
Co-authored-by: Po Yen Chen <PoYen.Chen@amd.com>
[ROCm/composable_kernel commit: 6df5fe2ad8]