Files
composable_kernel/include/ck/tensor_operation/gpu
huaiguxu 0ac91713ae Huaiguxu/moe fp8 pertoken scale fix (#2391)
* fix pertoken_scale a_scale dimension

* clang-format

* Fix moe_gemm2_fp8 perTokenScale reference and example.

[ROCm/composable_kernel commit: e1c5172fdb]
2025-06-27 10:24:34 +08:00
..
2025-06-24 14:51:29 +08:00