mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-20 12:59:49 +00:00
ck moe gemm implement (#1936)
* port all moe changes from ck_moe_gemm branch
* refine codes in the pr
* fix tail odd
* fix clang format
* fix clang format2
* make hot loop scheduler compatible with 16x16 and 32x32
* clang format
* fix per token quant
* rename moe example
* clang format
---------
Co-authored-by: coderfeli <coderfeli@163.com>
[ROCm/composable_kernel commit: 3786e16375]
This commit is contained in:
2144
include/ck/tensor_operation/gpu/grid/gridwise_moe_gemm.hpp
Normal file
2144
include/ck/tensor_operation/gpu/grid/gridwise_moe_gemm.hpp
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user