ck moe gemm implement (#1936)

* port all moe changes from ck_moe_gemm branch

* refine codes in the pr

* fix tail odd

* fix clang format

* fix clang format2

* make hot loop scheduler compatible with 16x16 and 32x32

* clang format

* fix per token quant

* rename moe example

* clang format

---------

Co-authored-by: coderfeli <coderfeli@163.com>
This commit is contained in:
feli
2025-03-05 15:56:55 +08:00
committed by GitHub
parent c95bda93ba
commit 3786e16375
13 changed files with 6144 additions and 7 deletions

File diff suppressed because it is too large Load Diff