This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-04-19 22:39:03 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
8aff45a8af0c868d8c3513dab3335e3b1d3e111f
composable_kernel
/
include
/
ck_tile
/
ops
/
fused_moe
/
kernel
History
carlushuang
8aff45a8af
[CK_TILE] moe sorting optimization : refactor subtoken logic to let more kernel pickup mp kernel (
#2327
)
...
* refactor subtoken logic to let more kernel pickup mp kernel * typo
2025-06-12 11:44:22 +08:00
..
fused_moegemm_kernel.hpp
[CK_TILE] moe sorting ex kernel to support expert > 128 (
#1840
)
2025-02-11 17:49:17 +08:00
fused_moegemm_shape.hpp
[CK_TILE] fused-moe first version (
#1634
)
2024-11-26 11:14:56 +08:00
fused_moegemm_tile_partitioner.hpp
[CK_TILE] fused-moe first version (
#1634
)
2024-11-26 11:14:56 +08:00
moe_sorting_kernel.hpp
[CK_TILE] moe sorting optimization : refactor subtoken logic to let more kernel pickup mp kernel (
#2327
)
2025-06-12 11:44:22 +08:00
moe_sorting_problem.hpp
[CK_TILE] optimize moe sorting kernel, boost large context case up to 20x (
#2153
)
2025-05-06 17:32:07 +08:00