Files
composable_kernel/include/ck_tile/ops
Ali Nouri 91317bdfe9 Add atomic-free MOE GEMM implementation
- Add FusedMoeGemmTilePartitioner_NoAtomic: Forces single workgroup per expert
- Add FusedMoeGemmPipelineFlatmmPolicy_NoAtomic: Fixes alignment consistency
- Update API to use no-atomic approach when intermediate_size <= Block_N0

Eliminates atomic operations by ensuring each workgroup handles complete
expert computation without K-dimension splitting.
2025-09-26 22:50:37 +00:00
..
2024-10-26 23:52:49 +08:00
2024-10-26 23:52:49 +08:00