mirror of
https://github.com/NVIDIA/cutlass.git
synced 2026-06-29 19:07:07 +00:00
* Blockscaled Ragged Contiguous Grouped Gemm for MoEs (#2790) * Adding blockscaled ragged contiguous grouped gemm for MoEs * cleaning up the example * introduction to example improved --------- Co-authored-by: Shreya Gaur <shgaur@dc2-container-xterm-012.prd.it.nvidia.com> * v4.3.1 update. --------- Co-authored-by: Shreya Gaur <48754356+Shreya-gaur@users.noreply.github.com> Co-authored-by: Shreya Gaur <shgaur@dc2-container-xterm-012.prd.it.nvidia.com>