mirror of
https://github.com/NVIDIA/cutlass.git
synced 2026-06-10 08:18:42 +00:00
* Add rmsnorm example * Address reviewer comments. (1) use the cute.runtime definition directly. (2) use the nvvm_wrapper's warp reduce directly * Separate out reduce.py * Change copyright notice years