mirror of
https://github.com/NVIDIA/cutlass.git
synced 2026-05-24 06:44:30 +00:00
Fix typo in CHANGELOG.md for mixed precision
This commit is contained in:
@@ -7,7 +7,7 @@
|
||||
### CuTe DSL
|
||||
* New features
|
||||
- New Block API `block_copy()` to simplify TMA and S2T copy. Users can ignore detail about multicast and 2CTA partition for TMA by `block_copy()` and need not to invoke `tma_partition()`. And users can remove bulk of S2T initialization to simplify S2T copy.
|
||||
- MXF8F6F4 mixed precision supoort
|
||||
- MXF8F6F4 mixed precision support
|
||||
- BlockScaled MMA now supports MXF8*MXF4 or MXF8*MXF6
|
||||
- Block Scaled MMA for SM120 now works on Spark
|
||||
- EFC broadcast semantics support
|
||||
|
||||
Reference in New Issue
Block a user