cutlass

mirror of https://github.com/NVIDIA/cutlass.git synced 2026-06-28 18:37:05 +00:00

Files

Blake Ledden 087c84df83 docs: Fix float16 documentation in elementwise_add notebook (#2949 ) (#3047 )

The notebook uses float16 tensors but the vectorized kernel documentation
incorrectly describes elements as 32-bit and uses 4-element vectorization.
Updated to correctly state 16-bit elements with 8-element vectorization
for proper 128-bit loads/stores.

Signed-off-by: Blake Ledden <bledden@users.noreply.github.com>
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

2026-03-12 10:29:46 +08:00

CuTeDSL

docs: Fix float16 documentation in elementwise_add notebook (#2949 ) (#3047 )

2026-03-12 10:29:46 +08:00

deprecated

v4.3.5 update. (#2934 )

2026-01-08 15:02:56 -05:00