Files
cutlass/python/cutlass_cppgen
Junkai-Wu 5873443bb6 v4.3.3 update (#2869)
* v4.3.3 update.

* fix print_layout printf format in device code (#2688)

* fix print_layout printf format in device code

* Replace %.*s format specifier with explicit loop
* Remove unused delim variable

The printf format %.*s with dynamic width does not work correctly
in CUDA device code, causing literal %.*s to appear in output.

Fixes #2496

* Update include/cute/util/print_tensor.hpp

Co-authored-by: Cris Cecka <ccecka@users.noreply.github.com>

* Update include/cute/util/print_tensor.hpp

Co-authored-by: Cris Cecka <ccecka@users.noreply.github.com>

---------

Co-authored-by: Cris Cecka <ccecka@users.noreply.github.com>

* Support PDL for SM90 Array TMA GEMM

* Update changelog

---------

Co-authored-by: Amin Sedaghat <35748194+Aminsed@users.noreply.github.com>
Co-authored-by: Cris Cecka <ccecka@users.noreply.github.com>
2025-12-11 00:26:17 -05:00
..
2025-11-20 20:49:44 -05:00
2025-12-11 00:26:17 -05:00