Files
composable_kernel/library
Po Yen Chen b6e54f589e Update tuning parameter & compilation options of DeviceGemmXdl<> instance (layout=TT) (#819)
* Enable pipeline v2 opt for layout=TT instance

* Use better thread mapping for reading A tile

* Conditionally enable pipeline v2 opt

* Allow enabling only fp16 gemm instances in profiler

* Fix formatting error

* Fix compilation error if we enable fp32 in profiler

[ROCm/composable_kernel commit: f7cc8c3b03]
2023-08-02 10:32:22 -05:00
..