mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-11 17:00:18 +00:00
* Use better ThreadClusterLengths to speed up * Update B tile reading pattern for layout=NN instance
* Use better ThreadClusterLengths to speed up * Update B tile reading pattern for layout=NN instance