mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-14 10:09:41 +00:00
* avoid LDS data hazard in gemm_softmax_gemm pipeline
* trivial refactors
* comments
* shrink blockwise gemm v2 thread buffer size
* reclaim A block lds space when during 2nd gemm
* amend
* amend
[ROCm/composable_kernel commit: c961ce9226]