Files
composable_kernel/include/ck/tensor_operation/gpu/block
Anthony Chang c961ce9226 Hotfix LDS data hazard in fused attention (#360)
* avoid LDS data hazard in gemm_softmax_gemm pipeline

* trivial refactors

* comments

* shrink blockwise gemm v2 thread buffer size

* reclaim A block lds space when during 2nd gemm

* amend

* amend
2022-08-15 12:04:20 -05:00
..
2022-08-13 00:16:14 -05:00
2022-08-13 09:43:18 -05:00