Commit Graph

3 Commits

Author SHA1 Message Date
Shaojie WANG
6446894289 Padding for attention: bmm+scale+softmax+bmm kernel (#385)
* add padding algo for bmm+scale+softmax+bmm. Version for verification

* remove verification code

* remove comments

* add padded bmm scale softmax bmm example

* format

* refactor

* add comments for usages of padding bmm+scale+softmax+bmm

Co-authored-by: Chao Liu <lc.roy86@gmail.com>

[ROCm/composable_kernel commit: 45adb736e7]
2022-08-30 11:01:37 -05:00
Anthony Chang
31cacabafa Attention with output permutation (#370)
* comment on specialization for TensorSpecialization::Packed

* gemm_softmax_gemm with output permutation

* scaling

* refactor MatrixPadder; rename to GemmPadder

* remove old sanity check

* restore original gemm_softmax_gemm

* revise comment in gemm_softmax_gemm example

* use GetElementSpaceSize()

* remove extra header

* typo

* remove archaic DeviceOpPtr

[ROCm/composable_kernel commit: e0d8806ca1]
2022-08-23 14:52:56 -05:00
Chao Liu
de15764ff1 use scale (#363)
[ROCm/composable_kernel commit: bac7df8faf]
2022-08-17 10:38:00 -05:00