Files
composable_kernel/include/ck/tensor_operation/gpu/device
Anthony Chang 9287b7c6b3 Grouped batched attention + permute (#412)
* grouped attn without batch validates; now move toward grouped batched attn

* grouped batched attention

* working

* remove debug logging

clean up

clean up

* reintroduce g_ prefix back to host tensor variables

* format

* rename file

* restore old file

* rename

* consolidate padded/non-padded attention example

* harmonize padding specialization in attn examples
2022-09-19 16:09:44 -05:00
..
2022-09-19 11:25:28 -05:00
2022-06-24 23:32:43 -05:00
2022-07-02 09:15:38 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00