Files
composable_kernel/include/ck/tensor_operation/gpu/device
zjing14 7959dad566 Grouped Gemm device with multiD grid (#319)
* replace gridwise_v2r3 with multiD

* adjust parameters

* add instances

* fixed test_grouped_gemm

* fix standalone softmax race condition around blockwise reduction

* fixed ci

* fixed comment: remove redundant workspace

* use instanceFactory

* add test layout

* add empty Ds

* add bias example

* use array

* sperate examples

Co-authored-by: Anthony Chang <ac.chang@outlook.com>
2022-07-21 10:07:01 -05:00
..
2022-06-24 23:32:43 -05:00
2022-07-02 09:15:38 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-07-02 09:15:38 -05:00
2022-06-24 23:32:43 -05:00