Files
composable_kernel/include/ck/tensor_operation/gpu/device
Bartłomiej Kocot fd46a01d8b Grouped convolution backward weight special vector size loads (#1772)
* Grouped convolution backward weight special vector size loads

* Instnaces and tests

* Fixes

* Add 7 and 13 special cases

* fix comments

* Fix

* Fix2

* fixes

* fix atomic add bf16
2025-01-10 22:02:30 +08:00
..
2024-03-08 17:11:51 -08:00
2023-08-15 02:25:28 +08:00
2024-03-08 17:11:51 -08:00
2024-06-25 16:37:35 -05:00