Files
composable_kernel/include/ck/tensor_operation/gpu/device/impl
Bartłomiej Kocot fd46a01d8b Grouped convolution backward weight special vector size loads (#1772)
* Grouped convolution backward weight special vector size loads

* Instnaces and tests

* Fixes

* Add 7 and 13 special cases

* fix comments

* Fix

* Fix2

* fixes

* fix atomic add bf16
2025-01-10 22:02:30 +08:00
..
2024-05-10 09:41:39 -07:00
2023-06-19 09:44:22 -05:00