Files
composable_kernel/include/ck/tensor_operation/gpu/device/impl
Bartłomiej Kocot c885afdaae Support access per groups and filter3x3 in grouped conv fwd (#1382)
* Support access per groups and filter3x3 in grouped conv fwd

* Fixes for large cases

* Fixes for large tensors

[ROCm/composable_kernel commit: 82e8a78a3f]
2024-07-12 11:08:42 -07:00
..
2024-05-10 09:41:39 -07:00
2023-06-19 09:44:22 -05:00