damien-lejeune
46c10c316d
Update include path to break the remod's cyclic dep issue ( #2978 )
...
* Update include path to break the cyclic dep issue
* Use ck_tile::permute_vectors_i4x4_b in tile engine
---------
Co-authored-by: Damien Lejeune <damien.lejeune@amd.com >
Co-authored-by: Adam Osewski <19374865+aosewski@users.noreply.github.com >
2025-10-13 13:24:47 +02:00
Bartłomiej Kocot
4212bbc170
[CK Tile] Grouped convolution backward data ( #2652 )
...
* base working version for single groupped conv bwd data
* Fix 2d descriptor
* fix groups
* Add 3d support
* fixes
* fixes
* fixes
---------
Co-authored-by: Jakub Piasecki <jakpia21@gmail.com >
2025-08-20 05:29:57 -07:00
jakpiase
6681593864
[CK_TILE] Grouped Convolution Backward Weight Kernel ( #2357 )
...
* [CK TILE] Grouped Convolution Forward Kernel
* custom vector size
* fixes
* refactor
* resolved conflicts
* rebase fixes
* fixes
* tmp
* add working support for splitk
* minor fix
* fixes
* fixes
* minor fix
* small fix
* Split K and preprocessing fixes
---------
Co-authored-by: Bartlomiej Kocot <barkocot@amd.com >
2025-07-24 10:41:35 +02:00
Bartłomiej Kocot
cebdee4d9e
[CK TILE] Grouped Convolution Forward Kernel ( #2188 )
...
* [CK TILE] Grouped Convolution Forward Kernel
* custom vector size
* fixes
* refactor
* rebase fixes
* fixes
* fixes
2025-06-20 15:44:36 -07:00