Aviral Goel
e044db7202
feat(precommit-hooks): add check for correct copyright header ( #3302 )
...
* chore(copyright): update copyright header for left files
* feat(copyright): add copyright check to precommit hooks
* chore(copyright): update copyright header for include/ck_tile directory
* chore(copyright): update copyright header for example directory
* chore(copyright): update copyright header for .github directory
* refactor: copyright_check script with better if else handling
* chore(copyright): update compyright header for remaining files
* feat: add script to automate copyright addition
[ROCm/composable_kernel commit: 6d25525adc ]
2025-12-10 22:50:43 -08:00
Ville Pietilä
fbf53fb970
[CK_TILE] Split-K autodeduction ( #3351 )
...
* First version of split-K autodeduction.
* Fix circular dependency and kernel construction.
* Fix tolerance calculation for bwd weight example.
* Simplify kernel construction.
* Fix kernel launching bug for split-K autodeduce.
* Add split-K autodeduction support for the two stage example.
* Fix a corner case.
* Fix clang-format.
* Fix clang-format for inc files.
* Add missing header.
* Prevent too large split-K values.
* Fix formatting.
* Add unit tests for IsSupportedArgument in grouped bwd conv.
* clang-format.
* Fix merge conflicts.
* Address feedback from code review.
* clang-format
* Fix new tests after merge.
---------
Co-authored-by: Ville Pietilä <>
[ROCm/composable_kernel commit: fc22320d78 ]
2025-12-10 09:30:30 +02:00
damien-lejeune
b904c41e44
Update include path to break the remod's cyclic dep issue ( #2978 )
...
* Update include path to break the cyclic dep issue
* Use ck_tile::permute_vectors_i4x4_b in tile engine
---------
Co-authored-by: Damien Lejeune <damien.lejeune@amd.com >
Co-authored-by: Adam Osewski <19374865+aosewski@users.noreply.github.com >
[ROCm/composable_kernel commit: 46c10c316d ]
2025-10-13 13:24:47 +02:00
Bartłomiej Kocot
092ba92f26
[CK Tile] Grouped convolution backward data ( #2652 )
...
* base working version for single groupped conv bwd data
* Fix 2d descriptor
* fix groups
* Add 3d support
* fixes
* fixes
* fixes
---------
Co-authored-by: Jakub Piasecki <jakpia21@gmail.com >
[ROCm/composable_kernel commit: 4212bbc170 ]
2025-08-20 05:29:57 -07:00
jakpiase
91f32588d1
[CK_TILE] Grouped Convolution Backward Weight Kernel ( #2357 )
...
* [CK TILE] Grouped Convolution Forward Kernel
* custom vector size
* fixes
* refactor
* resolved conflicts
* rebase fixes
* fixes
* tmp
* add working support for splitk
* minor fix
* fixes
* fixes
* minor fix
* small fix
* Split K and preprocessing fixes
---------
Co-authored-by: Bartlomiej Kocot <barkocot@amd.com >
[ROCm/composable_kernel commit: 6681593864 ]
2025-07-24 10:41:35 +02:00
Bartłomiej Kocot
29cfe38b42
[CK TILE] Grouped Convolution Forward Kernel ( #2188 )
...
* [CK TILE] Grouped Convolution Forward Kernel
* custom vector size
* fixes
* refactor
* rebase fixes
* fixes
* fixes
[ROCm/composable_kernel commit: cebdee4d9e ]
2025-06-20 15:44:36 -07:00