mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-01 20:21:23 +00:00
[CK][CK TILE] Improve oob check ## Motivation Improve OOB checks. Remove permutes which have been generated by thread buffer zero clear. at now in assembly there is only condmask instead of permute + condmask. Change number of KPack for generated instances ## Technical Details Remove permute instructions from assembly ## Test Plan test_grouped_convnd_fwd_tile ## Test Result passed ## Submission Checklist - [x] Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.
Grouped Convolution Tile Instances Generator
CK Tile Convolution instances implemented via builder and generated via python script. It is integrated with tests and ckProfiler This functionality will be refactored and moved under the Tile Engine. At now to speed up development and provide tests for CK Tile Convolution it has been implemented under experimental directory.