Files
composable_kernel/experimental/grouped_convolution_tile_instances
Bartłomiej Kocot 1a2c0d835a [rocm-libraries] ROCm/rocm-libraries#4791 (commit 6cc17c6)
[CK][CK TILE] Improve oob check

## Motivation

Improve OOB checks. Remove permutes which have been generated by thread
buffer zero clear. at now in assembly there is only condmask instead of
permute + condmask.

Change number of KPack for generated instances

## Technical Details

Remove permute instructions from assembly

## Test Plan

test_grouped_convnd_fwd_tile

## Test Result

passed

## Submission Checklist

- [x] Look over the contributing guidelines at
https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.
2026-02-24 21:41:44 +00:00
..

Grouped Convolution Tile Instances Generator

CK Tile Convolution instances implemented via builder and generated via python script. It is integrated with tests and ckProfiler This functionality will be refactored and moved under the Tile Engine. At now to speed up development and provide tests for CK Tile Convolution it has been implemented under experimental directory.