composable_kernel

mirror of https://github.com/ROCm/composable_kernel.git synced 2026-07-16 16:51:26 +00:00

Files

JP-Fernando 2c850cd693 [CK] Unify the grouped convolution gridwise Run() functions (#4421 )

## Motivation

There are currently three different grouped convolution related Run()
function overloads that exist in `gridwise_gemm_wmma_cshuffle_v3.hpp`.
These are used for the different types of grouped convolution: Forward,
Backward weights, and Backward data.
The functions are very similar and should be unified to a single `Run()`
function for all types of grouped convolution.

## Technical Details

The three old `Run<>()` functions were replaced with a single unified
function.
The new `Run<>()` function is run from device implementations:
  
-  DeviceGroupedConvFwdMultipleABD_Wmma_CShuffle_V3
  
-  DeviceGroupedConvBwdDataMultipleD_Wmma_CShuffleV3
  
-  DeviceGroupedConvBwdWeightMultipleD_Wmma_CShuffleV3
  
-  DeviceGroupedConvBwdWeightTwoStage_Wmma_CShuffleV3
  
-  DeviceGroupedConvBwdWeight_Wmma_CShuffleV3

The DeviceGroupedConvFwdMultipleD_Wmma_CShuffle_V3_Large_Tensor
implementation uses a different `Run<>()` overload and was therefore not
modified.

## Test Plan

Run the following grouped convolution tests on `gfx1201`, as this
architecture is WMMA-capable:

- `test_grouped_convnd_fwd`

- `test_grouped_convnd_bwd_weight`

- `test_grouped_convnd_bwd_data`

Compilation and testing were also executed on `gfx1100` to avoid CI
problems.

## Test Result

First part (unification of `Run<>()` function): All tests successful.

Second part (integration of single `Run<>()` function as a direct call):
All tests successful.

## Submission Checklist

- [x] Look over the contributing guidelines at
https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

---------

Co-authored-by: Fernando Jiménez <fernando.jimenez@streamhpc.com>

2026-03-11 17:38:55 +01:00

block

[CK] Workaround blockscale wp test failure (#4372 )

2026-02-06 16:09:08 -08:00

device

[CK] Unify the grouped convolution gridwise Run() functions (#4421 )

2026-03-11 17:38:55 +01:00

element

Test fix for gemm_b_scale_xdl_v3. (#3674 )

2026-01-30 10:34:54 -07:00

grid

[CK] Unify the grouped convolution gridwise Run() functions (#4421 )