Files
composable_kernel/include/ck/tensor_operation/gpu/device/impl
linqunAMD b6cb76a555 [CK] Fix example_grouped_conv_bwd_data_xdl_fp16 with ksplit = 2 (#2943)
root cause:  AK1 and BK1 may different in class template. so we need calculate k0 per block separately when ksplit is not 1.

[ROCm/composable_kernel commit: 769c58f133]
2025-09-29 07:56:33 -07:00
..
2024-05-10 09:41:39 -07:00
2023-06-19 09:44:22 -05:00