Logo
Explore Help
Register Sign In
ROCm/composable_kernel
1
0
Fork 0
You've already forked composable_kernel
mirror of https://github.com/ROCm/composable_kernel.git synced 2026-05-16 10:59:55 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
a53dbafec954f722ad61bd4881e164c376416f5d
composable_kernel/include/ck
History
Adam Osewski a53dbafec9 Always force output clearing for grouped conv bwd data (#2446)
* Always force output clearing

* dont run set zero for residual

---------

Co-authored-by: Bartlomiej Kocot <barkocot@amd.com>

[ROCm/composable_kernel commit: 3d70c638d1]
2025-07-04 07:49:52 -06:00
..
host_utility
enable gfx115x support (#2065)
2025-04-09 10:06:42 -07:00
library/utility
Improve fmha_bwd tests performance (#2376)
2025-06-24 07:45:24 -07:00
problem_transform
…
tensor
Jing's contribution: prototype of mixed precision gemm FP16/BF16xint4 GEMM (#1762)
2025-01-02 11:48:06 +08:00
tensor_description
…
tensor_operation
Always force output clearing for grouped conv bwd data (#2446)
2025-07-04 07:49:52 -06:00
utility
Fix return value bug that drops minus sign in some cases. (#2415)
2025-07-02 14:53:00 +08:00
wrapper
…
ck.hpp
Enable fp4 tests (#2329)
2025-06-25 07:38:54 -05:00
config.h.in
DeviceGemm_Wmma_CShuffleV3 with BlockGemmPipelineVersion::v3 (#2096)
2025-04-28 10:14:21 +05:00
filesystem.hpp
…
README.md
Add Grouped Convolution and GEMM documentation (#1719)
2025-02-04 16:41:49 +01:00
stream_config.hpp
…
version.h.in
…

README.md

Back to the main page

Composable Kernel supported operations

Supported device operations

  • GEMM
  • Grouped Convolution Forward
  • Grouped Convolution Backward Data
  • Grouped Convolution Backward Weight
Reference in New Issue View Git Blame Copy Permalink
Powered by Gitea Version: 1.25.4 Page: 1217ms Template: 16ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API