This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-30 19:57:40 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
33d4ae859eaec9c29bbb68bc8ad1615de656f97a
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Bartłomiej Kocot
33d4ae859e
Fix grouped conv bwd data wmma check (
#3562
)
2026-01-15 09:29:15 -05:00
..
block
Implement grouped gemm tile loop for RDNA4 (
#3304
)
2026-01-15 09:29:13 -05:00
device
Fix grouped conv bwd data wmma check (
#3562
)
2026-01-15 09:29:15 -05:00
element
Implement grouped gemm tile loop for RDNA4 (
#3304
)
2026-01-15 09:29:13 -05:00
grid
Add support for direct store in epilogue and padding support for wave transfer without transpose (
#3465
)
2026-01-15 09:29:14 -05:00
thread
Grouped convolution forward device implementation and base flavors for RDNA3/4 (
#2964
)
2025-12-18 13:12:15 -07:00
warp
chore(copyright): update copyright header for include directory (
#3293
)
2025-11-26 11:00:05 -07:00