This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-07-01 20:27:42 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
4cf3e61954deec4bbcc3a2e1f879494d618ed2d5
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
kiefer
4cf3e61954
Grab device and gridwise files from bkp branch, this should enable splitK support for convolution and also we no longer ForceThreadTileTransfer for explicit gemm. Also grab some updates from 7e7243783008b11e904f127ecf1df55ef95e9af2 to fix building on clang20.
2025-12-09 09:13:31 +00:00
..
block
Wave Tile Transfer supporting global load with transpose (
#3027
)
2025-10-16 11:33:56 -07:00
device
Grab device and gridwise files from bkp branch, this should enable splitK support for convolution and also we no longer ForceThreadTileTransfer for explicit gemm. Also grab some updates from 7e7243783008b11e904f127ecf1df55ef95e9af2 to fix building on clang20.
2025-12-09 09:13:31 +00:00
element
Wmma support for multiple Ds based GEMMs (
#2613
)
2025-09-05 16:31:08 +02:00
grid
Grab device and gridwise files from bkp branch, this should enable splitK support for convolution and also we no longer ForceThreadTileTransfer for explicit gemm. Also grab some updates from 7e7243783008b11e904f127ecf1df55ef95e9af2 to fix building on clang20.
2025-12-09 09:13:31 +00:00
thread
Extend XDL kernel to Support RDNA3/4 - Part 4 (
#2724
)
2025-09-12 08:17:07 -07:00
warp
fix:tf32:fix build fail for all supported targets (
#2942
)
2025-09-29 08:04:11 -07:00