This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-07-05 22:57:11 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
00a3ce734adce7cc3a754f845cf60e2976c65f64
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Ville Pietilä
00a3ce734a
Integrate new packed cast threadwise tensor slice transfer into gridwise gemm pipelines.
2025-08-15 12:06:44 +00:00
..
block
upgrade from clang-format-12 to clang-format-18 (
#2568
)
2025-07-28 11:34:07 -07:00
device
Automatic deduction of split-K value for grouped convolution (
#2491
)
2025-07-31 12:08:45 +02:00
element
Add more unit tests.
2025-08-14 11:33:02 +00:00
grid
Integrate new packed cast threadwise tensor slice transfer into gridwise gemm pipelines.
2025-08-15 12:06:44 +00:00
thread
Fix a bug in the packed cast threadwise transfer.
2025-08-15 10:41:06 +00:00
warp
MX GEMM - FP6 Example (
#2419
)
2025-07-07 10:33:26 -06:00