This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-12 09:16:52 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
417a6b65b6436b533c64966a6b8825f12c31d8ef
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Thomas Ning
1386924749
Add the instances for small sized GEMM in preshuffle and improve CMake Flag (
#2212
)
...
* Add small instance, add the bug fix, & improve the example CMake * clang format
2025-05-20 15:05:08 -07:00
..
block
Add the instances for small sized GEMM in preshuffle and improve CMake Flag (
#2212
)
2025-05-20 15:05:08 -07:00
device
Narrowing error fix for codegen compilation (
#2194
)
2025-05-16 11:11:54 -07:00
element
Add grouped conv fwd bias relu instances (
#2179
)
2025-05-09 22:52:34 +02:00
grid
Use new mfma instructions for FP8 on gfx950 (
#2202
)
2025-05-19 17:29:51 -07:00
thread
Moe gemm activation (
#2026
)
2025-04-23 10:35:34 +08:00
warp
Use new mfma instructions for FP8 on gfx950 (
#2202
)
2025-05-19 17:29:51 -07:00