This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-07 00:04:37 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
7dff5fe4ffb62a12aeccbf3b31d4149d8fb051db
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Anton Gorenko
7dff5fe4ff
Clone for device_gemm_wmma_cshuffle_v3.hpp for future Multiple D support
2025-06-04 12:29:34 +05:00
..
block
WMMA GEMM universal pipeline v1, mixed precision and paddings, examples (
#2230
)
2025-06-04 12:22:33 +06:00
device
Clone for device_gemm_wmma_cshuffle_v3.hpp for future Multiple D support
2025-06-04 12:29:34 +05:00
element
Add Clamp/Relu bf16/fp16 cast fixes (
#2279
)
2025-06-03 18:31:46 +02:00
grid
Use ThreadGroupTensorSliceTransfer_v7r3
2025-06-04 12:29:34 +05:00
thread
Moe gemm activation (
#2026
)
2025-04-23 10:35:34 +08:00
warp
Use new mfma instructions for FP8 on gfx950 (
#2202
)
2025-05-19 17:29:51 -07:00