This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-07-01 04:07:56 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
d60d23ea8e97ee202f175dc920e41f663950e496
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
aska-0096
d60d23ea8e
v1 performance debugging
2025-03-19 12:57:16 +00:00
..
block
v1 performance debugging
2025-03-19 12:57:16 +00:00
device
v1 performance debugging
2025-03-19 12:57:16 +00:00
element
Rebase the PR
#1520
to ROCm repo. (
#1574
)
2025-02-20 18:58:14 -08:00
grid
add ckProfiler. performance debugging for blockscale_wp prefill
2025-03-17 07:19:54 +00:00
thread
[A8W8 GEMM] Optimized weight-preshuffled implementation & add quantization datatype for CK TILE rms_norm (
#1862
)
2025-02-20 14:00:27 -08:00
warp
MX FP GEMM - Test MX FP8 MFMA Instructions (
#1902
)
2025-02-21 13:35:54 -07:00