This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-30 11:47:48 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
61172ea4bb9f56f8bfdd25da93c762ab4e0e2ab2
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
letaoqin
61172ea4bb
really padding N for B matrix
2025-03-10 10:52:59 +00:00
..
block
[BlockScale GEMM] FP8 Blockscale GEMM optimization and ckProfiler (
#1913
)
2025-02-25 15:42:20 +08:00
device
really padding N for B matrix
2025-03-10 10:52:59 +00:00
element
Rebase the PR
#1520
to ROCm repo. (
#1574
)
2025-02-20 18:58:14 -08:00
grid
really padding N for B matrix
2025-03-10 10:52:59 +00:00
thread
[A8W8 GEMM] Optimized weight-preshuffled implementation & add quantization datatype for CK TILE rms_norm (
#1862
)
2025-02-20 14:00:27 -08:00
warp
MX FP GEMM - Test MX FP8 MFMA Instructions (
#1902
)
2025-02-21 13:35:54 -07:00