This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-07-02 21:27:45 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
457d8b4f85c4adf2fd54b35ae4e60d1997743645
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
letaoqin
457d8b4f85
move kernel parametor to host for performance
2025-03-13 11:07:34 +00:00
..
block
[BlockScale GEMM] FP8 Blockscale GEMM optimization and ckProfiler (
#1913
)
2025-02-25 15:42:20 +08:00
device
move kernel parametor to host for performance
2025-03-13 11:07:34 +00:00
element
Rebase the PR
#1520
to ROCm repo. (
#1574
)
2025-02-20 18:58:14 -08:00
grid
move kernel parametor to host for performance
2025-03-13 11:07:34 +00:00
thread
[A8W8 GEMM] Optimized weight-preshuffled implementation & add quantization datatype for CK TILE rms_norm (
#1862
)
2025-02-20 14:00:27 -08:00
warp
MX FP GEMM - Test MX FP8 MFMA Instructions (
#1902
)
2025-02-21 13:35:54 -07:00