This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-07-02 21:27:45 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
295384ddf796f32450feb20d70316533a85391b2
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
ltqin
295384ddf7
Merge branch 'develop' into update_cka8w8_uc_padding
2025-02-25 19:41:58 +08:00
..
block
[BlockScale GEMM] FP8 Blockscale GEMM optimization and ckProfiler (
#1913
)
2025-02-25 15:42:20 +08:00
device
Merge branch 'develop' into update_cka8w8_uc_padding
2025-02-25 19:41:58 +08:00
element
Rebase the PR
#1520
to ROCm repo. (
#1574
)
2025-02-20 18:58:14 -08:00
grid
Merge branch 'develop' into update_cka8w8_uc_padding
2025-02-25 19:41:58 +08:00
thread
[A8W8 GEMM] Optimized weight-preshuffled implementation & add quantization datatype for CK TILE rms_norm (
#1862
)
2025-02-20 14:00:27 -08:00
warp
MX FP GEMM - Test MX FP8 MFMA Instructions (
#1902
)
2025-02-21 13:35:54 -07:00