This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-30 03:37:38 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
df43b0c8581d4f3d9ea25e104d224fe5a9d04ebb
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
qin letao
df43b0c858
Merge branch 'develop' into update_cka8w8_uc_padding
2025-02-24 06:55:52 +00:00
..
block
[A8W8 GEMM] Optimized weight-preshuffled implementation & add quantization datatype for CK TILE rms_norm (
#1862
)
2025-02-20 14:00:27 -08:00
device
Merge branch 'develop' into update_cka8w8_uc_padding
2025-02-24 06:55:52 +00:00
element
Rebase the PR
#1520
to ROCm repo. (
#1574
)
2025-02-20 18:58:14 -08:00
grid
Merge branch 'develop' into update_cka8w8_uc_padding
2025-02-24 06:55:52 +00:00
thread
[A8W8 GEMM] Optimized weight-preshuffled implementation & add quantization datatype for CK TILE rms_norm (
#1862
)
2025-02-20 14:00:27 -08:00
warp
MX FP GEMM - Test MX FP8 MFMA Instructions (
#1902
)
2025-02-21 13:35:54 -07:00