This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-30 11:47:48 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
fbfe57e7c28632f16d5bf4fae99aa2fe728c46aa
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Ville Pietilä
5d7a0487f8
Refactor conv profiler to produce statistics for analysing split-K autodeduction performance.
2025-06-30 14:20:10 +00:00
..
block
Add MoE & FP8 Blockscale WP Kernels for GFX950 (
#2297
)
2025-06-12 09:25:59 +08:00
device
Refactor conv profiler to produce statistics for analysing split-K autodeduction performance.
2025-06-30 14:20:10 +00:00
element
Optimized GEMMs for MX FP4/8 (
#2294
)
2025-06-05 13:54:15 -06:00
grid
WIP: Oversubscription factor.
2025-06-19 15:12:21 +00:00
thread
Add MoE & FP8 Blockscale WP Kernels for GFX950 (
#2297
)
2025-06-12 09:25:59 +08:00
warp
Add MoE & FP8 Blockscale WP Kernels for GFX950 (
#2297
)
2025-06-12 09:25:59 +08:00