This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-13 17:55:48 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
74a34e0f507cde4502f397dffd0b15fcea5e9982
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
chenjun
74a34e0f50
fix KPerBlock = 64 a8w8 bpreshulle gemm build fail in gfx950 (
#2437
)
...
Co-authored-by: valarLip <
340077269@qq.com
>
2025-07-02 19:12:07 +08:00
..
block
fix moe i4 bug from aiter (
#2339
)
2025-06-24 14:51:29 +08:00
device
[CK][CONV] Support NCHW in class DeviceGroupedConvFwdMultipleABD_Xdl_CShuffle (
#2375
)
2025-06-26 08:32:39 +08:00
element
Grouped convolution forward with clamp (
#2334
)
2025-06-16 15:36:53 +02:00
grid
fix KPerBlock = 64 a8w8 bpreshulle gemm build fail in gfx950 (
#2437
)
2025-07-02 19:12:07 +08:00
thread
Add MoE & FP8 Blockscale WP Kernels for GFX950 (
#2297
)
2025-06-12 09:25:59 +08:00
warp
Add MoE & FP8 Blockscale WP Kernels for GFX950 (
#2297
)
2025-06-12 09:25:59 +08:00