This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-07-02 21:27:45 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
d58d55e5404b1cb7e9c382d0ebfa65cf94d59154
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
chenjun
988478d452
edit fp8 ab scale for Scale_Block_M=1
2024-12-26 07:27:49 +00:00
..
block
edit fp8 ab scale for Scale_Block_M=1
2024-12-26 07:27:49 +00:00
device
Enable multiply_multiply for Scale_Block_M = 1 for deepseek
2024-12-23 20:35:34 +08:00
element
Remove virtual destructors from unary ops (
#1610
)
2024-10-30 17:42:50 +01:00
grid
edit fp8 ab scale for Scale_Block_M=1
2024-12-26 07:27:49 +00:00
thread
Moficiation to fix this issue "threadwise_tensor_slice_transfer_v5r1 issue
#1279
" (
#1492
)
2024-09-04 21:52:55 -07:00
warp
Fix compilation errors with Clang20.0. (
#1533
)
2024-09-25 13:45:38 -07:00