This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-12 17:26:00 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
ae3b8ff86c7796c0146d2b014903ed7b7483ca4a
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
jakpiase
b74d4d4d54
Fix for beta!=0 in reduce (
#1440
)
...
* fix for beta!=0 in reduce * add reviewers suggestions
2024-08-06 09:10:39 -07:00
..
block
[GEMM] F8 GEMM, performance optimized. (
#1384
)
2024-07-19 22:06:52 +08:00
device
Add Grouped Conv Fwd Large Tensor kernel (
#1432
)
2024-08-06 10:06:10 +02:00
element
Adding more instances of grouped convolution 3d forward for FP8 with ConvScale+Bias element-wise operation. (
#1412
)
2024-07-24 15:49:55 -05:00
grid
Fix for beta!=0 in reduce (
#1440
)
2024-08-06 09:10:39 -07:00
thread
Merging the gfx12 code into public repo. (
#1362
)
2024-06-27 00:33:34 -07:00
warp
Add structural sparsity xdlops (
#1363
)
2024-07-04 12:00:14 +02:00