This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-04-19 14:29:05 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
6300ad3c62298dc6fdddfcf19ecd074f7f08fa96
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
music-dino
6300ad3c62
Batched gemm softmax gemm descriptor fix (
#3564
)
...
* Add rocm to prefix path for codegen * Fix issue with c0_matrix_mask construction
2026-01-20 07:25:30 -08:00
..
block
Implement grouped gemm tile loop for RDNA4 (
#3304
)
2026-01-13 07:14:23 +01:00
device
Batched gemm softmax gemm descriptor fix (
#3564
)
2026-01-20 07:25:30 -08:00
element
Implement grouped gemm tile loop for RDNA4 (
#3304
)
2026-01-13 07:14:23 +01:00
grid
Implement batched gemm bias permute for RDNA4 (
#3534
)
2026-01-17 08:30:27 +01:00
thread
Grouped convolution forward device implementation and base flavors for RDNA3/4 (
#2964
)
2025-12-18 13:12:15 -07:00
warp
chore(copyright): update copyright header for include directory (
#3293
)
2025-11-26 11:00:05 -07:00