This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-04 05:31:24 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
88e43744d829858deedbbeb036a89759d536b79c
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Po Yen Chen
88e43744d8
Refactor the design of DeviceGemmMultipleDMultipleR_Xdl_CShuffle (
#378
)
2022-08-24 10:12:54 -05:00
..
block
Hotfix LDS data hazard in fused attention (
#360
)
2022-08-15 12:04:20 -05:00
device
Refactor the design of DeviceGemmMultipleDMultipleR_Xdl_CShuffle (
#378
)
2022-08-24 10:12:54 -05:00
element
Add example of Gemm + AddAddFastGelu (data type: int4) (
#369
)
2022-08-23 10:38:41 -05:00
grid
Refactor the design of DeviceGemmMultipleDMultipleR_Xdl_CShuffle (
#378
)
2022-08-24 10:12:54 -05:00
thread
Layernorm welford (
#346
)
2022-08-13 09:43:18 -05:00
warp
Hotfix LDS data hazard in fused attention (
#360
)
2022-08-15 12:04:20 -05:00