This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-12 01:10:17 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
29dcb956dbe1fcb166b4646633837b7da8e5fdba
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Lakhinder Walia
1f306024d0
fast_gelu: minor code reorg to enhance ref & gpu performance (
#1162
)
2024-02-07 19:24:51 -08:00
..
block
[GEMM] Optimization for MI200/300. (
#1135
)
2024-01-19 07:02:22 -06:00
device
Implement direct loads split-K GEMM kernel (
#1137
)
2024-02-07 01:08:34 +01:00
element
fast_gelu: minor code reorg to enhance ref & gpu performance (
#1162
)
2024-02-07 19:24:51 -08:00
grid
Implement direct loads split-K GEMM kernel (
#1137
)
2024-02-07 01:08:34 +01:00
thread
add vector_type support into thread_copy_v3r1 (
#969
)
2023-10-13 15:11:43 -05:00
warp
enable compilation of INSTANCES_ONLY for Windows (
#1082
)
2023-12-20 14:34:53 -08:00