This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-12 17:26:00 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
4581b5d504ad400dd36e641856e1a2ce7d91955d
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Lakhinder Walia
1f306024d0
fast_gelu: minor code reorg to enhance ref & gpu performance (
#1162
)
2024-02-07 19:24:51 -08:00
..
block
[GEMM] Optimization for MI200/300. (
#1135
)
2024-01-19 07:02:22 -06:00
device
Implement direct loads split-K GEMM kernel (
#1137
)
2024-02-07 01:08:34 +01:00
element
fast_gelu: minor code reorg to enhance ref & gpu performance (
#1162
)
2024-02-07 19:24:51 -08:00
grid
Implement direct loads split-K GEMM kernel (
#1137
)
2024-02-07 01:08:34 +01:00
thread
add vector_type support into thread_copy_v3r1 (
#969
)
2023-10-13 15:11:43 -05:00
warp
enable compilation of INSTANCES_ONLY for Windows (
#1082
)
2023-12-20 14:34:53 -08:00