This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-11 08:50:17 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
290
Commits
738
Branches
38
Tags
bc9ea646f8bb006913713dffacafdb0c929c899e
Go to file
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Chao Liu
bc9ea646f8
use ford/for instead of static_ford/static_for in threadwise copy, somehow register spill is greatly reduced on AMD
2019-08-07 19:09:13 -05:00
composable_kernel
/include
use ford/for instead of static_ford/static_for in threadwise copy, somehow register spill is greatly reduced on AMD
2019-08-07 19:09:13 -05:00
driver
use ford/for instead of static_ford/static_for in threadwise copy, somehow register spill is greatly reduced on AMD
2019-08-07 19:09:13 -05:00
script
refactored implicit gemm v1r3
2019-07-29 15:25:38 -05:00
.clang-format
start adding convolution
2018-10-08 22:49:58 -05:00
CMakeLists.txt
refactor
2019-06-19 17:43:56 -05:00
Description
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
MIT
228
MiB
Languages
C++
93.1%
Python
4.5%
CMake
1.5%
Shell
0.5%
Pawn
0.2%