This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-15 18:42:06 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
183
Commits
745
Branches
38
Tags
2fc34a91695ab90b389f1c6be5a20e9b2d3df748
Go to file
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Chao Liu
2fc34a9169
tuned implicit gemm v1 for 3x3 on AMD to 82%. Fixed a bug in 4d tensor blockwise copy.
...
[ROCm/composable_kernel commit:
96ee9571e2
]
2019-04-10 18:10:18 -05:00
driver
tuned implicit gemm v1 for 3x3 on AMD to 82%. Fixed a bug in 4d tensor blockwise copy.
2019-04-10 18:10:18 -05:00
script
enabled ds_read_b128 and ds_write_b128 on hip c++
2019-04-09 19:05:44 -05:00
src
tuned implicit gemm v1 for 3x3 on AMD to 82%. Fixed a bug in 4d tensor blockwise copy.
2019-04-10 18:10:18 -05:00
.clang-format
start adding convolution
2018-10-08 22:49:58 -05:00
CMakeLists.txt
updated build
2019-02-15 02:17:00 -06:00
Description
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
MIT
235
MiB
Languages
C++
93.1%
Python
4.5%
CMake
1.5%
Shell
0.5%
Pawn
0.2%