This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-12 01:10:17 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
183
Commits
739
Branches
38
Tags
96ee9571e2c96ba6eb6972da1be75453d6c6e9fa
Go to file
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Chao Liu
96ee9571e2
tuned implicit gemm v1 for 3x3 on AMD to 82%. Fixed a bug in 4d tensor blockwise copy.
2019-04-10 18:10:18 -05:00
driver
tuned implicit gemm v1 for 3x3 on AMD to 82%. Fixed a bug in 4d tensor blockwise copy.
2019-04-10 18:10:18 -05:00
script
enabled ds_read_b128 and ds_write_b128 on hip c++
2019-04-09 19:05:44 -05:00
src
tuned implicit gemm v1 for 3x3 on AMD to 82%. Fixed a bug in 4d tensor blockwise copy.
2019-04-10 18:10:18 -05:00
.clang-format
start adding convolution
2018-10-08 22:49:58 -05:00
CMakeLists.txt
updated build
2019-02-15 02:17:00 -06:00
Description
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
MIT
234
MiB
Languages
C++
93.1%
Python
4.5%
CMake
1.5%
Shell
0.5%
Pawn
0.2%