This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-11 17:00:18 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
191
Commits
739
Branches
38
Tags
6d066ede00be22d5c4f23c812c48b60b0525b4f1
Go to file
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Chao Liu
6d066ede00
added implicit gemm v1r3, refactored decomposition of wei tensor (loop over y, x first, and C second) to allow easy lds double buffer on C
2019-04-19 16:46:29 -05:00
driver
added implicit gemm v1r3, refactored decomposition of wei tensor (loop over y, x first, and C second) to allow easy lds double buffer on C
2019-04-19 16:46:29 -05:00
script
enabled ds_read_b128 and ds_write_b128 on hip c++
2019-04-09 19:05:44 -05:00
src
added implicit gemm v1r3, refactored decomposition of wei tensor (loop over y, x first, and C second) to allow easy lds double buffer on C
2019-04-19 16:46:29 -05:00
.clang-format
start adding convolution
2018-10-08 22:49:58 -05:00
CMakeLists.txt
updated build
2019-02-15 02:17:00 -06:00
Description
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
MIT
228
MiB
Languages
C++
93.1%
Python
4.5%
CMake
1.5%
Shell
0.5%
Pawn
0.2%