Logo
Explore Help
Register Sign In
ROCm/composable_kernel
1
0
Fork 0
You've already forked composable_kernel
mirror of https://github.com/ROCm/composable_kernel.git synced 2026-05-13 01:36:06 +00:00
Code Issues Packages Projects Releases Wiki Activity
198 Commits 742 Branches 38 Tags
3ce77700b62df4bb17832c048bbbcb965a457833
Commit Graph

9 Commits

Author SHA1 Message Date
Chao Liu
a903146427 implicit gemm v1r3 nchw_cyxk_nkhw 2019-04-25 15:14:39 -05:00
Chao Liu
569ad66e2a added implicit gemm v1r3 lds_double_buffer NCHW * CYXK = KNHW, reworked static functionals 2019-04-23 17:51:14 -05:00
Chao Liu
5ce19234a4 added GridwiseConvolutionImplicitGemm_v1r2_nchw_cyxk_khwn 2019-04-19 14:22:02 -05:00
Chao Liu
19f17df47a implicit gemm v1r2: adding support for nchw 2019-04-18 11:49:09 -05:00
Chao Liu
00899f191b implicit gemm v1r2: only load 1d filter 2019-04-13 11:19:17 -05:00
Chao Liu
766b0a9eaf experimenting 2019-03-24 12:09:57 -05:00
Chao Liu
050a1a6890 adding int8 direct that reads pre-vectorized data 2019-03-19 00:05:41 -05:00
Chao Liu
a65ef90308 device_implicit_gemm_convolution_1_chwn_csrk_khwn: use tensor copy (instead of pointwise) for writing output, 3x3 increased from 78% to 84%, 5x5 from 80% to 84% 2019-02-19 11:47:46 -06:00
Chao Liu
b2888adfbe change file extension to hip.hpp and hip.cpp 2019-02-15 02:13:21 -06:00
Powered by Gitea Version: 1.25.4 Page: 98ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API