Commit Graph

6 Commits

Author SHA1 Message Date
Chao Liu
a65ef90308 device_implicit_gemm_convolution_1_chwn_csrk_khwn: use tensor copy (instead of pointwise) for writing output, 3x3 increased from 78% to 84%, 5x5 from 80% to 84% 2019-02-19 11:47:46 -06:00
Chao Liu
1cb9885058 add anther verision of batch gemm 2019-02-17 01:50:57 -06:00
Chao Liu
9f2e8f8bb4 2-type implicit gemm using chwn 2019-02-15 22:51:51 -06:00
Chao Liu
d7c84daf66 delete useless code 2019-02-15 22:24:18 -06:00
Chao Liu
b2888adfbe change file extension to hip.hpp and hip.cpp 2019-02-15 02:13:21 -06:00
Chao Liu
a414e3fdf8 update build 2019-02-15 02:06:34 -06:00