Commit Graph

23 Commits

Author SHA1 Message Date
Chao Liu
1cc683a3a3 adding implicit gemm v3 2019-05-23 22:10:40 -05:00
Chao Liu
8a4b59785b adding implicit gemm v3 2019-05-22 19:39:56 -05:00
Chao Liu
2a48812edb behavior has changed (better and worse), figuring out why 2019-05-21 16:43:56 -05:00
Chao Liu
acd7082fe1 adding ConstantMergedTensorDescriptor, refactering ConstantTensorDescriptor, Sequence 2019-05-21 16:17:58 -05:00
Chao Liu
a6b95c393b rework sequence 2019-05-18 23:21:02 -05:00
Chao Liu
df73287b82 rework sequence 2019-05-17 14:56:39 -05:00
Chao Liu
33b5a8556b adding implicit gemm v3 2019-05-16 22:23:18 -05:00
Chao Liu
5e5c27a63b adding implicit gemm v3 2019-05-16 13:22:40 -05:00
Chao Liu
b7d052459d adding implicit gemm v3 2019-05-15 09:58:17 -05:00
Chao Liu
2603bb0fe3 tuning on vega 20 2019-04-25 17:28:59 -05:00
Chao Liu
a903146427 implicit gemm v1r3 nchw_cyxk_nkhw 2019-04-25 15:14:39 -05:00
Chao Liu
569ad66e2a added implicit gemm v1r3 lds_double_buffer NCHW * CYXK = KNHW, reworked static functionals 2019-04-23 17:51:14 -05:00
Chao Liu
19f17df47a implicit gemm v1r2: adding support for nchw 2019-04-18 11:49:09 -05:00
Chao Liu
17f3d2d4bc refactor ConstantTensorDescriptor and functional 2019-04-16 17:36:18 -05:00
Chao Liu
00899f191b implicit gemm v1r2: only load 1d filter 2019-04-13 11:19:17 -05:00
Chao Liu
471830a052 tidy yp 2019-04-09 18:07:36 -05:00
Chao Liu
c075d3f7d9 add more assertion 2019-04-08 12:02:56 -05:00
Chao Liu
c9fa46af0b debugging implicit gemm v1: use 10d tensor output 2019-04-08 10:27:32 -05:00
Chao Liu
e43d7bc63c refactor 2019-04-01 15:17:22 -05:00
Chao Liu
766b0a9eaf experimenting 2019-03-24 12:09:57 -05:00
Chao Liu
a0584426ff refactoring ConstantTensorDescriptor 2019-03-17 03:22:41 -05:00
Chao Liu
a65ef90308 device_implicit_gemm_convolution_1_chwn_csrk_khwn: use tensor copy (instead of pointwise) for writing output, 3x3 increased from 78% to 84%, 5x5 from 80% to 84% 2019-02-19 11:47:46 -06:00
Chao Liu
b2888adfbe change file extension to hip.hpp and hip.cpp 2019-02-15 02:13:21 -06:00