Chao Liu
|
1cc683a3a3
|
adding implicit gemm v3
|
2019-05-23 22:10:40 -05:00 |
|
Chao Liu
|
8a4b59785b
|
adding implicit gemm v3
|
2019-05-22 19:39:56 -05:00 |
|
Chao Liu
|
2a48812edb
|
behavior has changed (better and worse), figuring out why
|
2019-05-21 16:43:56 -05:00 |
|
Chao Liu
|
acd7082fe1
|
adding ConstantMergedTensorDescriptor, refactering ConstantTensorDescriptor, Sequence
|
2019-05-21 16:17:58 -05:00 |
|
Chao Liu
|
a6b95c393b
|
rework sequence
|
2019-05-18 23:21:02 -05:00 |
|
Chao Liu
|
df73287b82
|
rework sequence
|
2019-05-17 14:56:39 -05:00 |
|
Chao Liu
|
33b5a8556b
|
adding implicit gemm v3
|
2019-05-16 22:23:18 -05:00 |
|
Chao Liu
|
5e5c27a63b
|
adding implicit gemm v3
|
2019-05-16 13:22:40 -05:00 |
|
Chao Liu
|
b7d052459d
|
adding implicit gemm v3
|
2019-05-15 09:58:17 -05:00 |
|
Chao Liu
|
2603bb0fe3
|
tuning on vega 20
|
2019-04-25 17:28:59 -05:00 |
|
Chao Liu
|
a903146427
|
implicit gemm v1r3 nchw_cyxk_nkhw
|
2019-04-25 15:14:39 -05:00 |
|
Chao Liu
|
569ad66e2a
|
added implicit gemm v1r3 lds_double_buffer NCHW * CYXK = KNHW, reworked static functionals
|
2019-04-23 17:51:14 -05:00 |
|
Chao Liu
|
19f17df47a
|
implicit gemm v1r2: adding support for nchw
|
2019-04-18 11:49:09 -05:00 |
|
Chao Liu
|
17f3d2d4bc
|
refactor ConstantTensorDescriptor and functional
|
2019-04-16 17:36:18 -05:00 |
|
Chao Liu
|
00899f191b
|
implicit gemm v1r2: only load 1d filter
|
2019-04-13 11:19:17 -05:00 |
|
Chao Liu
|
471830a052
|
tidy yp
|
2019-04-09 18:07:36 -05:00 |
|
Chao Liu
|
c075d3f7d9
|
add more assertion
|
2019-04-08 12:02:56 -05:00 |
|
Chao Liu
|
c9fa46af0b
|
debugging implicit gemm v1: use 10d tensor output
|
2019-04-08 10:27:32 -05:00 |
|
Chao Liu
|
e43d7bc63c
|
refactor
|
2019-04-01 15:17:22 -05:00 |
|
Chao Liu
|
766b0a9eaf
|
experimenting
|
2019-03-24 12:09:57 -05:00 |
|
Chao Liu
|
a0584426ff
|
refactoring ConstantTensorDescriptor
|
2019-03-17 03:22:41 -05:00 |
|
Chao Liu
|
a65ef90308
|
device_implicit_gemm_convolution_1_chwn_csrk_khwn: use tensor copy (instead of pointwise) for writing output, 3x3 increased from 78% to 84%, 5x5 from 80% to 84%
|
2019-02-19 11:47:46 -06:00 |
|
Chao Liu
|
b2888adfbe
|
change file extension to hip.hpp and hip.cpp
|
2019-02-15 02:13:21 -06:00 |
|