Commit Graph

22 Commits

Author SHA1 Message Date
Chao Liu
db876ea7ec adding implicit gemm v4 (nchw, kcyx)
[ROCm/composable_kernel commit: b2439ec9dd]
2019-05-30 17:50:49 -05:00
Chao Liu
979dc4da2e adding implicit gemm v3
[ROCm/composable_kernel commit: 8a4b59785b]
2019-05-22 19:39:56 -05:00
Chao Liu
ffd172378a adding implicit gemm v3
[ROCm/composable_kernel commit: 5e5c27a63b]
2019-05-16 13:22:40 -05:00
Chao Liu
aeeefc1de3 refactored
[ROCm/composable_kernel commit: 4957d5a399]
2019-05-02 14:49:20 -05:00
Chao Liu
21988c32b4 added implicit gemm v1r3 lds_double_buffer NCHW * CYXK = KNHW, reworked static functionals
[ROCm/composable_kernel commit: 569ad66e2a]
2019-04-23 17:51:14 -05:00
Chao Liu
d0244d3a51 implicit gemm v1r2: adding support for nchw
[ROCm/composable_kernel commit: 19f17df47a]
2019-04-18 11:49:09 -05:00
Chao Liu
20df62cc31 clean up
[ROCm/composable_kernel commit: 5245a0162b]
2019-04-06 16:27:07 -05:00
Chao Liu
a3f850c5e6 debugging
[ROCm/composable_kernel commit: f6cb5b846d]
2019-04-06 15:10:40 -05:00
Chao Liu
e277457dce tidy up
[ROCm/composable_kernel commit: e2313c9eca]
2019-04-02 20:30:00 -05:00
Chao Liu
e423954e6e puting gridwise convolution into its own class
[ROCm/composable_kernel commit: 6290e0b080]
2019-04-02 20:18:01 -05:00
Chao Liu
7cbd63b2d0 refactor
[ROCm/composable_kernel commit: e43d7bc63c]
2019-04-01 15:17:22 -05:00
Chao Liu
2f8828e8d5 Jing's ds_read inline asm
[ROCm/composable_kernel commit: d6d9a8e4ce]
2019-03-28 19:46:29 -05:00
Chao Liu
cd883e7581 experimenting
[ROCm/composable_kernel commit: 766b0a9eaf]
2019-03-24 12:09:57 -05:00
Chao Liu
2c3bd06d25 Merge branch 'direct_fp16'
[ROCm/composable_kernel commit: fdaaaa500c]
2019-03-22 16:46:41 -05:00
Chao Liu
1f925812b2 hip build
[ROCm/composable_kernel commit: 8c923db423]
2019-03-22 14:22:58 -05:00
Chao Liu
732984e63b adding fp16 direct that reads pre-vectorized data
[ROCm/composable_kernel commit: 79d9b1084b]
2019-03-18 18:16:02 -05:00
Chao Liu
5ba0f64087 adding fp16 direct that reads pre-vectorized data
[ROCm/composable_kernel commit: 4f0fc72e91]
2019-03-18 15:03:17 -05:00
Chao Liu
6fd0910da8 refactoring ConstantTensorDescriptor
[ROCm/composable_kernel commit: a0584426ff]
2019-03-17 03:22:41 -05:00
Chao Liu
7bfe330532 update hip build
[ROCm/composable_kernel commit: 2c9b8c2432]
2019-03-12 17:20:11 -05:00
Chao Liu
e3ab560c50 refactor
[ROCm/composable_kernel commit: 04c5527d07]
2019-03-04 17:09:20 -06:00
Chao Liu
13388cef47 add anther verision of batch gemm
[ROCm/composable_kernel commit: 1cb9885058]
2019-02-17 01:50:57 -06:00
Chao Liu
c0baa18a3f change file extension to hip.hpp and hip.cpp
[ROCm/composable_kernel commit: b2888adfbe]
2019-02-15 02:13:21 -06:00