Chao Liu
|
db876ea7ec
|
adding implicit gemm v4 (nchw, kcyx)
[ROCm/composable_kernel commit: b2439ec9dd]
|
2019-05-30 17:50:49 -05:00 |
|
Chao Liu
|
979dc4da2e
|
adding implicit gemm v3
[ROCm/composable_kernel commit: 8a4b59785b]
|
2019-05-22 19:39:56 -05:00 |
|
Chao Liu
|
ffd172378a
|
adding implicit gemm v3
[ROCm/composable_kernel commit: 5e5c27a63b]
|
2019-05-16 13:22:40 -05:00 |
|
Chao Liu
|
aeeefc1de3
|
refactored
[ROCm/composable_kernel commit: 4957d5a399]
|
2019-05-02 14:49:20 -05:00 |
|
Chao Liu
|
21988c32b4
|
added implicit gemm v1r3 lds_double_buffer NCHW * CYXK = KNHW, reworked static functionals
[ROCm/composable_kernel commit: 569ad66e2a]
|
2019-04-23 17:51:14 -05:00 |
|
Chao Liu
|
d0244d3a51
|
implicit gemm v1r2: adding support for nchw
[ROCm/composable_kernel commit: 19f17df47a]
|
2019-04-18 11:49:09 -05:00 |
|
Chao Liu
|
20df62cc31
|
clean up
[ROCm/composable_kernel commit: 5245a0162b]
|
2019-04-06 16:27:07 -05:00 |
|
Chao Liu
|
a3f850c5e6
|
debugging
[ROCm/composable_kernel commit: f6cb5b846d]
|
2019-04-06 15:10:40 -05:00 |
|
Chao Liu
|
e277457dce
|
tidy up
[ROCm/composable_kernel commit: e2313c9eca]
|
2019-04-02 20:30:00 -05:00 |
|
Chao Liu
|
e423954e6e
|
puting gridwise convolution into its own class
[ROCm/composable_kernel commit: 6290e0b080]
|
2019-04-02 20:18:01 -05:00 |
|
Chao Liu
|
7cbd63b2d0
|
refactor
[ROCm/composable_kernel commit: e43d7bc63c]
|
2019-04-01 15:17:22 -05:00 |
|
Chao Liu
|
2f8828e8d5
|
Jing's ds_read inline asm
[ROCm/composable_kernel commit: d6d9a8e4ce]
|
2019-03-28 19:46:29 -05:00 |
|
Chao Liu
|
cd883e7581
|
experimenting
[ROCm/composable_kernel commit: 766b0a9eaf]
|
2019-03-24 12:09:57 -05:00 |
|
Chao Liu
|
2c3bd06d25
|
Merge branch 'direct_fp16'
[ROCm/composable_kernel commit: fdaaaa500c]
|
2019-03-22 16:46:41 -05:00 |
|
Chao Liu
|
1f925812b2
|
hip build
[ROCm/composable_kernel commit: 8c923db423]
|
2019-03-22 14:22:58 -05:00 |
|
Chao Liu
|
732984e63b
|
adding fp16 direct that reads pre-vectorized data
[ROCm/composable_kernel commit: 79d9b1084b]
|
2019-03-18 18:16:02 -05:00 |
|
Chao Liu
|
5ba0f64087
|
adding fp16 direct that reads pre-vectorized data
[ROCm/composable_kernel commit: 4f0fc72e91]
|
2019-03-18 15:03:17 -05:00 |
|
Chao Liu
|
6fd0910da8
|
refactoring ConstantTensorDescriptor
[ROCm/composable_kernel commit: a0584426ff]
|
2019-03-17 03:22:41 -05:00 |
|
Chao Liu
|
7bfe330532
|
update hip build
[ROCm/composable_kernel commit: 2c9b8c2432]
|
2019-03-12 17:20:11 -05:00 |
|
Chao Liu
|
e3ab560c50
|
refactor
[ROCm/composable_kernel commit: 04c5527d07]
|
2019-03-04 17:09:20 -06:00 |
|
Chao Liu
|
13388cef47
|
add anther verision of batch gemm
[ROCm/composable_kernel commit: 1cb9885058]
|
2019-02-17 01:50:57 -06:00 |
|
Chao Liu
|
c0baa18a3f
|
change file extension to hip.hpp and hip.cpp
[ROCm/composable_kernel commit: b2888adfbe]
|
2019-02-15 02:13:21 -06:00 |
|