Chao Liu
|
1c962a13ee
|
device_implicit_gemm_convolution_1_chwn_csrk_khwn: use tensor copy (instead of pointwise) for writing output, 3x3 increased from 78% to 84%, 5x5 from 80% to 84%
[ROCm/composable_kernel commit: a65ef90308]
|
2019-02-19 11:47:46 -06:00 |
|
Chao Liu
|
13388cef47
|
add anther verision of batch gemm
[ROCm/composable_kernel commit: 1cb9885058]
|
2019-02-17 01:50:57 -06:00 |
|
Chao Liu
|
8bde1393d4
|
2-type implicit gemm using chwn
[ROCm/composable_kernel commit: 9f2e8f8bb4]
|
2019-02-15 22:51:51 -06:00 |
|
Chao Liu
|
64620dfcd4
|
delete useless code
[ROCm/composable_kernel commit: d7c84daf66]
|
2019-02-15 22:24:18 -06:00 |
|
Chao Liu
|
c0baa18a3f
|
change file extension to hip.hpp and hip.cpp
[ROCm/composable_kernel commit: b2888adfbe]
|
2019-02-15 02:13:21 -06:00 |
|
Chao Liu
|
153629655f
|
update build
[ROCm/composable_kernel commit: a414e3fdf8]
|
2019-02-15 02:06:34 -06:00 |
|