Chao Liu
|
6fd0910da8
|
refactoring ConstantTensorDescriptor
[ROCm/composable_kernel commit: a0584426ff]
|
2019-03-17 03:22:41 -05:00 |
|
Chao Liu
|
1c962a13ee
|
device_implicit_gemm_convolution_1_chwn_csrk_khwn: use tensor copy (instead of pointwise) for writing output, 3x3 increased from 78% to 84%, 5x5 from 80% to 84%
[ROCm/composable_kernel commit: a65ef90308]
|
2019-02-19 11:47:46 -06:00 |
|
Chao Liu
|
c0baa18a3f
|
change file extension to hip.hpp and hip.cpp
[ROCm/composable_kernel commit: b2888adfbe]
|
2019-02-15 02:13:21 -06:00 |
|