Chao Liu
|
4c9d05cc91
|
added threadwise tensor reorder operation
[ROCm/composable_kernel commit: 3dbd47252c]
|
2019-01-04 15:34:13 -06:00 |
|
Chao Liu
|
f9be16c4b6
|
another version of direct conv
[ROCm/composable_kernel commit: 39775d484c]
|
2018-12-18 03:22:12 -06:00 |
|
Chao Liu
|
b158b92068
|
refactor
[ROCm/composable_kernel commit: 1eafc9c1fb]
|
2018-11-28 16:20:01 -06:00 |
|
Chao Liu
|
7780c99fba
|
changed direct conv
[ROCm/composable_kernel commit: fee92fb636]
|
2018-11-26 18:42:42 -06:00 |
|
Chao Liu
|
7569eeaf55
|
refactor direct
[ROCm/composable_kernel commit: 24d2f034fa]
|
2018-11-25 01:10:11 -06:00 |
|
Chao Liu
|
779a90edb3
|
faster: output skip LDS
[ROCm/composable_kernel commit: c587726190]
|
2018-11-16 06:08:11 -06:00 |
|
Chao Liu
|
3e4752bf7e
|
refactor
[ROCm/composable_kernel commit: 73480fee36]
|
2018-11-15 23:53:23 -06:00 |
|
Chao Liu
|
cd469dcd63
|
clean up
[ROCm/composable_kernel commit: 29496c95d3]
|
2018-11-15 20:16:58 -06:00 |
|
Chao Liu
|
fbc1d7b252
|
faster
[ROCm/composable_kernel commit: c8d0356a34]
|
2018-11-15 18:54:57 -06:00 |
|
Chao Liu
|
e807d7ff6c
|
improved blockwise_tensor_op
[ROCm/composable_kernel commit: 1812666a47]
|
2018-11-14 08:55:45 -06:00 |
|
Chao Liu
|
9f6a585b84
|
add 2nd version of blockwise_tensor_op
[ROCm/composable_kernel commit: 08c7f74391]
|
2018-11-07 18:35:54 -06:00 |
|