Chao Liu
|
c67332b930
|
Use Tuple and vector_type instead of Array for holding tensor data (#30)
* replacing array with tuple and vector for tensor data
[ROCm/composable_kernel commit: d075adf126]
|
2021-04-28 13:10:33 -05:00 |
|
Chao Liu
|
e2753e68bd
|
Dynamic tensor descriptor (#24)
* support dynamic tensor descriptor
* use buffer load OOB feature for padding case
* add navi support
* add int8x4 inference kernel
Co-authored-by: Chao Liu <chao@ixt-rack-81.local.lan>
Co-authored-by: Jing Zhang <jizhan@amd.com>
[ROCm/composable_kernel commit: fcbb978828]
|
2021-03-25 13:51:11 -05:00 |
|
Chao Liu
|
a9e6c3340c
|
Refactor for MIOpen integration (#4)
Refactor, so can bring multi-index transformation and padding support into MIOpen
[ROCm/composable_kernel commit: 52c3fe05be]
|
2019-10-11 11:37:31 -05:00 |
|
Chao Liu
|
17564ecfec
|
adding merge transform
[ROCm/composable_kernel commit: ca42e9101d]
|
2019-09-10 01:53:49 -05:00 |
|
Chao Liu
|
399be319a2
|
more utility code
[ROCm/composable_kernel commit: 7a7fe16086]
|
2019-09-09 00:29:33 -05:00 |
|