Chao Liu
|
55cdfe9695
|
add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files
[ROCm/composable_kernel commit: 3d32ae9404]
|
2021-07-30 17:50:17 -05:00 |
|
Chao Liu
|
67d45b2ee6
|
update to clang-format-10
[ROCm/composable_kernel commit: 82fae390fb]
|
2021-07-30 16:37:00 -05:00 |
|
Chao Liu
|
b4dbf677ce
|
Dynamic tensor descriptor (#24)
* support dynamic tensor descriptor
* use buffer load OOB feature for padding case
* add navi support
* add int8x4 inference kernel
Co-authored-by: Chao Liu <chao@ixt-rack-81.local.lan>
Co-authored-by: Jing Zhang <jizhan@amd.com>
[ROCm/composable_kernel commit: fcbb978828]
|
2021-03-25 13:51:11 -05:00 |
|
Chao Liu
|
0eb214d1cd
|
Code clean up (#20)
* tuning para,
* testing on v100
* add fp16
* remove deprecated tensor descriptor
* sync with miopen
* update build script
Co-authored-by: Jing Zhang <jizhan@amd.com>
[ROCm/composable_kernel commit: 5c7cec1115]
|
2020-06-23 20:31:27 -05:00 |
|