Commit Graph

2 Commits

Author SHA1 Message Date
Anthony Chang
ea5f57fa92 revise count_vgpr script to capture all possible syntaxes (#124)
[ROCm/composable_kernel commit: c78d1be19c]
2022-03-11 13:30:50 -06:00
Chao Liu
e2753e68bd Dynamic tensor descriptor (#24)
* support dynamic tensor descriptor

* use buffer load OOB feature for padding case

* add navi support

* add int8x4 inference kernel

Co-authored-by: Chao Liu <chao@ixt-rack-81.local.lan>
Co-authored-by: Jing Zhang <jizhan@amd.com>

[ROCm/composable_kernel commit: fcbb978828]
2021-03-25 13:51:11 -05:00