Files
composable_kernel/host/host_tensor/include
ltqin 627d8ef35a Backward weight v4r4r2 with xdlops (#18)
* start

* modify transformat

* modify device convolutiion

* modify host

* added host conv bwd and wrw

* remove bwd, seperate wrw

* clean

* hacall k to zero

* out log

* fixed

* fixed

* change to (out in wei)

* input hack

* hack to out

* format

* fix by comments

* change wei hacks(wei transform has not merge)

* fix program once issue

* fix review comment

* fix vector load issue

* tweak

Co-authored-by: ltqin <letaoqin@amd.com>
Co-authored-by: Jing Zhang <jizhan@amd.com>
Co-authored-by: Chao Liu <chao.liu2@amd.com>
2021-08-30 22:49:17 -05:00
..
2021-08-10 23:45:36 +00:00
2021-08-09 22:13:47 +00:00
2021-08-09 18:20:02 -05:00
2021-08-09 19:27:49 +00:00
2021-08-09 19:27:49 +00:00