carlushuang
|
efd1d25733
|
embedding fuse layernorm (#405)
* add gridwise/device sparse embedding
* update code
* update code
* remove useless makefile
* code fix
* workable
* work properly
* emb add
* add more instance
* format
* remove useless code
* fix format
* fix clang-tidy
* clean
* fix a compile error
Co-authored-by: Chao Liu <chao.liu2@amd.com>
Co-authored-by: Chao Liu <lc.roy86@gmail.com>
|
2022-09-09 10:41:15 -05:00 |
|