carlushuang
ededb608ed
embedding fuse layernorm (#405)
* add gridwise/device sparse embedding
* update code
* update code
* remove useless makefile
* code fix
* workable
* work properly
* emb add
* add more instance
* format
* remove useless code
* fix format
* fix clang-tidy
* clean
* fix a compile error
Co-authored-by: Chao Liu <chao.liu2@amd.com>
Co-authored-by: Chao Liu <lc.roy86@gmail.com>
[ROCm/composable_kernel commit: efd1d25733]
2022-09-09 10:41:15 -05:00
..
2022-06-24 23:32:43 -05:00
2022-08-15 10:11:02 -05:00
2022-08-15 10:11:02 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-09-08 09:27:50 -05:00
2022-09-08 09:27:50 -05:00
2022-06-24 23:32:43 -05:00
2022-08-15 10:11:02 -05:00
2022-06-27 14:25:10 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-08-24 10:12:54 -05:00
2022-07-29 18:19:25 -05:00
2022-06-24 23:32:43 -05:00
2022-07-08 15:55:14 -05:00
2022-06-27 14:25:10 -05:00
2022-07-08 15:55:14 -05:00
2022-07-01 01:38:00 -05:00
2022-06-24 23:32:43 -05:00
2022-08-13 09:18:58 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-08-23 14:41:56 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-08-13 09:43:18 -05:00
2022-08-13 09:43:18 -05:00
2022-06-24 23:32:43 -05:00
2022-08-15 10:11:02 -05:00
2022-07-14 22:52:45 -05:00
2022-09-09 10:41:15 -05:00