carlushuang
8d4fd1233d
embedding fuse layernorm (#405)
* add gridwise/device sparse embedding
* update code
* update code
* remove useless makefile
* code fix
* workable
* work properly
* emb add
* add more instance
* format
* remove useless code
* fix format
* fix clang-tidy
* clean
* fix a compile error
Co-authored-by: Chao Liu <chao.liu2@amd.com>
Co-authored-by: Chao Liu <lc.roy86@gmail.com>
[ROCm/composable_kernel commit: efd1d25733]
2022-09-09 10:41:15 -05:00
..
2022-09-01 09:31:17 -05:00
2022-07-29 18:19:25 -05:00
2022-07-29 18:19:25 -05:00
2022-08-23 10:38:41 -05:00
2022-09-01 09:31:17 -05:00
2022-08-31 16:32:17 -05:00
2022-08-25 16:58:48 -05:00
2022-07-29 18:19:25 -05:00
2022-07-29 18:19:25 -05:00
2022-08-25 17:19:15 -05:00
2022-08-30 11:38:26 -05:00
2022-07-29 18:19:25 -05:00
2022-08-13 00:16:14 -05:00
2022-08-15 10:11:02 -05:00
2022-07-29 18:19:25 -05:00
2022-08-15 10:11:02 -05:00
2022-08-25 17:19:15 -05:00
2022-09-06 12:22:48 -05:00
2022-08-25 17:19:15 -05:00
2022-08-23 14:41:56 -05:00
2022-08-12 15:22:39 -05:00
2022-07-29 18:19:25 -05:00
2022-08-13 09:43:18 -05:00
2022-08-10 12:20:29 -05:00
2022-08-10 12:20:29 -05:00
2022-08-31 11:27:11 -05:00
2022-09-01 09:31:17 -05:00
2022-09-08 09:27:50 -05:00
2022-08-15 10:11:02 -05:00
2022-08-15 10:11:02 -05:00
2022-08-25 17:19:15 -05:00
2022-09-09 10:41:15 -05:00
2022-09-01 09:31:17 -05:00
2022-09-09 10:41:15 -05:00