carlushuang
8d4fd1233d
embedding fuse layernorm (#405)
* add gridwise/device sparse embedding
* update code
* update code
* remove useless makefile
* code fix
* workable
* work properly
* emb add
* add more instance
* format
* remove useless code
* fix format
* fix clang-tidy
* clean
* fix a compile error
Co-authored-by: Chao Liu <chao.liu2@amd.com>
Co-authored-by: Chao Liu <lc.roy86@gmail.com>
[ROCm/composable_kernel commit: efd1d25733]
2022-09-09 10:41:15 -05:00
..
2022-06-24 23:32:43 -05:00
2022-09-09 10:41:15 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-08-30 11:01:37 -05:00
2022-07-01 01:38:00 -05:00
2022-08-02 14:52:27 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-08-23 10:01:02 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-07-07 14:31:11 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-08-18 14:53:47 -05:00
2022-08-13 09:43:18 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-30 12:08:50 -05:00
2022-08-30 11:38:26 -05:00
2022-07-07 14:31:11 -05:00
2022-07-07 14:31:11 -05:00
2022-08-13 09:18:58 -05:00
2022-08-13 00:16:14 -05:00
2022-06-24 23:32:43 -05:00
2022-08-13 01:35:49 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-07-29 18:19:25 -05:00
2022-07-07 14:31:11 -05:00