lalala-sh
39ba03f25d
Moe gemm activation (#2026)
* fix useless code and remove usless oob
* clang format
* fix coredump in e2e test
* fix2
* fix clang format
* fix output oob
* impl int64 but result not correct
* int64 index ok now
* input output all ok
* fix uint32
* revert v1 test
* use uint32
* mork to support 13w tokens
* moe sorting fix moebuf
* fix merge
* update moe api fix aiter build
* fix buid
* fuse silu
* silu ok
* acale ok
* add silu
* change code
* gemm2 ok
* gufusion compatible ok, fix warnings
* gu fusion for m32 m64 ok
* support bf16 cshuffle
* i4 gemm2 ok
* i4 gemm2 ok and i4 gemm1 build
* 16x16 run ok
* change flops; change cshuffle dtype
* fuse gelu silu act in moe gemm1
* fp8 with act ready
* int4 act ready
* remove useless changes
* remove useless code change
* fix clang format
* add the arch limit of int4 moe gemm
* fuse moe activation
* fix fp8 16x16
* fix no quant case
* fix bugs
* fix fp8 gufusion bug
* remove useless comments
* refine activation code & complete moe example
* fix int8 bugs
* merge tkw1
---------
Co-authored-by: coderfeli <coderfeli@163.com>
Co-authored-by: feli <felix.li@amd.com>
Co-authored-by: illsilin <Illia.Silin@amd.com>
Co-authored-by: root <root@hjbog-srdc-51.amd.com>
Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com>
2025-04-23 10:35:34 +08:00
..
2023-08-14 15:46:27 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2024-06-21 09:47:58 +02:00
2025-04-15 17:17:07 -06:00
2025-02-20 14:00:27 -08:00
2025-03-10 11:16:44 +08:00
2025-03-10 11:16:44 +08:00
2025-04-23 10:35:34 +08:00
2025-04-23 10:35:34 +08:00
2025-04-23 10:35:34 +08:00
2025-03-11 10:11:21 -07:00
2025-02-20 14:00:27 -08:00
2025-03-11 10:11:21 -07:00
2025-02-20 14:00:27 -08:00
2025-04-23 10:35:34 +08:00
2025-04-15 17:17:07 -06:00
2025-02-20 14:00:27 -08:00
2025-03-11 10:11:21 -07:00
2025-01-03 18:35:21 +08:00
2025-04-15 17:17:07 -06:00
2024-06-21 09:47:58 +02:00
2025-03-11 10:11:21 -07:00
2025-01-03 18:35:21 +08:00
2024-11-26 17:36:53 +01:00
2025-03-11 10:11:21 -07:00
2025-01-03 18:35:21 +08:00
2024-09-04 20:58:54 -07:00
2025-01-03 18:35:21 +08:00
2025-02-20 14:00:27 -08:00
2024-06-21 09:47:58 +02:00
2024-09-25 13:45:38 -07:00
2024-09-11 15:19:42 +02:00
2024-10-28 19:02:48 -07:00
2024-06-21 09:47:58 +02:00
2024-06-27 00:33:34 -07:00
2023-08-23 11:36:17 -07:00
2023-05-31 18:46:57 -05:00
2023-07-06 10:58:55 -05:00
2023-07-06 10:58:55 -05:00
2023-11-25 13:35:22 +01:00
2024-03-08 17:11:51 -08:00
2025-04-23 10:35:34 +08:00
2025-02-20 14:00:27 -08:00
2024-03-22 10:40:43 +01:00
2023-05-31 18:46:57 -05:00
2023-07-26 14:18:15 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2025-01-31 09:48:39 -08:00
2025-04-23 10:35:34 +08:00
2024-05-28 12:04:22 -05:00