lalala-sh
39ba03f25d
Moe gemm activation (#2026)
* fix useless code and remove usless oob
* clang format
* fix coredump in e2e test
* fix2
* fix clang format
* fix output oob
* impl int64 but result not correct
* int64 index ok now
* input output all ok
* fix uint32
* revert v1 test
* use uint32
* mork to support 13w tokens
* moe sorting fix moebuf
* fix merge
* update moe api fix aiter build
* fix buid
* fuse silu
* silu ok
* acale ok
* add silu
* change code
* gemm2 ok
* gufusion compatible ok, fix warnings
* gu fusion for m32 m64 ok
* support bf16 cshuffle
* i4 gemm2 ok
* i4 gemm2 ok and i4 gemm1 build
* 16x16 run ok
* change flops; change cshuffle dtype
* fuse gelu silu act in moe gemm1
* fp8 with act ready
* int4 act ready
* remove useless changes
* remove useless code change
* fix clang format
* add the arch limit of int4 moe gemm
* fuse moe activation
* fix fp8 16x16
* fix no quant case
* fix bugs
* fix fp8 gufusion bug
* remove useless comments
* refine activation code & complete moe example
* fix int8 bugs
* merge tkw1
---------
Co-authored-by: coderfeli <coderfeli@163.com>
Co-authored-by: feli <felix.li@amd.com>
Co-authored-by: illsilin <Illia.Silin@amd.com>
Co-authored-by: root <root@hjbog-srdc-51.amd.com>
Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com>
2025-04-23 10:35:34 +08:00
..
2025-04-23 10:35:34 +08:00
2023-09-27 17:19:06 +02:00
2023-07-18 11:01:33 -05:00
2023-05-31 18:46:57 -05:00
2025-01-31 09:48:39 -08:00
2023-08-10 12:04:35 +08:00
2025-02-20 18:58:14 -08:00
2023-05-31 18:46:57 -05:00
2022-07-29 18:19:25 -05:00
2023-05-31 18:46:57 -05:00
2024-12-13 21:08:35 +01:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2025-02-20 18:58:14 -08:00
2025-02-10 11:17:02 +08:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2024-10-12 14:05:11 +08:00
2023-10-17 20:17:58 -05:00
2023-11-02 14:26:33 -07:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-10-31 10:46:32 +01:00
2023-05-31 18:46:57 -05:00
2024-04-19 13:31:17 +02:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2024-03-08 17:11:51 -08:00
2023-09-26 21:16:23 -05:00
2024-07-19 22:06:52 +08:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2025-03-03 07:55:05 -08:00
2025-03-24 15:41:07 -06:00
2023-05-31 18:46:57 -05:00
2023-10-13 16:27:11 -05:00
2024-07-05 21:40:30 -07:00
2023-07-26 14:18:15 -05:00
2025-03-10 11:16:44 +08:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-10-04 18:04:27 -05:00
2024-04-18 23:35:04 +02:00
2023-10-04 08:19:08 -05:00
2025-01-31 09:48:39 -08:00
2023-11-14 17:00:40 +01:00
2023-05-31 18:46:57 -05:00
2024-11-27 13:02:44 +01:00
2024-04-15 21:09:45 -05:00
2024-04-15 21:09:45 -05:00
2023-05-31 18:46:57 -05:00
2024-11-27 13:02:44 +01:00
2024-11-27 13:02:44 +01:00
2024-11-27 13:02:44 +01:00
2023-08-31 21:01:50 +08:00
2023-05-31 18:46:57 -05:00
2023-12-19 04:23:11 +08:00
2023-11-10 18:02:03 +08:00
2023-11-09 08:34:51 +08:00
2023-05-31 18:46:57 -05:00
2023-08-15 02:25:28 +08:00
2023-08-10 12:04:35 +08:00
2024-07-19 22:01:22 +08:00
2023-05-31 18:46:57 -05:00
2023-10-11 14:27:29 -05:00
2023-05-31 18:46:57 -05:00
2025-02-20 18:58:14 -08:00
2024-07-19 09:29:25 +02:00
2025-02-20 18:58:14 -08:00
2024-06-25 16:37:35 -05:00
2023-05-31 18:46:57 -05:00
2025-02-20 18:58:14 -08:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00