lalala-sh
39ba03f25d
Moe gemm activation (#2026)
* fix useless code and remove usless oob
* clang format
* fix coredump in e2e test
* fix2
* fix clang format
* fix output oob
* impl int64 but result not correct
* int64 index ok now
* input output all ok
* fix uint32
* revert v1 test
* use uint32
* mork to support 13w tokens
* moe sorting fix moebuf
* fix merge
* update moe api fix aiter build
* fix buid
* fuse silu
* silu ok
* acale ok
* add silu
* change code
* gemm2 ok
* gufusion compatible ok, fix warnings
* gu fusion for m32 m64 ok
* support bf16 cshuffle
* i4 gemm2 ok
* i4 gemm2 ok and i4 gemm1 build
* 16x16 run ok
* change flops; change cshuffle dtype
* fuse gelu silu act in moe gemm1
* fp8 with act ready
* int4 act ready
* remove useless changes
* remove useless code change
* fix clang format
* add the arch limit of int4 moe gemm
* fuse moe activation
* fix fp8 16x16
* fix no quant case
* fix bugs
* fix fp8 gufusion bug
* remove useless comments
* refine activation code & complete moe example
* fix int8 bugs
* merge tkw1
---------
Co-authored-by: coderfeli <coderfeli@163.com>
Co-authored-by: feli <felix.li@amd.com>
Co-authored-by: illsilin <Illia.Silin@amd.com>
Co-authored-by: root <root@hjbog-srdc-51.amd.com>
Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com>
2025-04-23 10:35:34 +08:00
..
2025-04-14 16:58:57 +08:00
2024-07-03 23:34:38 -07:00
2024-04-02 09:42:17 -07:00
2025-03-05 11:07:33 -08:00
2025-03-17 18:08:53 -07:00
2024-11-14 08:40:50 -08:00
2024-08-13 16:15:47 +02:00
2023-09-20 22:15:56 -07:00
2024-04-02 09:42:17 -07:00
2025-04-03 13:35:43 +02:00
2024-11-14 08:40:50 -08:00
2024-08-06 10:06:10 +02:00
2025-03-05 11:07:33 -08:00
2024-04-19 13:31:17 +02:00
2025-02-18 10:10:22 +01:00
2024-12-03 08:42:55 -08:00
2023-10-18 11:14:14 -05:00
2023-05-31 18:46:57 -05:00
2025-04-14 16:58:57 +08:00
2023-09-20 22:15:56 -07:00
2024-05-10 09:41:39 -07:00
2023-11-28 11:17:37 -08:00
2023-09-20 22:15:56 -07:00
2024-06-27 00:33:34 -07:00
2025-03-04 10:32:25 -08:00
2025-02-07 15:05:05 -07:00
2024-12-03 08:42:55 -08:00
2023-05-31 18:46:57 -05:00
2024-04-19 13:31:17 +02:00
2024-12-03 08:42:55 -08:00
2023-05-31 18:46:57 -05:00
2024-12-03 08:42:55 -08:00
2024-12-03 08:42:55 -08:00
2023-10-18 11:14:14 -05:00
2024-04-02 09:42:17 -07:00
2025-02-07 15:05:05 -07:00
2023-11-28 11:17:37 -08:00
2023-09-20 22:15:56 -07:00
2024-11-05 13:58:29 -08:00
2023-10-19 07:36:29 +08:00
2024-05-10 09:41:39 -07:00
2024-12-03 08:42:55 -08:00
2024-01-24 13:47:48 -08:00
2023-09-20 22:15:56 -07:00
2023-09-20 22:15:56 -07:00
2024-01-24 13:47:48 -08:00
2024-04-02 09:42:17 -07:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2024-12-03 08:42:55 -08:00
2024-04-26 07:26:30 -05:00
2024-07-11 18:08:07 -07:00
2025-03-05 11:07:33 -08:00
2023-11-28 11:17:37 -08:00
2024-04-02 09:42:17 -07:00
2025-04-23 10:35:34 +08:00
2024-10-22 09:26:18 +08:00
2025-04-16 15:25:02 -06:00
2025-04-22 15:52:36 -07:00
2025-04-09 10:06:42 -07:00
2024-12-04 00:46:47 +01:00