lalala-sh
8f426b1216
Moe gemm activation (#2026)
* fix useless code and remove usless oob
* clang format
* fix coredump in e2e test
* fix2
* fix clang format
* fix output oob
* impl int64 but result not correct
* int64 index ok now
* input output all ok
* fix uint32
* revert v1 test
* use uint32
* mork to support 13w tokens
* moe sorting fix moebuf
* fix merge
* update moe api fix aiter build
* fix buid
* fuse silu
* silu ok
* acale ok
* add silu
* change code
* gemm2 ok
* gufusion compatible ok, fix warnings
* gu fusion for m32 m64 ok
* support bf16 cshuffle
* i4 gemm2 ok
* i4 gemm2 ok and i4 gemm1 build
* 16x16 run ok
* change flops; change cshuffle dtype
* fuse gelu silu act in moe gemm1
* fp8 with act ready
* int4 act ready
* remove useless changes
* remove useless code change
* fix clang format
* add the arch limit of int4 moe gemm
* fuse moe activation
* fix fp8 16x16
* fix no quant case
* fix bugs
* fix fp8 gufusion bug
* remove useless comments
* refine activation code & complete moe example
* fix int8 bugs
* merge tkw1
---------
Co-authored-by: coderfeli <coderfeli@163.com>
Co-authored-by: feli <felix.li@amd.com>
Co-authored-by: illsilin <Illia.Silin@amd.com>
Co-authored-by: root <root@hjbog-srdc-51.amd.com>
Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com>
[ROCm/composable_kernel commit: 39ba03f25d]
2025-04-23 10:35:34 +08:00
..
2023-05-31 18:46:57 -05:00
2025-03-05 14:33:28 -08:00
2025-02-20 18:58:14 -08:00
2025-04-07 07:08:39 -07:00
2023-09-06 11:44:09 -05:00
2025-03-24 15:08:54 -07:00
2024-02-07 01:08:34 +01:00
2024-09-11 15:19:42 +02:00
2025-02-20 18:58:14 -08:00
2024-11-18 14:07:04 -08:00
2025-04-16 15:25:02 -06:00
2023-05-31 18:46:57 -05:00
2025-01-31 09:48:39 -08:00
2025-03-07 13:43:52 -08:00
2023-05-31 18:46:57 -05:00
2025-03-12 07:29:09 -07:00
2023-05-31 18:46:57 -05:00
2025-01-31 09:48:39 -08:00
2025-03-26 19:23:01 -05:00
2025-01-31 09:48:39 -08:00
2025-03-24 15:08:54 -07:00
2025-04-01 12:22:10 -07:00
2025-04-23 10:35:34 +08:00
2025-04-15 17:17:07 -06:00
2025-02-20 18:58:14 -08:00
2025-04-03 15:30:21 -07:00
2025-03-24 15:08:54 -07:00
2024-04-25 15:07:14 -05:00
2025-04-01 12:06:25 -07:00
2023-05-31 18:46:57 -05:00
2025-01-31 09:48:39 -08:00
2025-01-31 09:48:39 -08:00
2025-03-24 15:08:54 -07:00
2023-05-31 18:46:57 -05:00
2023-07-06 10:58:55 -05:00
2023-05-31 18:46:57 -05:00
2023-09-06 11:44:09 -05:00
2023-10-19 09:34:39 -07:00
2025-01-31 09:48:39 -08:00
2025-01-31 09:48:39 -08:00
2024-01-19 11:29:00 +01:00
2025-02-20 18:58:14 -08:00
2025-03-24 15:08:54 -07:00
2025-02-20 18:58:14 -08:00
2023-10-19 17:23:19 +02:00
2023-05-31 18:46:57 -05:00
2025-03-26 19:23:01 -05:00
2025-03-24 15:08:54 -07:00
2025-04-03 12:42:03 -05:00
2025-04-15 17:17:07 -06:00
2023-05-31 18:46:57 -05:00
2025-03-24 15:08:54 -07:00
2025-03-24 15:08:54 -07:00
2025-01-31 09:48:39 -08:00
2023-07-06 10:58:55 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2024-09-20 09:40:45 +02:00
2025-04-03 12:42:03 -05:00
2023-05-31 18:46:57 -05:00
2025-04-08 09:00:51 -07:00
2023-05-31 18:46:57 -05:00
2025-01-02 11:48:06 +08:00
2025-01-31 09:48:39 -08:00
2023-05-31 18:46:57 -05:00
2024-06-27 00:33:34 -07:00
2023-05-31 18:46:57 -05:00
2024-04-13 21:03:18 -05:00
2025-04-23 10:35:34 +08:00
2025-01-31 09:48:39 -08:00
2025-04-22 01:13:22 -07:00
2025-02-20 18:58:14 -08:00
2023-07-26 14:18:15 -05:00
2023-07-06 10:58:55 -05:00