Logo
Explore Help
Register Sign In
kvcache-ai/sglang
1
0
Fork 0
You've already forked sglang
mirror of https://github.com/kvcache-ai/sglang.git synced 2026-07-03 13:57:04 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
cfd49e233ceb94de3f702f70aef06e299836aede
sglang/sgl-kernel/csrc/moe
History
Mohammad Miadh Angkad f88acf8780 [JIT Kernel] Reland NVFP4 kernels to JIT (#20012)
2026-03-07 10:31:08 +08:00
..
cutlass_moe/w4a8
[perf]optimize w4afp8 kernel on deepseek-v3-0324 (#12921)
2025-12-18 18:13:22 +08:00
cutlass_moe_helper.cu
[Fix]Fix index oob in get_group_gemm_starts kernel. (#8564)
2025-07-30 19:49:35 -07:00
fp8_blockwise_moe_kernel.cu
Update CUTLASS. Refine KernelSchedule for fp8 (grouped) gemm. (#10491)
2025-09-16 02:47:37 -07:00
fused_qknorm_rope_kernel.cu
[sgl-kernel][1/2] Fused qk_norm_rope for GLM4.6 (#15141)
2025-12-18 17:07:04 +08:00
kimi_k2_moe_fused_gate.cu
Fix warp illegal instruction in kimi k2 thinking PCG (#15306)
2025-12-18 16:58:23 +08:00
moe_align_kernel.cu
Opt moe align block size kernel (#14133)
2025-12-02 19:13:55 +08:00
moe_fused_gate.cu
Fix correction bias undefined behavior for nvfp4 models (#10426)
2025-09-14 18:41:09 -07:00
moe_sum_reduce.cu
[sgl-kernel] Support float64 moe_sum_reduce cuda kernel (#11068)
2025-10-07 14:31:11 +00:00
moe_sum.cu
[7/n] decouple quantization impl from vllm dependency - gguf kernel (#11019)
2025-10-11 14:04:57 -07:00
moe_topk_sigmoid_kernels.cu
Support moe topk sigmoid kernel (#13049)
2025-11-20 00:24:37 +08:00
moe_topk_softmax_kernels.cu
[kernel][moe] add moe topk fast (#13969)
2025-12-14 22:26:40 +08:00
prepare_moe_input.cu
fix: fix apply_shuffle_mul_sum (#7444)
2025-07-04 23:23:30 -07:00
Powered by Gitea Version: 1.25.4 Page: 404ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API