sglang/benchmark/kernels at support_multi_protocol - sglang - Public git mirror

kvcache-ai/sglang

mirror of https://github.com/kvcache-ai/sglang.git synced 2026-06-30 03:37:51 +00:00

Files

History

RunningLeon 335dbd60b4 Support Intern-S2-Preview (#24875 )

2026-05-10 22:17:30 +08:00

..

[AMD][No-Merge] Simplify fused allreduce + RMSNorm and remove hidden_dim allowlist (#21986 )

2026-04-11 23:47:08 -07:00

decoding_attention_triton

Fix benchmark import for should_use_tensor_core (#17232 )

2026-01-16 17:48:36 -05:00

Add CLI args to conveniently support tuning more models (#12922 )

2026-03-12 23:10:55 -07:00

Fix Python 3.11 f-string lint error in deepgemm Blackwell benchmark (#22108 )

2026-04-04 21:15:22 +08:00

[Benchmark] use flashinfer bench_gpu_time instead of triton do_bench (#20305 )

2026-03-12 04:04:30 +00:00

flashinfer_allreduce_fusion

[kernel slimming] Clean many useless sgl-kernel deprecated kernels (#20277 )

2026-03-14 16:45:54 +08:00

fused_moe_triton

Support Intern-S2-Preview (#24875 )

2026-05-10 22:17:30 +08:00

Add offline auto-tuning for LoRA CSGMV kernel (#20391 )

2026-04-10 13:10:43 -07:00

feat: tiny improve fp8_gemm tune usage (#23912 )

2026-04-28 07:47:46 -04:00

scheduler_batch

[Benchmark] use flashinfer bench_gpu_time instead of triton do_bench (#20305 )

2026-03-12 04:04:30 +00:00

sliding_window_attention_triton

[Benchmark] use flashinfer bench_gpu_time instead of triton do_bench (#20305 )

2026-03-12 04:04:30 +00:00