sglang

mirror of https://github.com/kvcache-ai/sglang.git synced 2026-06-30 03:37:51 +00:00

Files

xwy-amd8 01202ee43d Add DeepseekV32ForCausalLM to NSA auto-selection model_arch list

DeepseekV32ForCausalLM was missing from the model_arch guard in
_handle_model_specific_adjustments(), so is_deepseek_nsa() was never
reached for V3.2 models. This caused the NSA attention backend to not
be auto-selected, leading to q_rope TypeError with flashinfer or
incorrect behavior with other backends.

Upstream bug introduced in sgl-project/sglang#13687 (commit 618ca2380)
which refactored the flat is_deepseek_nsa() check into a nested block
under model_arch guard but only listed DeepseekV3ForCausalLM.

2026-02-26 14:03:55 +00:00

sglang

Add DeepseekV32ForCausalLM to NSA auto-selection model_arch list

2026-02-26 14:03:55 +00:00

pyproject_cpu.toml

refactor: replace local proto compilation with smg-grpc-proto package (#18682 )

2026-02-12 05:29:24 -08:00

pyproject_npu.toml

[diffusion] model: LTX-2 Support PR3 (#19151 )

2026-02-24 16:55:28 +08:00

pyproject_other.toml

[diffusion] model: LTX-2 Support PR3 (#19151 )

2026-02-24 16:55:28 +08:00

pyproject_xpu.toml

refactor: replace local proto compilation with smg-grpc-proto package (#18682 )

2026-02-12 05:29:24 -08:00

pyproject.toml

[diffusion] chore: tiny fix pyproject.toml (#19256 )

2026-02-25 11:57:53 +08:00