a tiny fix for support deepseek bf16 weights (#12313)

Co-authored-by: gaochang <gaochang@U-19PX2WQ1-0350.local>

a tiny fix for support deepseek bf16 weights (#12313)
Co-authored-by: gaochang <gaochang@U-19PX2WQ1-0350.local>
0297773a · Gao016 · GitHub · 587deb15 · 0297773a
Unverified Commit 0297773a authored Oct 29, 2025 by Gao016 Committed by GitHub Oct 28, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

python/sglang/srt/models/deepseek_v2.py python/sglang/srt/models/deepseek_v2.py +1 -1

No files found.
--- a/python/sglang/srt/models/deepseek_v2.py
+++ b/python/sglang/srt/models/deepseek_v2.py
@@ -2998,7 +2998,7 @@ class DeepseekV2ForCausalLM(nn.Module):
            disable_reason = "Only Deepseek V3/R1 on NV-platform with capability >= 80 can use shared experts fusion optimization."
        elif get_moe_expert_parallel_world_size() > 1:
            disable_reason = "Deepseek V3/R1 can not use shared experts fusion optimization under expert parallelism."
-        elif self.quant_config.get_name() == "w4afp8":
+        elif self.quant_config and self.quant_config.get_name() == "w4afp8":
            disable_reason = "Deepseek V3/R1 W4AFP8 model uses different quant method for routed experts and shared experts."

        if disable_reason is not None: