Unverified Commit 33a0d43c authored by Vadim Gimpelson's avatar Vadim Gimpelson Committed by GitHub
Browse files

[BUGFIX][Qwen3.5] Hardcode `mlp.gate` as not quantizable (#35156)


Signed-off-by: default avatarVadim Gimpelson <vadim.gimpelson@gmail.com>
parent 80d93fd6
......@@ -252,7 +252,7 @@ class Qwen3NextSparseMoeBlock(nn.Module):
config.hidden_size,
config.num_experts,
bias=False,
quant_config=quant_config,
quant_config=None,
prefix=f"{prefix}.gate",
)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment