Unverified Commit 52cefdbf authored by Yineng Zhang's avatar Yineng Zhang Committed by GitHub
Browse files

fix: resolve the fp8 bug introduced by vLLM 0.5.5 (#1276)

parent 51c554d8
......@@ -695,7 +695,7 @@ class DeepseekV2ForCausalLM(nn.Module):
weight_loader(
param,
loaded_weight,
weight_name,
name,
shard_id=shard_id,
expert_id=expert_id,
)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment