Unverified Commit f67d1f45 authored by Yineng Zhang's avatar Yineng Zhang Committed by GitHub
Browse files

[Auto Sync] Update deepseek_v2.py (20250922) (#10717)


Co-authored-by: default avatargithub-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: default avatarMichael Granado <mgranado@together.ai>
parent 0f04a5f4
......@@ -451,7 +451,7 @@ class MoEGate(nn.Module):
_is_cuda
and hidden_states.shape[0] <= 16
and hidden_states.shape[1] == 7168
and self.weight.shape[0] == 256
and (self.weight.shape[0] == 256 or self.weight.shape[0] == 384)
and _device_sm >= 90
):
# router gemm output float32
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment