Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
afd089f2
Unverified
Commit
afd089f2
authored
Mar 01, 2026
by
lailoo
Committed by
GitHub
Mar 01, 2026
Browse files
[Bugfix][Model] Fix Qwen3.5/Qwen3Next ignoring --dtype flag on older GPUs (#35617)
parent
3ecd0bf9
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
0 additions
and
5 deletions
+0
-5
vllm/model_executor/models/qwen3_5.py
vllm/model_executor/models/qwen3_5.py
+0
-2
vllm/model_executor/models/qwen3_next.py
vllm/model_executor/models/qwen3_next.py
+0
-3
No files found.
vllm/model_executor/models/qwen3_5.py
View file @
afd089f2
...
...
@@ -274,7 +274,6 @@ class Qwen3_5DecoderLayer(Qwen3NextDecoderLayer):
1
,
1
,
config
.
hidden_size
,
dtype
=
config
.
dtype
,
),
)
self
.
ffn_layer_scale
=
torch
.
nn
.
Parameter
(
...
...
@@ -282,7 +281,6 @@ class Qwen3_5DecoderLayer(Qwen3NextDecoderLayer):
1
,
1
,
config
.
hidden_size
,
dtype
=
config
.
dtype
,
),
)
...
...
vllm/model_executor/models/qwen3_next.py
View file @
afd089f2
...
...
@@ -463,7 +463,6 @@ class Qwen3NextGatedDeltaNet(nn.Module, MambaBase):
group_size
=
None
,
norm_before_gate
=
True
,
device
=
current_platform
.
current_device
(),
dtype
=
config
.
dtype
,
)
self
.
out_proj
=
RowParallelLinear
(
...
...
@@ -1018,7 +1017,6 @@ class Qwen3NextDecoderLayer(nn.Module):
1
,
1
,
config
.
hidden_size
,
dtype
=
config
.
dtype
,
),
)
self
.
ffn_layer_scale
=
torch
.
nn
.
Parameter
(
...
...
@@ -1026,7 +1024,6 @@ class Qwen3NextDecoderLayer(nn.Module):
1
,
1
,
config
.
hidden_size
,
dtype
=
config
.
dtype
,
),
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment