Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
ae4e2806
Unverified
Commit
ae4e2806
authored
Feb 10, 2026
by
Roger Wang
Committed by
GitHub
Feb 10, 2026
Browse files
[Bugfix] Fix FI kernel`chunk_gated_delta_rule` output shape for Qwen3.5 (#34219)
Signed-off-by:
Roger Wang
<
hey@rogerw.io
>
parent
cbea11c9
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
1 deletion
+3
-1
vllm/model_executor/models/qwen3_next.py
vllm/model_executor/models/qwen3_next.py
+3
-1
No files found.
vllm/model_executor/models/qwen3_next.py
View file @
ae4e2806
...
@@ -135,7 +135,7 @@ def fi_chunk_gated_delta_rule(
...
@@ -135,7 +135,7 @@ def fi_chunk_gated_delta_rule(
fi_state
=
initial_state
.
to
(
torch
.
float32
)
fi_state
=
initial_state
.
to
(
torch
.
float32
)
fi_g
=
g
.
to
(
torch
.
float32
)
fi_g
=
g
.
to
(
torch
.
float32
)
fi_beta
=
beta
.
to
(
torch
.
float32
)
fi_beta
=
beta
.
to
(
torch
.
float32
)
return
chunk_gated_delta_rule_fi
(
output
,
final_state
=
chunk_gated_delta_rule_fi
(
q
=
q
,
q
=
q
,
k
=
k
,
k
=
k
,
v
=
v
,
v
=
v
,
...
@@ -145,6 +145,8 @@ def fi_chunk_gated_delta_rule(
...
@@ -145,6 +145,8 @@ def fi_chunk_gated_delta_rule(
output_final_state
=
output_final_state
,
output_final_state
=
output_final_state
,
cu_seqlens
=
cu_seqlens
,
cu_seqlens
=
cu_seqlens
,
)
)
# Unsqueeze back to 4D (1, L, H, D) to match fla output format
return
output
.
unsqueeze
(
0
),
final_state
@
CustomOp
.
register
(
"chunk_gated_delta_rule"
)
@
CustomOp
.
register
(
"chunk_gated_delta_rule"
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment