Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
73a99cc2
Unverified
Commit
73a99cc2
authored
Oct 03, 2025
by
Aleksandr Samarin
Committed by
GitHub
Oct 03, 2025
Browse files
[Model] Fixed stream generator for gpt-oss + spec-decoding (#26027)
Signed-off-by:
Aleksandr Samarin
<
astrlrd@nebius.com
>
parent
adae0c1f
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
1 deletion
+3
-1
vllm/entrypoints/openai/serving_chat.py
vllm/entrypoints/openai/serving_chat.py
+3
-1
No files found.
vllm/entrypoints/openai/serving_chat.py
View file @
73a99cc2
...
...
@@ -691,11 +691,13 @@ class OpenAIServingChat(OpenAIServing):
if
self
.
use_harmony
:
harmony_parser
=
harmony_parsers
[
i
]
prev_recipient
=
harmony_parser
.
current_recipient
delta_text
=
""
for
token_id
in
output
.
token_ids
:
harmony_parser
.
process
(
token_id
)
delta_text
+=
(
harmony_parser
.
last_content_delta
or
""
)
cur_channel
=
harmony_parser
.
current_channel
cur_recipient
=
harmony_parser
.
current_recipient
delta_text
=
harmony_parser
.
last_content_delta
or
""
else
:
delta_text
=
output
.
text
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment