Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
3f7662d6
Unverified
Commit
3f7662d6
authored
Feb 03, 2026
by
Patrick von Platen
Committed by
GitHub
Feb 03, 2026
Browse files
[Voxtral Realtime] Change name (#33716)
Signed-off-by:
Patrick von Platen
<
patrick.v.platen@gmail.com
>
parent
a372f3f4
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
6 additions
and
6 deletions
+6
-6
examples/online_serving/openai_realtime_client.py
examples/online_serving/openai_realtime_client.py
+2
-2
examples/online_serving/openai_realtime_microphone_client.py
examples/online_serving/openai_realtime_microphone_client.py
+2
-2
tests/entrypoints/openai/test_realtime_validation.py
tests/entrypoints/openai/test_realtime_validation.py
+1
-1
tests/models/multimodal/generation/test_voxtral_realtime.py
tests/models/multimodal/generation/test_voxtral_realtime.py
+1
-1
No files found.
examples/online_serving/openai_realtime_client.py
View file @
3f7662d6
...
@@ -7,7 +7,7 @@ audio transcription by uploading an audio file.
...
@@ -7,7 +7,7 @@ audio transcription by uploading an audio file.
Before running this script, you must start the vLLM server with a realtime-capable
Before running this script, you must start the vLLM server with a realtime-capable
model, for example:
model, for example:
vllm serve mistralai/Voxtral-Mini-
3
B-Realtime-2602 --enforce-eager
vllm serve mistralai/Voxtral-Mini-
4
B-Realtime-2602 --enforce-eager
Requirements:
Requirements:
- vllm with audio support
- vllm with audio support
...
@@ -126,7 +126,7 @@ if __name__ == "__main__":
...
@@ -126,7 +126,7 @@ if __name__ == "__main__":
parser
.
add_argument
(
parser
.
add_argument
(
"--model"
,
"--model"
,
type
=
str
,
type
=
str
,
default
=
"mistralai/Voxtral-Mini-
3
B-Realtime-2602"
,
default
=
"mistralai/Voxtral-Mini-
4
B-Realtime-2602"
,
help
=
"Model that is served and should be pinged."
,
help
=
"Model that is served and should be pinged."
,
)
)
parser
.
add_argument
(
parser
.
add_argument
(
...
...
examples/online_serving/openai_realtime_microphone_client.py
View file @
3f7662d6
...
@@ -5,7 +5,7 @@ Minimal Gradio demo for real-time speech transcription using the vLLM Realtime A
...
@@ -5,7 +5,7 @@ Minimal Gradio demo for real-time speech transcription using the vLLM Realtime A
Start the vLLM server first:
Start the vLLM server first:
vllm serve mistralai/Voxtral-Mini-
3
B-Realtime-2602 --enforce-eager
vllm serve mistralai/Voxtral-Mini-
4
B-Realtime-2602 --enforce-eager
Then run this script:
Then run this script:
...
@@ -166,7 +166,7 @@ if __name__ == "__main__":
...
@@ -166,7 +166,7 @@ if __name__ == "__main__":
parser
.
add_argument
(
parser
.
add_argument
(
"--model"
,
"--model"
,
type
=
str
,
type
=
str
,
default
=
"mistralai/Voxtral-Mini-
3
B-Realtime-2602"
,
default
=
"mistralai/Voxtral-Mini-
4
B-Realtime-2602"
,
help
=
"Model that is served and should be pinged."
,
help
=
"Model that is served and should be pinged."
,
)
)
parser
.
add_argument
(
parser
.
add_argument
(
...
...
tests/entrypoints/openai/test_realtime_validation.py
View file @
3f7662d6
...
@@ -24,7 +24,7 @@ MISTRAL_FORMAT_ARGS = [
...
@@ -24,7 +24,7 @@ MISTRAL_FORMAT_ARGS = [
"mistral"
,
"mistral"
,
]
]
MODEL_NAME
=
"mistralai/Voxtral-Mini-
3
B-Realtime-2602"
MODEL_NAME
=
"mistralai/Voxtral-Mini-
4
B-Realtime-2602"
def
_audio_to_base64_pcm16
(
path
:
str
,
target_sr
:
int
=
16000
)
->
str
:
def
_audio_to_base64_pcm16
(
path
:
str
,
target_sr
:
int
=
16000
)
->
str
:
...
...
tests/models/multimodal/generation/test_voxtral_realtime.py
View file @
3f7662d6
...
@@ -19,7 +19,7 @@ from vllm.engine.arg_utils import AsyncEngineArgs
...
@@ -19,7 +19,7 @@ from vllm.engine.arg_utils import AsyncEngineArgs
from
vllm.inputs.data
import
TokensPrompt
from
vllm.inputs.data
import
TokensPrompt
from
vllm.v1.engine.async_llm
import
AsyncLLM
,
StreamingInput
from
vllm.v1.engine.async_llm
import
AsyncLLM
,
StreamingInput
MODEL_NAME
=
"mistralai/Voxtral-Mini-
3
B-Realtime-2602"
MODEL_NAME
=
"mistralai/Voxtral-Mini-
4
B-Realtime-2602"
ENGINE_CONFIG
=
dict
(
ENGINE_CONFIG
=
dict
(
model
=
MODEL_NAME
,
model
=
MODEL_NAME
,
max_model_len
=
8192
,
max_model_len
=
8192
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment