Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
f33251ff
Unverified
Commit
f33251ff
authored
Mar 11, 2026
by
Silvia Colabrese
Committed by
GitHub
Mar 11, 2026
Browse files
[Bugfix] Fix Mistral-small `--format` (#36782)
Signed-off-by:
12010486
<
silvia.colabrese@intel.com
>
parent
e584dce5
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
6 deletions
+6
-6
examples/offline_inference/mistral-small.py
examples/offline_inference/mistral-small.py
+6
-6
No files found.
examples/offline_inference/mistral-small.py
View file @
f33251ff
...
...
@@ -62,9 +62,9 @@ def run_simple_demo(args: argparse.Namespace):
llm
=
LLM
(
model
=
model_name
,
tokenizer_mode
=
"mistral"
if
args
.
format
==
"mistral"
else
"
auto
"
,
config_format
=
"mistral"
if
args
.
format
==
"mistral"
else
"
auto
"
,
load_format
=
"mistral"
if
args
.
format
==
"mistral"
else
"
auto
"
,
tokenizer_mode
=
"mistral"
if
args
.
format
==
"mistral"
else
"
hf
"
,
config_format
=
"mistral"
if
args
.
format
==
"mistral"
else
"
hf
"
,
load_format
=
"mistral"
if
args
.
format
==
"mistral"
else
"
hf
"
,
limit_mm_per_prompt
=
{
"image"
:
1
},
max_model_len
=
4096
,
max_num_seqs
=
2
,
...
...
@@ -102,9 +102,9 @@ def run_advanced_demo(args: argparse.Namespace):
sampling_params
=
SamplingParams
(
max_tokens
=
8192
,
temperature
=
0.7
)
llm
=
LLM
(
model
=
model_name
,
tokenizer_mode
=
"mistral"
if
args
.
format
==
"mistral"
else
"
auto
"
,
config_format
=
"mistral"
if
args
.
format
==
"mistral"
else
"
auto
"
,
load_format
=
"mistral"
if
args
.
format
==
"mistral"
else
"
auto
"
,
tokenizer_mode
=
"mistral"
if
args
.
format
==
"mistral"
else
"
hf
"
,
config_format
=
"mistral"
if
args
.
format
==
"mistral"
else
"
hf
"
,
load_format
=
"mistral"
if
args
.
format
==
"mistral"
else
"
hf
"
,
limit_mm_per_prompt
=
{
"image"
:
max_img_per_msg
},
max_model_len
=
max_img_per_msg
*
max_tokens_per_img
,
tensor_parallel_size
=
2
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment