Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
65225897
Commit
65225897
authored
Aug 22, 2025
by
zhuwenwen
Browse files
support adaptive max_num_seqs and cuda_graph_sizes
parent
9b67a1e0
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
vllm/config.py
vllm/config.py
+2
-2
No files found.
vllm/config.py
View file @
65225897
...
@@ -4754,8 +4754,8 @@ class VllmConfig:
...
@@ -4754,8 +4754,8 @@ class VllmConfig:
batch_size_capture_list
=
[]
batch_size_capture_list
=
[]
if
self
.
model_config
is
not
None
and
\
if
self
.
model_config
is
not
None
and
\
not
self
.
model_config
.
enforce_eager
:
not
self
.
model_config
.
enforce_eager
:
if
self
.
model_config
.
use_mla
and
self
.
compilation_config
.
full_cuda_graph
:
if
self
.
model_config
.
use_mla
and
self
.
compilation_config
.
full_cuda_graph
and
self
.
scheduler_config
.
max_num_seqs
<=
256
:
cuda_graph_sizes
=
[
256
]
cuda_graph_sizes
=
[
self
.
scheduler_config
.
max_num_seqs
]
else
:
else
:
cuda_graph_sizes
=
self
.
scheduler_config
.
cuda_graph_sizes
cuda_graph_sizes
=
self
.
scheduler_config
.
cuda_graph_sizes
if
len
(
cuda_graph_sizes
)
==
1
:
if
len
(
cuda_graph_sizes
)
==
1
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment