Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
14772eeb
Unverified
Commit
14772eeb
authored
May 22, 2024
by
Kante Yin
Committed by
GitHub
May 21, 2024
Browse files
[Bugfix] Fix flag name for `max_seq_len_to_capture` (#4935)
Signed-off-by:
kerthcet
<
kerthcet@gmail.com
>
parent
757b62c4
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
vllm/engine/arg_utils.py
vllm/engine/arg_utils.py
+2
-2
No files found.
vllm/engine/arg_utils.py
View file @
14772eeb
...
@@ -341,9 +341,9 @@ class EngineArgs:
...
@@ -341,9 +341,9 @@ class EngineArgs:
help
=
'Maximum context length covered by CUDA '
help
=
'Maximum context length covered by CUDA '
'graphs. When a sequence has context length '
'graphs. When a sequence has context length '
'larger than this, we fall back to eager mode. '
'larger than this, we fall back to eager mode. '
'(DEPRECATED. Use --max-seq
_
len-to-capture instead'
'(DEPRECATED. Use --max-seq
-
len-to-capture instead'
')'
)
')'
)
parser
.
add_argument
(
'--max-seq
_
len-to-capture'
,
parser
.
add_argument
(
'--max-seq
-
len-to-capture'
,
type
=
int
,
type
=
int
,
default
=
EngineArgs
.
max_seq_len_to_capture
,
default
=
EngineArgs
.
max_seq_len_to_capture
,
help
=
'Maximum sequence length covered by CUDA '
help
=
'Maximum sequence length covered by CUDA '
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment