Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
norm
vllm
Commits
290e015c
"git@developer.sourcefind.cn:OpenDAS/torchaudio.git" did not exist on "1f9b91041d2610c163a3931857cb71f3490f2912"
Unverified
Commit
290e015c
authored
Dec 18, 2023
by
Suhong Moon
Committed by
GitHub
Dec 18, 2023
Browse files
Update Help Text for --gpu-memory-utilization Argument (#2183)
parent
1b7c791d
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
7 additions
and
5 deletions
+7
-5
vllm/engine/arg_utils.py
vllm/engine/arg_utils.py
+7
-5
No files found.
vllm/engine/arg_utils.py
View file @
290e015c
...
@@ -156,11 +156,13 @@ class EngineArgs:
...
@@ -156,11 +156,13 @@ class EngineArgs:
type
=
int
,
type
=
int
,
default
=
EngineArgs
.
swap_space
,
default
=
EngineArgs
.
swap_space
,
help
=
'CPU swap space size (GiB) per GPU'
)
help
=
'CPU swap space size (GiB) per GPU'
)
parser
.
add_argument
(
'--gpu-memory-utilization'
,
parser
.
add_argument
(
type
=
float
,
'--gpu-memory-utilization'
,
default
=
EngineArgs
.
gpu_memory_utilization
,
type
=
float
,
help
=
'the percentage of GPU memory to be used for '
default
=
EngineArgs
.
gpu_memory_utilization
,
'the model executor'
)
help
=
'the fraction of GPU memory to be used for '
'the model executor, which can range from 0 to 1.'
'If unspecified, will use the default value of 0.9.'
)
parser
.
add_argument
(
'--max-num-batched-tokens'
,
parser
.
add_argument
(
'--max-num-batched-tokens'
,
type
=
int
,
type
=
int
,
default
=
EngineArgs
.
max_num_batched_tokens
,
default
=
EngineArgs
.
max_num_batched_tokens
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment