Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
norm
vllm
Commits
cb7a1c1c
Unverified
Commit
cb7a1c1c
authored
Jan 13, 2024
by
Ben
Committed by
GitHub
Jan 12, 2024
Browse files
Suggest using dtype=half when OOM.
parent
7878958c
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
1 deletion
+3
-1
vllm/worker/worker.py
vllm/worker/worker.py
+3
-1
No files found.
vllm/worker/worker.py
View file @
cb7a1c1c
...
@@ -239,4 +239,6 @@ def _check_if_gpu_supports_dtype(torch_dtype: torch.dtype):
...
@@ -239,4 +239,6 @@ def _check_if_gpu_supports_dtype(torch_dtype: torch.dtype):
raise
ValueError
(
raise
ValueError
(
"Bfloat16 is only supported on GPUs with compute capability "
"Bfloat16 is only supported on GPUs with compute capability "
f
"of at least 8.0. Your
{
gpu_name
}
GPU has compute capability "
f
"of at least 8.0. Your
{
gpu_name
}
GPU has compute capability "
f
"
{
compute_capability
[
0
]
}
.
{
compute_capability
[
1
]
}
."
)
f
"
{
compute_capability
[
0
]
}
.
{
compute_capability
[
1
]
}
."
f
" You can explicitly specify the data type by using the --dtype option, for example: --dtype=half."
)
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment