Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
0b8e871e
Unverified
Commit
0b8e871e
authored
Nov 05, 2025
by
Zhewen Li
Committed by
GitHub
Nov 05, 2025
Browse files
[CI/Build] Fix `test_defaults_with_usage_context` in AMD CI (#27926)
Signed-off-by:
zhewenli
<
zhewenli@meta.com
>
parent
5ee93a59
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
2 deletions
+4
-2
tests/v1/engine/test_engine_args.py
tests/v1/engine/test_engine_args.py
+4
-2
No files found.
tests/v1/engine/test_engine_args.py
View file @
0b8e871e
...
@@ -53,10 +53,12 @@ def test_defaults_with_usage_context():
...
@@ -53,10 +53,12 @@ def test_defaults_with_usage_context():
vllm_config
:
VllmConfig
=
engine_args
.
create_engine_config
(
UsageContext
.
LLM_CLASS
)
vllm_config
:
VllmConfig
=
engine_args
.
create_engine_config
(
UsageContext
.
LLM_CLASS
)
from
vllm.platforms
import
current_platform
from
vllm.platforms
import
current_platform
from
vllm.utils.mem_constants
import
GiB_bytes
device_memory
=
current_platform
.
get_device_total_memory
()
device_name
=
current_platform
.
get_device_name
().
lower
()
device_name
=
current_platform
.
get_device_name
().
lower
()
if
"h100"
in
device_
na
me
or
"h200"
in
device_name
:
if
device_me
m
or
y
>=
70
*
GiB_bytes
and
"a100"
not
in
device_name
:
# For
H100
and
H2
00
,
w
e use larger default values.
# For
GPUs like H100, H200,
and
MI3
00
x
w
ith >= 70GB memory
default_llm_tokens
=
16384
default_llm_tokens
=
16384
default_server_tokens
=
8192
default_server_tokens
=
8192
default_max_num_seqs
=
1024
default_max_num_seqs
=
1024
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment