Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
707b4891
Commit
707b4891
authored
Mar 26, 2026
by
wanglong3
Browse files
The gfx928 architecture force to set VLLM_W8A8_BACKEND == 1
parent
0bd5fcd2
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
1 deletion
+4
-1
vllm/envs.py
vllm/envs.py
+4
-1
No files found.
vllm/envs.py
View file @
707b4891
...
...
@@ -8,6 +8,7 @@ import os
import
sys
import
tempfile
import
uuid
import
torch
from
collections.abc
import
Callable
from
typing
import
TYPE_CHECKING
,
Any
,
Literal
...
...
@@ -1896,7 +1897,9 @@ environment_variables: dict[str, Callable[[], Any]] = {
# cutlass: 2 (will remove in the future)
# blaslt: 3 (default)
# rocblas: others
"VLLM_W8A8_BACKEND"
:
lambda
:
int
(
os
.
getenv
(
"VLLM_W8A8_BACKEND"
,
"3"
)),
"VLLM_W8A8_BACKEND"
:
lambda
:
int
(
1
if
"gfx928"
in
torch
.
cuda
.
get_device_properties
(
"cuda"
).
gcnArchName
.
split
(
':'
)[
0
]
else
os
.
getenv
(
"VLLM_W8A8_BACKEND"
,
"3"
)
),
# Capture MoE router logits for debugging/analysis.
"VLLM_MOE_ROUTER_CAPTURE"
:
lambda
:
(
os
.
getenv
(
"VLLM_MOE_ROUTER_CAPTURE"
,
"0"
).
lower
()
in
(
"true"
,
"1"
)),
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment