Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
83fd49b1
Unverified
Commit
83fd49b1
authored
Oct 28, 2025
by
Zhewen Li
Committed by
GitHub
Oct 29, 2025
Browse files
[CI/Build][Bugfix]Fix Quantized Models Test on AMD (#27712)
Signed-off-by:
zhewenli
<
zhewenli@meta.com
>
parent
a4a4f0f6
Changes
3
Show whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
8 additions
and
2 deletions
+8
-2
.buildkite/test-amd.yaml
.buildkite/test-amd.yaml
+1
-1
tests/models/quantization/test_bitsandbytes.py
tests/models/quantization/test_bitsandbytes.py
+6
-0
vllm/platforms/rocm.py
vllm/platforms/rocm.py
+1
-1
No files found.
.buildkite/test-amd.yaml
View file @
83fd49b1
...
...
@@ -908,7 +908,7 @@ steps:
-
label
:
Quantized Models Test
# 45 min
timeout_in_minutes
:
60
mirror_hardwares
:
[
amdexperimental
]
mirror_hardwares
:
[
amdexperimental
,
amdproduction
]
agent_pool
:
mi325_1
# grade: Blocking
source_file_dependencies
:
...
...
tests/models/quantization/test_bitsandbytes.py
View file @
83fd49b1
...
...
@@ -9,10 +9,16 @@ import pytest
from
transformers
import
BitsAndBytesConfig
from
tests.quantization.utils
import
is_quant_method_supported
from
vllm.platforms
import
current_platform
from
...utils
import
compare_two_settings
,
multi_gpu_test
from
..utils
import
check_embeddings_close
,
check_logprobs_close
pytestmark
=
pytest
.
mark
.
skipif
(
current_platform
.
is_rocm
(),
reason
=
"bitsandbytes quantization not supported on ROCm (CUDA-only kernels)"
,
)
models_4bit_to_test
=
[
(
"facebook/opt-125m"
,
"quantize opt model inflight"
),
(
...
...
vllm/platforms/rocm.py
View file @
83fd49b1
...
...
@@ -413,7 +413,7 @@ class RocmPlatform(Platform):
"Using AWQ quantization with ROCm, but VLLM_USE_TRITON_AWQ"
" is not set, enabling VLLM_USE_TRITON_AWQ."
)
envs
.
VLLM_USE_TRITON_AWQ
=
True
os
.
environ
[
"
VLLM_USE_TRITON_AWQ
"
]
=
"1"
@
classmethod
def
get_punica_wrapper
(
cls
)
->
str
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment