Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
82a006be
Unverified
Commit
82a006be
authored
Apr 02, 2026
by
Bowen Bao
Committed by
GitHub
Apr 03, 2026
Browse files
[CI][ROCm] Add gpt-oss w4a8 in CI (#38292)
Signed-off-by:
Bowen Bao
<
bowenbao@amd.com
>
parent
a9b4f07b
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
1 deletion
+10
-1
tests/evals/gpt_oss/configs/gpt-oss-20b-rocm-mxfp4-fp8.yaml
tests/evals/gpt_oss/configs/gpt-oss-20b-rocm-mxfp4-fp8.yaml
+8
-0
tests/evals/gpt_oss/configs/models-gfx950.txt
tests/evals/gpt_oss/configs/models-gfx950.txt
+2
-1
No files found.
tests/evals/gpt_oss/configs/gpt-oss-20b-rocm-mxfp4-fp8.yaml
0 → 100644
View file @
82a006be
# SPDX-License-Identifier: Apache-2.0
# SPDX-FileCopyrightText: Copyright contributors to the vLLM project
model_name
:
amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8
metric_threshold
:
0.568
reasoning_effort
:
low
server_args
:
"
--attention-backend
ROCM_AITER_UNIFIED_ATTN"
env
:
VLLM_ROCM_USE_AITER
:
"
1"
\ No newline at end of file
tests/evals/gpt_oss/configs/models-gfx950.txt
View file @
82a006be
# GFX950 model configurations for GPQA evaluation
# Tests different environment variable combinations
gpt-oss-20b-rocm-baseline.yaml
gpt-oss-20b-rocm-mxfp4-fp8.yaml
\ No newline at end of file
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment