Unverified Commit 82a006be authored by Bowen Bao's avatar Bowen Bao Committed by GitHub
Browse files

[CI][ROCm] Add gpt-oss w4a8 in CI (#38292)


Signed-off-by: default avatarBowen Bao <bowenbao@amd.com>
parent a9b4f07b
# SPDX-License-Identifier: Apache-2.0
# SPDX-FileCopyrightText: Copyright contributors to the vLLM project
model_name: amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8
metric_threshold: 0.568
reasoning_effort: low
server_args: "--attention-backend ROCM_AITER_UNIFIED_ATTN"
env:
VLLM_ROCM_USE_AITER: "1"
\ No newline at end of file
# GFX950 model configurations for GPQA evaluation # GFX950 model configurations for GPQA evaluation
# Tests different environment variable combinations # Tests different environment variable combinations
gpt-oss-20b-rocm-baseline.yaml gpt-oss-20b-rocm-baseline.yaml
\ No newline at end of file gpt-oss-20b-rocm-mxfp4-fp8.yaml
\ No newline at end of file
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment