"vscode:/vscode.git/clone" did not exist on "a298ce9c1db2abcc32b69db07683724c9d96f034"
Unverified Commit 6bfc6d1f authored by knarangN's avatar knarangN Committed by GitHub
Browse files

fix(ci): move test_multimodal_b64 to gpu_1 and force tool calls in...


fix(ci): move test_multimodal_b64 to gpu_1 and force tool calls in aggregated_toolcalling (DYN-2662) (#8126)
Signed-off-by: default avatarKavita Narang <knarang@nvidia.com>
parent e3e728a8
...@@ -509,7 +509,7 @@ vllm_configs = { ...@@ -509,7 +509,7 @@ vllm_configs = {
}, },
} }
], ],
"tool_choice": "auto", "tool_choice": "required",
"max_tokens": 1024, "max_tokens": 1024,
}, },
repeat_count=1, repeat_count=1,
...@@ -656,7 +656,7 @@ def test_serve_deployment( ...@@ -656,7 +656,7 @@ def test_serve_deployment(
@pytest.mark.vllm @pytest.mark.vllm
@pytest.mark.e2e @pytest.mark.e2e
@pytest.mark.gpu_2 @pytest.mark.gpu_1
@pytest.mark.nightly @pytest.mark.nightly
@pytest.mark.model("Qwen/Qwen2.5-VL-7B-Instruct") @pytest.mark.model("Qwen/Qwen2.5-VL-7B-Instruct")
@pytest.mark.timeout(360) # Match VLLMConfig.timeout for this multimodal deployment @pytest.mark.timeout(360) # Match VLLMConfig.timeout for this multimodal deployment
...@@ -672,9 +672,10 @@ def test_multimodal_b64( ...@@ -672,9 +672,10 @@ def test_multimodal_b64(
This test is separate because it loads the required image at runtime This test is separate because it loads the required image at runtime
(not collection time), ensuring it only fails when actually executed. (not collection time), ensuring it only fails when actually executed.
Uses ``@pytest.mark.model`` so nightly multi-GPU jobs (gpu_2 without the Runs on gpu_1 alongside other single-GPU multimodal tests that use the
gpu_1 multimodal_agg_qwen param) still predownload Qwen2.5-VL-7B before same model (mm_agg_qwen2.5-vl-7b). The ``@pytest.mark.model`` mark is
``HF_HUB_OFFLINE=1``. kept as a safety net so the model is predownloaded even if no other
gpu_1 config collects this model in a given CI job.
""" """
# Load B64 image at test execution time (uses real PNG even if MULTIMODAL_IMG is LFS pointer) # Load B64 image at test execution time (uses real PNG even if MULTIMODAL_IMG is LFS pointer)
b64_img = base64.b64encode(get_multimodal_test_image_bytes()).decode() b64_img = base64.b64encode(get_multimodal_test_image_bytes()).decode()
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment