[Bug] Fix Test in Batch Invariant (#26128)

Signed-off-by: yewentao256 <zhyanwentao@126.com>

[Bug] Fix Test in Batch Invariant (#26128)
Signed-off-by: yewentao256 <zhyanwentao@126.com>
4ba88757 · Wentao Ye · GitHub · 6273fe8d · 4ba88757
Unverified Commit 4ba88757 authored Oct 08, 2025 by Wentao Ye Committed by GitHub Oct 08, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 4 additions and 1 deletion

tests/v1/generation/test_batch_invariance.py tests/v1/generation/test_batch_invariance.py +4 -1

No files found.
--- a/tests/v1/generation/test_batch_invariance.py
+++ b/tests/v1/generation/test_batch_invariance.py
@@ -292,8 +292,11 @@ def LLM_with_max_seqs(
        # Allow some CPU offload if needed.
        swap_space=swap_space,
        # Keep things lean and CI-friendly.
-        dtype="float16",
+        dtype="auto",
        # Single-GPU by default; override externally if desired.
        tensor_parallel_size=int(os.getenv("VLLM_TP_SIZE", "1")),
        trust_remote_code=os.getenv("VLLM_TRUST_REMOTE_CODE", "0") == "1",
+        enable_prefix_caching=False,
+        # Enable for MOE models
+        # enable_expert_parallel=True,
    )