Unverified Commit be8921fb authored by Thomas Parnell's avatar Thomas Parnell Committed by GitHub
Browse files

Change size of single CUDA graph for CI to 4 (#26089)


Signed-off-by: default avatarThomas Parnell <tpa@zurich.ibm.com>
parent d4e7a115
......@@ -732,7 +732,7 @@ class VllmRunner:
set_default_torch_num_threads(default_torch_num_threads))
if not kwargs.get("compilation_config", None):
kwargs["compilation_config"] = {"cudagraph_capture_sizes": [8]}
kwargs["compilation_config"] = {"cudagraph_capture_sizes": [4]}
with init_ctx:
self.llm = LLM(
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment