"ssh:/git@developer.sourcefind.cn:2222/OpenDAS/vllm_cscc.git" did not exist on "9bdb06b4368e304bc5e23c8df2dff8f8b2ccf0f6"
Unverified Commit 2faad083 authored by Alessandro Sangiorgi's avatar Alessandro Sangiorgi Committed by GitHub
Browse files

[compile] Nest inductor cache under AOT compile dir (#39718)


Signed-off-by: default avatarAlessandro Sangiorgi <asangior@redhat.com>
parent 23f37602
...@@ -507,6 +507,16 @@ def _support_torch_compile( ...@@ -507,6 +507,16 @@ def _support_torch_compile(
hash_key, hash_key,
) )
# Hash-level dir; shared across ranks on the same node.
self.compilation_config.local_cache_dir = cache_dir
inductor_cache = os.path.join(cache_dir, "inductor_cache")
os.makedirs(inductor_cache, exist_ok=True)
# Process-wide: post-load execution, CUDA-graph capture, and later
# autotune/recompile all need to write under {hash}/inductor_cache/.
# Unconditional because torch's cache_dir() may have pre-filled the
# /tmp default during import, making setdefault a no-op.
os.environ["TORCHINDUCTOR_CACHE_DIR"] = inductor_cache
rank = self.vllm_config.parallel_config.rank rank = self.vllm_config.parallel_config.rank
dp_rank = self.vllm_config.parallel_config.data_parallel_index dp_rank = self.vllm_config.parallel_config.data_parallel_index
cache_dir = os.path.join(cache_dir, f"rank_{rank}_{dp_rank}") cache_dir = os.path.join(cache_dir, f"rank_{rank}_{dp_rank}")
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment