Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
aceadb5e
Unverified
Commit
aceadb5e
authored
Mar 23, 2026
by
Angela Yi
Committed by
GitHub
Mar 23, 2026
Browse files
Use lazy graph module during split_module to defer recompile() (#37609)
Signed-off-by:
angelayi
<
yiangela7@gmail.com
>
parent
ec228061
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
8 additions
and
3 deletions
+8
-3
vllm/compilation/backends.py
vllm/compilation/backends.py
+8
-3
No files found.
vllm/compilation/backends.py
View file @
aceadb5e
...
...
@@ -20,6 +20,7 @@ import torch
import
torch.fx
as
fx
from
torch._dynamo.utils
import
dynamo_timed
from
torch._logging._internal
import
trace_structured
from
torch.fx._lazy_graph_module
import
_use_lazy_graph_module
import
vllm.envs
as
envs
from
vllm.config
import
CompilationConfig
,
CUDAGraphMode
,
VllmConfig
...
...
@@ -573,9 +574,13 @@ def split_graph(
# otherwise pytorch might reorder the nodes and
# the semantics of the graph will change when we
# have mutations in the graph
split_gm
=
torch
.
fx
.
passes
.
split_module
.
split_module
(
graph
,
None
,
lambda
node
:
node_to_subgraph_id
[
node
],
keep_original_order
=
True
)
with
_use_lazy_graph_module
(
True
):
split_gm
=
torch
.
fx
.
passes
.
split_module
.
split_module
(
graph
,
None
,
lambda
node
:
node_to_subgraph_id
[
node
],
keep_original_order
=
True
,
)
outputs
=
[]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment