Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
682e0b6d
Unverified
Commit
682e0b6d
authored
Apr 19, 2025
by
Richard Zou
Committed by
GitHub
Apr 19, 2025
Browse files
Log how much time loading a compiled artifact takes (#16848)
Signed-off-by:
rzou
<
zou3519@gmail.com
>
parent
d6195a74
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
8 additions
and
4 deletions
+8
-4
vllm/compilation/backends.py
vllm/compilation/backends.py
+8
-4
No files found.
vllm/compilation/backends.py
View file @
682e0b6d
...
@@ -110,10 +110,14 @@ class CompilerManager:
...
@@ -110,10 +110,14 @@ class CompilerManager:
compiled_graph
=
self
.
load
(
graph
,
example_inputs
,
graph_index
,
compiled_graph
=
self
.
load
(
graph
,
example_inputs
,
graph_index
,
runtime_shape
)
runtime_shape
)
if
compiled_graph
is
not
None
:
if
compiled_graph
is
not
None
:
if
graph_index
==
0
:
if
graph_index
==
num_graphs
-
1
:
# adds some info logging for the first graph
# after loading the last graph for this shape, record the time.
logger
.
info
(
"Directly load the compiled graph for shape %s "
# there can be multiple graphs due to piecewise compilation.
"from the cache"
,
str
(
runtime_shape
))
# noqa
now
=
time
.
time
()
elapsed
=
now
-
compilation_start_time
logger
.
info
(
"Directly load the compiled graph(s) for shape %s "
"from the cache, took %.3f s"
,
str
(
runtime_shape
),
elapsed
)
return
compiled_graph
return
compiled_graph
# no compiler cached the graph, or the cache is disabled,
# no compiler cached the graph, or the cache is disabled,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment