Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
db87eb6c
Unverified
Commit
db87eb6c
authored
Dec 05, 2024
by
youkaichao
Committed by
GitHub
Dec 05, 2024
Browse files
[torch.compile] use size tuning for specific sizes (#10933)
Signed-off-by:
youkaichao
<
youkaichao@gmail.com
>
parent
9743d64e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
0 deletions
+6
-0
vllm/compilation/backends.py
vllm/compilation/backends.py
+6
-0
No files found.
vllm/compilation/backends.py
View file @
db87eb6c
...
@@ -43,6 +43,12 @@ def wrap_inductor(graph,
...
@@ -43,6 +43,12 @@ def wrap_inductor(graph,
if
additional_inductor_config
is
not
None
:
if
additional_inductor_config
is
not
None
:
current_config
.
update
(
additional_inductor_config
)
current_config
.
update
(
additional_inductor_config
)
if
isinstance
(
runtime_shape
,
int
):
# for a specific batchsize, tuning triton kernel parameters
# can be beneficial
current_config
[
"max_autotune"
]
=
True
current_config
[
"coordinate_descent_tuning"
]
=
True
# inductor can inplace modify the graph, so we need to copy it
# inductor can inplace modify the graph, so we need to copy it
# see https://github.com/pytorch/pytorch/issues/138980
# see https://github.com/pytorch/pytorch/issues/138980
graph
=
copy
.
deepcopy
(
graph
)
graph
=
copy
.
deepcopy
(
graph
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment