Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
8ca89548
Unverified
Commit
8ca89548
authored
Oct 22, 2024
by
yudian0504
Committed by
GitHub
Oct 21, 2024
Browse files
[Bugfix][Misc]: fix graph capture for decoder (#9549)
parent
f6b97293
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
vllm/worker/model_runner.py
vllm/worker/model_runner.py
+1
-1
No files found.
vllm/worker/model_runner.py
View file @
8ca89548
...
@@ -828,7 +828,7 @@ class ModelInputForGPUBuilder(ModelRunnerInputBuilderBase[ModelInputForGPU]):
...
@@ -828,7 +828,7 @@ class ModelInputForGPUBuilder(ModelRunnerInputBuilderBase[ModelInputForGPU]):
cuda_graph_pad_size
=
self
.
_get_cuda_graph_pad_size
(
cuda_graph_pad_size
=
self
.
_get_cuda_graph_pad_size
(
num_seqs
=
len
(
seq_lens
),
num_seqs
=
len
(
seq_lens
),
max_decode_seq_len
=
max_e
n
code
r
_seq_len
,
max_decode_seq_len
=
max_
d
ecode_seq_len
,
max_encoder_seq_len
=
max_encoder_seq_len
)
max_encoder_seq_len
=
max_encoder_seq_len
)
batch_size
=
len
(
input_tokens
)
batch_size
=
len
(
input_tokens
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment