Commit 9c9467da authored by zhuwenwen's avatar zhuwenwen
Browse files

Merge branch 'v0.5.0-dtk24.04.1_wangmin' into 'v0.5.0-dtk24.04.1'

fix error in test_long_context.py::test_batched_rope_kernel

See merge request dcutoolkit/deeplearing/vllm!16
parents 8ec2f591 65d20f7f
......@@ -830,6 +830,7 @@ class ModelRunner:
multi_modal_data=dummy_multi_modal_data,
)
seqs.append(seq)
max_num_batched_tokens -= seq_len
for group_id in range(max_num_seqs):
seq_len = (max_num_batched_tokens // max_num_seqs +
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment