Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
b3f2fddd172cef23b42ffaf6c226877b6588964c
Switch branch/tag
vllm_cscc
vllm
v1
worker
tpu_model_runner.py
Find file
Blame
History
Permalink
[TPU][V1] Fix exponential padding when `max-num-batched-tokens` is not a power of 2 (#16596)
· b3f2fddd
Nicolò Lucchesi
authored
Apr 14, 2025
Signed-off-by:
NickLucche
<
nlucches@redhat.com
>
b3f2fddd
tpu_model_runner.py
47.2 KB
Edit
Web IDE
Replace tpu_model_runner.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace tpu_model_runner.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.