Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
ad9d09e2b8a601b50d07c76fb8736c2bbda2d6fb
Switch branch/tag
vllm_cscc
vllm
v1
worker
gpu_model_runner.py
Find file
Blame
History
Permalink
[Perf] [Hybrid] Copy num_accepted_tokens in non-blocking way when not using prefix caching (#35442)
· ad9d09e2
Thomas Parnell
authored
Mar 03, 2026
Signed-off-by:
Thomas Parnell
<
tpa@zurich.ibm.com
>
ad9d09e2
gpu_model_runner.py
269 KB
Edit
Web IDE
Replace gpu_model_runner.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace gpu_model_runner.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.