Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
f256ebe4df6757d76f1f1642d7e110268a2f8190
Switch branch/tag
vllm_cscc
vllm
spec_decode
draft_model_runner.py
Find file
Blame
History
Permalink
Remove hard-dependencies of Speculative decode to CUDA workers (#10587)
· 0a71900b
Chendi.Xue
authored
Nov 26, 2024
Signed-off-by:
Chendi Xue
<
chendi.xue@intel.com
>
0a71900b
draft_model_runner.py
13.6 KB
Edit
Web IDE
Replace draft_model_runner.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace draft_model_runner.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.