"vllm/vscode:/vscode.git/clone" did not exist on "c0c25e25fa93ee7c3f279abbba5597c0fafa74ee"
Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 (#10136)
Signed-off-by:
Sourashis Roy <sroy@roblox.com>
Showing
Please register or sign in to comment