Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 (#10136)
Signed-off-by:
Sourashis Roy <sroy@roblox.com>
Showing
Please register or sign in to comment
Signed-off-by:
Sourashis Roy <sroy@roblox.com>