[Bugfix] [SpecDecode] Default speculative_draft_tensor_parallel_size to 1 when...
[Bugfix] [SpecDecode] Default speculative_draft_tensor_parallel_size to 1 when using MLPSpeculator (#7105)
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com>
Showing
Please register or sign in to comment