-
Thomas Parnell authored
[Bugfix] [SpecDecode] Default speculative_draft_tensor_parallel_size to 1 when using MLPSpeculator (#7105) Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
b1c9aa3d
[Bugfix] [SpecDecode] Default speculative_draft_tensor_parallel_size to 1 when using MLPSpeculator (#7105)
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com>