[Spec Decode] Add Batch Parallel Ngram. Upto 8x lower overhead. (#24986)
Signed-off-by:Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
Showing
Please register or sign in to comment