[Perf] Parallelize fill_bitmask to accelerate high-throughput guided decoding (#21862)
Signed-off-by:
Benjamin Chislett <benjamin.chislett@centml.ai>
Showing
Please register or sign in to comment
Signed-off-by:
Benjamin Chislett <benjamin.chislett@centml.ai>