[Bugfix] Fix tensor shape mismatch in sparse attention with speculative decoding (#39542)
Signed-off-by:
Santino Ramos <santinor@inferact.ai>
Showing
Please register or sign in to comment
Signed-off-by:
Santino Ramos <santinor@inferact.ai>