[Bugfix] Fix cuda graph sizes when running with speculative decoding (#30330)
Signed-off-by:Patryk Saffer <patryk.saffer99@gmail.com> Signed-off-by:
PatrykSaffer <patryk.saffer@mistral.ai> Co-authored-by:
Patryk Saffer <patryk.saffer99@gmail.com>
Showing
Please register or sign in to comment