[Bugfix] Restore CUDA graph persistent buffers for FP8 FlashMLA decode (#35175)
Signed-off-by:haosdent <haosdent@gmail.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
Showing
Please register or sign in to comment
Signed-off-by:haosdent <haosdent@gmail.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>