Support speculative decoding in the trtllm_mha attention backend (#9331)
Co-authored-by:
ispobock <ispobaoke@gmail.com>
Showing
This diff is collapsed.
Please register or sign in to comment
Co-authored-by:
ispobock <ispobaoke@gmail.com>