"git@developer.sourcefind.cn:modelzoo/qwen_lmdeploy.git" did not exist on "8cdcb2a92f80bc7752949ad3266edf1bc6595b5c"
Support speculative decoding in the trtllm_mha attention backend (#9331)
Co-authored-by:
ispobock <ispobaoke@gmail.com>
Showing
This diff is collapsed.
Please register or sign in to comment