Fix cache block size for flash decoding (#2351)
* Fix cache block size for flash decoding This seems to have been accidentally dropped during the TRT-LLM PR rebase. * Also run CI on changes to `backends`
Showing
Please register or sign in to comment