"tests/git@developer.sourcefind.cn:OpenDAS/mmdeploy.git" did not exist on "b95be0c56a0837eeb0090d9237f2b2a9c8c26a3b"
fix(mistralrs): Disable paged attention (#234)
Under load it sometimes drops a request. The request gets added to the batch (sequence) and immediately gets a FinishReason Stop. Not sure why. It doesn't happen with the default scheduler (non-paged attention), so switch to that for now.
Showing
Please register or sign in to comment