Unverified Commit 3e1a5534 authored by Will Killian's avatar Will Killian Committed by GitHub
Browse files

fix: add missing await in vllm-v1 `clear_kv_blocks` endpoint (#1642)


Signed-off-by: default avatarWill Killian <wkillian@nvidia.com>
parent 34256389
......@@ -126,7 +126,7 @@ class RequestHandler:
async def clear_kv_blocks(self, request=None):
try:
self.engine_client.reset_prefix_cache()
await self.engine_client.reset_prefix_cache()
yield {"status": "success", "message": "KV cache cleared"}
except Exception as e:
yield {"status": "error", "message": str(e)}
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment