[v1][KVCacheManager] Avoid full cache hit by controlling max_length (#17999)
Signed-off-by:Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
Showing
Please register or sign in to comment
Signed-off-by:Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>