Unverified Commit 870c3748 authored by Woosuk Kwon's avatar Woosuk Kwon Committed by GitHub
Browse files

[V1][Minor] Remove outdated comment (#12968)


Signed-off-by: default avatarWoosuk Kwon <woosuk.kwon@berkeley.edu>
parent 86222a3d
...@@ -205,8 +205,6 @@ class KVCacheManager: ...@@ -205,8 +205,6 @@ class KVCacheManager:
# Should not exceed the maximum number of blocks per request. # Should not exceed the maximum number of blocks per request.
# This is especially because the block table has the shape # This is especially because the block table has the shape
# [..., max_num_blocks_per_req]. # [..., max_num_blocks_per_req].
# TODO(woosuk): Check and reject requests if
# num_prompt_tokens + max_tokens > max_model_len.
self.max_num_blocks_per_req - len(req_blocks), self.max_num_blocks_per_req - len(req_blocks),
) )
assert num_new_blocks > 0 assert num_new_blocks > 0
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment