Unverified Commit e535d90d authored by Nick Hill's avatar Nick Hill Committed by GitHub
Browse files

[ModelRunner V2] Misc minor simplifications and optimizations (#33467)


Signed-off-by: default avatarNick Hill <nickhill123@gmail.com>
parent 0b225fb7
......@@ -10,12 +10,7 @@ from vllm.v1.worker.gpu.input_batch import InputBatch
class StructuredOutputsWorker:
def __init__(
self,
max_num_logits: int,
vocab_size: int,
device: torch.device,
):
def __init__(self, max_num_logits: int, vocab_size: int, device: torch.device):
self.logits_indices = torch.zeros(
max_num_logits, dtype=torch.int32, device=device
)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment