Unverified Commit 8a7368e0 authored by Jade Zheng's avatar Jade Zheng Committed by GitHub
Browse files

[Misc] Remove redundant comment (#16703)


Signed-off-by: default avatarJade Zheng <zheng.shoujian@outlook.com>
parent 93e561ec
......@@ -540,9 +540,6 @@ class GPUModelRunner(LoRAModelRunnerMixin):
# because M (max_model_len) is not necessarily divisible by block_size.
block_table_indices = (req_indices * self.max_num_blocks_per_req +
positions_np // self.block_size)
# NOTE(woosuk): We use torch.index_select instead of np.take here
# because torch.index_select is much faster than np.take for large
# tensors.
block_table_cpu = self.input_batch.block_table.get_cpu_tensor()
block_numbers = block_table_cpu.flatten()[block_table_indices].numpy()
block_offsets = positions_np % self.block_size
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment