Unverified Commit 51d3cb95 authored by Woosuk Kwon's avatar Woosuk Kwon Committed by GitHub
Browse files

Remove max_num_seqs in latency benchmark script (#1855)

parent e74b1736
...@@ -19,7 +19,6 @@ def main(args: argparse.Namespace): ...@@ -19,7 +19,6 @@ def main(args: argparse.Namespace):
tokenizer=args.tokenizer, tokenizer=args.tokenizer,
quantization=args.quantization, quantization=args.quantization,
tensor_parallel_size=args.tensor_parallel_size, tensor_parallel_size=args.tensor_parallel_size,
max_num_seqs=args.batch_size,
trust_remote_code=args.trust_remote_code, trust_remote_code=args.trust_remote_code,
dtype=args.dtype, dtype=args.dtype,
) )
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment