Unverified Commit d07e797a authored by fzyzcjy's avatar fzyzcjy Committed by GitHub
Browse files

Fix bench_one_batch producing unnatural results for expert parallel (#5149)

parent c555d794
...@@ -207,7 +207,7 @@ def prepare_extend_inputs_for_correctness_test( ...@@ -207,7 +207,7 @@ def prepare_extend_inputs_for_correctness_test(
def prepare_synthetic_inputs_for_latency_test(batch_size, input_len): def prepare_synthetic_inputs_for_latency_test(batch_size, input_len):
input_ids = np.ones((batch_size, input_len), dtype=np.int32) input_ids = np.random.randint(0, 10000, (batch_size, input_len), dtype=np.int32)
sampling_params = SamplingParams( sampling_params = SamplingParams(
temperature=0, temperature=0,
max_new_tokens=BenchArgs.output_len, max_new_tokens=BenchArgs.output_len,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment