Unverified Commit a98406d4 authored by Rohan Varma's avatar Rohan Varma Committed by GitHub
Browse files

fix: Update TRT-LLM Wide-EP Disagg GB200 Recipe to be compatible with TRT-LLM Version (#5383)

parent 9ca2923d
......@@ -20,7 +20,6 @@ metadata:
name: prefill-config
data:
prefill_config.yaml: |
build_config:
max_batch_size: 4
max_num_tokens: 4608
max_seq_len: 1227
......@@ -52,7 +51,6 @@ data:
moe_expert_parallel_size: 32
enable_attention_dp: true
pipeline_parallel_size: 1
build_config:
max_batch_size: 32
max_num_tokens: 32
max_seq_len: 2251
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment