[Docs] Fix typos in EP deployment doc (#24669)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

[Docs] Fix typos in EP deployment doc (#24669)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
51d41265 · Harry Mellor · GitHub · 4984a291 · 51d41265
Unverified Commit 51d41265 authored Sep 11, 2025 by Harry Mellor Committed by GitHub Sep 11, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

docs/serving/expert_parallel_deployment.md docs/serving/expert_parallel_deployment.md +2 -2

No files found.
--- a/docs/serving/expert_parallel_deployment.md
+++ b/docs/serving/expert_parallel_deployment.md
@@ -158,10 +158,10 @@ vllm serve Qwen/Qwen3-30B-A3B \
 ### Memory Footprint Overhead
-EPLB uses redundant experts to that need to fit in GPU memory. This means that EPLB may not be a good fit for memory constrained environments or when KV cache space is at a premium.
+EPLB uses redundant experts that need to fit in GPU memory. This means that EPLB may not be a good fit for memory constrained environments or when KV cache space is at a premium.
 This overhead equals `NUM_MOE_LAYERS * BYTES_PER_EXPERT * (NUM_TOTAL_EXPERTS + NUM_REDUNDANT_EXPERTS) ÷ NUM_EP_RANKS`.
-For DeepSeekV3, this is approximately `2.4 GB` for one redundant expert per rank.
+For DeepSeekV3, this is approximately `2.4 GB` for one redundant expert per EP rank.
 ### Example Command