docs: Update dynamo_glossary.md (#2082)

Signed-off-by: Anish <80174047+athreesh@users.noreply.github.com>

docs: Update dynamo_glossary.md (#2082)
Signed-off-by: Anish <80174047+athreesh@users.noreply.github.com>
7fbd43ae · Anish · GitHub · 3175b10d · 7fbd43ae
Unverified Commit 7fbd43ae authored Jul 29, 2025 by Anish Committed by GitHub Jul 29, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 3 additions and 4 deletions

docs/dynamo_glossary.md docs/dynamo_glossary.md +3 -4

No files found.
--- a/docs/dynamo_glossary.md
+++ b/docs/dynamo_glossary.md
@@ -11,16 +11,12 @@
 ## D
 **Decode Phase** - The second phase of LLM inference that generates output tokens one at a time.
-**depends()** - A Dynamo function that creates dependencies between services, enabling automatic client generation and service discovery.
 **Disaggregated Serving** - Dynamo's core architecture that separates prefill and decode phases into specialized engines to maximize GPU throughput and improve performance.
 **Distributed Runtime** - Dynamo's Rust-based core system that manages service discovery, communication, and component lifecycle across distributed clusters.
 **Dynamo** - NVIDIA's high-performance distributed inference framework for Large Language Models (LLMs) and generative AI models, designed for multinode environments with disaggregated serving and cache-aware routing.
-**Dynamo Artifact** - A packaged archive containing an inference graph and its dependencies, created using `dynamo build`. It's the containerized, deployable version of a Graph.
 **Dynamo Cloud** - A Kubernetes platform providing managed deployment experience for Dynamo inference graphs.
 ## E
@@ -80,5 +76,8 @@
 ## V
 **vLLM** - High-throughput LLM serving engine with Ray distributed support and PagedAttention.
+## W
+**Wide Expert Parallelism (WideEP)** - Mixture-of-Experts deployment strategy that spreads experts across many GPUs (e.g., 64-way EP) so each GPU hosts only a few experts.
 ## X
 **xPyD (x Prefill y Decode)** - Dynamo notation describing disaggregated serving configurations where x prefill workers serve y decode workers. Dynamo supports runtime-reconfigurable xPyD.