This directory contains an SGLang component for Dynamo and reference implementations for deploying Large Language Models (LLMs) in various configurations using SGLang. SGLang internally uses ZMQ to communicate between the ingress and the engine processes. For Dynamo, we leverage the runtime to communicate directly with the engine processes and handle ingress and pre/post processing on our end.
...
...
@@ -224,7 +224,7 @@ Below we provide a selected list of advanced examples. Please open up an issue i
### Large scale P/D disaggregation with WideEP
-**[Run DeepSeek-R1 on 104+ H100s](docs/dsr1-wideep-h100.md)**
-**[Run DeepSeek-R1 on GB200s](docs/dsr1-wideep-gb200.md)**
-**[Run DeepSeek-R1-FP8 on GB200s](docs/dsr1-wideep-gb200.md)**