add AMD guide for DeepSeek-R1 (#3338)

7348d962 · Yineng Zhang · GitHub · 25ed22b6 · 7348d962
Unverified Commit 7348d962 authored Feb 06, 2025 by Yineng Zhang Committed by GitHub Feb 06, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 0 deletions

benchmark/deepseek_v3/README.md benchmark/deepseek_v3/README.md +2 -0

No files found.
--- a/benchmark/deepseek_v3/README.md
+++ b/benchmark/deepseek_v3/README.md
@@ -11,6 +11,8 @@ For optimizations made on the DeepSeek series models regarding SGLang, please re
 If you do not have GPUs with large enough memory, please try multi-node tensor parallelism. There is an example serving with [2 H20 nodes](https://github.com/sgl-project/sglang/tree/main/benchmark/deepseek_v3#example-serving-with-2-h208) below.
+For running on AMD MI300X, use this as a reference. [Running DeepSeek-R1 on a single NDv5 MI300X VM](https://techcommunity.microsoft.com/blog/azurehighperformancecomputingblog/running-deepseek-r1-on-a-single-ndv5-mi300x-vm/4372726)
 ## Installation & Launch
 If you encounter errors when starting the server, ensure the weights have finished downloading. It's recommended to download them beforehand or restart multiple times until all weights are downloaded.