add AMD guide for DeepSeek-R1 (#3338)

This commit is contained in:
Yineng Zhang
2025-02-06 16:54:40 +08:00
committed by GitHub
parent 25ed22b685
commit 7348d9627e

View File

@@ -11,6 +11,8 @@ For optimizations made on the DeepSeek series models regarding SGLang, please re
If you do not have GPUs with large enough memory, please try multi-node tensor parallelism. There is an example serving with [2 H20 nodes](https://github.com/sgl-project/sglang/tree/main/benchmark/deepseek_v3#example-serving-with-2-h208) below.
For running on AMD MI300X, use this as a reference. [Running DeepSeek-R1 on a single NDv5 MI300X VM](https://techcommunity.microsoft.com/blog/azurehighperformancecomputingblog/running-deepseek-r1-on-a-single-ndv5-mi300x-vm/4372726)
## Installation & Launch
If you encounter errors when starting the server, ensure the weights have finished downloading. It's recommended to download them beforehand or restart multiple times until all weights are downloaded.