[Doc] Update DeepSeek V3.1/R1 2P1D doc (#5387)

### What this PR does / why we need it?
The PR updates the documentation for DeepSeek-V3.1 and DeepSeek-R1 in
the scenario of prefill-decode disaggregation.

Updated some PD separation-related setting parameters and optimal
configurations. This script has been verified.

- vLLM version: release/v0.13.0
- vLLM main:
bc0a5a0c08

Signed-off-by: chenmenglong <chenmenglong1@huawei.com>
This commit is contained in:
MengLong Chen
2025-12-27 17:28:43 +08:00
committed by GitHub
parent 843751768e
commit b8b5521f5b
3 changed files with 34 additions and 143 deletions

View File

@@ -231,7 +231,7 @@ vllm serve vllm-ascend/DeepSeek-R1-W8A8 \
### Prefill-Decode Disaggregation
We recommend using Mooncake for deployment: [Mooncake](./pd_disaggregation_mooncake_multi_node.md).
We recommend using DeepSeek-V3.1 for deployment: [DeepSeek-V3.1](./DeepSeek-V3.1.md).
This solution has been tested and demonstrates excellent performance.