Files

yupeng a746f8274f [DOC] Qwen3 PD disaggregation user guide (#2751 )

### What this PR does / why we need it?
The PR is for the document of the prefiller&decoder disaggregation
deloyment guide.

The scenario of the guide is:
- Use 3 nodes totally and 2 NPUs on each node
- Qwen3-30B-A3B
- 1P2D
- Expert Parallel

The deployment can be used to verify PD Disggregation / Expert Parallel
features with a slightly less resources.

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
No.


- vLLM version: v0.10.1.1
- vLLM main:
e599e2c65e

---------

Signed-off-by: paulyu12 <507435917@qq.com>

2025-09-07 10:35:37 +08:00

source

[DOC] Qwen3 PD disaggregation user guide (#2751 )

2025-09-07 10:35:37 +08:00

Makefile

[Doc]Add Chinese translation for documentation (#1870 )

2025-07-21 11:26:27 +08:00

README.md

[Doc]Add Chinese translation for documentation (#1870 )

2025-07-21 11:26:27 +08:00

requirements-docs.txt

[Doc]Add Chinese translation for documentation (#1870 )

2025-07-21 11:26:27 +08:00

requirements-test.txt

static EPLB fix bug, add unit test (#1186 )

2025-06-18 19:46:56 +08:00

README.md

vLLM Ascend Plugin documents

Live doc: https://vllm-ascend.readthedocs.io

Build the docs

# Install dependencies.
pip install -r requirements-docs.txt

# Build the docs.
make clean
make html

# Build the docs with translation
make intl

# Open the docs with your browser
python -m http.server -d _build/html/

Launch your browser and open:

English version: http://localhost:8000
Chinese version: http://localhost:8000/zh_CN