Files

xiemingda 59ea23d0d3 [Doc] Add Single NPU (Qwen2.5-VL-7B) tutorial (#311 )

Run vllm-ascend on Single NPU

What this PR does / why we need it?
Add vllm-ascend tutorial doc for Qwen/Qwen2.5-VL-7B-Instruct model
Inference/Serving doc

Does this PR introduce any user-facing change?
no

How was this patch tested?
no

Signed-off-by: xiemingda <xiemingda1002@gmail.com>

2025-03-12 20:37:12 +08:00

source

[Doc] Add Single NPU (Qwen2.5-VL-7B) tutorial (#311 )

2025-03-12 20:37:12 +08:00

Makefile

[Doc] Add sphinx build for vllm-ascend (#55 )

2025-02-13 18:44:17 +08:00

README.md

[Docs] Add dynamic version in docs (#90 )

2025-02-19 08:57:27 +08:00

requirements-docs.txt

[Docs] Add dynamic version in docs (#90 )

2025-02-19 08:57:27 +08:00

requirements-test.txt

[Doc] Add sphinx build for vllm-ascend (#55 )

2025-02-13 18:44:17 +08:00

README.md

vLLM Ascend Plugin documents

Live doc: https://vllm-ascend.readthedocs.io

Build the docs

# Install dependencies.
pip install -r requirements-docs.txt

# Build the docs.
make clean
make html

Open the docs with your browser

python -m http.server -d build/html/

Launch your browser and open http://localhost:8000/.