Files
xc-llm-ascend/docs/source/index.md
Yikun Jiang 38334f5daa [Docs] Re-arch on doc and make QwQ doc work (#271)
### What this PR does / why we need it?
Re-arch on tutorials, move singe npu / multi npu / multi node to index.
- Unifiy docker run cmd
- Use dropdown to hide build from source installation doc
- Re-arch tutorials to include Qwen/QwQ/DeepSeek
- Make QwQ doc works

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
CI test



Signed-off-by: Yikun Jiang <yikunkero@gmail.com>
2025-03-10 09:27:48 +08:00

1.9 KiB

Welcome to vLLM Ascend Plugin

:::{figure} ./logos/vllm-ascend-logo-text-light.png :align: center :alt: vLLM :class: no-scaled-link :width: 70% :::

:::{raw} html

vLLM Ascend Plugin

<script async defer src="https://buttons.github.io/buttons.js"></script> Star Watch Fork

:::

vLLM Ascend plugin (vllm-ascend) is a community maintained hardware plugin for running vLLM on the Ascend NPU.

This plugin is the recommended approach for supporting the Ascend backend within the vLLM community. It adheres to the principles outlined in the [RFC]: Hardware pluggable, providing a hardware-pluggable interface that decouples the integration of the Ascend NPU with vLLM.

By using vLLM Ascend plugin, popular open-source models, including Transformer-like, Mixture-of-Expert, Embedding, Multi-modal LLMs can run seamlessly on the Ascend NPU.

Documentation

% How to start using vLLM on Ascend NPU? :::{toctree} :caption: Getting Started :maxdepth: 1 quick_start installation tutorials/index.md faqs :::

% What does vLLM Ascend Plugin support? :::{toctree} :caption: User Guide :maxdepth: 1 user_guide/suppoted_features user_guide/supported_models user_guide/release_notes :::

% How to contribute to the vLLM project :::{toctree} :caption: Developer Guide :maxdepth: 1 developer_guide/contributing developer_guide/versioning_policy :::