2025-02-13 18:44:17 +08:00
# Welcome to vLLM Ascend Plugin
:::{figure} ./logos/vllm-ascend-logo-text-light.png
:align: center
:alt: vLLM
:class: no-scaled-link
:width: 70%
:::
:::{raw} html
< p style = "text-align:center" >
< strong > vLLM Ascend Plugin
< / strong >
< / p >
< p style = "text-align:center" >
< script async defer src = "https://buttons.github.io/buttons.js" > < / script >
< a class = "github-button" href = "https://github.com/vllm-project/vllm-ascend" data-show-count = "true" data-size = "large" aria-label = "Star" > Star< / a >
< a class = "github-button" href = "https://github.com/vllm-project/vllm-ascend/subscription" data-icon = "octicon-eye" data-size = "large" aria-label = "Watch" > Watch< / a >
< a class = "github-button" href = "https://github.com/vllm-project/vllm-ascend/fork" data-icon = "octicon-repo-forked" data-size = "large" aria-label = "Fork" > Fork< / a >
< / p >
:::
2026-02-13 15:50:05 +08:00
vLLM Ascend plugin (vllm-ascend) is a community-maintained hardware plugin for running vLLM on the Ascend NPU.
2025-02-13 18:44:17 +08:00
This plugin is the recommended approach for supporting the Ascend backend within the vLLM community. It adheres to the principles outlined in the [[RFC]: Hardware pluggable](https://github.com/vllm-project/vllm/issues/11162), providing a hardware-pluggable interface that decouples the integration of the Ascend NPU with vLLM.
2025-11-08 18:48:59 +08:00
By using vLLM Ascend plugin, popular open-source models, including Transformer-like, Mixture-of-Experts, Embedding, Multi-modal LLMs can run seamlessly on the Ascend NPU.
2025-02-13 18:44:17 +08:00
## Documentation
% How to start using vLLM on Ascend NPU?
:::{toctree}
:caption: Getting Started
:maxdepth: 1
quick_start
installation
2026-02-10 15:03:35 +08:00
tutorials/models/index
tutorials/features/index
tutorials/hardwares/index
2025-03-06 10:42:42 +08:00
faqs
2025-02-13 18:44:17 +08:00
:::
% What does vLLM Ascend Plugin support?
:::{toctree}
2025-02-18 11:20:06 +08:00
:caption: User Guide
2025-02-13 18:44:17 +08:00
:maxdepth: 1
2025-07-10 14:26:59 +08:00
user_guide/support_matrix/index
user_guide/configuration/index
user_guide/feature_guide/index
2025-12-23 17:46:04 +08:00
user_guide/deployment_guide/index
2025-02-18 11:20:06 +08:00
user_guide/release_notes
2025-02-13 18:44:17 +08:00
:::
2025-03-26 16:25:57 +08:00
% How to contribute to the vLLM Ascend project
2025-02-13 18:44:17 +08:00
:::{toctree}
:caption: Developer Guide
2025-04-30 14:53:58 +08:00
:maxdepth: 1
2025-06-25 12:00:45 +08:00
developer_guide/contribution/index
developer_guide/feature_guide/index
2025-03-31 00:24:25 +08:00
developer_guide/evaluation/index
2025-11-24 21:58:31 +08:00
developer_guide/performance_and_debug/index
2025-03-26 16:25:57 +08:00
:::
2025-06-05 11:56:51 +08:00
% How to involve vLLM Ascend
:::{toctree}
:caption: Community
:maxdepth: 1
community/governance
community/contributors
2025-06-25 12:00:45 +08:00
community/versioning_policy
2025-06-17 09:36:35 +08:00
community/user_stories/index
2025-03-26 16:25:57 +08:00
:::