2025-02-13 18:44:17 +08:00
# Welcome to vLLM Ascend Plugin
:::{figure} ./logos/vllm-ascend-logo-text-light.png
:align: center
:alt: vLLM
:class: no-scaled-link
:width: 70%
:::
:::{raw} html
< p style = "text-align:center" >
< strong > vLLM Ascend Plugin
< / strong >
< / p >
< p style = "text-align:center" >
< script async defer src = "https://buttons.github.io/buttons.js" > < / script >
< a class = "github-button" href = "https://github.com/vllm-project/vllm-ascend" data-show-count = "true" data-size = "large" aria-label = "Star" > Star< / a >
< a class = "github-button" href = "https://github.com/vllm-project/vllm-ascend/subscription" data-icon = "octicon-eye" data-size = "large" aria-label = "Watch" > Watch< / a >
< a class = "github-button" href = "https://github.com/vllm-project/vllm-ascend/fork" data-icon = "octicon-repo-forked" data-size = "large" aria-label = "Fork" > Fork< / a >
< / p >
:::
vLLM Ascend plugin (vllm-ascend) is a community maintained hardware plugin for running vLLM on the Ascend NPU.
This plugin is the recommended approach for supporting the Ascend backend within the vLLM community. It adheres to the principles outlined in the [[RFC]: Hardware pluggable](https://github.com/vllm-project/vllm/issues/11162), providing a hardware-pluggable interface that decouples the integration of the Ascend NPU with vLLM.
By using vLLM Ascend plugin, popular open-source models, including Transformer-like, Mixture-of-Expert, Embedding, Multi-modal LLMs can run seamlessly on the Ascend NPU.
## Documentation
% How to start using vLLM on Ascend NPU?
:::{toctree}
:caption: Getting Started
:maxdepth: 1
quick_start
installation
2025-03-10 09:27:48 +08:00
tutorials/index.md
2025-03-06 10:42:42 +08:00
faqs
2025-02-13 18:44:17 +08:00
:::
% What does vLLM Ascend Plugin support?
:::{toctree}
2025-02-18 11:20:06 +08:00
:caption: User Guide
2025-02-13 18:44:17 +08:00
:maxdepth: 1
2025-07-10 14:26:59 +08:00
user_guide/support_matrix/index
user_guide/configuration/index
user_guide/feature_guide/index
2025-02-18 11:20:06 +08:00
user_guide/release_notes
2025-02-13 18:44:17 +08:00
:::
2025-03-26 16:25:57 +08:00
% How to contribute to the vLLM Ascend project
2025-02-13 18:44:17 +08:00
:::{toctree}
:caption: Developer Guide
2025-04-30 14:53:58 +08:00
:maxdepth: 1
2025-06-25 12:00:45 +08:00
developer_guide/contribution/index
developer_guide/feature_guide/index
2025-03-31 00:24:25 +08:00
developer_guide/evaluation/index
2025-06-25 12:00:45 +08:00
developer_guide/performance/index
2025-06-27 16:46:49 +08:00
developer_guide/modeling/index
2025-03-26 16:25:57 +08:00
:::
2025-06-05 11:56:51 +08:00
% How to involve vLLM Ascend
:::{toctree}
:caption: Community
:maxdepth: 1
community/governance
community/contributors
2025-06-25 12:00:45 +08:00
community/versioning_policy
2025-06-17 09:36:35 +08:00
community/user_stories/index
2025-03-26 16:25:57 +08:00
:::