Add official doc index. Move the release content to the right place. Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
16 lines
897 B
Markdown
16 lines
897 B
Markdown
# Ascend plugin for vLLM
|
|
vLLM Ascend plugin (vllm-ascend) is a community maintained hardware plugin for running vLLM on the Ascend NPU.
|
|
|
|
This plugin is the recommended approach for supporting the Ascend backend within the vLLM community. It adheres to the principles outlined in the [[RFC]: Hardware pluggable](https://github.com/vllm-project/vllm/issues/11162), providing a hardware-pluggable interface that decouples the integration of the Ascend NPU with vLLM.
|
|
|
|
By using vLLM Ascend plugin, popular open-source models, including Transformer-like, Mixture-of-Expert, Embedding, Multi-modal LLMs can run seamlessly on the Ascend NPU.
|
|
|
|
## Contents
|
|
|
|
- [Quick Start](./quick_start.md)
|
|
- [Installation](./installation.md)
|
|
- Usage
|
|
- [Running vLLM with Ascend](./usage/running_vllm_with_ascend.md)
|
|
- [Feature Support](./usage/feature_support.md)
|
|
- [Supported Models](./usage/supported_models.md)
|