### What this PR does / why we need it?
add feature guide for disaggregated-prefill
### Does this PR introduce _any_ user-facing change?
no
### How was this patch tested?
by ci
- vLLM version: v0.11.0
- vLLM main:
83f478bb19
---------
Signed-off-by: wangxiaoteng <wangxiaoteng@huawei.com>
Signed-off-by: liziyu <liziyu16@huawei.com>
Signed-off-by: wangxiaoteng888 <56506195+wangxiaoteng888@users.noreply.github.com>
Co-authored-by: liziyu <liziyu16@huawei.com>
16 lines
353 B
Markdown
16 lines
353 B
Markdown
# Feature Guide
|
|
|
|
This section provides an overview of the features implemented in vLLM Ascend. Developers can refer to this guide to understand how vLLM Ascend works.
|
|
|
|
:::{toctree}
|
|
:caption: Feature Guide
|
|
:maxdepth: 1
|
|
patch
|
|
ModelRunner_prepare_inputs
|
|
disaggregated_prefill
|
|
eplb_swift_balancer.md
|
|
Multi_Token_Prediction
|
|
ACL_Graph
|
|
KV_Cache_Pool_Guide
|
|
:::
|