Files
xc-llm-ascend/docs/source/user_guide/feature_guide/index.md
shaopeng-666 592661e787 [Doc] EPD doc and load-balance proxy example (#6221)
Add EPD doc and load-balance proxy example

- vLLM version: v0.14.0
- vLLM main:
d68209402d

---------

Signed-off-by: 李少鹏 <lishaopeng21@huawei.com>
2026-03-12 16:17:17 +08:00

32 lines
483 B
Markdown

# Feature Guide
This section provides a detailed usage guide of vLLM Ascend features.
:::{toctree}
:caption: Feature Guide
:maxdepth: 1
graph_mode
cpu_binding
quantization
sleep_mode
structured_output
lora
eplb_swift_balancer
netloader
Multi_Token_Prediction
dynamic_batch
epd_disaggregation
kv_pool
external_dp
large_scale_ep
ucm_deployment
Fine_grained_TP
layer_sharding
speculative_decoding
context_parallel
npugraph_ex
weight_prefetch
sequence_parallelism
batch_invariance
:::