1. Add context parallel user guide
2. Add context parallel related message in supported features/models
- vLLM version: release/v0.13.0
- vLLM main:
bc0a5a0c08
Signed-off-by: zhangsicheng5 <zhangsicheng5@huawei.com>
376 B
376 B
Feature Guide
This section provides a detailed usage guide of vLLM Ascend features.
:::{toctree} :caption: Feature Guide :maxdepth: 1 graph_mode quantization quantization-llm-compressor sleep_mode structured_output lora eplb_swift_balancer netloader dynamic_batch kv_pool external_dp large_scale_ep ucm_deployment Fine_grained_TP speculative_decoding context_parallel :::