Files
xc-llm-ascend/docs/source/user_guide/feature_guide/index.md
zhaomingyu13 039cc65e58 [Doc] Add user guide of speculative decoding (#5074)
### What this PR does / why we need it?
Add user guide of speculative decoding that includes n-grams, EAGLE,
MTP, and suffix.
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

Signed-off-by: zhaomingyu <zhaomingyu13@h-partners.com>
2025-12-16 17:01:44 +08:00

343 B

Feature Guide

This section provides a detailed usage guide of vLLM Ascend features.

:::{toctree} :caption: Feature Guide :maxdepth: 1 graph_mode quantization quantization-llm-compressor sleep_mode structured_output lora eplb_swift_balancer netloader dynamic_batch kv_pool external_dp large_scale_ep ucm_deployment speculative_decoding :::