### What this PR does / why we need it?
Support an new load format: RFORK
For implementation details of this feature, please refer to #7441
### Does this PR introduce _any_ user-facing change?
add an new options for load-format: rfork
e.g.
```bash
vllm serve /workspace/models/Qwen3-8B --load-format rfork
```
### How was this patch tested?
- vLLM version: v0.17.0
- vLLM main:
4034c3d32e
Signed-off-by: Marck <1412354149@qq.com>
34 lines
515 B
Markdown
34 lines
515 B
Markdown
# Feature Guide
|
|
|
|
This section provides a detailed usage guide of vLLM Ascend features.
|
|
|
|
:::{toctree}
|
|
:caption: Feature Guide
|
|
:maxdepth: 1
|
|
graph_mode
|
|
cpu_binding
|
|
quantization
|
|
sleep_mode
|
|
structured_output
|
|
lora
|
|
eplb_swift_balancer
|
|
netloader
|
|
rfork
|
|
Multi_Token_Prediction
|
|
dynamic_batch
|
|
epd_disaggregation
|
|
kv_pool
|
|
external_dp
|
|
large_scale_ep
|
|
ucm_deployment
|
|
Fine_grained_TP
|
|
layer_sharding
|
|
speculative_decoding
|
|
context_parallel
|
|
npugraph_ex
|
|
weight_prefetch
|
|
sequence_parallelism
|
|
batch_invariance
|
|
lmcache_ascend_deployment
|
|
:::
|