[ModelLoader][Feature] Add rfork support for fast model loading (#7392)

### What this PR does / why we need it?
Support an new load format: RFORK

For implementation details of this feature, please refer to #7441


### Does this PR introduce _any_ user-facing change?

add an new options for load-format: rfork

e.g.
```bash
vllm serve /workspace/models/Qwen3-8B --load-format rfork
```

### How was this patch tested?

- vLLM version: v0.17.0
- vLLM main:
4034c3d32e

Signed-off-by: Marck <1412354149@qq.com>
This commit is contained in:
Marck
2026-03-25 16:40:30 +08:00
committed by GitHub
parent 6ddfc41312
commit 17da96658f
11 changed files with 1510 additions and 0 deletions

View File

@@ -13,6 +13,7 @@ structured_output
lora
eplb_swift_balancer
netloader
rfork
Multi_Token_Prediction
dynamic_batch
epd_disaggregation