Sync from v0.13
This commit is contained in:
8
examples/online_serving/disaggregated_serving/README.md
Normal file
8
examples/online_serving/disaggregated_serving/README.md
Normal file
@@ -0,0 +1,8 @@
|
||||
# Disaggregated Serving
|
||||
|
||||
This example contains scripts that demonstrate the disaggregated serving features of vLLM.
|
||||
|
||||
## Files
|
||||
|
||||
- `disagg_proxy_demo.py` - Demonstrates XpYd (X prefill instances, Y decode instances).
|
||||
- `kv_events.sh` - Demonstrates KV cache event publishing.
|
||||
Reference in New Issue
Block a user