[router] update router documentation (#9121)

This commit is contained in:
Simo Lin
2025-08-12 13:16:34 -07:00
committed by GitHub
parent c9ee738515
commit 1ce30dd13e
2 changed files with 361 additions and 85 deletions

View File

@@ -17,6 +17,10 @@ For the design details, please refer to [link](https://docs.google.com/document/
Currently, we support Mooncake and NIXL as the transfer engine.
## Router Integration
For deploying PD disaggregation at scale with load balancing and fault tolerance, SGLang provides a router. The router can distribute requests between prefill and decode instances using various routing policies. For detailed information on setting up routing with PD disaggregation, including configuration options and deployment patterns, see the [SGLang Router documentation](router.md#mode-3-prefill-decode-disaggregation).
## Mooncake
### Requirements