[Docs]: Fix Multi-User Port Allocation Conflicts (#3601)
Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com> Co-authored-by: simveit <simp.veitner@gmail.com>
This commit is contained in:
@@ -1,5 +1,5 @@
|
||||
SGLang Documentation
|
||||
====================================
|
||||
====================
|
||||
|
||||
SGLang is a fast serving framework for large language models and vision language models.
|
||||
It makes your interaction with models faster and more controllable by co-designing the backend runtime and frontend language.
|
||||
@@ -10,7 +10,6 @@ The core features include:
|
||||
- **Extensive Model Support**: Supports a wide range of generative models (Llama, Gemma, Mistral, QWen, DeepSeek, LLaVA, etc.), embedding models (e5-mistral, gte) and reward models (Skywork), with easy extensibility for integrating new models.
|
||||
- **Active Community**: SGLang is open-source and backed by an active community with industry adoption.
|
||||
|
||||
|
||||
.. toctree::
|
||||
:maxdepth: 1
|
||||
:caption: Getting Started
|
||||
@@ -39,7 +38,6 @@ The core features include:
|
||||
frontend/frontend.md
|
||||
frontend/choices_methods.md
|
||||
|
||||
|
||||
.. toctree::
|
||||
:maxdepth: 1
|
||||
:caption: SGLang Router
|
||||
@@ -47,24 +45,47 @@ The core features include:
|
||||
router/router.md
|
||||
|
||||
|
||||
References
|
||||
==========
|
||||
|
||||
General
|
||||
---------------------
|
||||
.. toctree::
|
||||
:maxdepth: 1
|
||||
:caption: References
|
||||
|
||||
references/supported_models.md
|
||||
references/contribution_guide.md
|
||||
references/troubleshooting.md
|
||||
references/faq.md
|
||||
references/learn_more.md
|
||||
|
||||
Hardware
|
||||
--------------------------
|
||||
.. toctree::
|
||||
:maxdepth: 1
|
||||
|
||||
references/AMD.md
|
||||
references/amd_configure.md
|
||||
references/nvidia_jetson.md
|
||||
|
||||
Advanced Models & Deployment
|
||||
------------------------------
|
||||
.. toctree::
|
||||
:maxdepth: 1
|
||||
|
||||
references/deepseek.md
|
||||
references/multi_node.md
|
||||
references/multi_node_inference_k8s_lws.md
|
||||
references/modelscope.md
|
||||
|
||||
Performance & Tuning
|
||||
--------------------
|
||||
.. toctree::
|
||||
:maxdepth: 1
|
||||
|
||||
references/sampling_params.md
|
||||
references/hyperparameter_tuning.md
|
||||
references/benchmark_and_profiling.md
|
||||
references/accuracy_evaluation.md
|
||||
references/custom_chat_template.md
|
||||
references/amd_configure.md
|
||||
references/deepseek.md
|
||||
references/multi_node.md
|
||||
references/multi_node_inference_k8s_lws.md
|
||||
references/modelscope.md
|
||||
references/quantization.md
|
||||
references/contribution_guide.md
|
||||
references/troubleshooting.md
|
||||
references/nvidia_jetson.md
|
||||
references/faq.md
|
||||
references/learn_more.md
|
||||
|
||||
Reference in New Issue
Block a user