[Lint]Style: reformat markdown files via markdownlint (#5884)

### What this PR does / why we need it? reformat markdown files via markdownlint - vLLM version: v0.13.0 - vLLM main: bde38c11df --------- Signed-off-by: root <root@LAPTOP-VQKDDVMG.localdomain> Signed-off-by: MrZ20 <2609716663@qq.com> Co-authored-by: root <root@LAPTOP-VQKDDVMG.localdomain>
2026-01-15 09:06:01 +08:00
parent 96edd4673f
commit 4811ba62e0
75 changed files with 711 additions and 308 deletions
--- a/docs/source/user_guide/feature_guide/eplb_swift_balancer.md
+++ b/docs/source/user_guide/feature_guide/eplb_swift_balancer.md
@@ -14,9 +14,12 @@ Expert balancing for MoE models in LLM serving is essential for optimal performa

 ## Support Scenarios

-### Models:
+### Models
+
 DeepseekV3/V3.1/R1、Qwen3-MOE
-### MOE QuantType:
+
+### MOE QuantType
+
 W8A8-dynamic

 ## How to Use EPLB
@@ -37,6 +40,7 @@ vllm serve Qwen/Qwen3-235B-A22 \
 ```

 ### Static EPLB
+
 #### Initial Setup (Record Expert Map)

 We need to add environment variable `export EXPERT_MAP_RECORD="true"` to record expert map.Generate the initial expert distribution map using expert_map_record_path. This creates a baseline configuration for future deployments.
@@ -54,6 +58,7 @@ vllm serve Qwen/Qwen3-235B-A22 \
 ```

 #### Subsequent Deployments (Use Recorded Map)
+
 Load the pre-recorded expert map for consistent performance. This avoids recalculating distributions at runtime.

 ```shell
@@ -66,6 +71,7 @@ vllm serve Qwen/Qwen3-235B-A22 \
 ```

 ## Critical Considerations
+
 1. Parameter Tuning:
   - num_iterations_eplb_update: Higher values (e.g., 400+) for stable workloads; lower values (e.g., 100-200) for fluctuating traffic.
   - num_wait_worker_iterations: Should be ≥ 30 to avoid premature balancing during startup.