From 0cb4bca1ff93c4130b20afdfc9d6ad64c34eefa3 Mon Sep 17 00:00:00 2001 From: LI SHENGYONG <49200266+shenchuxiaofugui@users.noreply.github.com> Date: Wed, 29 Apr 2026 17:15:29 +0800 Subject: [PATCH] [Doc] EPLB update the documentation (#8795) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit ### What this PR does / why we need it? 1. Update documentation:In the current version, we recommend using the following: policy of swift balancer(2). ### Does this PR introduce _any_ user-facing change? --additional-config '{ "eplb_config": { "dynamic_eplb": true, "expert_heat_collection_interval": 600, "algorithm_execution_interval": 50, "eplb_policy_type": 2, "num_redundant_experts": {ep_size} }}' ### How was this patch tested? Test in DSV3.1 EP32 Signed-off-by: shenchuxiaofugui <1311027364@qq.com> --- .../user_guide/feature_guide/eplb_swift_balancer.md | 8 +++++--- 1 file changed, 5 insertions(+), 3 deletions(-) diff --git a/docs/source/user_guide/feature_guide/eplb_swift_balancer.md b/docs/source/user_guide/feature_guide/eplb_swift_balancer.md index 410ca0e6..312669d2 100644 --- a/docs/source/user_guide/feature_guide/eplb_swift_balancer.md +++ b/docs/source/user_guide/feature_guide/eplb_swift_balancer.md @@ -26,7 +26,7 @@ W8A8-Dynamic ### Dynamic EPLB -We need to add environment variable `export DYNAMIC_EPLB="true"` to enable vLLM EPLB. Enable dynamic balancing with auto-tuned parameters. Adjust expert_heat_collection_interval and algorithm_execution_interval based on workload patterns. +We need to add environment variable `export DYNAMIC_EPLB="true"` to enable vLLM EPLB. Enable dynamic balancing with auto-tuned parameters. Adjust expert_heat_collection_interval and algorithm_execution_interval based on workload patterns. In the current version, we recommend using the following: policy of swift balancer(2). ```shell vllm serve Qwen/Qwen3-235B-A22 \ @@ -34,8 +34,10 @@ vllm serve Qwen/Qwen3-235B-A22 \ --enable-expert-parallel \ --additional-config '{ "eplb_config": { "dynamic_eplb": true, - "expert_heat_collection_interval": 400, - "algorithm_execution_interval": 30 + "expert_heat_collection_interval": 600, + "algorithm_execution_interval": 50, + "eplb_policy_type": 2, + "num_redundant_experts": {ep_size}, }}' ```