Add static EPLB (#1116)

### What this PR does / why we need it?
   Add EPLB expert map import capabilities
### Does this PR introduce _any_ user-facing change?
When importing the EPLB expert map you need import expert map file by
vllm args additional_config
### How was this patch tested?
1.You need to collect expert hotness and generate an expert placement
file based on the hotness and the EPLB algorithm, or you can directly
use an existing expert placement table.
2.When launching vLLM, enable EC2 and pass the configuration via the
command-line argument:
      --additional-config '{"expert_map_path": "/xxx/xxx/xx.json"}
Co-authored-by: songshanhu07 <1763685535@qq.com>

---------

Signed-off-by: songshanhu07 <1763685535@qq.com>
Signed-off-by: Yuxiao-Xu <664988918@qq.com>
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
Co-authored-by: songshanhu07 <1763685535@qq.com>
Co-authored-by: Xu Yuxiao <xuyuxiao2@huawei.com>
Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>

This commit is contained in:

Yuxiao-Xu

2025-06-09 19:28:11 +08:00

committed by

GitHub

parent cb341c7bcd

commit 6b853f15fe

6 changed files with 179 additions and 31 deletions

									
										1

vllm_ascend/ascend_config.py
									
												View File
												
				@@ -38,6 +38,7 @@ class AscendConfig:

				        self.expert_tensor_parallel_size = int(

				            additional_config.get("expert_tensor_parallel_size", 0))

				        self.expert_map_path = additional_config.get("expert_map_path", None)

				class TorchairGraphConfig:

Add static EPLB (#1116)

1 vllm_ascend/ascend_config.py Unescape Escape View File

1

vllm_ascend/ascend_config.py

View File