[releases/v0.18.0][Doc][Misc] Modifying Configuration Parameters (#8618)

### What this PR does / why we need it?
This PR renames the environment variable VLLM_NIXL_ABORT_REQUEST_TIMEOUT
to VLLM_MOONCAKE_ABORT_REQUEST_TIMEOUT to align with the Mooncake
connector naming convention. It also updates the documentation and test
configurations to reflect this change and adjusts the suggested timeout
value in the documentation to 480 seconds for consistency.

### Does this PR introduce _any_ user-facing change?
Yes. The environment variable for configuring the abort request timeout
has been renamed. Users should update their environment settings from
VLLM_NIXL_ABORT_REQUEST_TIMEOUT to VLLM_MOONCAKE_ABORT_REQUEST_TIMEOUT.

### How was this patch tested?
The changes were verified by updating the corresponding test
configuration files and ensuring consistency across the documentation.

---------

Signed-off-by: herizhen <1270637059@qq.com>
Signed-off-by: herizhen <59841270+herizhen@users.noreply.github.com>
This commit is contained in:
herizhen
2026-04-23 16:23:31 +08:00
committed by GitHub
parent ce92be29d2
commit ff76c6780e
20 changed files with 95 additions and 74 deletions

View File

@@ -134,7 +134,7 @@ msgstr ""
#: ../../source/developer_guide/Design_Documents/KV_Cache_Pool_Guide.md:31
msgid "1. Combining KV Cache Pool with on-chip memory Prefix Caching"
msgstr "1. 将 KV 缓存池与片上内存前缀缓存结合"
msgstr "1.将 KV 缓存池与片上内存前缀缓存结合"
#: ../../source/developer_guide/Design_Documents/KV_Cache_Pool_Guide.md:33
msgid ""
@@ -182,7 +182,7 @@ msgstr "将 KV 池中的 KV 缓存加载到片上内存后,剩余过程与片
#: ../../source/developer_guide/Design_Documents/KV_Cache_Pool_Guide.md:46
msgid "2. Combining KV Cache Pool with Mooncake PD Disaggregation"
msgstr "2. 将 KV 缓存池与 Mooncake PD 解耦结合"
msgstr "2.将 KV 缓存池与 Mooncake PD 解耦结合"
#: ../../source/developer_guide/Design_Documents/KV_Cache_Pool_Guide.md:48
msgid ""

View File

@@ -96,7 +96,7 @@ msgstr "工作原理"
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:29
msgid "1. Design Approach"
msgstr "1. 设计思路"
msgstr "1.设计思路"
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:31
msgid ""
@@ -110,7 +110,7 @@ msgstr ""
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:33
msgid "2. Implementation Design"
msgstr "2. 实现设计"
msgstr "2.实现设计"
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:35
msgid ""
@@ -246,7 +246,7 @@ msgstr "**MooncakeConnectorWorker**:用于在工作进程中管理 KV 缓存
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:64
msgid "4. Specifications Design"
msgstr "4. 规格设计"
msgstr "4.规格设计"
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:66
msgid ""
@@ -322,7 +322,7 @@ msgstr "DFX 分析"
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:87
msgid "1. Config Parameter Validation"
msgstr "1. 配置参数验证"
msgstr "1.配置参数验证"
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:89
msgid ""
@@ -335,7 +335,7 @@ msgstr ""
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:91
msgid "2. Port Conflict Detection"
msgstr "2. 端口冲突检测"
msgstr "2.端口冲突检测"
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:93
msgid ""
@@ -348,7 +348,7 @@ msgstr ""
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:95
msgid "3. PD Ratio Validation"
msgstr "3. PD 比例验证"
msgstr "3.PD 比例验证"
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:97
msgid ""

View File

@@ -434,6 +434,10 @@ msgstr ""
msgid "Consistency"
msgstr "一致性"
#: ../../source/developer_guide/Design_Documents/eplb_swift_balancer.md:236
msgid "Expert Map"
msgstr "专家映射"
#: ../../source/developer_guide/Design_Documents/eplb_swift_balancer.md:237
msgid ""
"The expert map must be globally unique during initialization and update. "

View File

@@ -38,7 +38,7 @@ msgstr "在线服务器"
#: ../../source/developer_guide/evaluation/using_ais_bench.md:7
msgid "1. Start the vLLM server"
msgstr "1. 启动 vLLM 服务器"
msgstr "1.启动 vLLM 服务器"
#: ../../source/developer_guide/evaluation/using_ais_bench.md:9
msgid "You can run docker container to start the vLLM server on a single NPU:"
@@ -60,7 +60,7 @@ msgstr "如果看到如下日志,则 vLLM 服务器启动成功:"
#: ../../source/developer_guide/evaluation/using_ais_bench.md:56
msgid "2. Run different datasets using AISBench"
msgstr "2. 使用 AISBench 运行不同数据集"
msgstr "2.使用 AISBench 运行不同数据集"
#: ../../source/developer_guide/evaluation/using_ais_bench.md:58
msgid "Install AISBench"
@@ -227,7 +227,7 @@ msgstr "执行后,您可以从保存的文件中获取结果,示例如下:
#: ../../source/developer_guide/evaluation/using_ais_bench.md:300
msgid "3. Troubleshooting"
msgstr "3. 故障排除"
msgstr "3.故障排除"
#: ../../source/developer_guide/evaluation/using_ais_bench.md:302
msgid "Invalid Image Path Error"

View File

@@ -28,7 +28,7 @@ msgstr ""
#: ../../source/developer_guide/evaluation/using_evalscope.md:5
msgid "1. Online server"
msgstr "1. 在线服务器"
msgstr "1.在线服务器"
#: ../../source/developer_guide/evaluation/using_evalscope.md:7
msgid "You can run docker container to start the vLLM server on a single NPU:"
@@ -48,7 +48,7 @@ msgstr "服务器启动后,你可以在新的终端中使用输入提示词查
#: ../../source/developer_guide/evaluation/using_evalscope.md:56
msgid "2. Install EvalScope using pip"
msgstr "2. 使用 pip 安装 EvalScope"
msgstr "2.使用 pip 安装 EvalScope"
#: ../../source/developer_guide/evaluation/using_evalscope.md:58
msgid "You can install EvalScope as follows:"
@@ -56,7 +56,7 @@ msgstr "你可以通过以下方式安装 EvalScope"
#: ../../source/developer_guide/evaluation/using_evalscope.md:66
msgid "3. Run GSM8K using EvalScope for accuracy testing"
msgstr "3. 使用 EvalScope 运行 GSM8K 进行精度测试"
msgstr "3.使用 EvalScope 运行 GSM8K 进行精度测试"
#: ../../source/developer_guide/evaluation/using_evalscope.md:68
msgid ""
@@ -81,7 +81,7 @@ msgstr ""
#: ../../source/developer_guide/evaluation/using_evalscope.md:92
msgid "4. Run model inference stress testing using EvalScope"
msgstr "4. 使用 EvalScope 运行模型推理压力测试"
msgstr "4.使用 EvalScope 运行模型推理压力测试"
#: ../../source/developer_guide/evaluation/using_evalscope.md:94
msgid "Install EvalScope[perf] using pip"

View File

@@ -33,7 +33,7 @@ msgstr "在线服务器"
#: ../../source/developer_guide/evaluation/using_lm_eval.md:7
msgid "1. Start the vLLM server"
msgstr "1. 启动 vLLM 服务器"
msgstr "1.启动 vLLM 服务器"
#: ../../source/developer_guide/evaluation/using_lm_eval.md:9
msgid "You can run docker container to start the vLLM server on a single NPU:"
@@ -48,7 +48,7 @@ msgid ""
"2. Run GSM8K using the vLLM server (curl) and then run lm-eval for "
"accuracy testing"
msgstr ""
"2. 使用 vLLM 服务器curl运行 GSM8K然后运行 lm-eval 进行准确率测试"
"2.使用 vLLM 服务器curl运行 GSM8K然后运行 lm-eval 进行准确率测试"
#: ../../source/developer_guide/evaluation/using_lm_eval.md:48
msgid "You can query the result with input prompts:"
@@ -90,7 +90,7 @@ msgstr "离线服务器"
#: ../../source/developer_guide/evaluation/using_lm_eval.md:145
msgid "1. Run docker container"
msgstr "1. 运行 docker 容器"
msgstr "1.运行 docker 容器"
#: ../../source/developer_guide/evaluation/using_lm_eval.md:147
msgid "You can run docker container on a single NPU:"
@@ -98,7 +98,7 @@ msgstr "您可以在单个 NPU 上运行 docker 容器:"
#: ../../source/developer_guide/evaluation/using_lm_eval.md:175
msgid "2. Run GSM8K using lm-eval for accuracy testing"
msgstr "2. 使用 lm-eval 运行 GSM8K 进行准确率测试"
msgstr "2.使用 lm-eval 运行 GSM8K 进行准确率测试"
#: ../../source/developer_guide/evaluation/using_lm_eval.md:203
msgid "After 1 to 2 minutes, the output is shown below:"

View File

@@ -33,7 +33,7 @@ msgstr ""
#: ../../source/developer_guide/evaluation/using_opencompass.md:5
msgid "1. Online Server"
msgstr "1. 在线服务"
msgstr "1.在线服务"
#: ../../source/developer_guide/evaluation/using_opencompass.md:7
msgid "You can run a docker container to start the vLLM server on a single NPU:"
@@ -53,7 +53,7 @@ msgstr "服务器启动后,你可以在新的终端中使用输入提示词来
msgid ""
"2. Run C-Eval (a Chinese language model evaluation benchmark) using "
"OpenCompass for accuracy testing"
msgstr "2. 使用 OpenCompass 运行 C-Eval 进行准确率测试"
msgstr "2.使用 OpenCompass 运行 C-Eval 进行准确率测试"
#: ../../source/developer_guide/evaluation/using_opencompass.md:58
msgid ""