[releases/v0.18.0][Doc][Misc] Modifying Configuration Parameters (#8618)
### What this PR does / why we need it? This PR renames the environment variable VLLM_NIXL_ABORT_REQUEST_TIMEOUT to VLLM_MOONCAKE_ABORT_REQUEST_TIMEOUT to align with the Mooncake connector naming convention. It also updates the documentation and test configurations to reflect this change and adjusts the suggested timeout value in the documentation to 480 seconds for consistency. ### Does this PR introduce _any_ user-facing change? Yes. The environment variable for configuring the abort request timeout has been renamed. Users should update their environment settings from VLLM_NIXL_ABORT_REQUEST_TIMEOUT to VLLM_MOONCAKE_ABORT_REQUEST_TIMEOUT. ### How was this patch tested? The changes were verified by updating the corresponding test configuration files and ensuring consistency across the documentation. --------- Signed-off-by: herizhen <1270637059@qq.com> Signed-off-by: herizhen <59841270+herizhen@users.noreply.github.com>
This commit is contained in:
@@ -134,7 +134,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/KV_Cache_Pool_Guide.md:31
|
||||
msgid "1. Combining KV Cache Pool with on-chip memory Prefix Caching"
|
||||
msgstr "1. 将 KV 缓存池与片上内存前缀缓存结合"
|
||||
msgstr "1.将 KV 缓存池与片上内存前缀缓存结合"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/KV_Cache_Pool_Guide.md:33
|
||||
msgid ""
|
||||
@@ -182,7 +182,7 @@ msgstr "将 KV 池中的 KV 缓存加载到片上内存后,剩余过程与片
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/KV_Cache_Pool_Guide.md:46
|
||||
msgid "2. Combining KV Cache Pool with Mooncake PD Disaggregation"
|
||||
msgstr "2. 将 KV 缓存池与 Mooncake PD 解耦结合"
|
||||
msgstr "2.将 KV 缓存池与 Mooncake PD 解耦结合"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/KV_Cache_Pool_Guide.md:48
|
||||
msgid ""
|
||||
|
||||
@@ -96,7 +96,7 @@ msgstr "工作原理"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:29
|
||||
msgid "1. Design Approach"
|
||||
msgstr "1. 设计思路"
|
||||
msgstr "1.设计思路"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:31
|
||||
msgid ""
|
||||
@@ -110,7 +110,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:33
|
||||
msgid "2. Implementation Design"
|
||||
msgstr "2. 实现设计"
|
||||
msgstr "2.实现设计"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:35
|
||||
msgid ""
|
||||
@@ -246,7 +246,7 @@ msgstr "**MooncakeConnectorWorker**:用于在工作进程中管理 KV 缓存
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:64
|
||||
msgid "4. Specifications Design"
|
||||
msgstr "4. 规格设计"
|
||||
msgstr "4.规格设计"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:66
|
||||
msgid ""
|
||||
@@ -322,7 +322,7 @@ msgstr "DFX 分析"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:87
|
||||
msgid "1. Config Parameter Validation"
|
||||
msgstr "1. 配置参数验证"
|
||||
msgstr "1.配置参数验证"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:89
|
||||
msgid ""
|
||||
@@ -335,7 +335,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:91
|
||||
msgid "2. Port Conflict Detection"
|
||||
msgstr "2. 端口冲突检测"
|
||||
msgstr "2.端口冲突检测"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:93
|
||||
msgid ""
|
||||
@@ -348,7 +348,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:95
|
||||
msgid "3. PD Ratio Validation"
|
||||
msgstr "3. PD 比例验证"
|
||||
msgstr "3.PD 比例验证"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/disaggregated_prefill.md:97
|
||||
msgid ""
|
||||
|
||||
@@ -434,6 +434,10 @@ msgstr ""
|
||||
msgid "Consistency"
|
||||
msgstr "一致性"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/eplb_swift_balancer.md:236
|
||||
msgid "Expert Map"
|
||||
msgstr "专家映射"
|
||||
|
||||
#: ../../source/developer_guide/Design_Documents/eplb_swift_balancer.md:237
|
||||
msgid ""
|
||||
"The expert map must be globally unique during initialization and update. "
|
||||
|
||||
@@ -38,7 +38,7 @@ msgstr "在线服务器"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_ais_bench.md:7
|
||||
msgid "1. Start the vLLM server"
|
||||
msgstr "1. 启动 vLLM 服务器"
|
||||
msgstr "1.启动 vLLM 服务器"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_ais_bench.md:9
|
||||
msgid "You can run docker container to start the vLLM server on a single NPU:"
|
||||
@@ -60,7 +60,7 @@ msgstr "如果看到如下日志,则 vLLM 服务器启动成功:"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_ais_bench.md:56
|
||||
msgid "2. Run different datasets using AISBench"
|
||||
msgstr "2. 使用 AISBench 运行不同数据集"
|
||||
msgstr "2.使用 AISBench 运行不同数据集"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_ais_bench.md:58
|
||||
msgid "Install AISBench"
|
||||
@@ -227,7 +227,7 @@ msgstr "执行后,您可以从保存的文件中获取结果,示例如下:
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_ais_bench.md:300
|
||||
msgid "3. Troubleshooting"
|
||||
msgstr "3. 故障排除"
|
||||
msgstr "3.故障排除"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_ais_bench.md:302
|
||||
msgid "Invalid Image Path Error"
|
||||
|
||||
@@ -28,7 +28,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_evalscope.md:5
|
||||
msgid "1. Online server"
|
||||
msgstr "1. 在线服务器"
|
||||
msgstr "1.在线服务器"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_evalscope.md:7
|
||||
msgid "You can run docker container to start the vLLM server on a single NPU:"
|
||||
@@ -48,7 +48,7 @@ msgstr "服务器启动后,你可以在新的终端中使用输入提示词查
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_evalscope.md:56
|
||||
msgid "2. Install EvalScope using pip"
|
||||
msgstr "2. 使用 pip 安装 EvalScope"
|
||||
msgstr "2.使用 pip 安装 EvalScope"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_evalscope.md:58
|
||||
msgid "You can install EvalScope as follows:"
|
||||
@@ -56,7 +56,7 @@ msgstr "你可以通过以下方式安装 EvalScope:"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_evalscope.md:66
|
||||
msgid "3. Run GSM8K using EvalScope for accuracy testing"
|
||||
msgstr "3. 使用 EvalScope 运行 GSM8K 进行精度测试"
|
||||
msgstr "3.使用 EvalScope 运行 GSM8K 进行精度测试"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_evalscope.md:68
|
||||
msgid ""
|
||||
@@ -81,7 +81,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_evalscope.md:92
|
||||
msgid "4. Run model inference stress testing using EvalScope"
|
||||
msgstr "4. 使用 EvalScope 运行模型推理压力测试"
|
||||
msgstr "4.使用 EvalScope 运行模型推理压力测试"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_evalscope.md:94
|
||||
msgid "Install EvalScope[perf] using pip"
|
||||
|
||||
@@ -33,7 +33,7 @@ msgstr "在线服务器"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_lm_eval.md:7
|
||||
msgid "1. Start the vLLM server"
|
||||
msgstr "1. 启动 vLLM 服务器"
|
||||
msgstr "1.启动 vLLM 服务器"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_lm_eval.md:9
|
||||
msgid "You can run docker container to start the vLLM server on a single NPU:"
|
||||
@@ -48,7 +48,7 @@ msgid ""
|
||||
"2. Run GSM8K using the vLLM server (curl) and then run lm-eval for "
|
||||
"accuracy testing"
|
||||
msgstr ""
|
||||
"2. 使用 vLLM 服务器(curl)运行 GSM8K,然后运行 lm-eval 进行准确率测试"
|
||||
"2.使用 vLLM 服务器(curl)运行 GSM8K,然后运行 lm-eval 进行准确率测试"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_lm_eval.md:48
|
||||
msgid "You can query the result with input prompts:"
|
||||
@@ -90,7 +90,7 @@ msgstr "离线服务器"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_lm_eval.md:145
|
||||
msgid "1. Run docker container"
|
||||
msgstr "1. 运行 docker 容器"
|
||||
msgstr "1.运行 docker 容器"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_lm_eval.md:147
|
||||
msgid "You can run docker container on a single NPU:"
|
||||
@@ -98,7 +98,7 @@ msgstr "您可以在单个 NPU 上运行 docker 容器:"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_lm_eval.md:175
|
||||
msgid "2. Run GSM8K using lm-eval for accuracy testing"
|
||||
msgstr "2. 使用 lm-eval 运行 GSM8K 进行准确率测试"
|
||||
msgstr "2.使用 lm-eval 运行 GSM8K 进行准确率测试"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_lm_eval.md:203
|
||||
msgid "After 1 to 2 minutes, the output is shown below:"
|
||||
|
||||
@@ -33,7 +33,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_opencompass.md:5
|
||||
msgid "1. Online Server"
|
||||
msgstr "1. 在线服务"
|
||||
msgstr "1.在线服务"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_opencompass.md:7
|
||||
msgid "You can run a docker container to start the vLLM server on a single NPU:"
|
||||
@@ -53,7 +53,7 @@ msgstr "服务器启动后,你可以在新的终端中使用输入提示词来
|
||||
msgid ""
|
||||
"2. Run C-Eval (a Chinese language model evaluation benchmark) using "
|
||||
"OpenCompass for accuracy testing"
|
||||
msgstr "2. 使用 OpenCompass 运行 C-Eval 进行准确率测试"
|
||||
msgstr "2.使用 OpenCompass 运行 C-Eval 进行准确率测试"
|
||||
|
||||
#: ../../source/developer_guide/evaluation/using_opencompass.md:58
|
||||
msgid ""
|
||||
|
||||
Reference in New Issue
Block a user