[releases/v0.18.0][Doc][Misc] Modifying Configuration Parameters (#8618)
### What this PR does / why we need it? This PR renames the environment variable VLLM_NIXL_ABORT_REQUEST_TIMEOUT to VLLM_MOONCAKE_ABORT_REQUEST_TIMEOUT to align with the Mooncake connector naming convention. It also updates the documentation and test configurations to reflect this change and adjusts the suggested timeout value in the documentation to 480 seconds for consistency. ### Does this PR introduce _any_ user-facing change? Yes. The environment variable for configuring the abort request timeout has been renamed. Users should update their environment settings from VLLM_NIXL_ABORT_REQUEST_TIMEOUT to VLLM_MOONCAKE_ABORT_REQUEST_TIMEOUT. ### How was this patch tested? The changes were verified by updating the corresponding test configuration files and ensuring consistency across the documentation. --------- Signed-off-by: herizhen <1270637059@qq.com> Signed-off-by: herizhen <59841270+herizhen@users.noreply.github.com>
This commit is contained in:
@@ -49,7 +49,7 @@ msgstr "通用常见问题"
|
||||
|
||||
#: ../../source/faqs.md:10
|
||||
msgid "1. What devices are currently supported?"
|
||||
msgstr "1. 目前支持哪些设备?"
|
||||
msgstr "1.目前支持哪些设备?"
|
||||
|
||||
#: ../../source/faqs.md:12
|
||||
msgid ""
|
||||
@@ -115,7 +115,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:28
|
||||
msgid "2. How to get our docker containers?"
|
||||
msgstr "2. 如何获取我们的 Docker 容器?"
|
||||
msgstr "2.如何获取我们的 Docker 容器?"
|
||||
|
||||
#: ../../source/faqs.md:30
|
||||
msgid ""
|
||||
@@ -154,7 +154,7 @@ msgstr "**在无互联网访问权限的环境中导入 Docker 镜像:**"
|
||||
|
||||
#: ../../source/faqs.md:70
|
||||
msgid "3. What models does vllm-ascend supports?"
|
||||
msgstr "3. vllm-ascend 支持哪些模型?"
|
||||
msgstr "3.vllm-ascend 支持哪些模型?"
|
||||
|
||||
#: ../../source/faqs.md:72
|
||||
msgid ""
|
||||
@@ -164,7 +164,7 @@ msgstr "更多详细信息请参见[<u>此处</u>](https://docs.vllm.ai/projects
|
||||
|
||||
#: ../../source/faqs.md:74
|
||||
msgid "4. How to get in touch with our community?"
|
||||
msgstr "4. 如何与我们的社区取得联系?"
|
||||
msgstr "4.如何与我们的社区取得联系?"
|
||||
|
||||
#: ../../source/faqs.md:76
|
||||
msgid ""
|
||||
@@ -205,7 +205,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:83
|
||||
msgid "5. What features does vllm-ascend V1 supports?"
|
||||
msgstr "5. vllm-ascend V1 支持哪些功能?"
|
||||
msgstr "5.vllm-ascend V1 支持哪些功能?"
|
||||
|
||||
#: ../../source/faqs.md:85
|
||||
msgid ""
|
||||
@@ -217,7 +217,7 @@ msgstr "更多详细信息请参见[<u>此处</u>](https://docs.vllm.ai/projects
|
||||
msgid ""
|
||||
"6. How to solve the problem of \"Failed to infer device type\" or "
|
||||
"\"libatb.so: cannot open shared object file\"?"
|
||||
msgstr "6. 如何解决“无法推断设备类型”或“libatb.so:无法打开共享对象文件”的问题?"
|
||||
msgstr "6.如何解决“无法推断设备类型”或“libatb.so:无法打开共享对象文件”的问题?"
|
||||
|
||||
#: ../../source/faqs.md:89
|
||||
msgid ""
|
||||
@@ -251,7 +251,7 @@ msgstr "如果以上所有步骤都无法解决问题,请随时提交一个 Gi
|
||||
|
||||
#: ../../source/faqs.md:105
|
||||
msgid "7. How vllm-ascend work with vLLM?"
|
||||
msgstr "7. vllm-ascend 如何与 vLLM 协同工作?"
|
||||
msgstr "7.vllm-ascend 如何与 vLLM 协同工作?"
|
||||
|
||||
#: ../../source/faqs.md:107
|
||||
msgid ""
|
||||
@@ -266,7 +266,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:109
|
||||
msgid "8. Does vllm-ascend support Prefill Disaggregation feature?"
|
||||
msgstr "8. vllm-ascend 是否支持 Prefill Disaggregation 功能?"
|
||||
msgstr "8.vllm-ascend 是否支持 Prefill Disaggregation 功能?"
|
||||
|
||||
#: ../../source/faqs.md:111
|
||||
msgid ""
|
||||
@@ -280,7 +280,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:113
|
||||
msgid "9. Does vllm-ascend support quantization method?"
|
||||
msgstr "9. vllm-ascend 是否支持量化方法?"
|
||||
msgstr "9.vllm-ascend 是否支持量化方法?"
|
||||
|
||||
#: ../../source/faqs.md:115
|
||||
msgid ""
|
||||
@@ -290,7 +290,7 @@ msgstr "目前,vllm-ascend 已支持 w8a8、w4a8 和 w4a4 量化方法。"
|
||||
|
||||
#: ../../source/faqs.md:117
|
||||
msgid "10. How is vllm-ascend tested?"
|
||||
msgstr "10. vllm-ascend 是如何测试的?"
|
||||
msgstr "10.vllm-ascend 是如何测试的?"
|
||||
|
||||
#: ../../source/faqs.md:119
|
||||
msgid ""
|
||||
@@ -339,7 +339,7 @@ msgstr "对于每个版本,我们未来都将发布性能测试和准确性测
|
||||
|
||||
#: ../../source/faqs.md:131
|
||||
msgid "11. How to fix the error \"InvalidVersion\" when using vllm-ascend?"
|
||||
msgstr "11. 使用 vllm-ascend 时如何修复 \"InvalidVersion\" 错误?"
|
||||
msgstr "11.使用 vllm-ascend 时如何修复 \"InvalidVersion\" 错误?"
|
||||
|
||||
#: ../../source/faqs.md:133
|
||||
msgid ""
|
||||
@@ -356,7 +356,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:135
|
||||
msgid "12. How to handle the out-of-memory issue?"
|
||||
msgstr "12. 如何处理内存不足问题?"
|
||||
msgstr "12.如何处理内存不足问题?"
|
||||
|
||||
#: ../../source/faqs.md:137
|
||||
msgid ""
|
||||
@@ -410,7 +410,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:147
|
||||
msgid "13. Failed to enable NPU graph mode when running DeepSeek"
|
||||
msgstr "13. 运行 DeepSeek 时无法启用 NPU 图模式"
|
||||
msgstr "13.运行 DeepSeek 时无法启用 NPU 图模式"
|
||||
|
||||
#: ../../source/faqs.md:149
|
||||
msgid ""
|
||||
@@ -438,7 +438,7 @@ msgstr ""
|
||||
msgid ""
|
||||
"14. Failed to reinstall vllm-ascend from source after uninstalling vllm-"
|
||||
"ascend"
|
||||
msgstr "14. 卸载 vllm-ascend 后无法从源码重新安装 vllm-ascend"
|
||||
msgstr "14.卸载 vllm-ascend 后无法从源码重新安装 vllm-ascend"
|
||||
|
||||
#: ../../source/faqs.md:160
|
||||
msgid ""
|
||||
@@ -452,7 +452,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:162
|
||||
msgid "15. How to generate deterministic results when using vllm-ascend?"
|
||||
msgstr "15. 使用 vllm-ascend 时如何生成确定性结果?"
|
||||
msgstr "15.使用 vllm-ascend 时如何生成确定性结果?"
|
||||
|
||||
#: ../../source/faqs.md:164
|
||||
msgid "There are several factors that affect output determinism:"
|
||||
@@ -473,7 +473,7 @@ msgid ""
|
||||
"16. How to fix the error \"ImportError: Please install vllm[audio] for "
|
||||
"audio support\" for the Qwen2.5-Omni model?"
|
||||
msgstr ""
|
||||
"16. 对于 Qwen2.5-Omni 模型,如何修复 \"ImportError: Please install vllm[audio] for"
|
||||
"16.对于 Qwen2.5-Omni 模型,如何修复 \"ImportError: Please install vllm[audio] for"
|
||||
" audio support\" 错误?"
|
||||
|
||||
#: ../../source/faqs.md:202
|
||||
@@ -493,7 +493,7 @@ msgstr ""
|
||||
msgid ""
|
||||
"17. How to troubleshoot and resolve size capture failures resulting from "
|
||||
"stream resource exhaustion, and what are the underlying causes?"
|
||||
msgstr "17. 如何排查和解决因流资源耗尽导致的尺寸捕获失败,其根本原因是什么?"
|
||||
msgstr "17.如何排查和解决因流资源耗尽导致的尺寸捕获失败,其根本原因是什么?"
|
||||
|
||||
#: ../../source/faqs.md:213
|
||||
msgid "Recommended mitigation strategies:"
|
||||
@@ -531,7 +531,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:221
|
||||
msgid "18. How to install custom version of torch_npu?"
|
||||
msgstr "18. 如何安装自定义版本的 torch_npu?"
|
||||
msgstr "18.如何安装自定义版本的 torch_npu?"
|
||||
|
||||
#: ../../source/faqs.md:223
|
||||
msgid ""
|
||||
@@ -546,7 +546,7 @@ msgstr ""
|
||||
msgid ""
|
||||
"19. On certain systems (e.g., Kylin OS), `docker pull` may fail with an "
|
||||
"`invalid tar header` error"
|
||||
msgstr "19. 在某些系统上(例如 Kylin OS),`docker pull` 可能因 `invalid tar header` 错误而失败"
|
||||
msgstr "19.在某些系统上(例如 Kylin OS),`docker pull` 可能因 `invalid tar header` 错误而失败"
|
||||
|
||||
#: ../../source/faqs.md:227
|
||||
msgid ""
|
||||
@@ -581,7 +581,7 @@ msgstr "将 `vllm_ascend_<tag>.tar` 文件(其中 `<tag>` 是你使用的镜
|
||||
msgid ""
|
||||
"20. Why am I getting an error when executing the script to start a Docker"
|
||||
" container? The error message is: \"operation not permitted\""
|
||||
msgstr "20. 为什么执行启动 Docker 容器的脚本时会出错?错误信息是:\"operation not permitted\""
|
||||
msgstr "20.为什么执行启动 Docker 容器的脚本时会出错?错误信息是:\"operation not permitted\""
|
||||
|
||||
#: ../../source/faqs.md:254
|
||||
msgid ""
|
||||
@@ -598,7 +598,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:256
|
||||
msgid "21. How to achieve low latency in a small batch scenario?"
|
||||
msgstr "21. 如何在小批量场景下实现低延迟?"
|
||||
msgstr "21.如何在小批量场景下实现低延迟?"
|
||||
|
||||
#: ../../source/faqs.md:258
|
||||
msgid ""
|
||||
@@ -636,7 +636,7 @@ msgstr ""
|
||||
msgid ""
|
||||
"22. How to set `SOC_VERSION` when building from source on a CPU-only "
|
||||
"machine?"
|
||||
msgstr "22. 在仅含 CPU 的机器上从源码构建时,如何设置 `SOC_VERSION`?"
|
||||
msgstr "22.在仅含 CPU 的机器上从源码构建时,如何设置 `SOC_VERSION`?"
|
||||
|
||||
#: ../../source/faqs.md:271
|
||||
msgid ""
|
||||
@@ -654,7 +654,7 @@ msgstr "你可以参考 `Dockerfile*` 中的默认值。例如:"
|
||||
|
||||
#: ../../source/faqs.md:289
|
||||
msgid "23. Compilation error occasionally encounters with triton-ascend"
|
||||
msgstr "23. triton-ascend 偶尔遇到编译错误"
|
||||
msgstr "23.triton-ascend 偶尔遇到编译错误"
|
||||
|
||||
#: ../../source/faqs.md:291
|
||||
msgid ""
|
||||
@@ -670,7 +670,7 @@ msgstr ""
|
||||
|
||||
#: ../../source/faqs.md:300
|
||||
msgid "24. Why TPOT increases drastically as concurrency grows?"
|
||||
msgstr "24. 为什么 TPOT 随着并发增长而急剧增加?"
|
||||
msgstr "24.为什么 TPOT 随着并发增长而急剧增加?"
|
||||
|
||||
#: ../../source/faqs.md:302
|
||||
msgid ""
|
||||
|
||||
Reference in New Issue
Block a user