From 4ae7588c5242229134bad8fd81257371c0faa678 Mon Sep 17 00:00:00 2001 From: Li Wang Date: Fri, 12 Dec 2025 15:38:29 +0800 Subject: [PATCH] [Doc] Upgrade outdated doc (#4957) ### What this PR does / why we need it? Updated some issues that caused sleep mode document content to be unavailable due to changes/outdated environment variables. --------- Signed-off-by: wangli --- docs/source/user_guide/feature_guide/sleep_mode.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/docs/source/user_guide/feature_guide/sleep_mode.md b/docs/source/user_guide/feature_guide/sleep_mode.md index 845e5a9b..5fa2ef1e 100644 --- a/docs/source/user_guide/feature_guide/sleep_mode.md +++ b/docs/source/user_guide/feature_guide/sleep_mode.md @@ -36,11 +36,12 @@ The following is a simple example of how to use sleep mode. import torch from vllm import LLM, SamplingParams - from vllm.utils import GiB_bytes + from vllm.utils.mem_constants import GiB_bytes os.environ["VLLM_USE_MODELSCOPE"] = "True" os.environ["VLLM_WORKER_MULTIPROC_METHOD"] = "spawn" + os.environ["VLLM_ASCEND_ENABLE_NZ"] = "0" if __name__ == "__main__": prompt = "How are you?" @@ -77,6 +78,7 @@ The following is a simple example of how to use sleep mode. export VLLM_SERVER_DEV_MODE="1" export VLLM_WORKER_MULTIPROC_METHOD="spawn" export VLLM_USE_MODELSCOPE="True" + export VLLM_ASCEND_ENABLE_NZ="0" vllm serve Qwen/Qwen2.5-0.5B-Instruct --enable-sleep-mode