[Doc][Misc] Comprehensive documentation cleanup and grammatical fixes (#8073)

What this PR does / why we need it? This pull request performs a comprehensive cleanup of the vLLM Ascend documentation. It fixes numerous typos, grammatical errors, and phrasing issues across community guidelines, developer documents, hardware tutorials, and feature guides. Key improvements include correcting hardware names (e.g., Atlas 300I), fixing broken links, cleaning up code examples (removing duplicate flags and trailing commas), and improving the clarity of technical explanations. These changes are necessary to ensure the documentation is professional, accurate, and easy for users to follow. Does this PR introduce any user-facing change? No, this PR contains documentation-only updates. How was this patch tested? The changes were manually reviewed for accuracy and grammatical correctness. No functional code changes were introduced. --------- Signed-off-by: herizhen <1270637059@qq.com> Signed-off-by: herizhen <59841270+herizhen@users.noreply.github.com>
2026-04-09 15:37:57 +08:00
parent c40a387f63
commit 0d1424d81a
71 changed files with 1295 additions and 1296 deletions
--- a/docs/source/tutorials/models/Qwen2.5-Omni.md
+++ b/docs/source/tutorials/models/Qwen2.5-Omni.md
@@ -69,7 +69,7 @@ docker run --rm \
 #### Single NPU (Qwen2.5-Omni-7B)

 :::{note}
-The env `LOCAL_MEDIA_PATH` which allowing API requests to read local images or videos from directories specified by the server file system. Please note this is a security risk. Should only be enabled in trusted environments.
+The **environment variable** `LOCAL_MEDIA_PATH` which **allows** API requests to read local images or videos from directories specified by the server file system. Please note this is a security risk. Should only be enabled in trusted environments.

 :::

@@ -99,7 +99,7 @@ VLLM_TARGET_DEVICE=empty pip install -v ".[audio]"

 `--allowed-local-media-path` is optional, only set it if you need infer model with local media file.

-`--gpu-memory-utilization` should not be set manually only if you know what this parameter aims to.
+`--gpu-memory-utilization` should not be set manually unless you know what this parameter does.

 #### Multiple NPU (Qwen2.5-Omni-7B)

@@ -128,7 +128,7 @@ Not supported yet.

 ## Functional Verification

-If your service start successfully, you can see the info shown below:
+If your service **starts** successfully, you can see the info shown below:

 ```bash
 INFO:     Started server process [2736]
@@ -195,7 +195,7 @@ Refer to [Using AISBench for performance evaluation](../../developer_guide/evalu

 Run performance evaluation of `Qwen2.5-Omni-7B` as an example.

-Refer to [vllm benchmark](https://docs.vllm.ai/en/latest/contributing/benchmarks.html) for more details.
+Refer to [vllm benchmark](https://docs.vllm.ai/en/latest/benchmarking/) for more details.

 There are three `vllm bench` subcommands: