What this PR does / why we need it? This pull request performs a comprehensive cleanup of the vLLM Ascend documentation. It fixes numerous typos, grammatical errors, and phrasing issues across community guidelines, developer documents, hardware tutorials, and feature guides. Key improvements include correcting hardware names (e.g., Atlas 300I), fixing broken links, cleaning up code examples (removing duplicate flags and trailing commas), and improving the clarity of technical explanations. These changes are necessary to ensure the documentation is professional, accurate, and easy for users to follow. Does this PR introduce any user-facing change? No, this PR contains documentation-only updates. How was this patch tested? The changes were manually reviewed for accuracy and grammatical correctness. No functional code changes were introduced. --------- Signed-off-by: herizhen <1270637059@qq.com> Signed-off-by: herizhen <59841270+herizhen@users.noreply.github.com>
20 lines
407 B
Markdown
20 lines
407 B
Markdown
# Design Documents
|
|
|
|
This section provides an overview of the features implemented in vLLM Ascend. Developers can refer to this guide to understand how vLLM Ascend works.
|
|
|
|
:::{toctree}
|
|
:caption: Design Documents
|
|
:maxdepth: 1
|
|
patch
|
|
cpu_binding
|
|
ModelRunner_prepare_inputs
|
|
disaggregated_prefill
|
|
eplb_swift_balancer
|
|
ACL_Graph
|
|
KV_Cache_Pool_Guide
|
|
add_custom_aclnn_op
|
|
context_parallel
|
|
quantization
|
|
npugraph_ex
|
|
:::
|