xc-llm-ascend

Author	SHA1	Message	Date
Wangbei25	571edc58fa	[Doc]Update DeepSeekOCR2.md for releases/v0.18.0 (#8604 ) <!-- Thanks for sending a pull request! BEFORE SUBMITTING, PLEASE READ https://docs.vllm.ai/en/latest/contributing/overview.html --> ### What this PR does / why we need it? <!-- - Please clarify what changes you are proposing. The purpose of this section is to outline the changes and how this PR fixes the issue. If possible, please consider writing useful notes for better and faster reviews in your PR. - Please clarify why the changes are needed. For instance, the use case and bug description. - Fixes # --> Update DeepSeekOCR2.md for releases/v0.18.0 ### Does this PR introduce _any_ user-facing change? <!-- Note that it means any user-facing change including all aspects such as API, interface or other behavior changes. Documentation-only updates are not considered user-facing changes. --> NO ### How was this patch tested? <!-- CI passed with new added/existing test. If it was tested in a way different from regular unit tests, please clarify how you tested step by step, ideally copy and paste-able, so that other reviewers can test and check, and descendants can verify in the future. If tests were not added, please describe why they were not added and/or why it was difficult to add. --> vLLM version: v0.18.0 vLLM main: `bcf2be9612` --------- Signed-off-by: Wangbei25 <wangbei41@huawie.com> Signed-off-by: Wangbei25 <wangbei41@huawei.com> Co-authored-by: Wangbei25 <wangbei41@huawie.com>	2026-04-23 23:48:03 +08:00
SparrowMu	c1f323ee46	[Doc] Add new intro to MiniMax-M2.5/M2.7 (#8169 ) ### What this PR does / why we need it? 1. This PR cherry pick commit that contains current best performance at 3.5k/1.5k and 128k/1k from main to 0.18.0 branch. 2. This PR introduce MiniMax-M2.7 0day information to users. 3. To finish previous step we also changes MiniMax doc name from MiniMax-M2.5.md to MiniMax-M2.md --------- Signed-off-by: limuyuan <limuyuan3@huawei.com> Co-authored-by: limuyuan <limuyuan3@huawei.com>	2026-04-12 21:45:07 +08:00
herizhen	0d1424d81a	[Doc][Misc] Comprehensive documentation cleanup and grammatical fixes (#8073 ) What this PR does / why we need it? This pull request performs a comprehensive cleanup of the vLLM Ascend documentation. It fixes numerous typos, grammatical errors, and phrasing issues across community guidelines, developer documents, hardware tutorials, and feature guides. Key improvements include correcting hardware names (e.g., Atlas 300I), fixing broken links, cleaning up code examples (removing duplicate flags and trailing commas), and improving the clarity of technical explanations. These changes are necessary to ensure the documentation is professional, accurate, and easy for users to follow. Does this PR introduce any user-facing change? No, this PR contains documentation-only updates. How was this patch tested? The changes were manually reviewed for accuracy and grammatical correctness. No functional code changes were introduced. --------- Signed-off-by: herizhen <1270637059@qq.com> Signed-off-by: herizhen <59841270+herizhen@users.noreply.github.com>	2026-04-09 15:37:57 +08:00
yydyzr	8ce4cfdae7	[Doc][Misc][v0.18.0] Add GLM5 to supported model list and update deployment document for GLM5 (#7963 ) ### What this PR does / why we need it? 1. Add version notes for GLM5. 2. Add paramter modification for GLM5. 3. Add GLM5 to supported model list. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.18.0 - vLLM main: `35141a7eed` --------- Signed-off-by: yydyzr <liuyuncong1@huawei.com> Signed-off-by: Zhu Jiyang <zhujiyang2@huawei.com> Co-authored-by: Zhu Jiyang <zhujiyang2@huawei.com>	2026-04-03 10:15:39 +08:00
shaopeng-666	3218eb9fe1	[DOC]update Qwen3.5 user guide (#7934 ) This pr cherry pick from #7866. Update the model user guide --------- Signed-off-by: 李少鹏 <lishaopeng21@huawei.com>	2026-04-02 22:09:00 +08:00
aipaes	5e65062973	[doc] Fix issues in the GLM4.7 documentation (#7457 ) ### What this PR does / why we need it? Fix issues in the GLM4.7 documentation and add some missing explanations. ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? document test - vLLM version: v0.17.0 - vLLM main: `8a680463fa` --------- Signed-off-by: zjks98 <zhangjiakang4@huawei.com> Co-authored-by: zjks98 <zhangjiakang4@huawei.com>	2026-03-19 16:42:59 +08:00
NJX	bb7ed759d4	[Doc] Fix broken chunked-prefill URL in supported features (#6963 ) ## What this PR does / why we need it? Fixes the broken URL for chunked-prefill in the supported features documentation page. The chunked prefill documentation URL was moved from `performance/optimization.html` to `configuration/optimization.html` in upstream vLLM docs. This PR updates the link to point to the correct location. Before: https://docs.vllm.ai/en/stable/performance/optimization.html#chunked-prefill (404) After: https://docs.vllm.ai/en/stable/configuration/optimization.html#chunked-prefill (working) ## Does this PR introduce _any_ user-facing change? Yes - fixes a broken documentation link that users encounter when clicking 'Chunked Prefill' in the supported features page. ## How was this patch tested? - Verified the new URL resolves correctly - Documentation change only Closes #4217 - vLLM version: v0.16.0 - vLLM main: `15d76f74e2` Signed-off-by: NJX-njx <3771829673@qq.com>	2026-03-10 10:10:07 +08:00
zzzzwwjj	f19f7b1fe2	[doc] fix supported_models (#6930 ) ### What this PR does / why we need it? Add Experimental supported model/feature for supported_models.md ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.16.0 - vLLM main: `15d76f74e2` Signed-off-by: zzzzwwjj <1183291235@qq.com>	2026-03-03 09:47:50 +08:00
zzzzwwjj	5c8ab7af39	[main]update release note & support matrix (#6759 ) ### What this PR does / why we need it? Update release note & support matrix to add experimental tag for features and models. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.15.0 - vLLM main: `9562912cea` 0.13.0 branch: https://github.com/vllm-project/vllm-ascend/pull/6751 Signed-off-by: zzzzwwjj <1183291235@qq.com>	2026-02-24 17:39:35 +08:00
wangxiyuan	7d4833bce9	[Doc][Misc] Restructure tutorial documentation (#6501 ) ### What this PR does / why we need it? This PR refactors the tutorial documentation by restructuring it into three categories: Models, Features, and Hardware. This improves the organization and navigation of the tutorials, making it easier for users to find relevant information. - The single `tutorials/index.md` is split into three separate index files: - `docs/source/tutorials/models/index.md` - `docs/source/tutorials/features/index.md` - `docs/source/tutorials/hardwares/index.md` - Existing tutorial markdown files have been moved into their respective new subdirectories (`models/`, `features/`, `hardwares/`). - The main `index.md` has been updated to link to these new tutorial sections. This change makes the documentation structure more logical and scalable for future additions. ### Does this PR introduce _any_ user-facing change? Yes, this PR changes the structure and URLs of the tutorial documentation pages. Users following old links to tutorials will encounter broken links. It is recommended to set up redirects if the documentation framework supports them. ### How was this patch tested? These are documentation-only changes. The documentation should be built and reviewed locally to ensure all links are correct and the pages render as expected. - vLLM version: v0.15.0 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.15.0 Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2026-02-10 15:03:35 +08:00
zhangxinyuehfad	08a45e6053	[Doc] update supported features (#6165 ) ### What this PR does / why we need it? update supported features - vLLM version: v0.13.0 - vLLM main: `d68209402d` Signed-off-by: hfadzxy <starmoon_zhang@163.com>	2026-01-23 09:50:11 +08:00
Canlin Guo	afabb49f00	[Docs][Model] Support Qwen3-VL-Embedding & Qwen3-VL-Reranker (#6034 ) ### What this PR does / why we need it? Add docs for Qwen3-VL-Embedding & Qwen3-VL-Reranker. - vLLM version: v0.13.0 - vLLM main: `2c24bc6996` --------- Signed-off-by: gcanlin <canlinguosdu@gmail.com>	2026-01-20 17:36:31 +08:00
herizhen	0eafed9bd6	[doc]Table split (#5929 ) ### What this PR does / why we need it? Added legend descriptions, and split redundant tables into core supported model tables and extended compatible model tables. ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? ut - vLLM version: v0.13.0 - vLLM main: `11b6af5280` --------- Signed-off-by: herizhen <1270637059@qq.com>	2026-01-19 09:15:04 +08:00
SILONG ZENG	4811ba62e0	[Lint]Style: reformat markdown files via markdownlint (#5884 ) ### What this PR does / why we need it? reformat markdown files via markdownlint - vLLM version: v0.13.0 - vLLM main: `bde38c11df` --------- Signed-off-by: root <root@LAPTOP-VQKDDVMG.localdomain> Signed-off-by: MrZ20 <2609716663@qq.com> Co-authored-by: root <root@LAPTOP-VQKDDVMG.localdomain>	2026-01-15 09:06:01 +08:00
wangxiyuan	354ee3b330	[Doc] Update doc url link (#5781 ) Drop `dev` suffix for doc url. Rename url to `https://docs.vllm.ai/projects/ascend` - vLLM version: v0.13.0 - vLLM main: `2f4e6548ef` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2026-01-12 11:21:31 +08:00
1092626063	3ba064f804	[Doc] Add GLM4.5 GLM4.6 doc (#5740 ) ### What this PR does / why we need it? Add GLM4.5 GLM4.6 doc - vLLM version: v0.13.0 - vLLM main: `2f4e6548ef` Signed-off-by: 1092626063 <1092626063@qq.com>	2026-01-09 16:40:49 +08:00
zyz111222	98c788a65a	[Doc] add PaddleOCR-VL tutorials guide (#5556 ) ### What this PR does / why we need it? 1. add PaddleOCR-VL.md in the `docs/source/tutorials/` 2. add PaddleOCR-VL index in `docs/source/tutorials/index.md` ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by CI - vLLM version: v0.13.0 - vLLM main: `7157596103` Signed-off-by: zouyizhou <zouyizhou@huawei.com>	2026-01-09 11:01:25 +08:00
meihanc	503822c56c	[Doc] Add Qwen3-Omni-30B-A3B-Thinking Tutorials (#3991 ) ### What this PR does / why we need it? Add Qwen3-Omni-30B-A3B-Thinking Tutorials ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: `5326c89803` --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>	2026-01-08 16:57:20 +08:00
zhangxinyuehfad	a099b994b3	[Doc] update supported models (#5379 ) ### What this PR does / why we need it? 1. update supported models: Llama2 & Kimi-K2-Thinking & ERNIE-4.5 & Qwen3-Omni 2. update Supported Hardware - vLLM version: release/v0.13.0 - vLLM main: `bc0a5a0c08` Signed-off-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: Mengqing Cao <cmq0113@163.com>	2026-01-05 09:21:52 +08:00
zhangsicheng5	8ed87dfa84	[doc] Add context parallel user guide (#5358 ) 1. Add context parallel user guide 2. Add context parallel related message in supported features/models - vLLM version: release/v0.13.0 - vLLM main: `bc0a5a0c08` Signed-off-by: zhangsicheng5 <zhangsicheng5@huawei.com>	2025-12-26 17:03:47 +08:00
zhangyiming	dc047489c7	[Doc] Fix DeepSeek-V3.2 tutorial. (#5190 ) ### What this PR does / why we need it? Fix DeepSeek-V3.2 tutorial. - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` Signed-off-by: menogrey <1299267905@qq.com>	2025-12-22 11:30:17 +08:00
luluxiu520	bc05a81bf2	Add Qwen3-VL-235B-A22B-Instruct tutorials (#5167 ) ### What this PR does / why we need it? This PR provides an introduction to the Qwen3-VL-235B-A22B-Instruct model, details on the features supported by the model in the current version, the model deployment process, as well as methods for performance testing and accuracy testing. With this document, the deployment and testing of the Qwen3-VL-235B-A22B-Instruct model can be implemented more easily. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` Signed-off-by: luluxiu520 <l2625793@outlook.com>	2025-12-19 14:56:17 +08:00
1092626063	f952de93df	【Doc】Deepseekv3.1/R1 doc enhancement (#4827 ) ### What this PR does / why we need it? Deepseekv3.1、DeepSeekR1 doc enhancement - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` --------- Signed-off-by: 1092626063 <1092626063@qq.com>	2025-12-19 10:52:33 +08:00
TingW09	879ec2d1c4	[Doc] add qwen3 reranker (#5086 ) ### What this PR does / why we need it? add qwen3 reranker tutorials ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.12.0 --------- Signed-off-by: TingW09 <944713709@qq.com>	2025-12-18 10:54:07 +08:00
lilinsiman	31c94b7e7b	[doc][main] Correct more doc mistakes (#4958 ) ### What this PR does / why we need it? Correct more doc mistakes - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` Signed-off-by: lilinsiman <lilinsiman@gmail.com>	2025-12-13 18:36:58 +08:00
lilinsiman	fc818f1509	[doc][main] Correct mistakes in doc (#4945 ) ### What this PR does / why we need it? Correct mistakes in doc - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` --------- Signed-off-by: lilinsiman <lilinsiman@gmail.com>	2025-12-12 19:17:10 +08:00
1092626063	62a9fea7af	【doc】Add model feature matrix (#4950 ) ### What this PR does / why we need it? doc tutorials add model feature matrix： DeepSeekR1 DeepSeekV3.1 Qwen3-Dense Qwen3-Moe Qwen3-Next Qwen2.5 Qwen2.5-VL Qwen3-VL ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` --------- Signed-off-by: 1092626063 <1092626063@qq.com>	2025-12-12 15:37:39 +08:00
wangxiyuan	e538fa6f9c	[Doc] Update tutorial index (#4920 ) Update tutorial index and remove useless doc - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-12-11 20:53:13 +08:00
SILONG ZENG	ff7d703192	[Doc]Add tutorial document for qwen-VL-Dense (#3516 ) ### What this PR does / why we need it? This document employs the qwen3-vl-8b model and qwen2.5-vl-32b to demonstrate the primary verification steps for the Qwen-VL series dense models, including supported features, feature configuration, environment preparation, NPU deployment, and accuracy and performance evaluation. - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` --------- Signed-off-by: MrZ20 <2609716663@qq.com>	2025-12-11 08:55:23 +08:00
lianyibo	e32014ac1d	[Model] Support pooling models (#3122 ) ### What this PR does / why we need it? Support pooling models (like `bge-reranker-v2-m3`) in vllm-ascend, this pr covered the three model types of embed (cls_token, mean_token, lasttoken). After this [commit](`17373dcd93`), vllm has provided support for adapting pooling models on the v1 engine. This PR includes corresponding adaptations on the vllm-ascend side. Fixes #1960 - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` --------- Signed-off-by: lianyibo <lianyibo1@kunlunit.com> Signed-off-by: MengqingCao <cmq0113@163.com> Co-authored-by: MengqingCao <cmq0113@163.com>	2025-12-10 11:37:57 +08:00
yeyifan	8907010815	[Doc] Add tutorial for Qwen3-Coder-30B-A3B (#4391 ) ### What this PR does / why we need it? Add tutorial for Qwen3-Coder-30B-A3B - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Signed-off-by: wangli <wangli858794774@gmail.com> Signed-off-by: nsdie <yeyifan@huawei.com> Signed-off-by: herizhen <you@example.com> Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com> Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: wangxiaoxin-sherie <wangxiaoxin7@huawei.com> Signed-off-by: weijinqian_v1 <weijinqian@huawei.com> Signed-off-by: weijinqian0 <1184188277@qq.com> Co-authored-by: Li Wang <wangli858794774@gmail.com> Co-authored-by: herizhen <59841270+herizhen@users.noreply.github.com> Co-authored-by: herizhen <you@example.com> Co-authored-by: Yizhou <136800916+yiz-liu@users.noreply.github.com> Co-authored-by: jiangyunfan1 <jiangyunfan1@h-partners.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com> Co-authored-by: XiaoxinWang <963372609@qq.com> Co-authored-by: wangxiaoxin-sherie <wangxiaoxin7@huawei.com> Co-authored-by: weijinqian0 <1184188277@qq.com> Co-authored-by: weijinqian_v1 <weijinqian@huawei.com>	2025-12-02 16:03:37 +08:00
lilinsiman	adee9dd3b1	[Info][main] Correct the mistake in information documents (#4157 ) ### What this PR does / why we need it? Correct the mistake in information documents ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? ut - vLLM version: v0.11.0 - vLLM main: `2918c1b49c` --------- Signed-off-by: lilinsiman <lilinsiman@gmail.com>	2025-11-13 15:53:58 +08:00
herizhen	75c3f9a780	[Typo] LLama has been changed to Llama (#4089 ) ### What this PR does / why we need it? First-generation model:uses"LLama",subsequent models use"Llama" The second"L"here should be lowercase.Other instances of "LLama"on this page should be corrected accordingly ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? ut - vLLM version: v0.11.0 - vLLM main: `83f478bb19` Signed-off-by: herizhen <you@example.com> Co-authored-by: herizhen <you@example.com>	2025-11-10 16:22:52 +08:00
lilinsiman	a3ff765c65	[Info][main] Corrected the errors in the information (#4055 ) ### What this PR does / why we need it? Corrected the errors in the information ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? ut - vLLM version: v0.11.0 - vLLM main: `83f478bb19` Signed-off-by: lilinsiman <lilinsiman@gmail.com>	2025-11-08 18:48:59 +08:00
zhangyiming	46ef280105	[Doc] Add model feature matrix table. (#4040 ) ### What this PR does / why we need it? Add model feature matrix table. - vLLM version: v0.11.0 - vLLM main: `83f478bb19` Signed-off-by: menogrey <1299267905@qq.com>	2025-11-07 11:28:05 +08:00
zhangxinyuehfad	789ba4c5c2	[Doc] Update doc (#3836 ) ### What this PR does / why we need it? Update doc ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/releases/v0.11.1 Signed-off-by: hfadzxy <starmoon_zhang@163.com>	2025-10-29 11:03:39 +08:00
zhangxinyuehfad	0637e8f021	[Doc] Update supported models (#3481 ) ### What this PR does / why we need it? Update supported models ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: hfadzxy <starmoon_zhang@163.com>	2025-10-25 11:13:46 +08:00
Mengqing Cao	4604882a3e	[ReleaseNote] Release note of v0.10.0rc1 (#2225 ) ### What this PR does / why we need it? Release note of v0.10.0rc1 - vLLM version: v0.10.0 - vLLM main: `8e8e0b6af1` --------- Signed-off-by: MengqingCao <cmq0113@163.com>	2025-08-07 14:46:49 +08:00
zhangxinyuehfad	92eebc0c9b	[Doc] Update user guide for suported models (#2263 ) ### What this PR does / why we need it? Update user guide for suported models - vLLM version: v0.10.0 - vLLM main: `4be02a3776` --------- Signed-off-by: hfadzxy <starmoon_zhang@163.com>	2025-08-07 14:39:51 +08:00
Li Wang	bdfb065b5d	[1/2/N] Enable pymarkdown and python __init__ for lint system (#2011 ) ### What this PR does / why we need it? 1. Enable pymarkdown check 2. Enable python `__init__.py` check for vllm and vllm-ascend 3. Make clean code ### How was this patch tested? - vLLM version: v0.9.2 - vLLM main: `29c6fbe58c` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-07-25 22:16:10 +08:00
wangxiyuan	326dcf2576	[Doc] Update support feature (#1828 ) The feature support matrix is out of date. This PR refresh the content. - vLLM version: v0.9.2 - vLLM main: `107111a859` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-07-23 15:19:15 +08:00
wangxiyuan	3d1e6a5929	[Doc] Update user doc index (#1581 ) Add user doc index to make the user guide more clear - vLLM version: v0.9.1 - vLLM main: `49e8c7ea25` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-07-10 14:26:59 +08:00

42 Commits