xc-llm-ascend

Author	SHA1	Message	Date
Shanshan Shen	e3eefdecbd	[Doc] Update `max_tokens` to `max_completion_tokens` in all docs (#6248 ) ### What this PR does / why we need it? Fix: ``` DeprecationWarning: max_tokens is deprecated in favor of the max_completion_tokens field. ``` - vLLM version: v0.14.1 - vLLM main: `d68209402d` Signed-off-by: shen-shanshan <467638484@qq.com>	2026-01-26 11:57:40 +08:00
SILONG ZENG	4811ba62e0	[Lint]Style: reformat markdown files via markdownlint (#5884 ) ### What this PR does / why we need it? reformat markdown files via markdownlint - vLLM version: v0.13.0 - vLLM main: `bde38c11df` --------- Signed-off-by: root <root@LAPTOP-VQKDDVMG.localdomain> Signed-off-by: MrZ20 <2609716663@qq.com> Co-authored-by: root <root@LAPTOP-VQKDDVMG.localdomain>	2026-01-15 09:06:01 +08:00
huqi	2d22700d69	Docs: Add A3 Docker image guidance for Atlas A3 machines (#5256 ) Fixes #3386 - Update Qwen3-30B-A3B.md to use A3-specific image tag - Update Qwen3-Dense.md to provide both A2 and A3 image options - Update Qwen3-Next.md to use A3-specific image for Atlas A3 environments Previously, documentation only mentioned A2 images (vllm-ascend:version) but Atlas A3 machines require A3-specific images (vllm-ascend:version-a3). This change ensures users select the correct image for their hardware. 🤖 Generated with [Claude Code](https://claude.com/claude-code) - vLLM version: release/v0.13.0 - vLLM main: `ad32e3e19c` Signed-off-by: hu-qi <huqi1024@gmail.com> Co-authored-by: Claude <noreply@anthropic.com>	2026-01-05 19:42:42 +08:00
wangxiyuan	2ae0bad96d	Remove VLLM_ASCEND_ENABLE_DENSE_OPTIMIZE (#5272 ) `VLLM_ASCEND_ENABLE_DENSE_OPTIMIZE` is only used together with `VLLM_ASCEND_ENABLE_PREFETCH_MLP` which is useless totally. This PR remove it. - vLLM version: release/v0.13.0 - vLLM main: `ad32e3e19c` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-12-25 11:09:56 +08:00
ZYang6263	a3f65b938f	[Doc] Add pa_shape_list description to qwen dense tutorial (#5225 ) ### What this PR does / why we need it? Add pa_shape_list description to qwen dense tutorial. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: release/v0.13.0 - vLLM main: `ad32e3e19c` Signed-off-by: ZYang6263 <zy626375@gmail.com> Co-authored-by: zzzzwwjj <34335947+zzzzwwjj@users.noreply.github.com>	2025-12-24 14:40:20 +08:00
wangxiyuan	e538fa6f9c	[Doc] Update tutorial index (#4920 ) Update tutorial index and remove useless doc - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-12-11 20:53:13 +08:00
wangxiyuan	c77dca54b2	[CI] fix lint (#4888 ) Fix lint CI error Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-12-10 16:57:24 +08:00
wind-all	1a443f2772	add multi_npu_qwen3_dense tutorials (#4543 ) ### What this PR does / why we need it? This PR adds tutorials for the Qwen3-Dense series models, including the A2 and A3 series, and provides accuracy validation results. - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` --------- Signed-off-by: wind-all <anyuting@h-partners.com>	2025-12-10 16:09:56 +08:00

8 Commits