xc-llm-ascend

Author	SHA1	Message	Date
Angazenn	eab306b09c	[doc] Update Qwen3-235B doc for reproducing latest performance (#5323 ) ### What this PR does / why we need it? This PR updates Qwen3-235B doc to give a simple recipe for repreducing our latest perfomance on Atlas A3 servers. - vLLM version: release/v0.13.0 - vLLM main: `5fbfa8d9ef` --------- Signed-off-by: Angazenn <supperccell@163.com>	2025-12-27 15:55:58 +08:00
rongfu.leng	c9b5881bcd	[Doc] fix docs set rope_theta value is 10e6 in qwen3-235b model (#5258 ) ### What this PR does / why we need it? Fixes https://github.com/vllm-project/vllm-ascend/issues/5201 ### Does this PR introduce _any_ user-facing change? No, doc only ### How was this patch tested? - vLLM version: release/v0.13.0 - vLLM main: `ad32e3e19c` Signed-off-by: rongfu.leng <lenronfu@gmail.com>	2025-12-23 10:21:46 +08:00
wangxiyuan	e538fa6f9c	[Doc] Update tutorial index (#4920 ) Update tutorial index and remove useless doc - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-12-11 20:53:13 +08:00
xuyexiong	193dc1703f	[Doc] Add Qwen3-235B tutorial (#4358 ) ### What this PR does / why we need it? Add Qwen3-235B tutorial including the following examples - Single-node Online Deployment for 128k context inference - Multi-node Deployment with MP - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` --------- Signed-off-by: xuyexiong <xuyexiong@huawei.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-12-08 20:06:46 +08:00