xc-llm-ascend

Author SHA1 Message Date

Author	SHA1	Message	Date
wangxiyuan	b5b7e0ecc7	[Doc] Add qwen3 embedding 8b guide (#1734 ) 1. Add the tutorials for qwen3-embedding-8b 2. Remove VLLM_USE_V1=1 in docs, it's useless any more from 0.9.2 - vLLM version: v0.9.2 - vLLM main: `5923ab9524` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-07-11 17:40:17 +08:00
leo-pony	b4b19ea588	[Doc] Add multi-npu qwen3-MoE-32B Tutorials (#1419 ) Signed-off-by: leo-pony <nengjunma@outlook.com> ### What this PR does / why we need it? Add multi-npu qwen3-MoE-32B Tutorials Relate RFC: https://github.com/vllm-project/vllm-ascend/issues/1248 - vLLM version: v0.9.1 - vLLM main: `5358cce5ff` --------- Signed-off-by: leo-pony <nengjunma@outlook.com>	2025-07-10 09:06:51 +08:00

wangxiyuan

b5b7e0ecc7

[Doc] Add qwen3 embedding 8b guide (#1734 )

1. Add the tutorials for qwen3-embedding-8b
2. Remove VLLM_USE_V1=1  in docs, it's useless any more from 0.9.2


- vLLM version: v0.9.2
- vLLM main:
5923ab9524

Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>

2025-07-11 17:40:17 +08:00

leo-pony

b4b19ea588

[Doc] Add multi-npu qwen3-MoE-32B Tutorials (#1419 )

Signed-off-by: leo-pony <nengjunma@outlook.com>

### What this PR does / why we need it?
Add multi-npu qwen3-MoE-32B Tutorials
Relate RFC: https://github.com/vllm-project/vllm-ascend/issues/1248
- vLLM version: v0.9.1
- vLLM main:
5358cce5ff

---------

Signed-off-by: leo-pony <nengjunma@outlook.com>

2025-07-10 09:06:51 +08:00

2 Commits