1. Add the tutorials for qwen3-embedding-8b 2. Remove VLLM_USE_V1=1 in docs, it's useless any more from 0.9.2 - vLLM version: v0.9.2 - vLLM main: 5923ab9524 Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
5923ab9524
Signed-off-by: leo-pony <nengjunma@outlook.com> ### What this PR does / why we need it? Add multi-npu qwen3-MoE-32B Tutorials Relate RFC: https://github.com/vllm-project/vllm-ascend/issues/1248 - vLLM version: v0.9.1 - vLLM main: 5358cce5ff --------- Signed-off-by: leo-pony <nengjunma@outlook.com>
5358cce5ff