Files
xc-llm-ascend/docs/source/faqs.md
Shanshan Shen 11ecbfdb31 [Doc] Update FAQ doc (#504)
### What this PR does / why we need it?
Update FAQ doc.
---------

Signed-off-by: shen-shanshan <467638484@qq.com>
2025-04-14 11:11:40 +08:00

2.9 KiB

FAQs

Version Specific FAQs

General FAQs

1. What devices are currently supported?

Currently, ONLY Atlas A2 series (Ascend-cann-kernels-910b) are supported:

  • Atlas A2 Training series (Atlas 800T A2, Atlas 900 A2 PoD, Atlas 200T A2 Box16, Atlas 300T A2)
  • Atlas 800I A2 Inference series (Atlas 800I A2)

Below series are NOT supported yet:

  • Atlas 300I Duo、Atlas 300I Pro (Ascend-cann-kernels-310p) might be supported on 2025.Q2
  • Atlas 200I A2 (Ascend-cann-kernels-310b) unplanned yet
  • Ascend 910, Ascend 910 Pro B (Ascend-cann-kernels-910) unplanned yet

From a technical view, vllm-ascend support would be possible if the torch-npu is supported. Otherwise, we have to implement it by using custom ops. We are also welcome to join us to improve together.

2. How to get our docker containers?

You can get our containers at Quay.io, e.g., vllm-ascend and cann.

If you are in China, you can use daocloud to accelerate your downloading:

  1. Open daemon.json:
vi /etc/docker/daemon.json
  1. Add https://docker.m.daocloud.io to registry-mirrors:
{
  "registry-mirrors": [
        "https://docker.m.daocloud.io"
    ]
}
  1. Restart your docker service:
sudo systemctl daemon-reload
sudo systemctl restart docker

After configuration, you can download our container from m.daocloud.io/quay.io/ascend/vllm-ascend:v0.7.3rc2.

3. What models does vllm-ascend supports?

Currently, we have already fully tested and supported Qwen / Deepseek (V0 only) / Llama models, other models we have tested are shown here. Plus, accoding to users' feedback, gemma3 and glm4 are not supported yet. Besides, more models need test.

4. How to get in touch with our community?

There are many channels that you can communicate with our community developers / users:

  • Submit a GitHub issue.
  • Join our weekly meeting and share your ideas.
  • Join our WeChat group and ask your quenstions.
  • Join our ascend channel in vLLM forums and publish your topics.

5. What features does vllm-ascend V1 supports?

Find more details here.