xc-llm-ascend/source at 052cc4e61bd7625d47f6e93ff03a272c0520be86 - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

History

Canlin Guo 052cc4e61b [Docs] Fix GLM-5 deploy command (#6711 )

This pull request refines the GLM-5 deployment documentation by updating
the Docker run command to include a more comprehensive set of device
mappings and by removing an extraneous quantization flag from the `vllm
serve` commands. These changes aim to correct and clarify the deployment
instructions, ensuring users can successfully set up and run the GLM-5
model as intended.


- vLLM version: v0.15.0
- vLLM main:
9562912cea

Signed-off-by: Canlin Guo <961750412@qq.com>

2026-02-12 08:55:48 +08:00

..

_templates/sections

[Doc] backport 0.13.0 release note (#6584 )

2026-02-06 10:29:15 +08:00

[doc](cp) correct the prefill of GQA and adjust desc of block table. (#5697 )

2026-01-19 18:53:48 +08:00

[main to main] upgrade main 0210 (#6673 )

2026-02-11 18:10:14 +08:00

developer_guide

[main][Docs] Fix spelling errors across documentation (#6649 )

2026-02-10 11:14:57 +08:00

locale/zh_CN/LC_MESSAGES

[main][Docs] Fix spelling errors across documentation (#6649 )

2026-02-10 11:14:57 +08:00

[Doc] Add sphinx build for vllm-ascend (#55 )

2025-02-13 18:44:17 +08:00

[Docs] Fix GLM-5 deploy command (#6711 )

2026-02-12 08:55:48 +08:00

[npugraph_ex]enable npugraph_ex by default (#6664 )

2026-02-12 08:44:06 +08:00

conf.py

[Main2Main][Deps][Misc] Upgrade vLLM to v0.15.0 (#6470 )

2026-02-02 15:57:55 +08:00

faqs.md

[main][Docs] Fix spelling errors across documentation (#6649 )

2026-02-10 11:14:57 +08:00

index.md

[Doc][Misc] Restructure tutorial documentation (#6501 )

2026-02-10 15:03:35 +08:00

installation.md

[Doc][Misc] Restructure tutorial documentation (#6501 )

2026-02-10 15:03:35 +08:00

quick_start.md

[Doc][Misc] Restructure tutorial documentation (#6501 )

2026-02-10 15:03:35 +08:00