xc-llm-kunlun/vllm_kunlun at main - xc-llm-kunlun - Gitea: Git with a cup of tea

EngineX/xc-llm-kunlun

Files

History

starkwj 34e04c5569 update base image

2026-03-02 18:46:04 +08:00

..

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

[Feature] Support glmx (#194 )

2026-02-12 15:40:42 +08:00

add vxpu

2026-03-02 18:38:10 +08:00

device_allocator

add vxpu

2026-03-02 18:38:10 +08:00

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

entrypoints/openai

[dev]add glm4.7 tool-parser (#151 )

2026-02-01 13:53:47 +08:00

Further optimize multi-lora inference,LoRA-enabled performance achieves 80%+ of non-LoRA performance (#190 )

2026-02-11 12:04:14 +08:00

[Feature] Merge branch 'Qwen3-Next' into main && Support Qwen-next (#222 )

2026-02-28 11:15:50 +08:00

[Bugfix] cocopod ops can't be finded (#242 )

2026-03-02 15:49:24 +08:00

add vxpu

2026-03-02 18:38:10 +08:00

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

update base image

2026-03-02 18:46:04 +08:00

Initial commit for vLLM-Kunlun Plugin

2025-12-10 12:05:39 +08:00

transformer_utils

[BugFix] Adapt GLM5 config for transformers 4.57 (#207 )

2026-02-25 18:47:26 +08:00

add vxpu

2026-03-02 18:38:10 +08:00

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

__init__.py

add vxpu

2026-03-02 18:38:10 +08:00

utils.py

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

vllm_utils_wrapper.py

[Feature] Merge branch 'Qwen3-Next' into main && Support Qwen-next (#222 )

2026-02-28 11:15:50 +08:00