xc-llm-kunlun

Author	SHA1	Message	Date
Xinyu Dong	bf9369f733	Migrate XTorch operations to Kunlun operations (accelerating iteration) (#177 ) Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com>	2026-02-12 18:13:00 +08:00
fromck	fc48b79ae9	support glm4.7 mtp (#187 ) Signed-off-by: chengxiaokang <chengxiaokang@baidu.com> Co-authored-by: chengxiaokang <chengxiaokang@baidu.com>	2026-02-11 18:32:30 +08:00
hanhaowen	b015bb76fd	remove qwen2.py llama.py fix llama output	2025-12-31 11:39:37 +08:00
Xinyu Dong	b3c30a3cb9	[Feature] Support XiaoMi MIMO Flash V2 (#62 ) * [Feature] Support MIMO Flash V2	2025-12-31 10:16:33 +08:00
ldh2020	58c1db5073	[Bugfix] fix the bug of the flash_attention in Qwen3-Next	2025-12-21 10:34:43 +08:00
chenyili	7c22d621fb	提交vllm0.11.0开发分支	2025-12-10 17:51:24 +08:00
dongxinyu03	c728e52505	Initial commit for vLLM-Kunlun Plugin	2025-12-10 12:05:39 +08:00