26 Commits

Author SHA1 Message Date
Joeegin
171f664a0f [Doc] Update dependencies (#225)
Signed-off-by: Joeegin <3318329726@qq.com>
2026-03-02 10:50:12 +08:00
Xinyu Dong
d425a0d0e9 [Docs] Add vLLM-Kunlun New Model Adaptation Manual and Update Model Support (#211)
* [Docs] Fix app.readthedocs buliding

Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com>

* [Docs] Add vLLM-Kunlun New Model Adaptation Manual and Update Model Support

Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com>
2026-02-26 10:06:58 +08:00
Xinyu Dong
a470452871 [Docs] Fix app.readthedocs buliding (#210)
Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com>
2026-02-17 16:17:25 +08:00
Li Wei
744719587e [Feature] Support glmx (#194)
Signed-off-by: Li Wei <liwei.109@outlook.com>
Co-authored-by: tangshiwen <tangshiwen@baidu.com>
Co-authored-by: Xinyu Dong <dongxinyu03@baidu.com>
2026-02-12 15:40:42 +08:00
WeiJie_Hong
9b1f25fbe3 [Doc] update xspeedgate_ops (20260130) (#188)
Signed-off-by: WeiJie_Hong <1462519292@qq.com>
2026-02-10 18:05:20 +08:00
WeiJie_Hong
42c7ef2f27 [Doc] add DeepSeek-V3.2-Exp-w8a8 to installation.md and tutorials (#186)
Signed-off-by: WeiJie_Hong <1462519292@qq.com>
2026-02-10 17:18:32 +08:00
WeiJie_Hong
d18df18499 [CI/Build] update .pre-commit-config.yaml && add _pylint.yml && update installation.md (#155)
Signed-off-by: WeiJie_Hong <1462519292@qq.com>
2026-01-28 17:58:46 +08:00
Li Wei
71bd70ad6c [Feature] support compressed-tensors w4a16 quantization (#154)
- native int4 kimi model inference is supported

Signed-off-by: Li Wei <liwei.109@outlook.com>
2026-01-27 19:56:22 +08:00
Shiwen Tang
0711c1abfa [Feature] Support AWQ MoE W4A16 Quantization (#142)
Signed-off-by: tangshiwen <tangshiwen@baidu.com>
Co-authored-by: Li Wei <liwei.109@outlook.com>
2026-01-26 18:56:05 +08:00
WeiJie_Hong
2a998286c0 [Doc] update base image url(1.Replace conda with uv; 2.Integrate xpytorch and ops into the image.) (#146)
Signed-off-by: WeiJie_Hong <1462519292@qq.com>
2026-01-23 18:55:56 +08:00
Lidang Jiang
9e13f23661 [Doc] Optimize the document (#136) 2026-01-22 14:12:44 +08:00
Joeegin
58f570ddea [Docs] Add XPU tutorials for Qwen / InternVL (#140)
Signed-off-by: Joeegin <3318329726@qq.com>
2026-01-22 13:50:49 +08:00
Xinyu Dong
7be26ca617 [Bugs] Fix Docs Build Problem (#97)
* [Bugs] Docs fixed

* Update contributing.md

* Update index.md

* fix lua to text

* fix title size
2026-01-10 05:55:40 +08:00
Xinyu Dong
462c44e2ac [Docs] Fix v0.11.0 Docs config 2026-01-09 17:07:18 +08:00
Li Wei
c403d921ff [doc] update quantization guide doc (#88) 2026-01-07 15:39:51 +08:00
Xinyu Dong
c46c46ef77 [Docs] Update torch and ops for mimo v2 2025-12-31 13:17:06 +08:00
WeiJie_Hong
341dc7f296 [Docs] Update base image path in Installation.md (#63) 2025-12-30 19:10:41 +08:00
hanhaowen
a4b9e92ca1 [Kernel] Replace native torch solve_tril by solve_tril_fwd kernel op 2025-12-22 17:37:19 +08:00
Xinyu Dong
911b886e9d [Docs] Update installation.md 2025-12-20 10:16:57 +08:00
Xinyu Dong
6b5740ad0a [Docs] Fix Docs 2025-12-16 16:04:29 +08:00
Xinyu Dong
8fb42b1c9a [Docs] Update installation.md 2025-12-16 14:49:12 +08:00
chenyili
7c22d621fb 提交vllm0.11.0开发分支 2025-12-10 17:51:24 +08:00
dongxinyu03
1b343812c9 [Doc] Update docs 2025-12-10 14:46:12 +08:00
dongxinyu03
a3d11f9b73 [Doc] Update docs 2025-12-10 14:26:37 +08:00
dongxinyu03
3762e6e3ab [Doc] Update docs 2025-12-10 14:16:10 +08:00
dongxinyu03
c728e52505 Initial commit for vLLM-Kunlun Plugin 2025-12-10 12:05:39 +08:00