Joeegin
171f664a0f
[Doc] Update dependencies ( #225 )
...
Signed-off-by: Joeegin <3318329726@qq.com >
2026-03-02 10:50:12 +08:00
Xinyu Dong
d425a0d0e9
[Docs] Add vLLM-Kunlun New Model Adaptation Manual and Update Model Support ( #211 )
...
* [Docs] Fix app.readthedocs buliding
Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com >
* [Docs] Add vLLM-Kunlun New Model Adaptation Manual and Update Model Support
Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com >
2026-02-26 10:06:58 +08:00
Xinyu Dong
a470452871
[Docs] Fix app.readthedocs buliding ( #210 )
...
Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com >
2026-02-17 16:17:25 +08:00
Li Wei
744719587e
[Feature] Support glmx ( #194 )
...
Signed-off-by: Li Wei <liwei.109@outlook.com >
Co-authored-by: tangshiwen <tangshiwen@baidu.com >
Co-authored-by: Xinyu Dong <dongxinyu03@baidu.com >
2026-02-12 15:40:42 +08:00
WeiJie_Hong
9b1f25fbe3
[Doc] update xspeedgate_ops (20260130) ( #188 )
...
Signed-off-by: WeiJie_Hong <1462519292@qq.com >
2026-02-10 18:05:20 +08:00
WeiJie_Hong
42c7ef2f27
[Doc] add DeepSeek-V3.2-Exp-w8a8 to installation.md and tutorials ( #186 )
...
Signed-off-by: WeiJie_Hong <1462519292@qq.com >
2026-02-10 17:18:32 +08:00
WeiJie_Hong
d18df18499
[CI/Build] update .pre-commit-config.yaml && add _pylint.yml && update installation.md ( #155 )
...
Signed-off-by: WeiJie_Hong <1462519292@qq.com >
2026-01-28 17:58:46 +08:00
Li Wei
71bd70ad6c
[Feature] support compressed-tensors w4a16 quantization ( #154 )
...
- native int4 kimi model inference is supported
Signed-off-by: Li Wei <liwei.109@outlook.com >
2026-01-27 19:56:22 +08:00
Shiwen Tang
0711c1abfa
[Feature] Support AWQ MoE W4A16 Quantization ( #142 )
...
Signed-off-by: tangshiwen <tangshiwen@baidu.com >
Co-authored-by: Li Wei <liwei.109@outlook.com >
2026-01-26 18:56:05 +08:00
WeiJie_Hong
2a998286c0
[Doc] update base image url(1.Replace conda with uv; 2.Integrate xpytorch and ops into the image.) ( #146 )
...
Signed-off-by: WeiJie_Hong <1462519292@qq.com >
2026-01-23 18:55:56 +08:00
Lidang Jiang
9e13f23661
[Doc] Optimize the document ( #136 )
2026-01-22 14:12:44 +08:00
Joeegin
58f570ddea
[Docs] Add XPU tutorials for Qwen / InternVL ( #140 )
...
Signed-off-by: Joeegin <3318329726@qq.com >
2026-01-22 13:50:49 +08:00
Xinyu Dong
7be26ca617
[Bugs] Fix Docs Build Problem ( #97 )
...
* [Bugs] Docs fixed
* Update contributing.md
* Update index.md
* fix lua to text
* fix title size
2026-01-10 05:55:40 +08:00
Xinyu Dong
462c44e2ac
[Docs] Fix v0.11.0 Docs config
2026-01-09 17:07:18 +08:00
Li Wei
c403d921ff
[doc] update quantization guide doc ( #88 )
2026-01-07 15:39:51 +08:00
Xinyu Dong
c46c46ef77
[Docs] Update torch and ops for mimo v2
2025-12-31 13:17:06 +08:00
WeiJie_Hong
341dc7f296
[Docs] Update base image path in Installation.md ( #63 )
2025-12-30 19:10:41 +08:00
tanjunchen
8c23a955a4
update readme.md
...
Signed-off-by: tanjunchen <tanjunchen20@gmail.com >
2025-12-29 21:21:10 +08:00
hanhaowen
a4b9e92ca1
[Kernel] Replace native torch solve_tril by solve_tril_fwd kernel op
2025-12-22 17:37:19 +08:00
Xinyu Dong
911b886e9d
[Docs] Update installation.md
2025-12-20 10:16:57 +08:00
Xinyu Dong
6b5740ad0a
[Docs] Fix Docs
2025-12-16 16:04:29 +08:00
Xinyu Dong
8fb42b1c9a
[Docs] Update installation.md
2025-12-16 14:49:12 +08:00
chenyili
7c22d621fb
提交vllm0.11.0开发分支
2025-12-10 17:51:24 +08:00
dongxinyu03
1b343812c9
[Doc] Update docs
2025-12-10 14:46:12 +08:00
dongxinyu03
a3d11f9b73
[Doc] Update docs
2025-12-10 14:26:37 +08:00
dongxinyu03
3762e6e3ab
[Doc] Update docs
2025-12-10 14:16:10 +08:00
dongxinyu03
c728e52505
Initial commit for vLLM-Kunlun Plugin
2025-12-10 12:05:39 +08:00