6 Commits

Author SHA1 Message Date
tanjunchen
1e1e870a71 update ci workflow (#159)
Signed-off-by: tanjunchen <tanjunchen20@gmail.com>
2026-01-28 20:28:38 +08:00
1916hcc
c0f06d04b1 [Doc] docs: remove internal pip index from requirements (#147)
Signed-off-by: Chenchao Hu <huchenchao@example.com>
Co-authored-by: Chenchao Hu <huchenchao@example.com>
2026-01-23 18:55:34 +08:00
Li Wei
515a4eeda9 [dev] support compressed-tensors w8a8 quantization (#75)
* [dev] support compressed-tensors w8a8 quantization

Co-authored-by: Li Wei <liwei.109@outlook.com>

* [refact]update KunlunScaleMMKernel impl

* [rebase]resolve conflicts and remove redundant code

---------

Co-authored-by: tangshiwen <tangshiwen@baidu.com>
2026-01-06 13:51:53 +08:00
Li Wei
6546323c71 [dev] support AWQ/GPTQ quantization for dense models 2025-12-24 13:46:06 +08:00
chenyili
7c22d621fb 提交vllm0.11.0开发分支 2025-12-10 17:51:24 +08:00
dongxinyu03
c728e52505 Initial commit for vLLM-Kunlun Plugin 2025-12-10 12:05:39 +08:00