xc-llm-kunlun/quantization at c403d921ffb32790d2c0bc3628d154ce28f54ad3 - xc-llm-kunlun - Gitea: Git with a cup of tea

EngineX/xc-llm-kunlun

Files

History

baoqian426 eb40e8a07a [Bugfix] fix can not import compressed_tensors (#87 )

Co-authored-by: root <root@rdtest-node1150.bcc-zwlt.baidu.com>

2026-01-07 11:32:10 +08:00

..

compressed_tensors

[Bugfix] fix can not import compressed_tensors (#87 )

2026-01-07 11:32:10 +08:00

[fix]update compressed-tensors scheme

2026-01-06 22:30:27 +08:00

__init__.py

Initial commit for vLLM-Kunlun Plugin

2025-12-10 12:05:39 +08:00

awq.py

[dev] support AWQ/GPTQ quantization for dense models

2025-12-24 13:46:06 +08:00

gptq.py

[dev] support AWQ/GPTQ quantization for dense models

2025-12-24 13:46:06 +08:00