Commit Graph

4 Commits

Author SHA1 Message Date
Li Wei
71bd70ad6c [Feature] support compressed-tensors w4a16 quantization (#154)
- native int4 kimi model inference is supported

Signed-off-by: Li Wei <liwei.109@outlook.com>
2026-01-27 19:56:22 +08:00
Li Wei
2a2d773ad0 [fix]bias bug in kunlun_scale_mm (#126) 2026-01-20 13:24:52 +08:00
Li Wei
8f56cbf3ed [refactor]update Kunlun classes with monkey patch (#122)
Signed-off-by: Li Wei <liwei.109@outlook.com>
2026-01-19 20:24:19 +08:00
Li Wei
1c1b84d78c [fix]update compressed-tensors scheme
Deepseek v3.2 is supported now

Signed-off-by: Li Wei <liwei.109@outlook.com>
2026-01-06 22:30:27 +08:00