[dev] support compressed-tensors w8a8 quantization (#75)

* [dev] support compressed-tensors w8a8 quantization Co-authored-by: Li Wei <liwei.109@outlook.com> * [refact]update KunlunScaleMMKernel impl * [rebase]resolve conflicts and remove redundant code --------- Co-authored-by: tangshiwen <tangshiwen@baidu.com>
2026-01-06 13:51:53 +08:00
parent ee0f50e68f
commit 515a4eeda9
8 changed files with 952 additions and 523 deletions
--- a/requirements.txt
+++ b/requirements.txt
@@ -9,7 +9,7 @@ blake3==1.0.5
 cachetools==6.1.0
 cbor2==5.7.0
 cloudpickle==3.1.1
-compressed-tensors==0.11.0
+compressed-tensors==0.13.0
 diskcache==5.6.3
 gguf==0.17.1
 mistral_common==1.8.3