[dev] support compressed-tensors w8a8 quantization (#75)
* [dev] support compressed-tensors w8a8 quantization Co-authored-by: Li Wei <liwei.109@outlook.com> * [refact]update KunlunScaleMMKernel impl * [rebase]resolve conflicts and remove redundant code --------- Co-authored-by: tangshiwen <tangshiwen@baidu.com>
This commit is contained in:
@@ -9,7 +9,7 @@ blake3==1.0.5
|
||||
cachetools==6.1.0
|
||||
cbor2==5.7.0
|
||||
cloudpickle==3.1.1
|
||||
compressed-tensors==0.11.0
|
||||
compressed-tensors==0.13.0
|
||||
diskcache==5.6.3
|
||||
gguf==0.17.1
|
||||
mistral_common==1.8.3
|
||||
|
||||
Reference in New Issue
Block a user