This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX
/
xc-llm-kunlun
Watch
3
Star
0
Fork
0
You've already forked xc-llm-kunlun
Code
Issues
Pull Requests
Actions
Projects
Releases
Wiki
Activity
Files
77dbc2ddeb4aa884199291afc028c458cfdc9e30
xc-llm-kunlun
/
vllm_kunlun
/
ops
/
quantization
/
compressed_tensors
History
Xinyu Dong
bf9369f733
Migrate XTorch operations to Kunlun operations (accelerating iteration) (
#177
)
...
Signed-off-by: dongxinyu03 <
dongxinyu03@baidu.com
>
2026-02-12 18:13:00 +08:00
..
__init__.py
[Bugfix] fix can not import compressed_tensors (
#87
)
2026-01-07 11:32:10 +08:00
compressed_tensors_moe.py
Migrate XTorch operations to Kunlun operations (accelerating iteration) (
#177
)
2026-02-12 18:13:00 +08:00
compressed_tensors.py
[Feature] support compressed-tensors w4a16 quantization (
#154
)
2026-01-27 19:56:22 +08:00