xc-llm-kunlun/vllm_kunlun at 6546323c71f419cfd74ade0e52050580ff5232ef - xc-llm-kunlun - Gitea: Git with a cup of tea

EngineX/xc-llm-kunlun

Files

History

Li Wei 6546323c71 [dev] support AWQ/GPTQ quantization for dense models

2025-12-24 13:46:06 +08:00

..

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

Initial commit for vLLM-Kunlun Plugin

2025-12-10 12:05:39 +08:00

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

[Kernel] Optimize the selection and update OP of ssm state

2025-12-21 15:45:32 +08:00

[dev] support AWQ/GPTQ quantization for dense models

2025-12-24 13:46:06 +08:00

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

Initial commit for vLLM-Kunlun Plugin

2025-12-10 12:05:39 +08:00

[Bugfix] fix the bug of the flash_attention in Qwen3-Next

2025-12-21 10:34:43 +08:00

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

__init__.py

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

utils.py

提交vllm0.11.0开发分支

2025-12-10 17:51:24 +08:00

vllm_utils_wrapper.py

[dev] support AWQ/GPTQ quantization for dense models

2025-12-24 13:46:06 +08:00