提交vllm0.11.0开发分支

This commit is contained in:
chenyili
2025-12-10 17:51:24 +08:00
parent deab7dd0b6
commit 7c22d621fb
175 changed files with 31856 additions and 8683 deletions

211
README.md
View File

@@ -1,14 +1,18 @@
![vLLM Kunlun Logo](vllm_kunlun/patches/vLLM_Kunlun.jpg)
<p align="center">
<a href="https://vllm-kunlun.readthedocs.io/en/latest/"><b>Documentation</b></a> |
<a href="https://join.slack.com/t/vllm-kunlun/shared_invite/zt-3iinb8u5z-FcqZKbNNdMJ_32fHmipzvw"><b>slack</b></a> |
<a href="./docs/_build/html/documentation.html"><b>Documentation</b></a> |
<a href=""><b>Users Forum</b></a> |
<a href="join.slack.com/t/vllm-kunlun/shared_invite/zt-3iinb8u5z-FcqZKbNNdMJ_32fHmipzvwjoin.slack.com/t/vllm-kunlun/shared_invite/zt-3iinb8u5z-FcqZKbNNdMJ_32fHmipzvw"><b>slack</b></a> |
</p>
---
## Latest News🔥
- [2025/12] Initial release of vLLM Kunlun
- [2025/11]
- [2025/11]
- [2025/11]
- [2025/11] Initial release of vLLM Kunlun
---
@@ -30,28 +34,107 @@ By utilizing the vLLM Kunlun plugin, popular open-source models, including Trans
---
## Supported Models
<style>
table {
width: 100%;
border-collapse: collapse;
background: white;
margin: 20px 0;
box-shadow: 0 2px 8px rgba(0, 0, 0, 0.08);
border-radius: 8px;
overflow: hidden;
}
th {
background: linear-gradient(135deg, #0E7DC6 0%, #0A5BA8 100%);
color: white;
padding: 14px 12px;
text-align: left;
font-weight: 600;
font-size: 13px;
letter-spacing: 0.5px;
border: none;
}
td {
padding: 12px;
border-bottom: 1px solid #e8e8e8;
font-size: 13px;
color: #333;
}
tr:last-child td {
border-bottom: none;
}
tbody tr {
transition: background-color 0.2s ease;
}
tbody tr:hover {
background-color: #f5faff;
}
tbody tr:nth-child(even) {
background-color: #fafbfc;
}
tbody tr:nth-child(even):hover {
background-color: #f0f7fc;
}
.status-support {
color: #22863a;
font-weight: 600;
font-size: 14px;
}
.status-progress {
color: #f6a909;
font-weight: 600;
font-size: 14px;
}
.status-coming {
color: #999;
font-size: 12px;
background-color: #f5f5f5;
padding: 2px 6px;
border-radius: 3px;
display: inline-block;
}
.model-name {
font-weight: 500;
color: #1e40af;
}
h3 {
color: #1e40af;
font-size: 16px;
margin-top: 30px;
margin-bottom: 15px;
font-weight: 600;
}
h3:first-of-type {
margin-top: 0;
}
</style>
<h3>Generaltive Models</h3>
<table>
<thead>
<tr>
<th width="23%">Model</th>
<th width="20%">Model</th>
<th width="12%">Support</th>
<th width="15%">Quantization</th>
<th width="10%">LoRA</th>
<th width="20%">Piecewise Kunlun Graph</th>
<th width="20%">Note</th>
<th width="23%">Note</th>
</tr>
</thead>
<tbody>
<tr>
<td class="model-name">Qwen2/2.5</td>
<td class="status-support"></td>
<td></td>
<td class="status-support"></td>
<td class="status-support"></td>
<td></td>
</tr>
<tr>
<td class="model-name">Qwen3</td>
<td class="status-support"></td>
@@ -61,7 +144,7 @@ By utilizing the vLLM Kunlun plugin, popular open-source models, including Trans
<td></td>
</tr>
<tr>
<td class="model-name">Qwen3-Moe/Coder</td>
<td class="model-name">Qwen3-Moe</td>
<td class="status-support"></td>
<td class="status-support"></td>
<td class="status-support"></td>
@@ -69,53 +152,13 @@ By utilizing the vLLM Kunlun plugin, popular open-source models, including Trans
<td></td>
</tr>
<tr>
<td class="model-name">QwQ-32B</td>
<td class="status-support"></td>
<td></td>
<td></td>
<td class="status-support"></td>
<td></td>
</tr>
<tr>
<td class="model-name">LLama2/3/3.1</td>
<td class="status-support"></td>
<td></td>
<td></td>
<td class="status-support"></td>
<td></td>
</tr>
<tr>
<td class="model-name">GLM-4.5/Air</td>
<td class="model-name">Qwen3-Next</td>
<td class="status-support"></td>
<td class="status-support"></td>
<td class="status-support"></td>
<td class="status-support"></td>
<td></td>
</tr>
<tr>
<td class="model-name">Qwen3next</td>
<td class="status-progress">⚠️</td>
<td></td>
<td></td>
<td></td>
<td><span class="status-coming">comming soon</span></td>
</tr>
<tr>
<td class="model-name">Gpt oss</td>
<td class="status-progress">⚠️</td>
<td></td>
<td></td>
<td></td>
<td><span class="status-coming">comming soon</span></td>
</tr>
<tr>
<td class="model-name">Deepseek v3/3.2</td>
<td class="status-progress">⚠️</td>
<td></td>
<td></td>
<td></td>
<td><span class="status-coming">comming soon</span></td>
</tr>
</tbody>
</table>
@@ -133,61 +176,13 @@ By utilizing the vLLM Kunlun plugin, popular open-source models, including Trans
</thead>
<tbody>
<tr>
<td class="model-name">Qianfan-VL</td>
<td class="model-name">Qwen3-VL</td>
<td class="status-support"></td>
<td></td>
<td></td>
<td class="status-support"></td>
<td></td>
</tr>
<tr>
<td class="model-name">Qwen2.5VL</td>
<td class="status-support"></td>
<td></td>
<td></td>
<td class="status-support"></td>
<td></td>
</tr>
<tr>
<td class="model-name">InternVL2.5/3/3.5</td>
<td class="status-support"></td>
<td></td>
<td></td>
<td class="status-support"></td>
<td></td>
</tr>
<tr>
<td class="model-name">InternVL3.5</td>
<td class="status-support"></td>
<td></td>
<td></td>
<td class="status-support"></td>
<td></td>
</tr>
<tr>
<td class="model-name">InternS1</td>
<td class="status-support"></td>
<td></td>
<td></td>
<td class="status-support"></td>
<td></td>
</tr>
<tr>
<td class="model-name">Qwen2.5 omini</td>
<td class="status-progress">⚠️</td>
<td></td>
<td></td>
<td></td>
<td><span class="status-coming">comming soon</span></td>
</tr>
<tr>
<td class="model-name">Qwen3vl</td>
<td class="status-progress">⚠️</td>
<td></td>
<td></td>
<td></td>
<td><span class="status-coming">comming soon</span></td>
</tr>
</tbody>
</table>
@@ -207,17 +202,17 @@ Please use the following recommended versions to get started quickly:
| Version | Release type | Doc |
|----------|---------------|-----|
| v0.10.1.1 | Latest stable version | [QuickStart](https://vllm-kunlun.readthedocs.io/en/latest/quick_start.html) and [Installation](https://vllm-kunlun.readthedocs.io/en/latest/installation.html) for more details |
| v0.11.0 | Latest stable version | [QuickStart](./docs/_build/html/quick_start.html) and [Installation](./docs/_build/html/installation.html) for more details |
---
## Contributing
See [CONTRIBUTING](https://vllm-kunlun.readthedocs.io/en/latest/developer_guide/contribution/index.html) for more details, which is a step-by-step guide to help you set up the development environment, build, and test.
See [CONTRIBUTING]() for more details, which is a step-by-step guide to help you set up the development environment, build, and test.
We welcome and value any contributions and collaborations:
- Open an [Issue](https://github.com/baidu/vLLM-Kunlun/issues) if you find a bug or have a feature request
- Open an [Issue]() if you find a bug or have a feature request
## License
Apache License 2.0, as found in the [LICENSE](https://github.com/baidu/vLLM-Kunlun/blob/main/LICENSE.txt) file.
Apache License 2.0, as found in the [LICENSE](./LICENSE) file.