Xinyu Dong
|
b3c30a3cb9
|
[Feature] Support XiaoMi MIMO Flash V2 (#62)
* [Feature] Support MIMO Flash V2
|
2025-12-31 10:16:33 +08:00 |
|
ldh2020
|
58c1db5073
|
[Bugfix] fix the bug of the flash_attention in Qwen3-Next
|
2025-12-21 10:34:43 +08:00 |
|
chenyili0619
|
2e2933d217
|
[Bug] Fixed the issue where an error occurred when the request included a seed.
|
2025-12-18 13:03:34 +08:00 |
|
baoqian426
|
fae22c2e62
|
Merge pull request #3 from xyDong0223/main
[Kernel] Enable fast random sample on Kunlun3 Platform
|
2025-12-11 11:47:30 +08:00 |
|
xyDong0223
|
af2cd6097f
|
[Kernell] fix miss import os
|
2025-12-11 11:17:28 +08:00 |
|
xyDong0223
|
670c2397b8
|
[Kernel] Enable fast random sample on Kunlun P
|
2025-12-10 21:52:48 +08:00 |
|
chenyili
|
7c22d621fb
|
提交vllm0.11.0开发分支
|
2025-12-10 17:51:24 +08:00 |
|
dongxinyu03
|
c728e52505
|
Initial commit for vLLM-Kunlun Plugin
|
2025-12-10 12:05:39 +08:00 |
|