10 Commits

Author SHA1 Message Date
Xinyu Dong
bf9369f733 Migrate XTorch operations to Kunlun operations (accelerating iteration) (#177)
Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com>
2026-02-12 18:13:00 +08:00
yuqilinaa
c9f00c132c [Kernel] Enable fast random sample on Kunlun3 Platform with generators (#73)
Co-authored-by: Xinyu Dong <dongxinyu03@baidu.com>
2026-01-20 21:49:33 +08:00
Jin Hanyu
bd90350968 [Bug] Fix no apply_top_k_top_p issue. (#101) 2026-01-12 16:38:03 +08:00
baoqian426
ee0f50e68f [Feature] support deepseek v3/r1/v3.2 (#78)
* [Feature] support deepseek v3/r1/v3.2

* fix gpt_oss

* update readme

* update readme

---------

Co-authored-by: hanhaowen <hanhaowen@baidu.com>
2026-01-05 22:55:35 +08:00
chenyili0619
2e2933d217 [Bug] Fixed the issue where an error occurred when the request included a seed. 2025-12-18 13:03:34 +08:00
baoqian426
fae22c2e62 Merge pull request #3 from xyDong0223/main
[Kernel] Enable fast random sample on Kunlun3 Platform
2025-12-11 11:47:30 +08:00
xyDong0223
af2cd6097f [Kernell] fix miss import os 2025-12-11 11:17:28 +08:00
xyDong0223
670c2397b8 [Kernel] Enable fast random sample on Kunlun P 2025-12-10 21:52:48 +08:00
chenyili
7c22d621fb 提交vllm0.11.0开发分支 2025-12-10 17:51:24 +08:00
dongxinyu03
c728e52505 Initial commit for vLLM-Kunlun Plugin 2025-12-10 12:05:39 +08:00