Commit Graph

277 Commits

Author SHA1 Message Date
yudian0504
81372f3bef Fix fused_moe triton configs (#7029) 2025-06-09 23:23:03 -07:00
Wenxuan Tan
a968c888c0 Fix torchvision version for Blackwell (#7015) 2025-06-09 15:50:19 -07:00
Yineng Zhang
56ccd3c22c chore: upgrade flashinfer v0.2.6.post1 jit (#6958)
Co-authored-by: alcanderian <alcanderian@gmail.com>
Co-authored-by: Qiaolin Yu <qy254@cornell.edu>
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
Co-authored-by: Mick <mickjagger19@icloud.com>
Co-authored-by: ispobock <ispobaoke@gmail.com>
2025-06-09 09:22:39 -07:00
Yineng Zhang
23881fa60c chore: upgrade sgl-kernel v0.1.6.post1 (#6957) 2025-06-07 17:18:55 -07:00
JieXin Liang
6153f2ff6e chore: upgrade sgl-kernel v0.1.6 (#6945) 2025-06-07 02:53:26 -07:00
Zaili Wang
562f279a2d [CPU] enable CI for PRs, add Dockerfile and auto build task (#6458)
Co-authored-by: diwei sun <diwei.sun@intel.com>
Co-authored-by: Yineng Zhang <me@zhyncs.com>
2025-06-05 13:43:54 -07:00
Yineng Zhang
34c63731fc chore: upgrade sgl-kernel v0.1.5 (#6795) 2025-05-31 18:32:00 -07:00
Qiaolin Yu
7dc0e39442 Bump torch to 2.7.0 (#6788) 2025-05-31 14:43:12 -07:00
Yineng Zhang
7eb9d8e594 chore: upgrade transformers 4.52.3 (#6575)
Co-authored-by: Mick <mickjagger19@icloud.com>
2025-05-25 22:49:58 -07:00
Lifu Huang
022012aae8 Support Phi-4 Multi-Modal (text + vision only) (#6494) 2025-05-24 21:43:38 -07:00
Yineng Zhang
7e257cd666 chore: bump v0.4.6.post5 (#6566) 2025-05-24 00:48:05 -07:00
Yineng Zhang
0b07c4a99f chore: upgrade sgl-kernel v0.1.4 (#6532) 2025-05-22 13:28:16 -07:00
Trevor Morris
7adf245ba2 [Metrics] Add KV events publishing (#6098) 2025-05-19 14:19:54 -07:00
Yineng Zhang
f07c6a009b chore: upgrade sgl-kernel v0.1.3 (#6377) 2025-05-17 19:47:05 -07:00
Lianmin Zheng
4bb816d444 Fix CI tests (#6362) 2025-05-17 19:16:45 -07:00
Lianmin Zheng
dcc0a45618 Fix amd ci (#6360) 2025-05-16 15:33:10 -07:00
Baizhou Zhang
839fb31e5f [Fix] Improve dependencies for Blackwell image (#6334) 2025-05-16 12:38:22 -07:00
Lianmin Zheng
e07a6977e7 Minor improvements of TokenizerManager / health check (#6327) 2025-05-15 15:29:25 -07:00
Yineng Zhang
16267d4fa7 chore: bump v0.4.6.post4 (#6245) 2025-05-13 01:57:51 -07:00
Stefan He
1ab14c4c5c [VERL Use Case] Add torch_memory_saver into deps (#6247) 2025-05-12 19:09:03 -07:00
Lianmin Zheng
e8e18dcdcc Revert "fix some typos" (#6244) 2025-05-12 12:53:26 -07:00
applesaucethebun
d738ab52f8 fix some typos (#6209)
Co-authored-by: Brayden Zhong <b8zhong@uwaterloo.ca>
2025-05-13 01:42:38 +08:00
Yineng Zhang
230106304d chore: upgrade sgl-kernel v0.1.2.post1 (#6196)
Co-authored-by: alcanderian <alcanderian@gmail.com>
2025-05-11 22:41:37 +08:00
applesaucethebun
2ce8793519 Add typo checker in pre-commit (#6179)
Co-authored-by: Brayden Zhong <b8zhong@uwaterloo.ca>
2025-05-11 12:55:00 +08:00
Yineng Zhang
678d8cc987 chore: bump v0.4.6.post3 (#6165) 2025-05-09 15:38:47 -07:00
Yixin Dong
911f3ba6f4 upgrade xgrammar to 0.1.19 (#6129) 2025-05-08 14:42:02 -07:00
JieXin Liang
f1ff736d68 [fix] fix pyproject.toml dependencies (#6119) 2025-05-08 02:14:36 -07:00
Song Zhang
00c2c1f08b [Feature] Support for Ascend NPU backend (#3853)
Signed-off-by: Song Zhang <gepin.zs@antgroup.com>
Co-authored-by: 22dimensions <waitingwind@foxmail.com>
2025-05-06 20:32:53 -07:00
Yineng Zhang
9858113c33 chore: bump v0.4.6.post2 (#5939) 2025-04-30 22:04:40 -07:00
Yineng Zhang
9a6ad8916d chore: upgrade sgl-kernel 0.1.1 (#5933) 2025-04-30 16:13:30 -07:00
liwenju0
8fefdd32c7 [Feature] add support kimi vl model (#5383)
Co-authored-by: wenju.li <wenju.li@deepctr.cn>
2025-04-29 21:31:19 -07:00
Baizhou Zhang
799789afed Bump Flashinfer to 0.2.5 (#5870)
Co-authored-by: Yuhao Chen <yxckeis8@gmail.com>
2025-04-29 19:50:57 -07:00
Yineng Zhang
dcae1fb2cd chore: bump v0.4.6.post1 (#5845) 2025-04-28 12:57:08 -07:00
Yineng Zhang
41ac0c6d48 chore: upgrade sgl-kernel 0.1.0 (#5690) 2025-04-27 21:00:50 -07:00
Baizhou Zhang
84022c0e56 Release v0.4.6 (#5795) 2025-04-27 14:07:05 -07:00
Michał Moskal
bdbe5f816b update llguidance to 0.7.11; adds StructTag (#4870) 2025-04-26 20:13:57 -07:00
Connector Switch
70d040f904 [NFC] Remove duplicate compressed-tensors (#5640) 2025-04-22 09:10:25 -07:00
Yineng Zhang
b9c87e781d chore: bump v0.4.5.post3 (#5611) 2025-04-21 18:16:20 -07:00
lukec
417b44eba8 [Feat] upgrade pytorch2.6 (#5417) 2025-04-20 16:06:34 -07:00
Lianmin Zheng
fbdc94ba59 Release v0.4.5.post2 (#5582) 2025-04-20 14:12:37 -07:00
Yineng Zhang
2c11f9c2eb chore: upgrade sgl-kernel 0.0.9.post2 (#5540) 2025-04-18 21:17:23 -07:00
Yineng Zhang
5b5c7237c8 chore: bump v0.4.5.post1 (#5445) 2025-04-15 23:00:07 -07:00
Yineng Zhang
8ec0bb7d55 chore: upgrade sgl-kernel 0.0.9.post1 (#5436) 2025-04-15 15:45:51 -07:00
Yineng Zhang
8aab7fdb21 chore: upgrade sgl-kernel 0.0.9 (#5401) 2025-04-14 22:37:59 -07:00
Yineng Zhang
f58b929a51 chore: upgrade sgl-kernel 0.0.8.post3 (#5342) 2025-04-13 00:45:59 -07:00
Yineng Zhang
f774a0d275 feat: add blackwell Dockerfile (#5302) 2025-04-11 13:08:53 -07:00
Ke Bao
1078396f47 Update deps for mllama4 (#5215) 2025-04-10 09:12:44 -07:00
Yineng Zhang
57f99608f4 bump v0.4.5 (#5117) 2025-04-07 00:35:00 -07:00
Yineng Zhang
35e0856b90 bump v0.4.4.post4 (#5091) 2025-04-05 15:36:17 -07:00
Yi Zhang
aba5ca154d python transfer custom allreduce from trt kernel to vllm kernel (#5080) 2025-04-05 15:35:55 -07:00