Yi Zhang
|
89f1d4f536
|
update deepep commit to support qwen3-coder (#9066)
|
2025-08-11 10:42:33 -07:00 |
|
Yineng Zhang
|
48b8b4c124
|
fix nvshmem cu126 (#9001)
|
2025-08-09 03:34:54 -07:00 |
|
ishandhanani
|
4e7f025219
|
chore(gb200): update to CUDA 12.9 and improve build process (#8772)
|
2025-08-08 13:42:47 -07:00 |
|
Yineng Zhang
|
41357e511b
|
chore: update flashinfer (#8958)
|
2025-08-08 02:15:22 -07:00 |
|
Yineng Zhang
|
54ea57f245
|
chore: bump sgl-kernel v0.3.3 (#8957)
|
2025-08-08 01:35:37 -07:00 |
|
Yineng Zhang
|
cbbd685a46
|
chore: use torch 2.8 stable (#8880)
|
2025-08-06 15:51:40 -07:00 |
|
Mick
|
01c99a9959
|
chore: update Dockerfile (#8872)
Co-authored-by: zhyncs <me@zhyncs.com>
|
2025-08-06 09:30:33 -07:00 |
|
Yineng Zhang
|
4ef47839ae
|
feat: use py312 (#8832)
|
2025-08-05 13:38:22 -07:00 |
|
Yineng Zhang
|
0a56b721d5
|
chore: bump sgl-kernel v0.2.9 (#8713)
|
2025-08-02 16:21:56 -07:00 |
|
Yineng Zhang
|
43118f5f2a
|
chore: bump sgl-kernel v0.2.8 (#8599)
|
2025-07-30 22:23:52 -07:00 |
|
Charles Chen
|
659bfd1023
|
Add GKE's default CUDA runtime lib location to PATH and LD_LIBRARY_PATH. (#8544)
|
2025-07-30 20:28:07 -07:00 |
|
Yineng Zhang
|
7181ec8cfc
|
fix: upgrade nccl version (#8359)
|
2025-07-25 14:59:02 -07:00 |
|
Shangming Cai
|
70e37b97bf
|
chore: upgrade mooncake 0.3.5 (#8341)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
|
2025-07-25 01:17:26 -07:00 |
|
Yineng Zhang
|
4c605235aa
|
fix: workaround for deepgemm warmup issue (#8302)
|
2025-07-23 12:01:51 -07:00 |
|
Yineng Zhang
|
429bb0efa2
|
chore: bump sgl-kernel v0.2.6.post1 (#8200)
|
2025-07-20 19:50:28 -07:00 |
|
Yineng Zhang
|
2db6719cc5
|
feat: update nccl 2.27.6 (#8182)
|
2025-07-19 22:55:45 -07:00 |
|
Yineng Zhang
|
f98e88b9fb
|
chore: bump sgl-kernel v0.2.6 (#8165)
|
2025-07-19 00:56:18 -07:00 |
|
Simo Lin
|
8a7a7770e5
|
[ci] limit cmake build nproc (#8100)
|
2025-07-16 18:09:28 -07:00 |
|
Simo Lin
|
d9eb5efc71
|
[misc] update nvshmem and pin deepEP commit hash (#8098)
|
2025-07-16 08:54:55 -07:00 |
|
Yineng Zhang
|
8f3173d0b0
|
chore: bump sgl-kernel v0.2.5 (#7964)
|
2025-07-11 18:24:20 -07:00 |
|
Yineng Zhang
|
f200af0d8c
|
chore: bump sgl-kernel v0.2.4 (#7800)
|
2025-07-05 15:03:31 -07:00 |
|
Yineng Zhang
|
75354d9ae9
|
fix: use nvidia-nccl-cu12 2.27.5 (#7787)
|
2025-07-05 01:28:21 -07:00 |
|
Yineng Zhang
|
4fece12be9
|
chore: bump sgl-kernel v0.2.3 (#7784)
|
2025-07-05 00:05:45 -07:00 |
|
Yineng Zhang
|
aca1101a13
|
chore: bump sgl-kernel 0.2.2 (#7755)
|
2025-07-03 12:49:10 -07:00 |
|
Yineng Zhang
|
637bfee448
|
chore: bump sgl-kernel v0.2.1 (#7675)
|
2025-06-30 22:12:33 -07:00 |
|
Xiaoyu Zhang
|
ff2e9c9479
|
Add small requirements for benchmark/parse_result tools (#7671)
|
2025-06-30 21:52:20 -07:00 |
|
Shangming Cai
|
a07f8ae4b7
|
[CI] Upgrade mooncake to v0.3.4.post2 to fix potential slice failed bug (#7522)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
|
2025-06-25 01:49:22 -07:00 |
|
Yineng Zhang
|
e846d95ef6
|
chore: bump sgl-kernel v0.2.0 (#7490)
|
2025-06-23 22:29:50 -07:00 |
|
Liangsheng Yin
|
76139bfba0
|
update mooncake in dockerfile (#7480)
|
2025-06-24 02:29:30 +08:00 |
|
Yineng Zhang
|
4d8d9b8efd
|
chore: upgrade mooncake-transfer-engine 0.3.4 (#7401)
|
2025-06-20 16:38:54 -07:00 |
|
ybyang
|
906dbc34f1
|
[Docker] optimize dockerfile remove deepep and blackwell merge it to… (#7343)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2025-06-19 17:42:40 -07:00 |
|
Yineng Zhang
|
cef6655b26
|
fix 24.12 docker (#7045)
|
2025-06-10 02:43:22 -07:00 |
|
Swipe4057
|
27196d4148
|
[Docker] Upgrading base image from 24.04 to 24.12 (#7043)
|
2025-06-10 02:23:05 -07:00 |
|
Yineng Zhang
|
4f723edd3b
|
chore: bump v0.4.7 (#7038)
|
2025-06-10 01:56:20 -07:00 |
|
Yineng Zhang
|
141a459644
|
fix: only upgrade nccl for cu128 (#5986)
|
2025-05-02 11:31:29 -07:00 |
|
Yineng Zhang
|
f4c191a712
|
chore: update Dockerfile (#5894)
|
2025-04-29 12:55:13 -07:00 |
|
lukec
|
417b44eba8
|
[Feat] upgrade pytorch2.6 (#5417)
|
2025-04-20 16:06:34 -07:00 |
|
Kebe
|
4a893d142d
|
Refactor Dockerfile: unify CUDA logic and reduce image size by ~2.6 GB (#3749)
Signed-off-by: Kebe <mail@kebe7jun.com>
|
2025-03-08 03:01:13 -08:00 |
|
Shenggui Li
|
c9565e49e7
|
[docker] added rdma support (#3619)
|
2025-02-17 15:36:16 +08:00 |
|
Yineng Zhang
|
ac963be234
|
update flashinfer-python (#3557)
|
2025-02-14 09:52:56 +08:00 |
|
Yineng Zhang
|
fa1b40e00d
|
use nvcr.io/nvidia/tritonserver:24.04-py3-min as base image (#3457)
|
2025-02-10 13:52:33 +08:00 |
|
Yineng Zhang
|
6186a8f889
|
update flashinfer install index url (#3293)
|
2025-02-05 00:44:35 +08:00 |
|
Yineng Zhang
|
b49d6d0fee
|
support 12.5 CUDA runtime (#3231)
|
2025-01-31 20:31:38 +08:00 |
|
Yineng Zhang
|
cf142b6eb8
|
fix: update Dockerfile for cu118 (#3181)
|
2025-01-27 23:46:44 +08:00 |
|
Yineng Zhang
|
4e1e3cff20
|
fix #2528 (#2541)
|
2024-12-22 00:14:41 +08:00 |
|
Yineng Zhang
|
fae4e5e99a
|
chore: bump v0.3.6.post3 (#2259)
|
2024-11-30 01:41:16 +08:00 |
|
Lianmin Zheng
|
39bb49d156
|
Update dockerfile to include datamodel_code_generator (#1492)
|
2024-09-22 04:49:16 -07:00 |
|
William Arnold
|
58cafe23a7
|
Add libibverbs-dev to Dockerfile (#1427)
|
2024-09-15 15:40:31 +09:00 |
|
Yineng Zhang
|
6aa8ad14f8
|
fix: resolve Python.h header missing (#1119)
|
2024-08-16 15:46:43 +10:00 |
|
Yineng Zhang
|
0076f11541
|
fix: use devel for Triton's compiler requirements (#1074)
|
2024-08-13 04:08:43 -07:00 |
|