Commit Graph

48 Commits

Author SHA1 Message Date
Yineng Zhang
7181ec8cfc fix: upgrade nccl version (#8359) 2025-07-25 14:59:02 -07:00
Shangming Cai
70e37b97bf chore: upgrade mooncake 0.3.5 (#8341)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
2025-07-25 01:17:26 -07:00
Yineng Zhang
4c605235aa fix: workaround for deepgemm warmup issue (#8302) 2025-07-23 12:01:51 -07:00
Yineng Zhang
429bb0efa2 chore: bump sgl-kernel v0.2.6.post1 (#8200) 2025-07-20 19:50:28 -07:00
Yineng Zhang
2db6719cc5 feat: update nccl 2.27.6 (#8182) 2025-07-19 22:55:45 -07:00
Yineng Zhang
f98e88b9fb chore: bump sgl-kernel v0.2.6 (#8165) 2025-07-19 00:56:18 -07:00
Simo Lin
8a7a7770e5 [ci] limit cmake build nproc (#8100) 2025-07-16 18:09:28 -07:00
Simo Lin
d9eb5efc71 [misc] update nvshmem and pin deepEP commit hash (#8098) 2025-07-16 08:54:55 -07:00
Yineng Zhang
8f3173d0b0 chore: bump sgl-kernel v0.2.5 (#7964) 2025-07-11 18:24:20 -07:00
Yineng Zhang
f200af0d8c chore: bump sgl-kernel v0.2.4 (#7800) 2025-07-05 15:03:31 -07:00
Yineng Zhang
75354d9ae9 fix: use nvidia-nccl-cu12 2.27.5 (#7787) 2025-07-05 01:28:21 -07:00
Yineng Zhang
4fece12be9 chore: bump sgl-kernel v0.2.3 (#7784) 2025-07-05 00:05:45 -07:00
Yineng Zhang
aca1101a13 chore: bump sgl-kernel 0.2.2 (#7755) 2025-07-03 12:49:10 -07:00
Yineng Zhang
637bfee448 chore: bump sgl-kernel v0.2.1 (#7675) 2025-06-30 22:12:33 -07:00
Xiaoyu Zhang
ff2e9c9479 Add small requirements for benchmark/parse_result tools (#7671) 2025-06-30 21:52:20 -07:00
Shangming Cai
a07f8ae4b7 [CI] Upgrade mooncake to v0.3.4.post2 to fix potential slice failed bug (#7522)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
2025-06-25 01:49:22 -07:00
Yineng Zhang
e846d95ef6 chore: bump sgl-kernel v0.2.0 (#7490) 2025-06-23 22:29:50 -07:00
Liangsheng Yin
76139bfba0 update mooncake in dockerfile (#7480) 2025-06-24 02:29:30 +08:00
Yineng Zhang
4d8d9b8efd chore: upgrade mooncake-transfer-engine 0.3.4 (#7401) 2025-06-20 16:38:54 -07:00
ybyang
906dbc34f1 [Docker] optimize dockerfile remove deepep and blackwell merge it to… (#7343)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
2025-06-19 17:42:40 -07:00
Yineng Zhang
cef6655b26 fix 24.12 docker (#7045) 2025-06-10 02:43:22 -07:00
Swipe4057
27196d4148 [Docker] Upgrading base image from 24.04 to 24.12 (#7043) 2025-06-10 02:23:05 -07:00
Yineng Zhang
4f723edd3b chore: bump v0.4.7 (#7038) 2025-06-10 01:56:20 -07:00
Yineng Zhang
141a459644 fix: only upgrade nccl for cu128 (#5986) 2025-05-02 11:31:29 -07:00
Yineng Zhang
f4c191a712 chore: update Dockerfile (#5894) 2025-04-29 12:55:13 -07:00
lukec
417b44eba8 [Feat] upgrade pytorch2.6 (#5417) 2025-04-20 16:06:34 -07:00
Kebe
4a893d142d Refactor Dockerfile: unify CUDA logic and reduce image size by ~2.6 GB (#3749)
Signed-off-by: Kebe <mail@kebe7jun.com>
2025-03-08 03:01:13 -08:00
Shenggui Li
c9565e49e7 [docker] added rdma support (#3619) 2025-02-17 15:36:16 +08:00
Yineng Zhang
ac963be234 update flashinfer-python (#3557) 2025-02-14 09:52:56 +08:00
Yineng Zhang
fa1b40e00d use nvcr.io/nvidia/tritonserver:24.04-py3-min as base image (#3457) 2025-02-10 13:52:33 +08:00
Yineng Zhang
6186a8f889 update flashinfer install index url (#3293) 2025-02-05 00:44:35 +08:00
Yineng Zhang
b49d6d0fee support 12.5 CUDA runtime (#3231) 2025-01-31 20:31:38 +08:00
Yineng Zhang
cf142b6eb8 fix: update Dockerfile for cu118 (#3181) 2025-01-27 23:46:44 +08:00
Yineng Zhang
4e1e3cff20 fix #2528 (#2541) 2024-12-22 00:14:41 +08:00
Yineng Zhang
fae4e5e99a chore: bump v0.3.6.post3 (#2259) 2024-11-30 01:41:16 +08:00
Lianmin Zheng
39bb49d156 Update dockerfile to include datamodel_code_generator (#1492) 2024-09-22 04:49:16 -07:00
William Arnold
58cafe23a7 Add libibverbs-dev to Dockerfile (#1427) 2024-09-15 15:40:31 +09:00
Yineng Zhang
6aa8ad14f8 fix: resolve Python.h header missing (#1119) 2024-08-16 15:46:43 +10:00
Yineng Zhang
0076f11541 fix: use devel for Triton's compiler requirements (#1074) 2024-08-13 04:08:43 -07:00
Yineng Zhang
cb99ba4fc6 feat: update Dockerfile (#1033)
Co-authored-by: vhain <vhain6512@gmail.com>
2024-08-12 16:24:06 +10:00
Yineng Zhang
c31f084c71 chore: update vllm to 0.5.4 (#966) 2024-08-07 21:15:41 +10:00
Ying Sheng
001b0bdd08 Update the base image of the docker (#900) 2024-08-02 21:54:57 -07:00
Yineng Zhang
8628ab9c8b feat: add docker workflow (#751) 2024-07-27 03:54:51 +10:00
Ying Sheng
7802df1e2b Update readme 2024-07-25 08:45:06 -07:00
Liangsheng Yin
04ec6ba2ac Fix dockerfile and triton cache manager (#720) 2024-07-25 03:04:21 -07:00
Liangsheng Yin
8832ecb1e4 Reduce docker size (#632) 2024-07-16 16:12:12 -07:00
Ying Sheng
f6b29f6920 Update docker file (#629) 2024-07-16 01:12:37 -07:00
Ying Sheng
ac11388756 Add docker file (#588)
Co-authored-by: Ying Sheng <ying.sheng@databricks.com>
2024-07-04 00:53:49 -07:00