Commit Graph

102 Commits

Author SHA1 Message Date
nvjullin
9a71500cfb Fixed aarch64 flash-mla (#12009) 2025-10-23 17:47:04 -07:00
fzyzcjy
8612811d85 Bump grace blackwell DeepEP version (#11990) 2025-10-22 21:08:12 -07:00
Baizhou Zhang
ebff4ee648 Update sgl-kernel and remove fast hadamard depedency (#11844) 2025-10-21 13:13:54 -07:00
fzyzcjy
9e3be1fa2a Tiny bump DeepEP version in ARM blackwell (#11810) 2025-10-20 08:15:14 +08:00
kyleliang-nv
fda0cb2a30 Fix Dockerfile not installing correct version of DeepEP for arm build (#11773) 2025-10-18 15:06:05 -07:00
Yineng Zhang
86373b9e48 fix: Update SGL_KERNEL_VERSION to 0.3.15 (#11633) 2025-10-14 14:45:28 -07:00
Baizhou Zhang
8b85926a6e Remove tilelang dependency in Dockerfile (#11455) 2025-10-10 23:17:53 -07:00
gongwei-130
4aeb193fbd disable sm100 for FlashMLA and fast-hadamard-transform in cuda12.6.1 (#11274) 2025-10-06 14:48:31 -07:00
Baizhou Zhang
292a867ad9 Add flashmla and fast hadamard transform to Dockerfile (#11235) 2025-10-05 21:31:28 -07:00
gongwei-130
0618ad6dd5 fix: shoudn't include CUDA_ARCH 100 and 120 for cuda12.6.1 (#11176) 2025-10-02 13:24:23 -07:00
ishandhanani
47488cc353 docker: x86 dev builds for hopper and blackwell (#11075) 2025-10-01 00:06:38 -07:00
ishandhanani
adba172fd1 ci: free space on workers for build (#10786)
Co-authored-by: zhyncs <me@zhyncs.com>
2025-09-24 02:58:22 -07:00
ishandhanani
1c82d9db28 feat: unify dockerfiles (#10705)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
2025-09-22 23:23:48 -07:00
Shangming Cai
74cd6e3902 chore: upgrade mooncake 0.3.6.post1 to fix gb200 dockerfile (#10681)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-09-20 00:12:26 -07:00
Baizhou Zhang
3fa3c22ae2 Fix fast decode plan for flashinfer v0.4.0rc1 and upgrade sgl-kernel 0.3.11 (#10634)
Co-authored-by: zhyncs <me@zhyncs.com>
2025-09-19 01:25:29 -07:00
Yi Zhang
e07b21ceaf update deepep version for qwen3-next deepep moe (#10624) 2025-09-18 11:35:22 -07:00
Shangming Cai
60fc5b51f6 chore: upgrade mooncake 0.3.6 (#10596)
Signed-off-by: Shangming Cai <csmthu@gmail.com>
2025-09-18 00:19:30 -07:00
Yineng Zhang
c0c6f543e4 chore: upgrade sgl-kernel 0.3.10 (#10500) 2025-09-16 02:00:53 -07:00
Yineng Zhang
b0d25e72c4 chore: bump v0.5.2 (#10221) 2025-09-11 16:09:20 -07:00
Yineng Zhang
bfe01a5eef chore: upgrade v0.3.9.post2 sgl-kernel (#10297) 2025-09-11 04:10:29 -07:00
Lianmin Zheng
bcf1955f7e Revert "chore: upgrade v0.3.9 sgl-kernel" (#10245) 2025-09-09 19:05:20 -07:00
Yineng Zhang
d3ee70985f chore: upgrade v0.3.9 sgl-kernel (#10220) 2025-09-09 03:16:25 -07:00
Liangsheng Yin
6e95f5e5bd Simplify Router arguments passing and build it in docker image (#9964) 2025-09-05 12:13:55 +08:00
JieXin Liang
1db649ac02 [feat] apply deep_gemm compile_mode to skip launch (#9879) 2025-09-02 03:20:30 -07:00
Yineng Zhang
9970e3bf32 chore: upgrade sgl-kernel 0.3.7.post1 with deepgemm fix (#9822) 2025-08-30 04:02:25 -07:00
Yineng Zhang
9c99949ef3 chore: update Dockerfile (#9820) 2025-08-30 03:08:14 -07:00
Yineng Zhang
bc80dc4ce0 chore: bump v0.5.1.post3 (#9716) 2025-08-27 15:42:42 -07:00
fzyzcjy
44ffe2cb72 Install py-spy by default for containers for easier debugging (#9649) 2025-08-26 10:40:52 -07:00
Yineng Zhang
bf863e3bbf fix: use sgl-kernel 0.3.5 (#9565) 2025-08-24 15:46:47 -07:00
gongwei-130
fb107cfd75 feat: allow use local branch to build image (#9546) 2025-08-23 16:38:30 -07:00
Yineng Zhang
b6b2287e4b chore: bump sgl-kernel v0.3.6.post2 (#9475) 2025-08-21 23:02:08 -07:00
Yineng Zhang
a1c7f742f9 chore: bump sgl-kernel v0.3.6.post1 (#9286) 2025-08-17 16:26:17 -07:00
Yineng Zhang
87dab54824 Revert "chore: bump sgl-kernel v0.3.6 (#9220)" (#9247) 2025-08-15 17:24:36 -07:00
Yineng Zhang
c186feed7f chore: bump sgl-kernel v0.3.6 (#9220) 2025-08-15 02:50:50 -07:00
fzyzcjy
f8644a5632 Tiny update tmux history limit on dev container (#9218) 2025-08-15 00:22:08 -07:00
fzyzcjy
392de007cb Minor fix docker container DeepEP on multi platforms (#9205) 2025-08-14 17:41:49 -07:00
Yineng Zhang
1fea998a45 chore: bump sgl-kernel v0.3.5 (#9185) 2025-08-14 03:20:48 -07:00
fzyzcjy
b3363cc1aa Fix docker container DeepEP error on Blackwell (#9171) 2025-08-13 21:06:48 -07:00
Yineng Zhang
71fb8c9527 feat: update fa3 (#9126) 2025-08-13 20:07:08 +08:00
Yineng Zhang
924827c3de chore: use cp310 (#9130) 2025-08-12 15:33:22 -07:00
Yineng Zhang
c81daf838d fix: update Dockerfile (#9129) 2025-08-12 15:01:29 -07:00
Yineng Zhang
305b27c124 fix: update Dockerfile (#9125) 2025-08-12 13:23:10 -07:00
Yineng Zhang
3a9afe2a42 chore: bump sgl-kernel v0.3.4 (#9103) 2025-08-12 01:48:47 -07:00
Yi Zhang
89f1d4f536 update deepep commit to support qwen3-coder (#9066) 2025-08-11 10:42:33 -07:00
Yineng Zhang
48b8b4c124 fix nvshmem cu126 (#9001) 2025-08-09 03:34:54 -07:00
ishandhanani
4e7f025219 chore(gb200): update to CUDA 12.9 and improve build process (#8772) 2025-08-08 13:42:47 -07:00
Yineng Zhang
41357e511b chore: update flashinfer (#8958) 2025-08-08 02:15:22 -07:00
Yineng Zhang
54ea57f245 chore: bump sgl-kernel v0.3.3 (#8957) 2025-08-08 01:35:37 -07:00
Yineng Zhang
cbbd685a46 chore: use torch 2.8 stable (#8880) 2025-08-06 15:51:40 -07:00
Mick
01c99a9959 chore: update Dockerfile (#8872)
Co-authored-by: zhyncs <me@zhyncs.com>
2025-08-06 09:30:33 -07:00