HAI
|
44426e54be
|
Update REVIEWERS (#9063)
|
2025-08-11 11:04:39 -07:00 |
|
Zhiqiang Xie
|
b32792516a
|
REVIEWERS.md typo fix (#9048)
|
2025-08-10 22:33:37 -07:00 |
|
Lianmin Zheng
|
6beeff41c5
|
Update REVIEWERS.md (#9046)
|
2025-08-10 21:11:14 -07:00 |
|
Lianmin Zheng
|
2e8e7e353b
|
Improve docs and developer guide (#9044)
|
2025-08-10 21:05:18 -07:00 |
|
Lianmin Zheng
|
2449a0afe2
|
Refactor the docs (#9031)
|
2025-08-10 19:49:45 -07:00 |
|
Lianmin Zheng
|
0f229c07f1
|
Update release-docs.yml (#9037)
|
2025-08-10 18:52:11 -07:00 |
|
Lianmin Zheng
|
2c7f01bc89
|
Reorganize CI and test files (#9027)
|
2025-08-10 12:30:06 -07:00 |
|
Simo Lin
|
3817a37d87
|
[router] upgrade to latest sgl kernel for router ci (#9019)
|
2025-08-09 21:49:18 -07:00 |
|
Lianmin Zheng
|
ef48d5547e
|
Fix CI (#9013)
|
2025-08-09 16:00:10 -07:00 |
|
Lianmin Zheng
|
9a44b643c6
|
Fix CI (#9012)
|
2025-08-09 13:33:42 -07:00 |
|
ishandhanani
|
de8b8b6e5c
|
chore(deps): update minimum python to 3.10 (#8984)
|
2025-08-09 00:30:23 -07:00 |
|
DiweiSun
|
7c0db868a1
|
Molly/ci gnr server (#8667)
|
2025-08-08 20:01:16 -07:00 |
|
Lianmin Zheng
|
706bd69cc5
|
Clean up server_args.py to have a dedicated function for model specific adjustments (#8983)
|
2025-08-08 19:56:50 -07:00 |
|
Lianmin Zheng
|
6642e3a295
|
[Fix] Add a workflow to cancel all pending CI runs (#8988)
|
2025-08-08 16:09:50 -07:00 |
|
Lianmin Zheng
|
67a7d1f699
|
Create cancel-all-pr-test-runs (#8986)
|
2025-08-08 15:53:51 -07:00 |
|
ishandhanani
|
7d3af603e7
|
chore(ci): update Python version from 3.9 to 3.10 in sgl-kernel workflow (#8981)
|
2025-08-08 14:03:17 -07:00 |
|
ishandhanani
|
4e7f025219
|
chore(gb200): update to CUDA 12.9 and improve build process (#8772)
|
2025-08-08 13:42:47 -07:00 |
|
Yineng Zhang
|
1ac16add8b
|
chore: support blackwell cu129 image (#8928)
|
2025-08-07 14:24:57 -07:00 |
|
Zhiyu
|
c3a5fb3b28
|
codeowner updates for modelopt related files (#8925)
|
2025-08-07 14:21:41 -07:00 |
|
Simo Lin
|
16a4c66d25
|
[router] update pd router ci summary step with new threshold (#8916)
|
2025-08-07 07:15:38 -07:00 |
|
Simo Lin
|
89e6521c61
|
[router] re-enable pd router benchmark CI (#8912)
|
2025-08-07 06:29:36 -07:00 |
|
fzyzcjy
|
b114a8105b
|
Support B200 in CI (#8861)
|
2025-08-06 21:42:44 +08:00 |
|
Yineng Zhang
|
aeac900ca2
|
fix: resolve ci issue (#8859)
|
2025-08-06 02:28:14 -07:00 |
|
Yineng Zhang
|
3ae8e3ea8f
|
chore: upgrade torch 2.8.0 (#8836)
|
2025-08-05 17:32:01 -07:00 |
|
kk
|
32d9e39a29
|
Fix potential memory fault issue and ncclSystemError in CI test (#8681)
Co-authored-by: wunhuang <wunhuang@amd.com>
|
2025-08-05 12:19:37 -07:00 |
|
Yineng Zhang
|
194561f27a
|
feat: support sgl-kernel cu129 (#8800)
|
2025-08-05 02:33:47 -07:00 |
|
Even Zhou
|
fee0ab0fba
|
[CI] Ascend NPU CI enhancement (#8294)
Co-authored-by: ronnie_zheng <zl19940307@163.com>
|
2025-08-03 22:16:38 -07:00 |
|
Liangsheng Yin
|
7a27e798ca
|
[CI] Do not trigger pd-disaggregation CI in draft PR (#8737)
|
2025-08-04 05:12:20 +08:00 |
|
Yineng Zhang
|
5ce5093b97
|
chore: bump sgl-kernel 0.3.0 with torch 2.8.0 (#8718)
|
2025-08-03 02:31:50 -07:00 |
|
Lianmin Zheng
|
07e46ecaad
|
Update CODEOWNERS (#8686)
|
2025-08-01 15:09:44 -07:00 |
|
li chaoran
|
fe5086fd8b
|
chore: speedup NPU CI by cache (#8270)
Signed-off-by: mywaaagh_admin <pkwarcraft@gmail.com>
Co-authored-by: ronnie_zheng <zl19940307@163.com>
|
2025-07-31 17:29:50 -07:00 |
|
Simo Lin
|
aee0ef52f5
|
[router] update router pypi version (#8628)
|
2025-07-31 11:24:12 -07:00 |
|
Simo Lin
|
ae807774f5
|
[ci] fix genai-bench execution cmd (#8629)
|
2025-07-31 10:40:54 -07:00 |
|
yihong
|
09f1a247ce
|
fix: fork should not run pypi router (#8604)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2025-07-31 02:37:13 -07:00 |
|
Shangming Cai
|
55ecdc0a8e
|
Update CODEOWNERS (#8562)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
|
2025-07-30 01:05:57 -07:00 |
|
Simo Lin
|
a9fd80336d
|
[router] allow longer time out for router e2e (#8560)
|
2025-07-29 23:43:37 -07:00 |
|
Lianmin Zheng
|
263c9236a0
|
Always trigger pr-test (#8527)
|
2025-07-29 04:05:19 -07:00 |
|
Lianmin Zheng
|
69712e6f55
|
Rename the last step in pr-test.yml as pr-test-finish (#8486)
|
2025-07-28 19:06:13 -07:00 |
|
Lianmin Zheng
|
001bffca62
|
Update CODEOWNERS (#8485)
|
2025-07-28 17:57:23 -07:00 |
|
Keyang Ru
|
7c9697178e
|
[CI]Add genai-bench Performance Validation for PD Router (#8477)
Co-authored-by: key4ng <rukeyang@gamil.com>
|
2025-07-28 16:58:23 -07:00 |
|
Lianmin Zheng
|
bd51694906
|
Update codeowner (#8476)
|
2025-07-28 16:03:49 -07:00 |
|
Ke Bao
|
b582159246
|
Update PR template (#8465)
|
2025-07-28 22:12:36 +08:00 |
|
kyleliang-nv
|
5922c0cbf6
|
Remove zstd compression for building Dockerfile.gb200 (#8442)
|
2025-07-27 22:58:53 -07:00 |
|
Stefan He
|
4ad9737045
|
chore: bump transformer to 4.54.0 (#8416)
Co-authored-by: Binyao Jiang <byjiang1996@gmail.com>
Co-authored-by: Lifu Huang <lifu.hlf@gmail.com>
|
2025-07-27 21:27:25 -07:00 |
|
kyleliang-nv
|
bb81daefb8
|
Fix docker buildx push error (#8425)
|
2025-07-27 17:59:38 -07:00 |
|
kyleliang-nv
|
95217a9b4d
|
Change to use native arm runner (#8414)
|
2025-07-27 12:48:12 -07:00 |
|
Yineng Zhang
|
bf3352c559
|
chore: update CODEOWNERS (#8407)
|
2025-07-27 01:39:36 -07:00 |
|
kyleliang-nv
|
62a6b7c773
|
Add docker release flow for gb200 (#8394)
|
2025-07-26 21:25:07 -07:00 |
|
Lifu Huang
|
5c705b1dce
|
Add perf tests for LoRA (#8314)
|
2025-07-26 14:55:22 -07:00 |
|
Shangming Cai
|
70e37b97bf
|
chore: upgrade mooncake 0.3.5 (#8341)
Signed-off-by: Shangming Cai <caishangming@linux.alibaba.com>
|
2025-07-25 01:17:26 -07:00 |
|