Yineng Zhang
|
88defc4d89
|
fix: solve release issue (#5434)
|
2025-04-15 12:58:11 -07:00 |
|
Lianmin Zheng
|
838fa0f218
|
[minor] cleanup cmakelists.txt (#5420)
|
2025-04-15 07:07:07 -07:00 |
|
Yineng Zhang
|
11421a3f44
|
fix: update pr-test-sgl-kernel (#5399)
|
2025-04-14 21:14:59 -07:00 |
|
yhyang201
|
072df75354
|
Support for Qwen2.5-VL Model in bitsandbytes Format (#5003)
|
2025-04-14 02:03:40 -07:00 |
|
Yineng Zhang
|
b62e7e99b8
|
feat: adapt merge_state (#5337)
|
2025-04-12 21:14:04 -07:00 |
|
Yineng Zhang
|
75015bb688
|
ci: update release node (#5333)
|
2025-04-12 14:22:45 -07:00 |
|
Yineng Zhang
|
812e82f35e
|
fix: solve cu118 issue for cutlass mla (#5331)
|
2025-04-12 12:51:09 -07:00 |
|
Yineng Zhang
|
6f8593799b
|
feat: add blackwell workflow (#5303)
|
2025-04-11 13:42:00 -07:00 |
|
Yineng Zhang
|
b75275b6f2
|
feat: add cu128 identifier for sgl-kernel (#5287)
|
2025-04-11 01:58:46 -07:00 |
|
saienduri
|
7f875f1293
|
update grok test (#5171)
|
2025-04-09 11:09:47 -07:00 |
|
saienduri
|
3033c11a21
|
Add dummy grok test to amd CI. (#5115)
|
2025-04-08 07:44:59 +00:00 |
|
Yineng Zhang
|
3289c1207d
|
Update the retry count (#5051)
|
2025-04-03 17:07:38 -07:00 |
|
renxin
|
cccfc10e9c
|
Feature/revise docs ci (#5009)
|
2025-04-02 20:08:56 -07:00 |
|
Yuhong Guo
|
87fafa0105
|
Revert PR 4764 & 4813 related to R1 RoPE (#4959)
|
2025-03-31 20:56:58 -07:00 |
|
Lianmin Zheng
|
f842853a40
|
Fix the timeout for unit-test-2-gpu in pr-test.yml (#4927)
|
2025-03-30 12:15:40 -07:00 |
|
Adarsh Shirawalmath
|
9fccda3111
|
[Feature] use pytest for sgl-kernel (#4896)
|
2025-03-30 10:36:52 -07:00 |
|
Lianmin Zheng
|
4ede6770cd
|
Fix retract for page size > 1 (#4914)
|
2025-03-30 02:57:15 -07:00 |
|
Yineng Zhang
|
400ad66019
|
Update CODEOWNERS (#4889)
|
2025-03-29 09:56:51 -07:00 |
|
Yineng Zhang
|
72549263c6
|
update sgl-kernel test ci (#4866)
|
2025-03-28 11:42:41 -07:00 |
|
Lianmin Zheng
|
74e0ac1dbd
|
Clean up import vllm in quantization/__init__.py (#4834)
|
2025-03-28 10:34:10 -07:00 |
|
warjiang
|
18317ddc13
|
ci: add condition for daily docker build (#4487)
|
2025-03-27 21:44:37 -07:00 |
|
fzyzcjy
|
0d3e3072ee
|
Fix CI of test_patch_torch (#4844)
|
2025-03-27 21:22:45 -07:00 |
|
Yineng Zhang
|
5fa3058f01
|
fix the release doc dependency issue (#4828)
|
2025-03-27 13:28:12 -07:00 |
|
strgrb
|
668ecc6c5b
|
Fix ut mla-test-1-gpu-amd (#4813)
Co-authored-by: Zhang Kaihong <zhangkaihong.zkh@alibaba-inc.com>
|
2025-03-27 08:27:51 -07:00 |
|
Yineng Zhang
|
8bf6d7f406
|
support cmake for sgl-kernel (#4706)
Co-authored-by: hebiao064 <hebiaobuaa@gmail.com>
Co-authored-by: yinfan98 <1106310035@qq.com>
|
2025-03-27 01:42:28 -07:00 |
|
Xiaoyu Zhang
|
04e3ff6975
|
Support compressed tensors fp8w8a8 (#4743)
|
2025-03-26 13:21:25 -07:00 |
|
fzyzcjy
|
26f07294f1
|
Warn users when release_memory_occupation is called without memory saver enabled (#4566)
|
2025-03-26 00:18:14 -07:00 |
|
fzyzcjy
|
15ddd84322
|
Add retry for flaky tests in CI (#4755)
|
2025-03-25 16:53:12 -07:00 |
|
fzyzcjy
|
e45ae444db
|
Revert "Add DeepEP tests into CI (#4737)" (#4751)
|
2025-03-25 00:44:01 -07:00 |
|
Yineng Zhang
|
9b7cf9ee6c
|
support cu128 sgl-kernel (#4744)
|
2025-03-24 20:53:23 -07:00 |
|
fzyzcjy
|
64129fa632
|
Add DeepEP tests into CI (#4737)
|
2025-03-24 19:54:31 -07:00 |
|
aoshen524
|
588865f0e0
|
[Feature] Support Tensor Parallelism and Weight Slicing for Lora (#4274)
Co-authored-by: ShenAo1111 <1377693092@qq.com>
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
|
2025-03-18 20:33:07 -07:00 |
|
Yineng Zhang
|
c787298547
|
use sgl custom all reduce (#4441)
|
2025-03-18 00:46:41 -07:00 |
|
Lianmin Zheng
|
82dec1f70b
|
Remove redundant type conversion (#4513)
|
2025-03-17 05:57:35 -07:00 |
|
Lianmin Zheng
|
5493c3343e
|
Fix data parallel + tensor parallel (#4499)
|
2025-03-17 05:13:16 -07:00 |
|
Lianmin Zheng
|
754a0e8278
|
Update CODEOWNERS (#4484)
|
2025-03-16 17:10:15 -07:00 |
|
Lianmin Zheng
|
06d12b39d3
|
Remove filter for pr-tests (#4468)
|
2025-03-16 00:57:26 -07:00 |
|
Lianmin Zheng
|
c30976fb41
|
Fix finish step for pr tests and notebook tests (#4467)
|
2025-03-16 00:52:06 -07:00 |
|
Yineng Zhang
|
ad1ae7f7cd
|
use topk_softmax with sgl-kernel (#4439)
|
2025-03-14 15:59:06 -07:00 |
|
Yineng Zhang
|
977d7cd26a
|
cleanup deps 1/n (#4400)
Co-authored-by: sleepcoo <sleepcoo@gmail.com>
|
2025-03-14 00:00:33 -07:00 |
|
Lianmin Zheng
|
bb37855653
|
Update CODEOWNERS (#4403)
|
2025-03-13 17:54:40 -07:00 |
|
Lianmin Zheng
|
a5a892ffd3
|
Fix auto merge & add back get_flat_data_by_layer (#4393)
|
2025-03-13 08:46:25 -07:00 |
|
HandH1998
|
2ac189edc8
|
Amd test fp8 (#4261)
|
2025-03-10 10:12:09 -07:00 |
|
Lianmin Zheng
|
5a6400eec5
|
Test no vllm custom allreduce (#4256)
|
2025-03-10 10:08:25 -07:00 |
|
Lianmin Zheng
|
3d56585a97
|
increase the timeout of nightly-test.yml (#4262)
|
2025-03-10 05:07:03 -07:00 |
|
Lianmin Zheng
|
aa957102a9
|
Simplify tests & Fix trtllm custom allreduce registration (#4252)
|
2025-03-10 01:24:22 -07:00 |
|
Lianmin Zheng
|
e8a69e4d0c
|
Clean up fp8 support (#4230)
|
2025-03-09 21:46:35 -07:00 |
|
Lianmin Zheng
|
fbd560028a
|
Auto balance CI tests (#4238)
|
2025-03-09 21:05:55 -07:00 |
|
Lianmin Zheng
|
8abf74e3c9
|
Rename files in sgl kernel to avoid nested folder structure (#4213)
Co-authored-by: zhyncs <me@zhyncs.com>
|
2025-03-08 22:54:51 -08:00 |
|
Yineng Zhang
|
ee132a4515
|
use latest sgl-kernel for mla test (#4222)
|
2025-03-08 22:27:47 -08:00 |
|