sglang-bot
|
1053e1be17
|
chore: bump SGLang version to 0.5.4 (#12027)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-10-23 18:01:40 -07:00 |
|
Gaurav Verma
|
6f9b66bdda
|
[AMD] Update wave-lang to 3.8.0 (#11878)
Signed-off-by: xintin <gaurav.verma@amd.com>
|
2025-10-20 23:11:09 -07:00 |
|
Lianmin Zheng
|
67e34c56d7
|
Fix install instructions and pyproject.tomls (#11781)
|
2025-10-18 01:08:01 -07:00 |
|
sglang-bot
|
85ebeecf06
|
chore: bump SGLang version to 0.5.3.post3 (#11693)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-10-16 13:14:55 -07:00 |
|
sglang-bot
|
baf277a9bf
|
chore: bump SGLang version to 0.5.3.post2 (#11680)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-10-15 16:49:14 -07:00 |
|
Sahithi Chigurupati
|
e9e120ac7a
|
fix: upgrade transformers to 4.57.1 (#11628)
Signed-off-by: Sahithi Chigurupati <chigurupati.sahithi@gmail.com>
Co-authored-by: zhyncs <me@zhyncs.com>
|
2025-10-14 18:35:05 -07:00 |
|
Yineng Zhang
|
4299aebdbb
|
chore: update pyproject (#11420)
|
2025-10-10 00:56:30 -07:00 |
|
sglang-bot
|
758b887ad1
|
chore: bump SGLang version to 0.5.3.post1 (#11324)
|
2025-10-09 15:19:59 -07:00 |
|
Yineng Zhang
|
44cb060785
|
chore: upgrade flashinfer 0.4.0 (#11364)
|
2025-10-09 14:17:54 -07:00 |
|
Lifu Huang
|
edefab0c64
|
[2/2] Support MHA prefill with FlashAttention 4. (#10937)
Co-authored-by: Hieu Pham <hyhieu@gmail.com>
|
2025-10-08 00:54:20 -07:00 |
|
DarkSharpness
|
832c84fba9
|
[Chore] Update xgrammar 0.1.24 -> 0.1.25 (#10710)
|
2025-10-07 18:22:28 -07:00 |
|
Lianmin Zheng
|
eb30b888db
|
Remove env var warnings for release (#11262)
|
2025-10-06 10:09:17 -07:00 |
|
sglang-bot
|
a4a3d82393
|
chore: bump SGLang version to 0.5.3 (#11263)
|
2025-10-06 20:07:02 +08:00 |
|
sglang-bot
|
0b13cbb7c9
|
chore: bump SGLang version to 0.5.3rc2 (#11259)
Co-authored-by: sglang-bot <sglang-bot@users.noreply.github.com>
|
2025-10-06 01:12:10 -07:00 |
|
Lianmin Zheng
|
f8924ad74b
|
update sgl kernel version to 0.3.14.post1 (#11242)
|
2025-10-05 20:30:40 -07:00 |
|
Matt Nappo
|
8c57490210
|
[Feature] Option to save model weights to CPU when memory saver mode is enabled (#10873)
Co-authored-by: molocule <34072934+molocule@users.noreply.github.com>
|
2025-10-03 16:48:19 +08:00 |
|
eigen
|
ac1f2928ae
|
feat: add fast_decode_plan from flashinfer, flashinfer to 0.4.0rc3 (#10760)
Co-authored-by: Zihao Ye <yezihhhao@gmail.com>
Co-authored-by: Sleepcoo <Sleepcoo@gmail.com>
|
2025-10-01 02:56:13 -07:00 |
|
Yineng Zhang
|
3a641d9085
|
chore: upgrade sgl-kernel 0.3.13 (#11056)
|
2025-09-29 02:22:25 -07:00 |
|
Yineng Zhang
|
5942fdb480
|
chore: upgrade cutedsl 4.2.1 (#11054)
|
2025-09-29 00:24:17 -07:00 |
|
Swipe4057
|
0035f1cefa
|
fix env flashinfer (#10910)
|
2025-09-25 15:44:48 -07:00 |
|
Yineng Zhang
|
8c1ef0f914
|
chore: upgrade sgl-kernel 0.3.12 (#10782)
|
2025-09-23 00:18:54 -07:00 |
|
Baizhou Zhang
|
3fa3c22ae2
|
Fix fast decode plan for flashinfer v0.4.0rc1 and upgrade sgl-kernel 0.3.11 (#10634)
Co-authored-by: zhyncs <me@zhyncs.com>
|
2025-09-19 01:25:29 -07:00 |
|
Yineng Zhang
|
c0c6f543e4
|
chore: upgrade sgl-kernel 0.3.10 (#10500)
|
2025-09-16 02:00:53 -07:00 |
|
Yineng Zhang
|
86a32bb5cd
|
chore: bump v0.5.3rc0 (#10468)
|
2025-09-15 03:55:18 -07:00 |
|
Yineng Zhang
|
5afd036533
|
feat: support pip install sglang (#10465)
|
2025-09-15 03:09:17 -07:00 |
|