Lianmin Zheng
|
1fc84cf60b
|
Update readme (#2500)
Co-authored-by: Ravi Theja <ravi03071991@gmail.com>
Co-authored-by: “yixin-huang1” <yixinhuang1@berkeley.edu>
|
2024-12-17 04:33:36 -08:00 |
|
Yineng Zhang
|
f68175967c
|
docs: update adoption (Meituan) (#2373)
|
2024-12-06 01:59:26 -08:00 |
|
Yineng Zhang
|
eb0c1f5373
|
docs: add SGLang v0.4 blog (#2341)
|
2024-12-05 01:24:51 +08:00 |
|
Yineng Zhang
|
de3b67b77d
|
docs: update adoption (#2204)
|
2024-11-26 12:57:16 -08:00 |
|
Lianmin Zheng
|
8912b7637f
|
Fix docs (#2164)
|
2024-11-24 08:25:56 -08:00 |
|
Lianmin Zheng
|
c211e7b669
|
Simplify batch update (#2154)
|
2024-11-24 04:47:10 -08:00 |
|
James Xu
|
f6f713797b
|
Add support for Qwen2-VL-based embedding models (#2055)
|
2024-11-21 14:24:25 -08:00 |
|
Yineng Zhang
|
aaf0a3156e
|
docs: add slides link in README (#1997)
|
2024-11-11 05:03:16 -08:00 |
|
Lianmin Zheng
|
760552e068
|
Update README.md (#1974)
|
2024-11-09 11:32:13 -08:00 |
|
Kursat Aktas
|
d9aada9db1
|
Introducing SGLang Guru on Gurubase.io (#1745)
|
2024-11-09 11:29:26 -08:00 |
|
Chayenne
|
e3126e3c5f
|
Update README.md's Slack invitation link (#1962)
|
2024-11-08 11:46:25 -08:00 |
|
Chayenne
|
704f8e8ed1
|
Add Reward API Docs etc (#1910)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
|
2024-11-03 22:33:03 -08:00 |
|
Lianmin Zheng
|
2565cb0f40
|
Update docs and workflow (#1881)
|
2024-11-01 20:29:41 -07:00 |
|
Chayenne
|
61cf00e112
|
change file tree (#1859)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
|
2024-10-31 20:10:16 -07:00 |
|
Lianmin Zheng
|
0ab7bcaf66
|
Simplify documentation in README.md (#1851)
|
2024-10-30 21:57:49 -07:00 |
|
Lianmin Zheng
|
3184aa95a7
|
Update README.md (#1840)
|
2024-10-30 03:16:43 -07:00 |
|
Lianmin Zheng
|
9084a86445
|
Update links (#1805)
|
2024-10-26 04:46:01 -07:00 |
|
Lianmin Zheng
|
30643fed7f
|
Release v0.3.4.post2 (#1796)
Co-authored-by: DarkSharpness <76582120+DarkSharpness@users.noreply.github.com>
|
2024-10-25 11:07:19 -07:00 |
|
Lianmin Zheng
|
e646c5901e
|
Fix logprob in the overlapped mode (#1795)
|
2024-10-25 11:06:57 -07:00 |
|
yizhang2077
|
def55bc876
|
Qwen2vl support cuda graph and disable radix cache (#1780)
|
2024-10-25 10:45:17 -04:00 |
|
Lianmin Zheng
|
b7d0559496
|
Update docs (#1768)
Co-authored-by: Chayenne Zhao <zhaochenyang20@gmail.com>
Co-authored-by: Chayenne <zhaochen20@outlook.com>
|
2024-10-23 11:28:48 -07:00 |
|
Lianmin Zheng
|
1f26e8b8e4
|
Release v0.3.4.post1 (#1749)
|
2024-10-21 21:16:43 -07:00 |
|
Lianmin Zheng
|
09603c6dc9
|
Maintain seq_lens_sum to make more FlashInfer operations non-blocking (#1741)
|
2024-10-21 01:43:16 -07:00 |
|
sixgod
|
45d5af2416
|
Add GLM-4 TextGeneration Model support for SGLang (#1736)
|
2024-10-21 04:08:30 +00:00 |
|
Ying Sheng
|
95946271af
|
Update README.md
|
2024-10-19 22:29:12 -07:00 |
|
Ying Sheng
|
5c4ce65631
|
Update README.md (#1722)
|
2024-10-19 22:27:38 -07:00 |
|
Lianmin Zheng
|
b6cd903604
|
Update readme and workflow (#1716)
|
2024-10-19 13:01:44 -07:00 |
|
Lianmin Zheng
|
087257ea03
|
Release v0.3.4 (#1714)
|
2024-10-19 08:17:41 -07:00 |
|
Lianmin Zheng
|
736f04025d
|
Update README.md (#1713)
|
2024-10-19 07:11:02 -07:00 |
|
Lianmin Zheng
|
d19cc0b9c9
|
Update README.md (#1689)
|
2024-10-16 18:36:24 -07:00 |
|
Ying Sheng
|
e4b367baa8
|
[Event] Add online meetup meeting link (#1686)
|
2024-10-16 10:58:14 -07:00 |
|
Byron Hsu
|
cd0be7489f
|
[doc] improve engine doc and add to readme (#1670)
|
2024-10-14 19:56:21 -07:00 |
|
Lianmin Zheng
|
00c7e6368b
|
Release v0.3.3.post1 (#1636)
|
2024-10-11 07:56:16 -07:00 |
|
Janumala Akhilendra
|
81c3327402
|
Added a "Back To Top" Button (#1633)
|
2024-10-11 06:25:30 -07:00 |
|
Lianmin Zheng
|
5476ccad8f
|
Update README.md
|
2024-10-11 01:59:49 -07:00 |
|
Lianmin Zheng
|
b040ed71f7
|
Update README.md (#1629)
|
2024-10-11 01:58:25 -07:00 |
|
Kushal Agrawal
|
c9e6658699
|
Update README.md (#1625)
|
2024-10-11 01:57:42 -07:00 |
|
Lianmin Zheng
|
7b69d91b4f
|
Release v0.3.3 (#1605)
|
2024-10-08 12:58:41 -07:00 |
|
Lianmin Zheng
|
f7cce751f9
|
Update README.md (#1591)
|
2024-10-06 15:14:29 -07:00 |
|
Ying Sheng
|
1c1bdc7699
|
[Event] Update README.md (#1572)
|
2024-10-05 11:16:47 -07:00 |
|
Ikko Eltociear Ashimine
|
f8fb4ce9b0
|
chore: update README.md (#1580)
|
2024-10-05 11:05:57 -07:00 |
|
Theresa Barton
|
2c7d0a5b8b
|
[Fix] Fix all the Huggingface paths (#1553)
|
2024-10-02 10:12:07 -07:00 |
|
Lianmin Zheng
|
048685430d
|
Improve process creation (#1534)
|
2024-09-29 02:36:12 -07:00 |
|
Lianmin Zheng
|
4e4459b91f
|
Multiple minor fixes (#1530)
|
2024-09-28 14:43:35 -07:00 |
|
Kylin
|
f42e9bfb52
|
[bugfix] Add modelscope package to avoid docker image without modelscope (#1520)
|
2024-09-28 12:43:22 -07:00 |
|
Ying Sheng
|
b1e330bcb0
|
[Event] Update meeting link (#1529)
|
2024-09-27 13:30:04 -07:00 |
|
Ying Sheng
|
37c5899fc2
|
Release v0.3.2 (#1512)
|
2024-09-25 14:17:09 +08:00 |
|
TianyiQ
|
3c93187caf
|
Add support for tie_word_embeddings when loading weights + support for SmolLM (#1508)
|
2024-09-24 21:50:20 -07:00 |
|
Lianmin Zheng
|
167591e864
|
Better unit tests for adding a new model (#1488)
|
2024-09-22 01:50:37 -07:00 |
|
Yineng Zhang
|
82136eb0b5
|
chore: bump v0.3.1.post3 (#1483)
|
2024-09-21 11:17:45 +08:00 |
|