Commit Graph

207 Commits

Author SHA1 Message Date
Lianmin Zheng
1fc84cf60b Update readme (#2500)
Co-authored-by: Ravi Theja <ravi03071991@gmail.com>
Co-authored-by: “yixin-huang1” <yixinhuang1@berkeley.edu>
2024-12-17 04:33:36 -08:00
Yineng Zhang
f68175967c docs: update adoption (Meituan) (#2373) 2024-12-06 01:59:26 -08:00
Yineng Zhang
eb0c1f5373 docs: add SGLang v0.4 blog (#2341) 2024-12-05 01:24:51 +08:00
Yineng Zhang
de3b67b77d docs: update adoption (#2204) 2024-11-26 12:57:16 -08:00
Lianmin Zheng
8912b7637f Fix docs (#2164) 2024-11-24 08:25:56 -08:00
Lianmin Zheng
c211e7b669 Simplify batch update (#2154) 2024-11-24 04:47:10 -08:00
James Xu
f6f713797b Add support for Qwen2-VL-based embedding models (#2055) 2024-11-21 14:24:25 -08:00
Yineng Zhang
aaf0a3156e docs: add slides link in README (#1997) 2024-11-11 05:03:16 -08:00
Lianmin Zheng
760552e068 Update README.md (#1974) 2024-11-09 11:32:13 -08:00
Kursat Aktas
d9aada9db1 Introducing SGLang Guru on Gurubase.io (#1745) 2024-11-09 11:29:26 -08:00
Chayenne
e3126e3c5f Update README.md's Slack invitation link (#1962) 2024-11-08 11:46:25 -08:00
Chayenne
704f8e8ed1 Add Reward API Docs etc (#1910)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
2024-11-03 22:33:03 -08:00
Lianmin Zheng
2565cb0f40 Update docs and workflow (#1881) 2024-11-01 20:29:41 -07:00
Chayenne
61cf00e112 change file tree (#1859)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
2024-10-31 20:10:16 -07:00
Lianmin Zheng
0ab7bcaf66 Simplify documentation in README.md (#1851) 2024-10-30 21:57:49 -07:00
Lianmin Zheng
3184aa95a7 Update README.md (#1840) 2024-10-30 03:16:43 -07:00
Lianmin Zheng
9084a86445 Update links (#1805) 2024-10-26 04:46:01 -07:00
Lianmin Zheng
30643fed7f Release v0.3.4.post2 (#1796)
Co-authored-by: DarkSharpness <76582120+DarkSharpness@users.noreply.github.com>
2024-10-25 11:07:19 -07:00
Lianmin Zheng
e646c5901e Fix logprob in the overlapped mode (#1795) 2024-10-25 11:06:57 -07:00
yizhang2077
def55bc876 Qwen2vl support cuda graph and disable radix cache (#1780) 2024-10-25 10:45:17 -04:00
Lianmin Zheng
b7d0559496 Update docs (#1768)
Co-authored-by: Chayenne Zhao <zhaochenyang20@gmail.com>
Co-authored-by: Chayenne <zhaochen20@outlook.com>
2024-10-23 11:28:48 -07:00
Lianmin Zheng
1f26e8b8e4 Release v0.3.4.post1 (#1749) 2024-10-21 21:16:43 -07:00
Lianmin Zheng
09603c6dc9 Maintain seq_lens_sum to make more FlashInfer operations non-blocking (#1741) 2024-10-21 01:43:16 -07:00
sixgod
45d5af2416 Add GLM-4 TextGeneration Model support for SGLang (#1736) 2024-10-21 04:08:30 +00:00
Ying Sheng
95946271af Update README.md 2024-10-19 22:29:12 -07:00
Ying Sheng
5c4ce65631 Update README.md (#1722) 2024-10-19 22:27:38 -07:00
Lianmin Zheng
b6cd903604 Update readme and workflow (#1716) 2024-10-19 13:01:44 -07:00
Lianmin Zheng
087257ea03 Release v0.3.4 (#1714) 2024-10-19 08:17:41 -07:00
Lianmin Zheng
736f04025d Update README.md (#1713) 2024-10-19 07:11:02 -07:00
Lianmin Zheng
d19cc0b9c9 Update README.md (#1689) 2024-10-16 18:36:24 -07:00
Ying Sheng
e4b367baa8 [Event] Add online meetup meeting link (#1686) 2024-10-16 10:58:14 -07:00
Byron Hsu
cd0be7489f [doc] improve engine doc and add to readme (#1670) 2024-10-14 19:56:21 -07:00
Lianmin Zheng
00c7e6368b Release v0.3.3.post1 (#1636) 2024-10-11 07:56:16 -07:00
Janumala Akhilendra
81c3327402 Added a "Back To Top" Button (#1633) 2024-10-11 06:25:30 -07:00
Lianmin Zheng
5476ccad8f Update README.md 2024-10-11 01:59:49 -07:00
Lianmin Zheng
b040ed71f7 Update README.md (#1629) 2024-10-11 01:58:25 -07:00
Kushal Agrawal
c9e6658699 Update README.md (#1625) 2024-10-11 01:57:42 -07:00
Lianmin Zheng
7b69d91b4f Release v0.3.3 (#1605) 2024-10-08 12:58:41 -07:00
Lianmin Zheng
f7cce751f9 Update README.md (#1591) 2024-10-06 15:14:29 -07:00
Ying Sheng
1c1bdc7699 [Event] Update README.md (#1572) 2024-10-05 11:16:47 -07:00
Ikko Eltociear Ashimine
f8fb4ce9b0 chore: update README.md (#1580) 2024-10-05 11:05:57 -07:00
Theresa Barton
2c7d0a5b8b [Fix] Fix all the Huggingface paths (#1553) 2024-10-02 10:12:07 -07:00
Lianmin Zheng
048685430d Improve process creation (#1534) 2024-09-29 02:36:12 -07:00
Lianmin Zheng
4e4459b91f Multiple minor fixes (#1530) 2024-09-28 14:43:35 -07:00
Kylin
f42e9bfb52 [bugfix] Add modelscope package to avoid docker image without modelscope (#1520) 2024-09-28 12:43:22 -07:00
Ying Sheng
b1e330bcb0 [Event] Update meeting link (#1529) 2024-09-27 13:30:04 -07:00
Ying Sheng
37c5899fc2 Release v0.3.2 (#1512) 2024-09-25 14:17:09 +08:00
TianyiQ
3c93187caf Add support for tie_word_embeddings when loading weights + support for SmolLM (#1508) 2024-09-24 21:50:20 -07:00
Lianmin Zheng
167591e864 Better unit tests for adding a new model (#1488) 2024-09-22 01:50:37 -07:00
Yineng Zhang
82136eb0b5 chore: bump v0.3.1.post3 (#1483) 2024-09-21 11:17:45 +08:00