Commit Graph

106 Commits

Author SHA1 Message Date
Ying Sheng
141e8c71a3 Bump version to 0.2.10 (#923) 2024-08-04 16:52:51 -07:00
Ying Sheng
8c5382e62c Update README.md 2024-08-03 12:58:41 -07:00
Ying Sheng
b906c01592 Bump version to 0.2.9.post1 (#899) 2024-08-02 12:08:00 -07:00
Ying Sheng
30a9b2ef20 Bump version to v0.2.9 (#890) 2024-08-02 01:45:48 -07:00
Ying Sheng
e4d3333c6c bump to 0.2.8 (#877) 2024-08-01 14:18:26 -07:00
Ikko Eltociear Ashimine
7d5ed7c6ee docs: update README.md (#843) 2024-07-31 12:48:18 +10:00
Yineng Zhang
1edd4e07d6 chore: bump v0.2.7 (#830) 2024-07-30 20:41:10 +10:00
Yineng Zhang
bece265f5a docs: update README (#819) 2024-07-30 16:17:50 +10:00
Ying Sheng
db6089e6f3 Revert "Organize public APIs" (#815) 2024-07-29 19:40:28 -07:00
Liangsheng Yin
c8e9fed87a Organize public APIs (#809) 2024-07-29 15:34:16 -07:00
ObjectNotFound
8f6274c82b Add role documentation, add system begin & end tokens (#793) 2024-07-28 23:02:49 -07:00
Ying Sheng
5bd899243b Update README.md (#792) 2024-07-28 21:57:23 -07:00
Yineng Zhang
1f013d64eb docs: make badges center (#789) 2024-07-28 22:27:52 +10:00
Yineng Zhang
628e1fa760 docs: update README (#788) 2024-07-28 22:24:27 +10:00
Ying Sheng
bcb6611a46 Update README.md 2024-07-28 01:00:06 -07:00
Yineng Zhang
948625799e docs: init readthedocs support (#783) 2024-07-28 16:50:31 +10:00
Lianmin Zheng
f64b2a9bc0 Add slack invitation link. 2024-07-27 06:29:15 -07:00
Ying Sheng
9f95dcc64f Update readme (#769)
Co-authored-by: Mingyi <wisclmy0611@gmail.com>
2024-07-27 06:12:16 -07:00
Liangsheng Yin
ba29504b21 Update supported models (#763) 2024-07-27 15:53:53 +10:00
Yineng Zhang
3e455b016e misc: replace deprecated variable HUGGING_FACE_HUB_TOKEN with HF_TOKEN (#752) 2024-07-27 04:19:30 +10:00
Yineng Zhang
01fbb11bb7 docs: fix typo (#742) 2024-07-26 21:05:53 +10:00
Yineng Zhang
05d216da32 docs: add llama 3.1 405b instruction (#739)
Co-authored-by: Ying1123 <sqy1415@gmail.com>
2024-07-26 21:03:20 +10:00
Ying Sheng
7f6f2f0f09 Update readme (#731) 2024-07-25 09:16:10 -07:00
Ying Sheng
7802df1e2b Update readme 2024-07-25 08:45:06 -07:00
Ying Sheng
8fbba3de3d Fix bugs (fp8 checkpoints, triton cache manager) (#729) 2024-07-25 07:42:00 -07:00
Yineng Zhang
926ac01b64 fix: resolve the logo display issue on the PyPI page (#726) 2024-07-25 20:47:46 +10:00
Yineng Zhang
fded67441d misc: update bulid instruction (#724) 2024-07-25 17:08:11 +10:00
Yineng Zhang
d5146baec9 docs: update supported models (#719) 2024-07-25 09:34:01 +10:00
Ying Sheng
08a3bd19cc docs: update doc (#716) 2024-07-24 20:44:03 +00:00
zhyncs
7b597475f2 docs: update README (#692) 2024-07-22 03:41:20 +10:00
Lianmin Zheng
5a4ef2b5c8 update readme 2024-07-21 02:58:57 -07:00
zhyncs
9dab947d56 docs: update README (#688) 2024-07-21 18:32:58 +10:00
Lianmin Zheng
33ee97b0bf Allow disabling streaming in bench (#687) 2024-07-21 01:12:34 -07:00
Lianmin Zheng
77e592e8e0 support non-streaming benchmark (#682) 2024-07-20 18:36:42 -07:00
Ying Sheng
2b4c646277 Update version to 0.1.22 (#677) 2024-07-20 03:39:50 -07:00
Ying Sheng
50a53887be Update docs 2024-07-19 11:40:06 -07:00
Ying Sheng
11c8efff73 Add benchmark instructions (#663) 2024-07-19 11:12:23 -07:00
Ying Sheng
e87c7fd501 Improve docs (#662) 2024-07-19 10:58:03 -07:00
Ying Sheng
51fda1439f Update Readme (#660)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
2024-07-19 09:54:01 -07:00
Ying Sheng
5f90e0769c Update README.md 2024-07-16 19:18:54 -07:00
Liangsheng Yin
8832ecb1e4 Reduce docker size (#632) 2024-07-16 16:12:12 -07:00
Ying Sheng
6a2941f4d0 Improve tensor parallel performance (#625)
Co-authored-by: Mingyi <wisclmy0611@gmail.com>
2024-07-15 07:10:51 -07:00
Lianmin Zheng
da2e5d6546 Fix the default argument of OpenAI Chat completion (#605) 2024-07-09 02:04:43 -07:00
胡译文
02b7258658 [Feat] Expose logprob options to sgl.gen API (#503)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
2024-07-09 00:35:39 -07:00
Tommy Yang
b38687226a Make sglang compat with vllm 0.5.1 (#598) 2024-07-08 23:44:22 -07:00
Liangsheng Yin
5304b4ef58 Add --enable-p2p-check option (#599) 2024-07-06 23:34:10 -07:00
Lianmin Zheng
d737da5f17 Update README.md 2024-07-04 00:56:58 -07:00
Ying Sheng
ac11388756 Add docker file (#588)
Co-authored-by: Ying Sheng <ying.sheng@databricks.com>
2024-07-04 00:53:49 -07:00
Lianmin Zheng
dc8cef1d0c Update README.md 2024-07-04 00:05:40 -07:00
Lianmin Zheng
c7709d3abe Update install commands (#583) 2024-07-03 02:10:59 -07:00