Ying Sheng
|
141e8c71a3
|
Bump version to 0.2.10 (#923)
|
2024-08-04 16:52:51 -07:00 |
|
Ying Sheng
|
8c5382e62c
|
Update README.md
|
2024-08-03 12:58:41 -07:00 |
|
Ying Sheng
|
b906c01592
|
Bump version to 0.2.9.post1 (#899)
|
2024-08-02 12:08:00 -07:00 |
|
Ying Sheng
|
30a9b2ef20
|
Bump version to v0.2.9 (#890)
|
2024-08-02 01:45:48 -07:00 |
|
Ying Sheng
|
e4d3333c6c
|
bump to 0.2.8 (#877)
|
2024-08-01 14:18:26 -07:00 |
|
Ikko Eltociear Ashimine
|
7d5ed7c6ee
|
docs: update README.md (#843)
|
2024-07-31 12:48:18 +10:00 |
|
Yineng Zhang
|
1edd4e07d6
|
chore: bump v0.2.7 (#830)
|
2024-07-30 20:41:10 +10:00 |
|
Yineng Zhang
|
bece265f5a
|
docs: update README (#819)
|
2024-07-30 16:17:50 +10:00 |
|
Ying Sheng
|
db6089e6f3
|
Revert "Organize public APIs" (#815)
|
2024-07-29 19:40:28 -07:00 |
|
Liangsheng Yin
|
c8e9fed87a
|
Organize public APIs (#809)
|
2024-07-29 15:34:16 -07:00 |
|
ObjectNotFound
|
8f6274c82b
|
Add role documentation, add system begin & end tokens (#793)
|
2024-07-28 23:02:49 -07:00 |
|
Ying Sheng
|
5bd899243b
|
Update README.md (#792)
|
2024-07-28 21:57:23 -07:00 |
|
Yineng Zhang
|
1f013d64eb
|
docs: make badges center (#789)
|
2024-07-28 22:27:52 +10:00 |
|
Yineng Zhang
|
628e1fa760
|
docs: update README (#788)
|
2024-07-28 22:24:27 +10:00 |
|
Ying Sheng
|
bcb6611a46
|
Update README.md
|
2024-07-28 01:00:06 -07:00 |
|
Yineng Zhang
|
948625799e
|
docs: init readthedocs support (#783)
|
2024-07-28 16:50:31 +10:00 |
|
Lianmin Zheng
|
f64b2a9bc0
|
Add slack invitation link.
|
2024-07-27 06:29:15 -07:00 |
|
Ying Sheng
|
9f95dcc64f
|
Update readme (#769)
Co-authored-by: Mingyi <wisclmy0611@gmail.com>
|
2024-07-27 06:12:16 -07:00 |
|
Liangsheng Yin
|
ba29504b21
|
Update supported models (#763)
|
2024-07-27 15:53:53 +10:00 |
|
Yineng Zhang
|
3e455b016e
|
misc: replace deprecated variable HUGGING_FACE_HUB_TOKEN with HF_TOKEN (#752)
|
2024-07-27 04:19:30 +10:00 |
|
Yineng Zhang
|
01fbb11bb7
|
docs: fix typo (#742)
|
2024-07-26 21:05:53 +10:00 |
|
Yineng Zhang
|
05d216da32
|
docs: add llama 3.1 405b instruction (#739)
Co-authored-by: Ying1123 <sqy1415@gmail.com>
|
2024-07-26 21:03:20 +10:00 |
|
Ying Sheng
|
7f6f2f0f09
|
Update readme (#731)
|
2024-07-25 09:16:10 -07:00 |
|
Ying Sheng
|
7802df1e2b
|
Update readme
|
2024-07-25 08:45:06 -07:00 |
|
Ying Sheng
|
8fbba3de3d
|
Fix bugs (fp8 checkpoints, triton cache manager) (#729)
|
2024-07-25 07:42:00 -07:00 |
|
Yineng Zhang
|
926ac01b64
|
fix: resolve the logo display issue on the PyPI page (#726)
|
2024-07-25 20:47:46 +10:00 |
|
Yineng Zhang
|
fded67441d
|
misc: update bulid instruction (#724)
|
2024-07-25 17:08:11 +10:00 |
|
Yineng Zhang
|
d5146baec9
|
docs: update supported models (#719)
|
2024-07-25 09:34:01 +10:00 |
|
Ying Sheng
|
08a3bd19cc
|
docs: update doc (#716)
|
2024-07-24 20:44:03 +00:00 |
|
zhyncs
|
7b597475f2
|
docs: update README (#692)
|
2024-07-22 03:41:20 +10:00 |
|
Lianmin Zheng
|
5a4ef2b5c8
|
update readme
|
2024-07-21 02:58:57 -07:00 |
|
zhyncs
|
9dab947d56
|
docs: update README (#688)
|
2024-07-21 18:32:58 +10:00 |
|
Lianmin Zheng
|
33ee97b0bf
|
Allow disabling streaming in bench (#687)
|
2024-07-21 01:12:34 -07:00 |
|
Lianmin Zheng
|
77e592e8e0
|
support non-streaming benchmark (#682)
|
2024-07-20 18:36:42 -07:00 |
|
Ying Sheng
|
2b4c646277
|
Update version to 0.1.22 (#677)
|
2024-07-20 03:39:50 -07:00 |
|
Ying Sheng
|
50a53887be
|
Update docs
|
2024-07-19 11:40:06 -07:00 |
|
Ying Sheng
|
11c8efff73
|
Add benchmark instructions (#663)
|
2024-07-19 11:12:23 -07:00 |
|
Ying Sheng
|
e87c7fd501
|
Improve docs (#662)
|
2024-07-19 10:58:03 -07:00 |
|
Ying Sheng
|
51fda1439f
|
Update Readme (#660)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-07-19 09:54:01 -07:00 |
|
Ying Sheng
|
5f90e0769c
|
Update README.md
|
2024-07-16 19:18:54 -07:00 |
|
Liangsheng Yin
|
8832ecb1e4
|
Reduce docker size (#632)
|
2024-07-16 16:12:12 -07:00 |
|
Ying Sheng
|
6a2941f4d0
|
Improve tensor parallel performance (#625)
Co-authored-by: Mingyi <wisclmy0611@gmail.com>
|
2024-07-15 07:10:51 -07:00 |
|
Lianmin Zheng
|
da2e5d6546
|
Fix the default argument of OpenAI Chat completion (#605)
|
2024-07-09 02:04:43 -07:00 |
|
胡译文
|
02b7258658
|
[Feat] Expose logprob options to sgl.gen API (#503)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-07-09 00:35:39 -07:00 |
|
Tommy Yang
|
b38687226a
|
Make sglang compat with vllm 0.5.1 (#598)
|
2024-07-08 23:44:22 -07:00 |
|
Liangsheng Yin
|
5304b4ef58
|
Add --enable-p2p-check option (#599)
|
2024-07-06 23:34:10 -07:00 |
|
Lianmin Zheng
|
d737da5f17
|
Update README.md
|
2024-07-04 00:56:58 -07:00 |
|
Ying Sheng
|
ac11388756
|
Add docker file (#588)
Co-authored-by: Ying Sheng <ying.sheng@databricks.com>
|
2024-07-04 00:53:49 -07:00 |
|
Lianmin Zheng
|
dc8cef1d0c
|
Update README.md
|
2024-07-04 00:05:40 -07:00 |
|
Lianmin Zheng
|
c7709d3abe
|
Update install commands (#583)
|
2024-07-03 02:10:59 -07:00 |
|