zhyncs
|
7b597475f2
|
docs: update README (#692)
|
2024-07-22 03:41:20 +10:00 |
|
Lianmin Zheng
|
5a4ef2b5c8
|
update readme
|
2024-07-21 02:58:57 -07:00 |
|
zhyncs
|
9dab947d56
|
docs: update README (#688)
|
2024-07-21 18:32:58 +10:00 |
|
Lianmin Zheng
|
33ee97b0bf
|
Allow disabling streaming in bench (#687)
|
2024-07-21 01:12:34 -07:00 |
|
Lianmin Zheng
|
77e592e8e0
|
support non-streaming benchmark (#682)
|
2024-07-20 18:36:42 -07:00 |
|
Ying Sheng
|
2b4c646277
|
Update version to 0.1.22 (#677)
|
2024-07-20 03:39:50 -07:00 |
|
Ying Sheng
|
50a53887be
|
Update docs
|
2024-07-19 11:40:06 -07:00 |
|
Ying Sheng
|
11c8efff73
|
Add benchmark instructions (#663)
|
2024-07-19 11:12:23 -07:00 |
|
Ying Sheng
|
e87c7fd501
|
Improve docs (#662)
|
2024-07-19 10:58:03 -07:00 |
|
Ying Sheng
|
51fda1439f
|
Update Readme (#660)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-07-19 09:54:01 -07:00 |
|
Ying Sheng
|
5f90e0769c
|
Update README.md
|
2024-07-16 19:18:54 -07:00 |
|
Liangsheng Yin
|
8832ecb1e4
|
Reduce docker size (#632)
|
2024-07-16 16:12:12 -07:00 |
|
Ying Sheng
|
6a2941f4d0
|
Improve tensor parallel performance (#625)
Co-authored-by: Mingyi <wisclmy0611@gmail.com>
|
2024-07-15 07:10:51 -07:00 |
|
Lianmin Zheng
|
da2e5d6546
|
Fix the default argument of OpenAI Chat completion (#605)
|
2024-07-09 02:04:43 -07:00 |
|
胡译文
|
02b7258658
|
[Feat] Expose logprob options to sgl.gen API (#503)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-07-09 00:35:39 -07:00 |
|
Tommy Yang
|
b38687226a
|
Make sglang compat with vllm 0.5.1 (#598)
|
2024-07-08 23:44:22 -07:00 |
|
Liangsheng Yin
|
5304b4ef58
|
Add --enable-p2p-check option (#599)
|
2024-07-06 23:34:10 -07:00 |
|
Lianmin Zheng
|
d737da5f17
|
Update README.md
|
2024-07-04 00:56:58 -07:00 |
|
Ying Sheng
|
ac11388756
|
Add docker file (#588)
Co-authored-by: Ying Sheng <ying.sheng@databricks.com>
|
2024-07-04 00:53:49 -07:00 |
|
Lianmin Zheng
|
dc8cef1d0c
|
Update README.md
|
2024-07-04 00:05:40 -07:00 |
|
Lianmin Zheng
|
c7709d3abe
|
Update install commands (#583)
|
2024-07-03 02:10:59 -07:00 |
|
Ying Sheng
|
9380f50ff9
|
Turn on flashinfer by default (#578)
|
2024-07-02 02:25:07 -07:00 |
|
Lianmin Zheng
|
26294b2f3d
|
Update README.md
|
2024-07-01 09:54:08 -07:00 |
|
Lianmin Zheng
|
1fa15099d8
|
Add LlamaForClassification (#559)
|
2024-06-22 00:49:31 -07:00 |
|
Lianmin Zheng
|
f6dbd24043
|
Improve doc strings (#518)
|
2024-06-08 02:39:32 -07:00 |
|
Lianmin Zheng
|
c0ae70c8ed
|
Improve logging & fix litellm dependency. (#512)
|
2024-06-07 13:10:32 -07:00 |
|
Lianmin Zheng
|
9f009261f2
|
Improve docs
|
2024-06-01 17:46:08 -05:00 |
|
Lianmin Zheng
|
adc974268a
|
Update docs (#486)
|
2024-05-27 22:51:05 -07:00 |
|
Yuanhan Zhang
|
44c998fcb5
|
Add the instruction link to the LLaVA-NeXT-Video at README (#463)
|
2024-05-24 03:38:20 -07:00 |
|
Yuanhan Zhang
|
0992d85f92
|
support llava video (#426)
|
2024-05-13 16:57:00 -07:00 |
|
Lianmin Zheng
|
455c9ccc4a
|
Update readme (#434)
|
2024-05-13 00:17:02 -07:00 |
|
Lianmin Zheng
|
aee4f523cf
|
Fix logit processor bugs (#427)
|
2024-05-12 04:54:07 -07:00 |
|
Ying Sheng
|
ca4f1ab89c
|
Update model support in readme (#370)
|
2024-04-17 00:16:32 -07:00 |
|
Ikko Eltociear Ashimine
|
c93293c57e
|
Update README.md (#358)
|
2024-04-09 23:39:30 +08:00 |
|
Lianmin Zheng
|
faba293a0d
|
Improve gemma and documentations (#278)
|
2024-03-11 04:43:39 -07:00 |
|
Lianmin Zheng
|
a833de05d3
|
Add logo (#275)
|
2024-03-10 18:51:47 -07:00 |
|
Ikko Eltociear Ashimine
|
ce3b261053
|
Update README.md (#207)
|
2024-02-19 09:09:03 -08:00 |
|
Yaya Sy
|
c97fdae4aa
|
correct a mistake on the README.md (#182)
|
2024-02-11 13:25:57 -08:00 |
|
Ying Sheng
|
79e6b84bec
|
Update README.md
|
2024-02-06 23:14:59 -08:00 |
|
Ying Sheng
|
cb8e1982f8
|
Update README.md
|
2024-02-06 18:45:38 -08:00 |
|
Lianmin Zheng
|
ee1df26a77
|
Update README.md
|
2024-02-06 11:35:42 -08:00 |
|
Lianmin Zheng
|
8ff870bf3e
|
improve docs
|
2024-02-05 11:29:08 +00:00 |
|
Lianmin Zheng
|
8fb7459e08
|
update json decoding docs
|
2024-02-03 17:44:02 -08:00 |
|
Ying Sheng
|
f6bfe3aaff
|
Release 0.1.11 (#134)
|
2024-02-03 02:50:13 -08:00 |
|
Lianmin Zheng
|
03e04b2331
|
update docs for Yi-VL
|
2024-02-01 22:44:59 +00:00 |
|
Lianmin Zheng
|
a49dc52bfa
|
release v0.1.10
|
2024-01-30 15:37:52 +00:00 |
|
Keith Stevens
|
1d0fbe8e43
|
[Feature] Adds basic support for image content in OpenAI chat routes (#113)
|
2024-01-30 06:12:33 -08:00 |
|
Lianmin Zheng
|
97aa9b3284
|
Improve docs & Add JSON decode example (#121)
|
2024-01-30 05:45:27 -08:00 |
|
Lianmin Zheng
|
0617528632
|
Update quick start examples (#120)
|
2024-01-30 04:29:32 -08:00 |
|
Lianmin Zheng
|
4ea92f8307
|
Format code (#118)
|
2024-01-29 17:08:12 -08:00 |
|