Commit Graph

77 Commits

Author SHA1 Message Date
zhyncs
7b597475f2 docs: update README (#692) 2024-07-22 03:41:20 +10:00
Lianmin Zheng
5a4ef2b5c8 update readme 2024-07-21 02:58:57 -07:00
zhyncs
9dab947d56 docs: update README (#688) 2024-07-21 18:32:58 +10:00
Lianmin Zheng
33ee97b0bf Allow disabling streaming in bench (#687) 2024-07-21 01:12:34 -07:00
Lianmin Zheng
77e592e8e0 support non-streaming benchmark (#682) 2024-07-20 18:36:42 -07:00
Ying Sheng
2b4c646277 Update version to 0.1.22 (#677) 2024-07-20 03:39:50 -07:00
Ying Sheng
50a53887be Update docs 2024-07-19 11:40:06 -07:00
Ying Sheng
11c8efff73 Add benchmark instructions (#663) 2024-07-19 11:12:23 -07:00
Ying Sheng
e87c7fd501 Improve docs (#662) 2024-07-19 10:58:03 -07:00
Ying Sheng
51fda1439f Update Readme (#660)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
2024-07-19 09:54:01 -07:00
Ying Sheng
5f90e0769c Update README.md 2024-07-16 19:18:54 -07:00
Liangsheng Yin
8832ecb1e4 Reduce docker size (#632) 2024-07-16 16:12:12 -07:00
Ying Sheng
6a2941f4d0 Improve tensor parallel performance (#625)
Co-authored-by: Mingyi <wisclmy0611@gmail.com>
2024-07-15 07:10:51 -07:00
Lianmin Zheng
da2e5d6546 Fix the default argument of OpenAI Chat completion (#605) 2024-07-09 02:04:43 -07:00
胡译文
02b7258658 [Feat] Expose logprob options to sgl.gen API (#503)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
2024-07-09 00:35:39 -07:00
Tommy Yang
b38687226a Make sglang compat with vllm 0.5.1 (#598) 2024-07-08 23:44:22 -07:00
Liangsheng Yin
5304b4ef58 Add --enable-p2p-check option (#599) 2024-07-06 23:34:10 -07:00
Lianmin Zheng
d737da5f17 Update README.md 2024-07-04 00:56:58 -07:00
Ying Sheng
ac11388756 Add docker file (#588)
Co-authored-by: Ying Sheng <ying.sheng@databricks.com>
2024-07-04 00:53:49 -07:00
Lianmin Zheng
dc8cef1d0c Update README.md 2024-07-04 00:05:40 -07:00
Lianmin Zheng
c7709d3abe Update install commands (#583) 2024-07-03 02:10:59 -07:00
Ying Sheng
9380f50ff9 Turn on flashinfer by default (#578) 2024-07-02 02:25:07 -07:00
Lianmin Zheng
26294b2f3d Update README.md 2024-07-01 09:54:08 -07:00
Lianmin Zheng
1fa15099d8 Add LlamaForClassification (#559) 2024-06-22 00:49:31 -07:00
Lianmin Zheng
f6dbd24043 Improve doc strings (#518) 2024-06-08 02:39:32 -07:00
Lianmin Zheng
c0ae70c8ed Improve logging & fix litellm dependency. (#512) 2024-06-07 13:10:32 -07:00
Lianmin Zheng
9f009261f2 Improve docs 2024-06-01 17:46:08 -05:00
Lianmin Zheng
adc974268a Update docs (#486) 2024-05-27 22:51:05 -07:00
Yuanhan Zhang
44c998fcb5 Add the instruction link to the LLaVA-NeXT-Video at README (#463) 2024-05-24 03:38:20 -07:00
Yuanhan Zhang
0992d85f92 support llava video (#426) 2024-05-13 16:57:00 -07:00
Lianmin Zheng
455c9ccc4a Update readme (#434) 2024-05-13 00:17:02 -07:00
Lianmin Zheng
aee4f523cf Fix logit processor bugs (#427) 2024-05-12 04:54:07 -07:00
Ying Sheng
ca4f1ab89c Update model support in readme (#370) 2024-04-17 00:16:32 -07:00
Ikko Eltociear Ashimine
c93293c57e Update README.md (#358) 2024-04-09 23:39:30 +08:00
Lianmin Zheng
faba293a0d Improve gemma and documentations (#278) 2024-03-11 04:43:39 -07:00
Lianmin Zheng
a833de05d3 Add logo (#275) 2024-03-10 18:51:47 -07:00
Ikko Eltociear Ashimine
ce3b261053 Update README.md (#207) 2024-02-19 09:09:03 -08:00
Yaya Sy
c97fdae4aa correct a mistake on the README.md (#182) 2024-02-11 13:25:57 -08:00
Ying Sheng
79e6b84bec Update README.md 2024-02-06 23:14:59 -08:00
Ying Sheng
cb8e1982f8 Update README.md 2024-02-06 18:45:38 -08:00
Lianmin Zheng
ee1df26a77 Update README.md 2024-02-06 11:35:42 -08:00
Lianmin Zheng
8ff870bf3e improve docs 2024-02-05 11:29:08 +00:00
Lianmin Zheng
8fb7459e08 update json decoding docs 2024-02-03 17:44:02 -08:00
Ying Sheng
f6bfe3aaff Release 0.1.11 (#134) 2024-02-03 02:50:13 -08:00
Lianmin Zheng
03e04b2331 update docs for Yi-VL 2024-02-01 22:44:59 +00:00
Lianmin Zheng
a49dc52bfa release v0.1.10 2024-01-30 15:37:52 +00:00
Keith Stevens
1d0fbe8e43 [Feature] Adds basic support for image content in OpenAI chat routes (#113) 2024-01-30 06:12:33 -08:00
Lianmin Zheng
97aa9b3284 Improve docs & Add JSON decode example (#121) 2024-01-30 05:45:27 -08:00
Lianmin Zheng
0617528632 Update quick start examples (#120) 2024-01-30 04:29:32 -08:00
Lianmin Zheng
4ea92f8307 Format code (#118) 2024-01-29 17:08:12 -08:00