Commit Graph

61 Commits

Author SHA1 Message Date
Liangsheng Yin
5304b4ef58 Add --enable-p2p-check option (#599) 2024-07-06 23:34:10 -07:00
Lianmin Zheng
d737da5f17 Update README.md 2024-07-04 00:56:58 -07:00
Ying Sheng
ac11388756 Add docker file (#588)
Co-authored-by: Ying Sheng <ying.sheng@databricks.com>
2024-07-04 00:53:49 -07:00
Lianmin Zheng
dc8cef1d0c Update README.md 2024-07-04 00:05:40 -07:00
Lianmin Zheng
c7709d3abe Update install commands (#583) 2024-07-03 02:10:59 -07:00
Ying Sheng
9380f50ff9 Turn on flashinfer by default (#578) 2024-07-02 02:25:07 -07:00
Lianmin Zheng
26294b2f3d Update README.md 2024-07-01 09:54:08 -07:00
Lianmin Zheng
1fa15099d8 Add LlamaForClassification (#559) 2024-06-22 00:49:31 -07:00
Lianmin Zheng
f6dbd24043 Improve doc strings (#518) 2024-06-08 02:39:32 -07:00
Lianmin Zheng
c0ae70c8ed Improve logging & fix litellm dependency. (#512) 2024-06-07 13:10:32 -07:00
Lianmin Zheng
9f009261f2 Improve docs 2024-06-01 17:46:08 -05:00
Lianmin Zheng
adc974268a Update docs (#486) 2024-05-27 22:51:05 -07:00
Yuanhan Zhang
44c998fcb5 Add the instruction link to the LLaVA-NeXT-Video at README (#463) 2024-05-24 03:38:20 -07:00
Yuanhan Zhang
0992d85f92 support llava video (#426) 2024-05-13 16:57:00 -07:00
Lianmin Zheng
455c9ccc4a Update readme (#434) 2024-05-13 00:17:02 -07:00
Lianmin Zheng
aee4f523cf Fix logit processor bugs (#427) 2024-05-12 04:54:07 -07:00
Ying Sheng
ca4f1ab89c Update model support in readme (#370) 2024-04-17 00:16:32 -07:00
Ikko Eltociear Ashimine
c93293c57e Update README.md (#358) 2024-04-09 23:39:30 +08:00
Lianmin Zheng
faba293a0d Improve gemma and documentations (#278) 2024-03-11 04:43:39 -07:00
Lianmin Zheng
a833de05d3 Add logo (#275) 2024-03-10 18:51:47 -07:00
Ikko Eltociear Ashimine
ce3b261053 Update README.md (#207) 2024-02-19 09:09:03 -08:00
Yaya Sy
c97fdae4aa correct a mistake on the README.md (#182) 2024-02-11 13:25:57 -08:00
Ying Sheng
79e6b84bec Update README.md 2024-02-06 23:14:59 -08:00
Ying Sheng
cb8e1982f8 Update README.md 2024-02-06 18:45:38 -08:00
Lianmin Zheng
ee1df26a77 Update README.md 2024-02-06 11:35:42 -08:00
Lianmin Zheng
8ff870bf3e improve docs 2024-02-05 11:29:08 +00:00
Lianmin Zheng
8fb7459e08 update json decoding docs 2024-02-03 17:44:02 -08:00
Ying Sheng
f6bfe3aaff Release 0.1.11 (#134) 2024-02-03 02:50:13 -08:00
Lianmin Zheng
03e04b2331 update docs for Yi-VL 2024-02-01 22:44:59 +00:00
Lianmin Zheng
a49dc52bfa release v0.1.10 2024-01-30 15:37:52 +00:00
Keith Stevens
1d0fbe8e43 [Feature] Adds basic support for image content in OpenAI chat routes (#113) 2024-01-30 06:12:33 -08:00
Lianmin Zheng
97aa9b3284 Improve docs & Add JSON decode example (#121) 2024-01-30 05:45:27 -08:00
Lianmin Zheng
0617528632 Update quick start examples (#120) 2024-01-30 04:29:32 -08:00
Lianmin Zheng
4ea92f8307 Format code (#118) 2024-01-29 17:08:12 -08:00
Lianmin Zheng
93414c8238 Add a link to HF paper page 2024-01-24 22:25:33 -08:00
Lianmin Zheng
ed7c7eca0e Update README.md 2024-01-24 16:52:21 -08:00
Lianmin Zheng
9a16fea012 Return logprob for choices (#87) 2024-01-23 05:07:30 -08:00
Lianmin Zheng
9e037c822c Update README.md 2024-01-23 03:43:19 -08:00
0xWe11es.eth
9076386d90 Fix SRT endpoint api json syntax (#84) 2024-01-23 00:25:26 -08:00
Arcmoon
63e97e5e4c Suppport qwen model and solve some problems (#75) 2024-01-22 20:14:51 -08:00
Lianmin Zheng
cd3ccb2ed7 Add a note about triton version for older GPUs (#72) 2024-01-21 16:51:45 -08:00
Lianmin Zheng
007eeb4eb9 Fix the error message and dependency of openai backend (#71) 2024-01-21 14:56:25 -08:00
Ying Sheng
e8f2b155fe Update README.md 2024-01-21 02:45:58 -08:00
Ikko Eltociear Ashimine
0b2efc2adc Update README.md (#58) 2024-01-19 21:00:29 -08:00
Lianmin Zheng
199e82a15d Format code & Improve readme (#52) 2024-01-18 23:51:19 -08:00
Cody Yu
23471f9aa3 Support v1/chat/completions (#50) 2024-01-18 23:43:09 -08:00
Cody Yu
61d4c93962 Support stream=True in v1/completions (#49) 2024-01-18 17:00:56 -08:00
Lianmin Zheng
05b4c398df Document sampling parameters (#45) 2024-01-18 11:49:27 -08:00
Lianmin Zheng
70528762bf update readme 2024-01-17 10:42:55 -08:00
Ying Sheng
71d30d6ddc Update README.md 2024-01-17 09:49:53 -08:00