Commit Graph

38 Commits

Author SHA1 Message Date
Ying Sheng
cb8e1982f8 Update README.md 2024-02-06 18:45:38 -08:00
Lianmin Zheng
ee1df26a77 Update README.md 2024-02-06 11:35:42 -08:00
Lianmin Zheng
8ff870bf3e improve docs 2024-02-05 11:29:08 +00:00
Lianmin Zheng
8fb7459e08 update json decoding docs 2024-02-03 17:44:02 -08:00
Ying Sheng
f6bfe3aaff Release 0.1.11 (#134) 2024-02-03 02:50:13 -08:00
Lianmin Zheng
03e04b2331 update docs for Yi-VL 2024-02-01 22:44:59 +00:00
Lianmin Zheng
a49dc52bfa release v0.1.10 2024-01-30 15:37:52 +00:00
Keith Stevens
1d0fbe8e43 [Feature] Adds basic support for image content in OpenAI chat routes (#113) 2024-01-30 06:12:33 -08:00
Lianmin Zheng
97aa9b3284 Improve docs & Add JSON decode example (#121) 2024-01-30 05:45:27 -08:00
Lianmin Zheng
0617528632 Update quick start examples (#120) 2024-01-30 04:29:32 -08:00
Lianmin Zheng
4ea92f8307 Format code (#118) 2024-01-29 17:08:12 -08:00
Lianmin Zheng
93414c8238 Add a link to HF paper page 2024-01-24 22:25:33 -08:00
Lianmin Zheng
ed7c7eca0e Update README.md 2024-01-24 16:52:21 -08:00
Lianmin Zheng
9a16fea012 Return logprob for choices (#87) 2024-01-23 05:07:30 -08:00
Lianmin Zheng
9e037c822c Update README.md 2024-01-23 03:43:19 -08:00
0xWe11es.eth
9076386d90 Fix SRT endpoint api json syntax (#84) 2024-01-23 00:25:26 -08:00
Arcmoon
63e97e5e4c Suppport qwen model and solve some problems (#75) 2024-01-22 20:14:51 -08:00
Lianmin Zheng
cd3ccb2ed7 Add a note about triton version for older GPUs (#72) 2024-01-21 16:51:45 -08:00
Lianmin Zheng
007eeb4eb9 Fix the error message and dependency of openai backend (#71) 2024-01-21 14:56:25 -08:00
Ying Sheng
e8f2b155fe Update README.md 2024-01-21 02:45:58 -08:00
Ikko Eltociear Ashimine
0b2efc2adc Update README.md (#58) 2024-01-19 21:00:29 -08:00
Lianmin Zheng
199e82a15d Format code & Improve readme (#52) 2024-01-18 23:51:19 -08:00
Cody Yu
23471f9aa3 Support v1/chat/completions (#50) 2024-01-18 23:43:09 -08:00
Cody Yu
61d4c93962 Support stream=True in v1/completions (#49) 2024-01-18 17:00:56 -08:00
Lianmin Zheng
05b4c398df Document sampling parameters (#45) 2024-01-18 11:49:27 -08:00
Lianmin Zheng
70528762bf update readme 2024-01-17 10:42:55 -08:00
Ying Sheng
71d30d6ddc Update README.md 2024-01-17 09:49:53 -08:00
Lianmin Zheng
bf51ddc6e5 Improve docs & Rename Gemini -> VertexAI (#19) 2024-01-17 02:54:41 -08:00
Lianmin Zheng
c4707f1bb5 Improve docs (#17) 2024-01-16 19:53:55 -08:00
Ying Sheng
ffe4aaee1d Fix for T4 GPUs (#16)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
2024-01-16 15:49:03 -08:00
Lianmin Zheng
e71d4ab3f9 Update docs (#12) 2024-01-16 06:00:48 -08:00
Lianmin Zheng
fbf42263f1 Update Readme (#11) 2024-01-16 10:48:12 +00:00
Lianmin Zheng
46b7ea7c85 Improve Readme (#10) 2024-01-16 05:53:06 +00:00
Lianmin Zheng
30720e732c Add install with pip (#3) 2024-01-09 12:43:40 -08:00
Lianmin Zheng
93eeb543ba Update readme.md 2024-01-08 21:22:44 +00:00
Liangsheng Yin
ead5b39f82 Add flashinfer && Oultines (#1) 2024-01-08 08:26:18 -08:00
Lianmin Zheng
22085081bb release initial code
Co-authored-by: Ying Sheng <sqy1415@gmail.com>
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
Co-authored-by: Zhiqiang Xie <xiezhq@stanford.edu>
Co-authored-by: parasol-aser <3848358+parasol-aser@users.noreply.github.com>
Co-authored-by: LiviaSun <33578456+ChuyueSun@users.noreply.github.com>
Co-authored-by: Cody Yu <hao.yu.cody@gmail.com>
2024-01-08 04:37:50 +00:00
Ying Sheng
f6d40df0ee Initial commit 2023-10-09 15:41:15 -07:00