Commit Graph

34 Commits

Author SHA1 Message Date
Lianmin Zheng
e0ae5d42ec Update version to 0.1.16 (#438) 2024-05-13 17:29:17 -07:00
Yuanhan Zhang
0992d85f92 support llava video (#426) 2024-05-13 16:57:00 -07:00
Lianmin Zheng
72bb344388 Update version to 0.1.15 (#431) 2024-05-12 14:22:33 -07:00
Lianmin Zheng
2d580e7a89 Fix flashinfer (#430) 2024-05-12 08:18:53 -07:00
Qubitium
33b242df30 Compat with latest VLLM 0.4.2 main + fork.number rename + Flashinfer 0.0.4 (#380)
Co-authored-by: ZX <zx@lbx.dev>
Co-authored-by: ZhouXingg <165115237+ZhouXingg@users.noreply.github.com>
2024-05-11 16:37:49 -07:00
Lianmin Zheng
a511a2d089 restrict vllm version 2024-05-09 15:49:29 -07:00
Jani Monoses
30d17840fc Update dependencies (#326) 2024-03-23 10:15:58 -07:00
Lianmin Zheng
51104cd405 Update version to v0.1.14 (#324) 2024-03-22 13:42:22 -07:00
Jani Monoses
e57f079275 Use Anthropic messages API (#304) 2024-03-22 13:23:31 -07:00
Lianmin Zheng
4aa5dd2c5f Update version to v0.1.13 (#280) 2024-03-11 05:49:27 -07:00
Liangsheng Yin
a7ace9c88d Fix qwen config (#261) 2024-03-10 18:54:18 -07:00
Cody Yu
3c2c5869ad Support outlines > 0.0.31 (#219) 2024-02-24 15:06:17 +08:00
Liangsheng Yin
91e036334f Adjust outlines version. (#200)
Co-authored-by: comaniac <hao.yu.cody@gmail.com>
2024-02-17 13:40:39 +08:00
Cody Yu
2a74748b2f Pin outlines version (#196) 2024-02-16 13:01:40 -08:00
Lianmin Zheng
624b21e742 Update version to 0.1.12 (#178) 2024-02-11 06:43:45 -08:00
Liangsheng Yin
37b42297f8 import outlines (#168) 2024-02-09 10:13:02 +08:00
Ying Sheng
f6bfe3aaff Release 0.1.11 (#134) 2024-02-03 02:50:13 -08:00
Lianmin Zheng
a49dc52bfa release v0.1.10 2024-01-30 15:37:52 +00:00
Lianmin Zheng
97aa9b3284 Improve docs & Add JSON decode example (#121) 2024-01-30 05:45:27 -08:00
Cody Yu
3a581e9949 Dynamic model class loading (#101) 2024-01-25 15:29:07 -08:00
Lianmin Zheng
6dceab4d17 bump version to 0.1.9 2024-01-24 11:37:25 +00:00
Lianmin Zheng
c70b3cfa9e Bump the version to v0.1.8 (#93) 2024-01-24 03:33:34 -08:00
shiyi.c_98
c6576e820c Llava-hd Support (#92)
Co-authored-by: Haotian Liu <liuhaotian.cn@gmail.com>
2024-01-24 01:51:21 -08:00
Lianmin Zheng
007eeb4eb9 Fix the error message and dependency of openai backend (#71) 2024-01-21 14:56:25 -08:00
Lianmin Zheng
723f042163 release v0.1.7 & fix bugs 2024-01-21 10:31:02 +00:00
Lianmin Zheng
cc3ada983f Bump version to 0.1.6 (#68) 2024-01-21 01:45:02 -08:00
Liangsheng Yin
ca13f3b8c5 Disk FSM cache and adjust code. (#63) 2024-01-20 21:26:11 -08:00
Cody Yu
61d4c93962 Support stream=True in v1/completions (#49) 2024-01-18 17:00:56 -08:00
Lianmin Zheng
501f944445 Bump version to 0.1.5 (#33) 2024-01-17 21:14:31 -08:00
Ying Sheng
ffe4aaee1d Fix for T4 GPUs (#16)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
2024-01-16 15:49:03 -08:00
Lianmin Zheng
2ccd9fd8c5 update version to 0.1.3 2024-01-16 05:55:25 +00:00
Lianmin Zheng
4bd8233f2c Fix test cases (#6) 2024-01-15 01:15:53 -08:00
Liangsheng Yin
331848de9d Add SRT json decode example (#2) 2024-01-09 12:35:44 -08:00
Lianmin Zheng
22085081bb release initial code
Co-authored-by: Ying Sheng <sqy1415@gmail.com>
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
Co-authored-by: Zhiqiang Xie <xiezhq@stanford.edu>
Co-authored-by: parasol-aser <3848358+parasol-aser@users.noreply.github.com>
Co-authored-by: LiviaSun <33578456+ChuyueSun@users.noreply.github.com>
Co-authored-by: Cody Yu <hao.yu.cody@gmail.com>
2024-01-08 04:37:50 +00:00