Commit Graph

180 Commits

Author SHA1 Message Date
Liangsheng Yin
9acc6e3504 add .isort.cfg (#378) 2024-04-22 22:38:09 +08:00
Lianmin Zheng
65501a9cf1 Fix commandr import; format code 2024-04-16 18:10:12 +00:00
ZhouXingg
db611066ad support command-r (#369) 2024-04-16 10:36:51 -07:00
Ying Sheng
b0890631a0 fix gemma import error 2024-04-01 07:36:52 +00:00
Liangsheng Yin
2af565b3bb [model] DBRX-instruct support (#337) 2024-03-28 10:05:19 -07:00
Jani Monoses
b57abe1663 Add StableLM model. (#301) 2024-03-22 13:24:08 -07:00
Lianmin Zheng
faba293a0d Improve gemma and documentations (#278) 2024-03-11 04:43:39 -07:00
Liangsheng Yin
89885b31ef Gemma Support (#256) 2024-03-11 12:14:27 +08:00
Geary.Z
64fe311593 replace skip_embed with input_embeds (#222) 2024-03-10 19:04:52 -07:00
Liangsheng Yin
a7ace9c88d Fix qwen config (#261) 2024-03-10 18:54:18 -07:00
Lianmin Zheng
c51020cf0c Fix the chat template for llava-v1.6-34b & format code (#177) 2024-02-11 05:50:13 -08:00
Lianmin Zheng
23f05005fd Format code & move functions (#155) 2024-02-06 13:27:46 -08:00
Arcmoon
3ae78a09b3 Add gptq quantization model support (#141) 2024-02-06 11:35:04 -08:00
Christopher Chou
864425300f Yi-VL Model (#112) 2024-02-01 08:33:22 -08:00
Lianmin Zheng
ad82bac6f5 Fix model loading & format code (#125) 2024-01-30 23:49:52 -08:00
Lianmin Zheng
0617528632 Update quick start examples (#120) 2024-01-30 04:29:32 -08:00
Lianmin Zheng
4ea92f8307 Format code (#118) 2024-01-29 17:08:12 -08:00
Junyang Lin
6b0af2853c Add qwen2 (#114) 2024-01-29 17:06:02 -08:00
Lianmin Zheng
6f560c761b Improve the control of streaming and improve the first token latency in streaming (#117) 2024-01-29 17:05:42 -08:00
Cody Yu
cd6872334e Fix Mistral model loading (#108)
Co-authored-by: johndun <dunavent.jm@gmail.com>
2024-01-26 09:38:43 -08:00
Cody Yu
3a581e9949 Dynamic model class loading (#101) 2024-01-25 15:29:07 -08:00
shiyi.c_98
0147f940dd fix batch error for llava-hd (#98) 2024-01-25 07:56:25 -08:00
Lianmin Zheng
bef0b35902 Fix llava & Fix multiprocessing 2024-01-24 10:35:31 +00:00
shiyi.c_98
c6576e820c Llava-hd Support (#92)
Co-authored-by: Haotian Liu <liuhaotian.cn@gmail.com>
2024-01-24 01:51:21 -08:00
Lianmin Zheng
94e05770db Fix after QWen support (#82) 2024-01-22 21:17:05 -08:00
Arcmoon
63e97e5e4c Suppport qwen model and solve some problems (#75) 2024-01-22 20:14:51 -08:00
isaac-vidas
e08bca2840 Support load fine-tuned LLaVA model (#80) 2024-01-22 18:15:48 -08:00
shiyi.c_98
fd7c479239 Gemini Backend (#9)
Co-authored-by: Ying Sheng <sqy1415@gmail.com>
2024-01-16 22:29:37 -08:00
Lianmin Zheng
70359bf31a Update benchmark scripts (#8) 2024-01-15 16:12:57 -08:00
Lianmin Zheng
22085081bb release initial code
Co-authored-by: Ying Sheng <sqy1415@gmail.com>
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
Co-authored-by: Zhiqiang Xie <xiezhq@stanford.edu>
Co-authored-by: parasol-aser <3848358+parasol-aser@users.noreply.github.com>
Co-authored-by: LiviaSun <33578456+ChuyueSun@users.noreply.github.com>
Co-authored-by: Cody Yu <hao.yu.cody@gmail.com>
2024-01-08 04:37:50 +00:00