sglang

Author	SHA1	Message	Date
Liangsheng Yin	9acc6e3504	add `.isort.cfg` (#378 )	2024-04-22 22:38:09 +08:00
Lianmin Zheng	65501a9cf1	Fix commandr import; format code	2024-04-16 18:10:12 +00:00
ZhouXingg	db611066ad	support `command-r` (#369 )	2024-04-16 10:36:51 -07:00
Ying Sheng	b0890631a0	fix gemma import error	2024-04-01 07:36:52 +00:00
Liangsheng Yin	2af565b3bb	[model] DBRX-instruct support (#337 )	2024-03-28 10:05:19 -07:00
Jani Monoses	b57abe1663	Add StableLM model. (#301 )	2024-03-22 13:24:08 -07:00
Lianmin Zheng	faba293a0d	Improve gemma and documentations (#278 )	2024-03-11 04:43:39 -07:00
Liangsheng Yin	89885b31ef	Gemma Support (#256 )	2024-03-11 12:14:27 +08:00
Geary.Z	64fe311593	replace skip_embed with input_embeds (#222 )	2024-03-10 19:04:52 -07:00
Liangsheng Yin	a7ace9c88d	Fix qwen config (#261 )	2024-03-10 18:54:18 -07:00
Lianmin Zheng	c51020cf0c	Fix the chat template for llava-v1.6-34b & format code (#177 )	2024-02-11 05:50:13 -08:00
Lianmin Zheng	23f05005fd	Format code & move functions (#155 )	2024-02-06 13:27:46 -08:00
Arcmoon	3ae78a09b3	Add gptq quantization model support (#141 )	2024-02-06 11:35:04 -08:00
Christopher Chou	864425300f	Yi-VL Model (#112 )	2024-02-01 08:33:22 -08:00
Lianmin Zheng	ad82bac6f5	Fix model loading & format code (#125 )	2024-01-30 23:49:52 -08:00
Lianmin Zheng	0617528632	Update quick start examples (#120 )	2024-01-30 04:29:32 -08:00
Lianmin Zheng	4ea92f8307	Format code (#118 )	2024-01-29 17:08:12 -08:00
Junyang Lin	6b0af2853c	Add qwen2 (#114 )	2024-01-29 17:06:02 -08:00
Lianmin Zheng	6f560c761b	Improve the control of streaming and improve the first token latency in streaming (#117 )	2024-01-29 17:05:42 -08:00
Cody Yu	cd6872334e	Fix Mistral model loading (#108 ) Co-authored-by: johndun <dunavent.jm@gmail.com>	2024-01-26 09:38:43 -08:00
Cody Yu	3a581e9949	Dynamic model class loading (#101 )	2024-01-25 15:29:07 -08:00
shiyi.c_98	0147f940dd	fix batch error for llava-hd (#98 )	2024-01-25 07:56:25 -08:00
Lianmin Zheng	bef0b35902	Fix llava & Fix multiprocessing	2024-01-24 10:35:31 +00:00
shiyi.c_98	c6576e820c	Llava-hd Support (#92 ) Co-authored-by: Haotian Liu <liuhaotian.cn@gmail.com>	2024-01-24 01:51:21 -08:00
Lianmin Zheng	94e05770db	Fix after QWen support (#82 )	2024-01-22 21:17:05 -08:00
Arcmoon	63e97e5e4c	Suppport qwen model and solve some problems (#75 )	2024-01-22 20:14:51 -08:00
isaac-vidas	e08bca2840	Support load fine-tuned LLaVA model (#80 )	2024-01-22 18:15:48 -08:00
shiyi.c_98	fd7c479239	Gemini Backend (#9 ) Co-authored-by: Ying Sheng <sqy1415@gmail.com>	2024-01-16 22:29:37 -08:00
Lianmin Zheng	70359bf31a	Update benchmark scripts (#8 )	2024-01-15 16:12:57 -08:00
Lianmin Zheng	22085081bb	release initial code Co-authored-by: Ying Sheng <sqy1415@gmail.com> Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com> Co-authored-by: Zhiqiang Xie <xiezhq@stanford.edu> Co-authored-by: parasol-aser <3848358+parasol-aser@users.noreply.github.com> Co-authored-by: LiviaSun <33578456+ChuyueSun@users.noreply.github.com> Co-authored-by: Cody Yu <hao.yu.cody@gmail.com>	2024-01-08 04:37:50 +00:00

1 2 3 4

180 Commits