Lianmin Zheng
|
501f944445
|
Bump version to 0.1.5 (#33)
|
2024-01-17 21:14:31 -08:00 |
|
Lianmin Zheng
|
22ec7bc2a1
|
Expose more arguments to control the scheduling policy (#32)
|
2024-01-17 18:37:02 -08:00 |
|
Christopher Chou
|
c0454b323c
|
Add option to return metadata in async streaming (#18)
|
2024-01-17 18:15:02 -08:00 |
|
Lianmin Zheng
|
8024fc5eec
|
Fix streaming (#30)
|
2024-01-17 16:38:20 -08:00 |
|
Lianmin Zheng
|
70528762bf
|
update readme
|
2024-01-17 10:42:55 -08:00 |
|
Ying Sheng
|
71d30d6ddc
|
Update README.md
|
2024-01-17 09:49:53 -08:00 |
|
Lianmin Zheng
|
f9d723816a
|
Teak mem fraction (#20)
|
2024-01-17 04:43:17 -08:00 |
|
Lianmin Zheng
|
bf51ddc6e5
|
Improve docs & Rename Gemini -> VertexAI (#19)
|
2024-01-17 02:54:41 -08:00 |
|
shiyi.c_98
|
fd7c479239
|
Gemini Backend (#9)
Co-authored-by: Ying Sheng <sqy1415@gmail.com>
|
2024-01-16 22:29:37 -08:00 |
|
Lianmin Zheng
|
c4707f1bb5
|
Improve docs (#17)
|
2024-01-16 19:53:55 -08:00 |
|
Ying Sheng
|
ffe4aaee1d
|
Fix for T4 GPUs (#16)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-01-16 15:49:03 -08:00 |
|
Christopher Chou
|
5b27a1dce4
|
Rename image_url to image_file (#15)
|
2024-01-16 15:41:30 -08:00 |
|
Lianmin Zheng
|
e71d4ab3f9
|
Update docs (#12)
|
2024-01-16 06:00:48 -08:00 |
|
Lianmin Zheng
|
fbf42263f1
|
Update Readme (#11)
|
2024-01-16 10:48:12 +00:00 |
|
Lianmin Zheng
|
2ccd9fd8c5
|
update version to 0.1.3
|
2024-01-16 05:55:25 +00:00 |
|
Lianmin Zheng
|
46b7ea7c85
|
Improve Readme (#10)
|
2024-01-16 05:53:06 +00:00 |
|
Lianmin Zheng
|
70359bf31a
|
Update benchmark scripts (#8)
|
2024-01-15 16:12:57 -08:00 |
|
Liangsheng Yin
|
01ca82d765
|
fix radix cache match (#7)
|
2024-01-15 09:42:46 -08:00 |
|
Lianmin Zheng
|
4bd8233f2c
|
Fix test cases (#6)
|
2024-01-15 01:15:53 -08:00 |
|
Liangsheng Yin
|
08ab2a1655
|
Json Decode && Mutl-Turns (#4)
|
2024-01-15 00:49:29 -08:00 |
|
hnyls2002
|
f652494df1
|
fix typo
|
2024-01-10 04:21:17 +00:00 |
|
Lianmin Zheng
|
30720e732c
|
Add install with pip (#3)
|
2024-01-09 12:43:40 -08:00 |
|
Liangsheng Yin
|
331848de9d
|
Add SRT json decode example (#2)
|
2024-01-09 12:35:44 -08:00 |
|
Lianmin Zheng
|
93eeb543ba
|
Update readme.md
|
2024-01-08 21:22:44 +00:00 |
|
Liangsheng Yin
|
ead5b39f82
|
Add flashinfer && Oultines (#1)
|
2024-01-08 08:26:18 -08:00 |
|
Lianmin Zheng
|
22085081bb
|
release initial code
Co-authored-by: Ying Sheng <sqy1415@gmail.com>
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
Co-authored-by: Zhiqiang Xie <xiezhq@stanford.edu>
Co-authored-by: parasol-aser <3848358+parasol-aser@users.noreply.github.com>
Co-authored-by: LiviaSun <33578456+ChuyueSun@users.noreply.github.com>
Co-authored-by: Cody Yu <hao.yu.cody@gmail.com>
|
2024-01-08 04:37:50 +00:00 |
|
Ying Sheng
|
f6d40df0ee
|
Initial commit
|
2023-10-09 15:41:15 -07:00 |
|