Byron Hsu
|
6cc9c52521
|
[doc] fix quick start link (#1282)
|
2024-08-31 22:54:34 -07:00 |
|
Lianmin Zheng
|
79ece2c51f
|
Report median instead of mean in bench_latency.py (#1269)
|
2024-08-30 06:05:01 -07:00 |
|
김종곤
|
55f5976b42
|
Update README.md - Supported Models add Exaone 3.0 (#1267)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2024-08-30 18:49:07 +10:00 |
|
Yineng Zhang
|
13ac95b894
|
chore: bump v0.2.14.post2 (#1250)
|
2024-08-28 18:46:33 +00:00 |
|
Lianmin Zheng
|
bf53bf5142
|
[Fix] Fix llava on multi images (#1247)
|
2024-08-28 06:33:05 -07:00 |
|
Yineng Zhang
|
f25f4dfde5
|
hotfix: revert sampler CUDA Graph (#1242)
|
2024-08-28 21:16:47 +10:00 |
|
Lianmin Zheng
|
184ae1c683
|
Update README.md (#1239)
|
2024-08-28 02:15:52 -07:00 |
|
Yineng Zhang
|
198974cd1a
|
feat: support sm75 with FlashInfer v0.1.6 (#1233)
|
2024-08-28 18:39:12 +10:00 |
|
Dr. Artificial曾小健
|
c8a9e79186
|
Fix readme (#1236)
|
2024-08-27 23:51:41 -07:00 |
|
Yineng Zhang
|
c5fe11a8e1
|
chore: bump v0.2.14 (#1155)
|
2024-08-27 00:28:24 +10:00 |
|
Chayenne
|
30b4f771b0
|
Support Alibaba-NLP/gte-Qwen2-7B-instruct embedding Model (#1186)
Co-authored-by: Ying Sheng <sqy1415@gmail.com>
|
2024-08-25 10:29:12 -07:00 |
|
Lianmin Zheng
|
b20daf982a
|
Update README.md (#1198)
|
2024-08-24 14:50:05 -07:00 |
|
Lianmin Zheng
|
f6af3a6561
|
Cleanup readme, llava examples, usage examples and nccl init (#1194)
|
2024-08-24 08:02:23 -07:00 |
|
Kaichen Zhang - NTU
|
a5b14ad043
|
[Feat/WIP] add llava-onevision, with support for (1) siglip encoder, (2) qwen2 decoder (3) openai api compatible server. (#1123)
Co-authored-by: Bo Li <drluodian@gmail.com>
|
2024-08-23 14:11:16 -07:00 |
|
Zhanghao Wu
|
ac1b74fa85
|
[Docs] Fix rendering of details in README (#1179)
|
2024-08-22 07:05:33 +08:00 |
|
Yineng Zhang
|
350a81609b
|
fix: resolve README render (#1166)
|
2024-08-21 03:23:52 +10:00 |
|
Lianmin Zheng
|
a8ae640328
|
Improve docs and warnings (#1164)
|
2024-08-20 08:31:29 -07:00 |
|
Zhanghao Wu
|
d8627ed16d
|
[Docs] Add instruction for running on clouds and kubernetes with SkyPilot (#1144)
Co-authored-by: Zongheng Yang <zongheng.y@gmail.com>
|
2024-08-19 14:01:55 +08:00 |
|
Yineng Zhang
|
5bd953749b
|
chore: bump v0.2.13 (#1111)
|
2024-08-16 03:50:43 +10:00 |
|
Yineng Zhang
|
fe5024325b
|
docs: update README (#1098)
|
2024-08-14 04:40:05 -07:00 |
|
Lucien
|
312e849255
|
Example file for docker compose and k8s (#1006)
|
2024-08-13 15:07:57 -07:00 |
|
Yineng Zhang
|
b0ad0c1bc8
|
chore: bump v0.2.12 (#1048)
|
2024-08-12 20:59:38 +10:00 |
|
Lianmin Zheng
|
41598e0d8e
|
Add longer accuracy test on CI (#1049)
|
2024-08-12 09:21:38 +00:00 |
|
Lianmin Zheng
|
a97df79124
|
Clean up readme and arguments of chunked prefill (#1022)
|
2024-08-11 01:18:52 -07:00 |
|
Lianmin Zheng
|
54fb1c80c0
|
Clean up unit tests (#1020)
|
2024-08-10 15:09:03 -07:00 |
|
liuyhwangyh
|
b91a4cb1b1
|
support models from www.modelscope.cn (#994)
Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>
|
2024-08-09 02:52:14 -07:00 |
|
Yineng Zhang
|
dc9d06d886
|
chore: bump v0.2.11 (#970)
|
2024-08-07 20:47:53 +08:00 |
|
Yineng Zhang
|
c31f084c71
|
chore: update vllm to 0.5.4 (#966)
|
2024-08-07 21:15:41 +10:00 |
|
Yineng Zhang
|
fde8340550
|
docs: update README (#935)
|
2024-08-05 20:06:06 +10:00 |
|
Ying Sheng
|
399cad91f3
|
Update README.md (#927)
|
2024-08-04 23:01:35 -07:00 |
|
Ying Sheng
|
3bc99e6fe4
|
Test openai vision api (#925)
|
2024-08-05 13:51:55 +10:00 |
|
Ying Sheng
|
141e8c71a3
|
Bump version to 0.2.10 (#923)
|
2024-08-04 16:52:51 -07:00 |
|
Ying Sheng
|
8c5382e62c
|
Update README.md
|
2024-08-03 12:58:41 -07:00 |
|
Ying Sheng
|
b906c01592
|
Bump version to 0.2.9.post1 (#899)
|
2024-08-02 12:08:00 -07:00 |
|
Ying Sheng
|
30a9b2ef20
|
Bump version to v0.2.9 (#890)
|
2024-08-02 01:45:48 -07:00 |
|
Ying Sheng
|
e4d3333c6c
|
bump to 0.2.8 (#877)
|
2024-08-01 14:18:26 -07:00 |
|
Ikko Eltociear Ashimine
|
7d5ed7c6ee
|
docs: update README.md (#843)
|
2024-07-31 12:48:18 +10:00 |
|
Yineng Zhang
|
1edd4e07d6
|
chore: bump v0.2.7 (#830)
|
2024-07-30 20:41:10 +10:00 |
|
Yineng Zhang
|
bece265f5a
|
docs: update README (#819)
|
2024-07-30 16:17:50 +10:00 |
|
Ying Sheng
|
db6089e6f3
|
Revert "Organize public APIs" (#815)
|
2024-07-29 19:40:28 -07:00 |
|
Liangsheng Yin
|
c8e9fed87a
|
Organize public APIs (#809)
|
2024-07-29 15:34:16 -07:00 |
|
ObjectNotFound
|
8f6274c82b
|
Add role documentation, add system begin & end tokens (#793)
|
2024-07-28 23:02:49 -07:00 |
|
Ying Sheng
|
5bd899243b
|
Update README.md (#792)
|
2024-07-28 21:57:23 -07:00 |
|
Yineng Zhang
|
1f013d64eb
|
docs: make badges center (#789)
|
2024-07-28 22:27:52 +10:00 |
|
Yineng Zhang
|
628e1fa760
|
docs: update README (#788)
|
2024-07-28 22:24:27 +10:00 |
|
Ying Sheng
|
bcb6611a46
|
Update README.md
|
2024-07-28 01:00:06 -07:00 |
|
Yineng Zhang
|
948625799e
|
docs: init readthedocs support (#783)
|
2024-07-28 16:50:31 +10:00 |
|
Lianmin Zheng
|
f64b2a9bc0
|
Add slack invitation link.
|
2024-07-27 06:29:15 -07:00 |
|
Ying Sheng
|
9f95dcc64f
|
Update readme (#769)
Co-authored-by: Mingyi <wisclmy0611@gmail.com>
|
2024-07-27 06:12:16 -07:00 |
|
Liangsheng Yin
|
ba29504b21
|
Update supported models (#763)
|
2024-07-27 15:53:53 +10:00 |
|