Commit Graph

122 Commits

Author SHA1 Message Date
Yineng Zhang
350a81609b fix: resolve README render (#1166) 2024-08-21 03:23:52 +10:00
Lianmin Zheng
a8ae640328 Improve docs and warnings (#1164) 2024-08-20 08:31:29 -07:00
Zhanghao Wu
d8627ed16d [Docs] Add instruction for running on clouds and kubernetes with SkyPilot (#1144)
Co-authored-by: Zongheng Yang <zongheng.y@gmail.com>
2024-08-19 14:01:55 +08:00
Yineng Zhang
5bd953749b chore: bump v0.2.13 (#1111) 2024-08-16 03:50:43 +10:00
Yineng Zhang
fe5024325b docs: update README (#1098) 2024-08-14 04:40:05 -07:00
Lucien
312e849255 Example file for docker compose and k8s (#1006) 2024-08-13 15:07:57 -07:00
Yineng Zhang
b0ad0c1bc8 chore: bump v0.2.12 (#1048) 2024-08-12 20:59:38 +10:00
Lianmin Zheng
41598e0d8e Add longer accuracy test on CI (#1049) 2024-08-12 09:21:38 +00:00
Lianmin Zheng
a97df79124 Clean up readme and arguments of chunked prefill (#1022) 2024-08-11 01:18:52 -07:00
Lianmin Zheng
54fb1c80c0 Clean up unit tests (#1020) 2024-08-10 15:09:03 -07:00
liuyhwangyh
b91a4cb1b1 support models from www.modelscope.cn (#994)
Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>
2024-08-09 02:52:14 -07:00
Yineng Zhang
dc9d06d886 chore: bump v0.2.11 (#970) 2024-08-07 20:47:53 +08:00
Yineng Zhang
c31f084c71 chore: update vllm to 0.5.4 (#966) 2024-08-07 21:15:41 +10:00
Yineng Zhang
fde8340550 docs: update README (#935) 2024-08-05 20:06:06 +10:00
Ying Sheng
399cad91f3 Update README.md (#927) 2024-08-04 23:01:35 -07:00
Ying Sheng
3bc99e6fe4 Test openai vision api (#925) 2024-08-05 13:51:55 +10:00
Ying Sheng
141e8c71a3 Bump version to 0.2.10 (#923) 2024-08-04 16:52:51 -07:00
Ying Sheng
8c5382e62c Update README.md 2024-08-03 12:58:41 -07:00
Ying Sheng
b906c01592 Bump version to 0.2.9.post1 (#899) 2024-08-02 12:08:00 -07:00
Ying Sheng
30a9b2ef20 Bump version to v0.2.9 (#890) 2024-08-02 01:45:48 -07:00
Ying Sheng
e4d3333c6c bump to 0.2.8 (#877) 2024-08-01 14:18:26 -07:00
Ikko Eltociear Ashimine
7d5ed7c6ee docs: update README.md (#843) 2024-07-31 12:48:18 +10:00
Yineng Zhang
1edd4e07d6 chore: bump v0.2.7 (#830) 2024-07-30 20:41:10 +10:00
Yineng Zhang
bece265f5a docs: update README (#819) 2024-07-30 16:17:50 +10:00
Ying Sheng
db6089e6f3 Revert "Organize public APIs" (#815) 2024-07-29 19:40:28 -07:00
Liangsheng Yin
c8e9fed87a Organize public APIs (#809) 2024-07-29 15:34:16 -07:00
ObjectNotFound
8f6274c82b Add role documentation, add system begin & end tokens (#793) 2024-07-28 23:02:49 -07:00
Ying Sheng
5bd899243b Update README.md (#792) 2024-07-28 21:57:23 -07:00
Yineng Zhang
1f013d64eb docs: make badges center (#789) 2024-07-28 22:27:52 +10:00
Yineng Zhang
628e1fa760 docs: update README (#788) 2024-07-28 22:24:27 +10:00
Ying Sheng
bcb6611a46 Update README.md 2024-07-28 01:00:06 -07:00
Yineng Zhang
948625799e docs: init readthedocs support (#783) 2024-07-28 16:50:31 +10:00
Lianmin Zheng
f64b2a9bc0 Add slack invitation link. 2024-07-27 06:29:15 -07:00
Ying Sheng
9f95dcc64f Update readme (#769)
Co-authored-by: Mingyi <wisclmy0611@gmail.com>
2024-07-27 06:12:16 -07:00
Liangsheng Yin
ba29504b21 Update supported models (#763) 2024-07-27 15:53:53 +10:00
Yineng Zhang
3e455b016e misc: replace deprecated variable HUGGING_FACE_HUB_TOKEN with HF_TOKEN (#752) 2024-07-27 04:19:30 +10:00
Yineng Zhang
01fbb11bb7 docs: fix typo (#742) 2024-07-26 21:05:53 +10:00
Yineng Zhang
05d216da32 docs: add llama 3.1 405b instruction (#739)
Co-authored-by: Ying1123 <sqy1415@gmail.com>
2024-07-26 21:03:20 +10:00
Ying Sheng
7f6f2f0f09 Update readme (#731) 2024-07-25 09:16:10 -07:00
Ying Sheng
7802df1e2b Update readme 2024-07-25 08:45:06 -07:00
Ying Sheng
8fbba3de3d Fix bugs (fp8 checkpoints, triton cache manager) (#729) 2024-07-25 07:42:00 -07:00
Yineng Zhang
926ac01b64 fix: resolve the logo display issue on the PyPI page (#726) 2024-07-25 20:47:46 +10:00
Yineng Zhang
fded67441d misc: update bulid instruction (#724) 2024-07-25 17:08:11 +10:00
Yineng Zhang
d5146baec9 docs: update supported models (#719) 2024-07-25 09:34:01 +10:00
Ying Sheng
08a3bd19cc docs: update doc (#716) 2024-07-24 20:44:03 +00:00
zhyncs
7b597475f2 docs: update README (#692) 2024-07-22 03:41:20 +10:00
Lianmin Zheng
5a4ef2b5c8 update readme 2024-07-21 02:58:57 -07:00
zhyncs
9dab947d56 docs: update README (#688) 2024-07-21 18:32:58 +10:00
Lianmin Zheng
33ee97b0bf Allow disabling streaming in bench (#687) 2024-07-21 01:12:34 -07:00
Lianmin Zheng
77e592e8e0 support non-streaming benchmark (#682) 2024-07-20 18:36:42 -07:00