Zhang, Liangang
|
5d638c92f5
|
[Feature, Hardware] Enable SGLang on XPU GPUs via PyTorch (#1480)
|
2024-10-12 18:10:32 +00:00 |
|
Lianmin Zheng
|
00c7e6368b
|
Release v0.3.3.post1 (#1636)
|
2024-10-11 07:56:16 -07:00 |
|
Lianmin Zheng
|
7b69d91b4f
|
Release v0.3.3 (#1605)
|
2024-10-08 12:58:41 -07:00 |
|
Kylin
|
f42e9bfb52
|
[bugfix] Add modelscope package to avoid docker image without modelscope (#1520)
|
2024-09-28 12:43:22 -07:00 |
|
Ying Sheng
|
37c5899fc2
|
Release v0.3.2 (#1512)
|
2024-09-25 14:17:09 +08:00 |
|
Yineng Zhang
|
82136eb0b5
|
chore: bump v0.3.1.post3 (#1483)
|
2024-09-21 11:17:45 +08:00 |
|
Lianmin Zheng
|
5ce55aee15
|
Release v0.3.1.post2 (#1470)
|
2024-09-19 02:03:38 -07:00 |
|
Lianmin Zheng
|
90a26be31c
|
Release 0.3.1.post1 (#1445)
|
2024-09-17 01:47:31 -07:00 |
|
Lianmin Zheng
|
e79f6cd73d
|
Release v0.3.1 (#1430)
|
2024-09-15 23:03:16 +09:00 |
|
Ying Sheng
|
712216928f
|
[Feature] Initial support for multi-LoRA serving (#1307)
|
2024-09-12 16:46:14 -07:00 |
|
Jerry Zhang
|
a7c47e0f02
|
Add torchao quant (int4/int8/fp8) to llama models (#1341)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-09-09 05:32:41 -07:00 |
|
Yineng Zhang
|
a63c8275c6
|
chore: bump v0.3.0 (#1320)
|
2024-09-04 04:32:18 +08:00 |
|
Lianmin Zheng
|
9999442756
|
Release v0.2.15 (#1295)
|
2024-09-01 22:22:38 -07:00 |
|
Yineng Zhang
|
13ac95b894
|
chore: bump v0.2.14.post2 (#1250)
|
2024-08-28 18:46:33 +00:00 |
|
Yineng Zhang
|
f25f4dfde5
|
hotfix: revert sampler CUDA Graph (#1242)
|
2024-08-28 21:16:47 +10:00 |
|
Yineng Zhang
|
c5fe11a8e1
|
chore: bump v0.2.14 (#1155)
|
2024-08-27 00:28:24 +10:00 |
|
Liangsheng Yin
|
632d506d0b
|
minor: improve CI and dependencies (#1212)
|
2024-08-26 04:26:31 +00:00 |
|
Kaichen Zhang - NTU
|
a5b14ad043
|
[Feat/WIP] add llava-onevision, with support for (1) siglip encoder, (2) qwen2 decoder (3) openai api compatible server. (#1123)
Co-authored-by: Bo Li <drluodian@gmail.com>
|
2024-08-23 14:11:16 -07:00 |
|
Yineng Zhang
|
5bd953749b
|
chore: bump v0.2.13 (#1111)
|
2024-08-16 03:50:43 +10:00 |
|
Yineng Zhang
|
b0ad0c1bc8
|
chore: bump v0.2.12 (#1048)
|
2024-08-12 20:59:38 +10:00 |
|
Yineng Zhang
|
dc9d06d886
|
chore: bump v0.2.11 (#970)
|
2024-08-07 20:47:53 +08:00 |
|
Yineng Zhang
|
c31f084c71
|
chore: update vllm to 0.5.4 (#966)
|
2024-08-07 21:15:41 +10:00 |
|
min-xu-et
|
ebf69964cd
|
latency test enhancement - final part (#921)
|
2024-08-04 18:15:23 -07:00 |
|
Ying Sheng
|
141e8c71a3
|
Bump version to 0.2.10 (#923)
|
2024-08-04 16:52:51 -07:00 |
|
Ying Sheng
|
995af5a54b
|
Improve the structure of CI (#911)
|
2024-08-03 23:09:21 -07:00 |
|
min-xu-et
|
539856455d
|
latency test enhancement - part 1 (#909)
|
2024-08-03 22:44:58 -07:00 |
|
Ying Sheng
|
b906c01592
|
Bump version to 0.2.9.post1 (#899)
|
2024-08-02 12:08:00 -07:00 |
|
Yineng Zhang
|
046c2b339e
|
chore: add multipart dep for fastapi (#895)
|
2024-08-03 00:50:19 +10:00 |
|
Ying Sheng
|
30a9b2ef20
|
Bump version to v0.2.9 (#890)
|
2024-08-02 01:45:48 -07:00 |
|
Ying Sheng
|
e4d3333c6c
|
bump to 0.2.8 (#877)
|
2024-08-01 14:18:26 -07:00 |
|
Yineng Zhang
|
1edd4e07d6
|
chore: bump v0.2.7 (#830)
|
2024-07-30 20:41:10 +10:00 |
|
Lianmin Zheng
|
bc1154c399
|
Bump version to 0.2.6 (#779)
|
2024-07-27 20:29:33 -07:00 |
|
Yineng Zhang
|
5bd06b4599
|
fix: use REPO_TOKEN (#755)
|
2024-07-27 05:56:30 +10:00 |
|
Yineng Zhang
|
9a61182732
|
fix: add release tag workflow (#754)
|
2024-07-27 05:48:38 +10:00 |
|
Yineng Zhang
|
eeb2482186
|
feat: add release tag workflow (#753)
|
2024-07-27 05:37:02 +10:00 |
|
Yineng Zhang
|
8628ab9c8b
|
feat: add docker workflow (#751)
|
2024-07-27 03:54:51 +10:00 |
|
Yineng Zhang
|
1b77670f39
|
chore: bump v0.2.1 (#740)
|
2024-07-26 21:27:41 +10:00 |
|
Ying Sheng
|
1a491d00cb
|
Bump version to 0.2.0 (#730)
|
2024-07-25 08:03:36 -07:00 |
|
Yineng Zhang
|
926ac01b64
|
fix: resolve the logo display issue on the PyPI page (#726)
|
2024-07-25 20:47:46 +10:00 |
|
Yineng Zhang
|
25c881a005
|
chore: bump v0.1.25 (#725)
|
2024-07-25 20:04:35 +10:00 |
|
Ying Sheng
|
459abad261
|
Bump version to 0.1.24 (#718)
|
2024-07-24 15:55:01 -07:00 |
|
Ying Sheng
|
9f94728f5a
|
bump version to 0.1.23 (#706)
|
2024-07-23 13:53:19 -07:00 |
|
Ying Sheng
|
444a02441a
|
Update vllm version to support llama3.1 (#705)
|
2024-07-23 13:49:34 -07:00 |
|
Ying Sheng
|
2b4c646277
|
Update version to 0.1.22 (#677)
|
2024-07-20 03:39:50 -07:00 |
|
zhyncs
|
dc4e4a6acc
|
misc: update SGLang package description (#659)
|
2024-07-19 09:27:39 -07:00 |
|
Mingyi
|
d774acad5c
|
Remove the dependency of rpyc (#646)
|
2024-07-18 02:13:54 -07:00 |
|
Ying Sheng
|
56f5fc4ab5
|
Bump version to 0.1.21 (#626)
|
2024-07-15 13:10:53 -07:00 |
|
Lianmin Zheng
|
5d264a90ac
|
Bump version to 0.1.20 (#618)
|
2024-07-13 17:27:55 -07:00 |
|
Lianmin Zheng
|
ad872feb14
|
bump version to 0.1.19
|
2024-07-09 02:23:14 -07:00 |
|
Tommy Yang
|
b38687226a
|
Make sglang compat with vllm 0.5.1 (#598)
|
2024-07-08 23:44:22 -07:00 |
|