Commit Graph

69 Commits

Author SHA1 Message Date
Lianmin Zheng
f64eae3a29 [Fix] Reduce memory usage for loading llava model & Remove EntryClassRemapping (#1308) 2024-09-02 21:44:45 -07:00
Yineng Zhang
2561ed012c feat: update nightly gsm8k eval (#1304) 2024-09-03 01:18:41 +10:00
Yineng Zhang
6487ef64c6 ci: add nightly eval (#1291) 2024-09-02 03:19:49 +10:00
Lianmin Zheng
761b2cebd6 [CI] merge all ci tests into one file (#1289) 2024-09-01 02:36:56 -07:00
Lianmin Zheng
1b5d56f7f8 [CI] Add more multi-gpu tests (#1280) 2024-09-01 00:27:25 -07:00
Lianmin Zheng
6c49831394 Add sglang.bench_latency to CI (#1243) 2024-08-28 21:20:54 +10:00
Yineng Zhang
f25f4dfde5 hotfix: revert sampler CUDA Graph (#1242) 2024-08-28 21:16:47 +10:00
Liangsheng Yin
1ece2cda3d Fix bench latency benchmark (#1225) 2024-08-28 00:37:32 -07:00
Mingyi
97589a60a2 [CI] Parallelize unit tests in CI (#1219) 2024-08-26 04:54:02 +00:00
Liangsheng Yin
632d506d0b minor: improve CI and dependencies (#1212) 2024-08-26 04:26:31 +00:00
Lianmin Zheng
d3efcb3930 Update workflow files (#1214) 2024-08-25 17:45:35 -07:00
Lianmin Zheng
61bb223e0f Update CI runner docs (#1213) 2024-08-25 17:31:52 -07:00
Lianmin Zheng
15f1a49d2d Update CI workflows (#1210) 2024-08-25 16:43:07 -07:00
Chayenne
30b4f771b0 Support Alibaba-NLP/gte-Qwen2-7B-instruct embedding Model (#1186)
Co-authored-by: Ying Sheng <sqy1415@gmail.com>
2024-08-25 10:29:12 -07:00
Liangsheng Yin
5d0d40d0eb Fix CI accuracy && time out limit (#1133) 2024-08-16 21:41:11 -07:00
Yineng Zhang
26e9c12c15 ci: compatible with fork repo (#1115) 2024-08-16 04:26:44 +10:00
Lianmin Zheng
e86b1ccbf0 Enable chunked prefill by default (#1040) 2024-08-14 21:56:20 -07:00
Yineng Zhang
f14569f64a ci: remove workflow path trigger (#1096) 2024-08-14 20:36:24 +10:00
Yineng Zhang
c8423ca311 ci: update timeout and retry (#1086)
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
2024-08-14 00:27:35 -07:00
Yineng Zhang
cebd78d83e ci: add accuracy timeout (#1078) 2024-08-13 22:12:58 +10:00
Yineng Zhang
f7fb68d292 ci: add moe test (#1053) 2024-08-13 18:43:23 +10:00
Yineng Zhang
396a13e6ad ci: add cancel pr workflow (#1070) 2024-08-13 18:16:50 +10:00
Lianmin Zheng
c877292cc1 Re-organize CI tests (#1052) 2024-08-12 03:39:01 -07:00
Lianmin Zheng
41598e0d8e Add longer accuracy test on CI (#1049) 2024-08-12 09:21:38 +00:00
Yineng Zhang
cb99ba4fc6 feat: update Dockerfile (#1033)
Co-authored-by: vhain <vhain6512@gmail.com>
2024-08-12 16:24:06 +10:00
Lianmin Zheng
8207637029 Improve end-to-end throughput test and its coverage (#1039) 2024-08-11 18:27:33 -07:00
Lianmin Zheng
54fb1c80c0 Clean up unit tests (#1020) 2024-08-10 15:09:03 -07:00
Yineng Zhang
e712837d38 misc: update test config (#990) 2024-08-11 04:20:30 +10:00
Ying Sheng
e040a2450b Add e5-mistral embedding model - step 3/3 (#988) 2024-08-08 16:31:19 -07:00
Liangsheng Yin
4d929107ae Run purge-cache only in sgl-project (#976) 2024-08-07 13:16:36 -07:00
Liangsheng Yin
fbe0c818c2 Purge self-runner's pip cache weekly (#975) 2024-08-07 12:43:12 -07:00
Yineng Zhang
c31f084c71 chore: update vllm to 0.5.4 (#966) 2024-08-07 21:15:41 +10:00
Liangsheng Yin
bb66cc4c52 Fix CI && python3.8 compatible (#920) 2024-08-04 16:02:05 -07:00
Yineng Zhang
4f00525057 fix: use e2e and unit test only for original repo or pr (#912) 2024-08-04 16:34:50 +10:00
Ying Sheng
995af5a54b Improve the structure of CI (#911) 2024-08-03 23:09:21 -07:00
Ying Sheng
70cc0749ce Add model accuracy test - step 1 (#866) 2024-08-03 18:20:50 -07:00
Liangsheng Yin
947402c829 Reorder CI unit tests. (#908) 2024-08-03 16:18:50 -07:00
Ying Sheng
001b0bdd08 Update the base image of the docker (#900) 2024-08-02 21:54:57 -07:00
Yineng Zhang
046c2b339e chore: add multipart dep for fastapi (#895) 2024-08-03 00:50:19 +10:00
Yineng Zhang
2e218b9e04 fix: set env in runner (#891) 2024-08-02 20:48:56 +10:00
Ying Sheng
4c8093c8db Update workflow name (#883) 2024-08-01 21:29:46 -07:00
Ying Sheng
ae7ee01a8e Add accuracy test to CI: MMLU (#882) 2024-08-01 21:20:17 -07:00
Ying Sheng
76e59088d8 Add more unit tests to CI (#880) 2024-08-01 18:14:33 -07:00
Liangsheng Yin
12ce3befb6 Update runner docs (#879) 2024-08-01 17:37:47 -07:00
Liangsheng Yin
70c78cfb03 Update runner docs (#876) 2024-08-01 15:32:33 -07:00
Ying Sheng
72b6ea88b4 Make scripts under /test/srt as unit tests (#875) 2024-08-01 14:34:55 -07:00
Ying Sheng
6f221d4ca0 Fix unit tests for the frontend language part (#872) 2024-08-01 12:39:12 -07:00
Yineng Zhang
aba6f51f88 misc: update unit test config (#873) 2024-08-02 05:27:05 +10:00
Yineng Zhang
7f6c690b67 misc: use pip cache purge and add unit test ci (#871) 2024-08-02 05:12:20 +10:00
Ying Sheng
40e6f5131a Fix openai CI tests (#870) 2024-08-01 09:39:09 -07:00