Lianmin Zheng
|
63a395b985
|
Update nightly-eval.yml (#2100)
|
2024-11-19 22:15:02 -08:00 |
|
Lianmin Zheng
|
c1f401fc58
|
Revert "chore: update torch v2.5.1" (#2063)
|
2024-11-17 15:29:38 -08:00 |
|
Yineng Zhang
|
3b878863f7
|
chore: update torch v2.5.1 (#1849)
|
2024-11-18 00:06:00 +08:00 |
|
Ke Bao
|
976bc302e5
|
Support DP MLA (#1970)
|
2024-11-16 09:01:43 +00:00 |
|
Lianmin Zheng
|
befc6beb86
|
Fix a typo in io_struct.py (#2008)
|
2024-11-11 16:34:10 -08:00 |
|
Byron Hsu
|
3e33574374
|
run rust test on ubuntu instead of 1-gpu-runner (#2003)
|
2024-11-11 14:46:08 -08:00 |
|
Byron Hsu
|
e728258d34
|
release router from py38 to py312 (#2002)
|
2024-11-11 14:30:25 -08:00 |
|
Byron Hsu
|
239eafbd2e
|
Fix rust unit test and pypi token (#2001)
|
2024-11-11 14:18:21 -08:00 |
|
Byron Hsu
|
00ffde206f
|
setup router python binding ci (#1999)
|
2024-11-11 12:19:32 -08:00 |
|
Lianmin Zheng
|
3d043319aa
|
[CI] Balance unit tests (#1988)
|
2024-11-10 11:45:01 -08:00 |
|
Lianmin Zheng
|
520f0094e4
|
[CI] balance unit tests (#1977)
|
2024-11-09 16:46:14 -08:00 |
|
Lianmin Zheng
|
a1f32867ca
|
Update pr-test-rust.yml to add a "finish" step (#1975)
|
2024-11-09 13:53:35 -08:00 |
|
HAI
|
f9a377f650
|
[Release, ROCm] release ROCm docker build for AMD MI GPUs (#1957)
|
2024-11-08 00:14:15 -08:00 |
|
Lianmin Zheng
|
691808d587
|
Add a timeout for execute-notebook.yml (#1951)
|
2024-11-08 10:28:29 +08:00 |
|
Xuehai Pan
|
a5e0defb5a
|
minor: Add basic editorconfig and pre-commit hooks to enforce style for whitespaces (#1926)
|
2024-11-06 13:46:04 +00:00 |
|
Byron Hsu
|
96766101b4
|
[rust] refactor server and router (#1922)
|
2024-11-06 00:02:02 -08:00 |
|
Byron Hsu
|
530ff541cf
|
[router] Impl radix tree and set up CI (#1893)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-11-04 10:56:52 -08:00 |
|
Chayenne
|
704f8e8ed1
|
Add Reward API Docs etc (#1910)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
|
2024-11-03 22:33:03 -08:00 |
|
Chayenne
|
f4cd804073
|
Fix ci and link error (#1892)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
|
2024-11-02 19:08:49 -07:00 |
|
Chayenne
|
3b60558dd7
|
Native api (#1886)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
|
2024-11-02 01:02:17 -07:00 |
|
Lianmin Zheng
|
2565cb0f40
|
Update docs and workflow (#1881)
|
2024-11-01 20:29:41 -07:00 |
|
Yineng Zhang
|
104bf2609b
|
minor: update nightly eval (#1867)
|
2024-11-01 21:38:29 +08:00 |
|
Yineng Zhang
|
d86a2d6562
|
minor: add human eval (#1754)
|
2024-11-01 14:29:20 +08:00 |
|
Chayenne
|
61cf00e112
|
change file tree (#1859)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
|
2024-10-31 20:10:16 -07:00 |
|
Liangsheng Yin
|
b9fd178f1b
|
Fix retraction + overlap (#1860)
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
|
2024-10-31 18:27:42 -07:00 |
|
Lianmin Zheng
|
a2e0424abf
|
Fix memory leak for chunked prefill 2 (#1858)
Co-authored-by: Liangsheng Yin <hnyls2002@gmail.com>
|
2024-10-31 14:51:51 -07:00 |
|
Chayenne
|
6e13b650a9
|
Fix docs deploy ci (#1821)
|
2024-10-27 21:03:41 -07:00 |
|
Chayenne
|
51c81e339b
|
Add openAI compatible API (#1810)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
|
2024-10-27 10:51:42 -07:00 |
|
Chayenne
|
9d6fb08457
|
Fix docs ci (#1808)
|
2024-10-26 11:23:51 -07:00 |
|
Chayenne
|
ced362f7c6
|
Simplify our docs with complicated functions into utils (#1807)
Co-authored-by: Chayenne <zhaochenyang@ucla.edu>
|
2024-10-26 17:44:11 +00:00 |
|
Lianmin Zheng
|
9084a86445
|
Update links (#1805)
|
2024-10-26 04:46:01 -07:00 |
|
Lianmin Zheng
|
6aa94b967c
|
Update ci workflows (#1804)
|
2024-10-26 04:32:36 -07:00 |
|
Chayenne
|
715b16c140
|
Add support for ipynb (#1786)
|
2024-10-25 20:48:35 -07:00 |
|
Lianmin Zheng
|
1701b0db31
|
Enhance the test case for chunked prefill (#1785)
|
2024-10-24 21:23:09 -07:00 |
|
Yineng Zhang
|
cbbc82b7b8
|
Support qwen2 vl model (#1721)
Co-authored-by: yizhang2077 <1109276519@qq.com>
Co-authored-by: ispobock <ISPObaoke@163.com>
|
2024-10-19 21:44:38 -07:00 |
|
Lianmin Zheng
|
b6cd903604
|
Update readme and workflow (#1716)
|
2024-10-19 13:01:44 -07:00 |
|
Lianmin Zheng
|
d17d19e5b8
|
Fix mixed batch for multi modal models (#1702)
|
2024-10-17 10:27:26 -07:00 |
|
Lianmin Zheng
|
02f7f3e488
|
Update the transformers version in CI (#1690)
|
2024-10-16 19:03:55 -07:00 |
|
Lianmin Zheng
|
6790240cc3
|
Fix unit test order to balance the tasks in CI (#1665)
|
2024-10-14 02:01:44 -07:00 |
|
Lianmin Zheng
|
69aa937aa5
|
Fix unit tests and type annotations (#1648)
|
2024-10-12 14:49:24 -07:00 |
|
Lianmin Zheng
|
00c7e6368b
|
Release v0.3.3.post1 (#1636)
|
2024-10-11 07:56:16 -07:00 |
|
Lianmin Zheng
|
23cc66f7b6
|
Add back data parallelism (#1635)
|
2024-10-11 07:22:48 -07:00 |
|
Ying Sheng
|
04b262cd91
|
[Fix] Fix major performance bug in certain cases (#1563)
Co-authored-by: hnyls2002 <hnyls2002@gmail.com>
|
2024-10-04 08:51:11 +00:00 |
|
Lianmin Zheng
|
048685430d
|
Improve process creation (#1534)
|
2024-09-29 02:36:12 -07:00 |
|
Ying Sheng
|
9aa6553d2a
|
[Feature] Support reward model LxzGordon/URM-LLaMa-3.1-8B (#1525)
|
2024-09-27 23:32:11 -07:00 |
|
Lianmin Zheng
|
bc068e9618
|
[CI] Move AMD test to a separate file (#1500)
|
2024-09-24 02:06:28 -07:00 |
|
Yineng Zhang
|
42a2d82ba7
|
minor: add mla fp8 test (#1494)
|
2024-09-23 20:40:17 +08:00 |
|
Ying Sheng
|
6f3cf1297e
|
[CI, AMD] Add AMD tests to CI (#1491)
|
2024-09-22 04:45:10 -07:00 |
|
Lianmin Zheng
|
13f1357ef0
|
Add a unit test for data parallelism (#1489)
|
2024-09-22 02:21:05 -07:00 |
|
Ke Bao
|
b8ccaf4d73
|
Add MLA gsm8k eval (#1484)
|
2024-09-21 11:16:13 +08:00 |
|