Commit Graph

27 Commits

Author SHA1 Message Date
Lianmin Zheng
8496701934 [Misc] Fix metrics, weight update lock, request logging (#2543) 2024-12-22 06:27:22 -08:00
Lianmin Zheng
21e9e63ad5 Print progress bar during cuda graph capture (#2502) 2024-12-17 06:33:46 -08:00
Fred Reiss
993956c6b1 Add support for IBM Granite 3.x models (#2437) 2024-12-11 06:30:23 -08:00
SangBin Cho
1f09e84b9a nit: Remove busy waiting on scheduler (#2382) 2024-12-08 01:06:15 -08:00
Lianmin Zheng
0e7409adb6 Fix the overlap for xgrammar (#2377) 2024-12-06 05:49:29 -08:00
vchzls
3cde5eb629 docs: Improve instructions for supporting new models (#2363)
Co-authored-by: zhaohoulong <zhaohoulong@xiaomi.com>
2024-12-06 04:27:17 -08:00
bjmsong
91e5dbf554 add profile in offline benchmark & update doc (#2123)
Co-authored-by: root <bjmsong@126.com>
2024-11-27 14:57:13 -08:00
Rin Intachuen
1aea19f64b Input_embeds support (#2052) 2024-11-25 16:35:04 -08:00
Lianmin Zheng
c211e7b669 Simplify batch update (#2154) 2024-11-24 04:47:10 -08:00
Lianmin Zheng
dfec7fca06 Rename sglang.bench_latency to sglang.bench_one_batch (#2118) 2024-11-21 20:07:48 -08:00
Tanjiro
8c280cee55 add phi-3 small support (#2062)
Co-authored-by: Tushar Goel <114812108+AI-Tushar@users.noreply.github.com>
2024-11-17 18:47:43 -08:00
Xiaoyu Zhang
023d0a73df fix small typos in docs (#2047) 2024-11-15 11:09:10 -08:00
ws
29ebe3dff4 fix: align enable_overlap_scheduler naming between code and docs (#2038) 2024-11-15 03:39:10 -08:00
RangiLyu
f18b9c7252 support internlm2-reward (#1994) 2024-11-11 15:09:58 -08:00
aqweteddy
f16eb15d0d Gemma2 reward model support (#1954) 2024-11-07 22:42:27 -08:00
Yudi Xue
5bc2508b80 Monitoring documentation (#1933) 2024-11-07 22:14:16 -08:00
Lianmin Zheng
1ae270c5d0 [Doc] fix docs (#1949) 2024-11-07 18:20:41 -08:00
Xuehai Pan
a5e0defb5a minor: Add basic editorconfig and pre-commit hooks to enforce style for whitespaces (#1926) 2024-11-06 13:46:04 +00:00
Lianmin Zheng
f5113e50ae [Doc] improve relative links and structure (#1924) 2024-11-05 01:12:10 -08:00
Lianmin Zheng
1853c3523b Fix regex docs (#1909) 2024-11-03 14:18:16 -08:00
Lianmin Zheng
838dcda162 Simplify tokenizer manager (#1899) 2024-11-03 03:52:38 -08:00
Lianmin Zheng
be7986e005 Fix docs (#1890) 2024-11-02 13:26:32 -07:00
Lianmin Zheng
7b394e5f2b Fix docs (#1889) 2024-11-02 11:46:00 -07:00
Lianmin Zheng
2134f0898c Fix links in the docs (#1878) 2024-11-01 18:25:55 -07:00
Lianmin Zheng
a54f278d44 Add a FAQ documentation (#1877) 2024-11-01 18:16:29 -07:00
Lianmin Zheng
d1b31b0684 Improve docs and fix the broken links (#1875) 2024-11-01 17:47:44 -07:00
Chayenne
61cf00e112 change file tree (#1859)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
2024-10-31 20:10:16 -07:00