Commit Graph

46 Commits

Author SHA1 Message Date
Chayenne
0ffcfdf474 Docs: Only use X-Grammar in structed output (#2991) 2025-01-19 20:22:47 -08:00
Enrique Shockwave
3bcf5ecea7 support regex in xgrammar backend (#2983) 2025-01-20 04:34:41 +08:00
Lianmin Zheng
8b6ce52e92 Support multi-node DP attention (#2925)
Co-authored-by: dhou-xai <dhou@x.ai>
2025-01-16 11:15:00 -08:00
Shi Shuai
c4f9707e16 Improve: Token-In Token-Out Usage for RLHF (#2843) 2025-01-11 15:14:26 -08:00
Chayenne
2e6346fc2e Docs:Update the style of llma 3.1 405B docs (#2789) 2025-01-08 01:07:54 -08:00
mlmz
977f785dad Docs: Rewrite docs for LLama 405B and ModelSpace (#2773)
Co-authored-by: Chayenne <zhaochen20@outlook.com>
2025-01-08 00:02:59 -08:00
Shi Shuai
062c48d2bd [Docs] Add Support for Pydantic Structured Output Format (#2697) 2025-01-01 15:08:43 -08:00
Shi Shuai
0a765bbccc Docs: Refactor Contribution Guide (#2690) 2024-12-31 14:11:00 -08:00
Lianmin Zheng
bdd2827a80 Update structured_outputs.ipynb (#2666) 2024-12-30 00:46:41 -08:00
Lianmin Zheng
8c3b420eec [Docs] clean up structured outputs docs (#2654) 2024-12-29 23:57:16 -08:00
Adarsh Shirawalmath
fd34f2da35 [Docs] Add EBNF to sampling params docs (#2609) 2024-12-29 00:05:00 -08:00
Tanjiro
8ee9a8501a [Feature] Function Calling (#2544)
Co-authored-by: Haoyu Wang <120358163+HaoyuWang4188@users.noreply.github.com>
2024-12-28 21:58:52 -08:00
Shi Shuai
333e3bfde5 [docs]Refactor constrained decoding tutorial (#2633) 2024-12-28 07:00:38 -08:00
Shi Shuai
239c9d4d3a Docs: Add constrained decoding tutorial (#2614)
Co-authored-by: Chayenne <zhaochen20@outlook.com>
2024-12-27 23:54:28 -08:00
Lianmin Zheng
773951548d Fix logprob_start_len for multi modal models (#2597)
Co-authored-by: libra <lihu723@gmail.com>
Co-authored-by: fzyzcjy <ch271828n@outlook.com>
Co-authored-by: Wang, Haoyu <haoyu.wang@intel.com>
2024-12-26 06:27:45 -08:00
Shi Shuai
25e5d589e3 Doc: Update Grammar Backend (#2545)
Co-authored-by: Chayenne <zhaochen20@outlook.com>
2024-12-22 17:14:40 -08:00
Chayenne
786be44da5 Fix Docs CI When Compile Error (#2323) 2024-12-04 11:19:46 -08:00
Chayenne
7d5d1d3d29 udate weights from disk (#2265) 2024-11-30 01:17:00 +00:00
Lianmin Zheng
4f2ee48ed1 Update backend.md (#2251) 2024-11-28 23:18:07 -08:00
Lianmin Zheng
71ff2728a1 Update backend.md (#2250) 2024-11-28 23:14:36 -08:00
Lianmin Zheng
8912b7637f Fix docs (#2164) 2024-11-24 08:25:56 -08:00
Lianmin Zheng
c211e7b669 Simplify batch update (#2154) 2024-11-24 04:47:10 -08:00
Henry Hyeonmok Ko
dbe1729395 Merged three native APIs into one: get_server_info (#2152) 2024-11-24 01:37:58 -08:00
Henry Hyeonmok Ko
c35cd1f8c7 Expose max total num tokens from Runtime & Engine API (#2092) 2024-11-22 15:10:10 -08:00
Xuehai Pan
72f87b723b feat(pre-commit): trim unnecessary notebook metadata from git history (#2127) 2024-11-22 13:04:51 -08:00
Xiaoyu Zhang
023d0a73df fix small typos in docs (#2047) 2024-11-15 11:09:10 -08:00
ws
29ebe3dff4 fix: align enable_overlap_scheduler naming between code and docs (#2038) 2024-11-15 03:39:10 -08:00
Chayenne
c77c1e05ba fix black in pre-commit (#1940) 2024-11-08 07:42:47 +08:00
Xuehai Pan
a5e0defb5a minor: Add basic editorconfig and pre-commit hooks to enforce style for whitespaces (#1926) 2024-11-06 13:46:04 +00:00
Lianmin Zheng
f5113e50ae [Doc] improve relative links and structure (#1924) 2024-11-05 01:12:10 -08:00
Chayenne
02755768d3 Change judge to classify & Modify make file (#1920) 2024-11-04 23:53:44 -08:00
Chayenne
704f8e8ed1 Add Reward API Docs etc (#1910)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
2024-11-03 22:33:03 -08:00
Lianmin Zheng
1853c3523b Fix regex docs (#1909) 2024-11-03 14:18:16 -08:00
Chayenne
908dd7f9aa Add engine api (#1894) 2024-11-02 22:03:38 -07:00
Chayenne
f4cd804073 Fix ci and link error (#1892)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
2024-11-02 19:08:49 -07:00
Lianmin Zheng
be7986e005 Fix docs (#1890) 2024-11-02 13:26:32 -07:00
Chayenne
5a5f18432f Fix docs ci (#1888) 2024-11-02 11:57:22 -07:00
Lianmin Zheng
7b394e5f2b Fix docs (#1889) 2024-11-02 11:46:00 -07:00
Chayenne
3b60558dd7 Native api (#1886)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
2024-11-02 01:02:17 -07:00
Chayenne
72e979bfb5 add native api docs (#1883)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
2024-11-02 00:17:30 -07:00
Lianmin Zheng
660ecb731f Fix doc links (#1882) 2024-11-01 20:42:30 -07:00
Lianmin Zheng
066e8a4ef0 Update docs title (#1879) 2024-11-01 20:00:41 -07:00
Lianmin Zheng
2134f0898c Fix links in the docs (#1878) 2024-11-01 18:25:55 -07:00
Lianmin Zheng
d1b31b0684 Improve docs and fix the broken links (#1875) 2024-11-01 17:47:44 -07:00
Chayenne
3bf3d011ed Add vlm document (#1866)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
2024-11-01 00:51:15 -07:00
Chayenne
61cf00e112 change file tree (#1859)
Co-authored-by: Chayenne <zhaochenyang@g.ucla.edu>
2024-10-31 20:10:16 -07:00