Simo Lin
|
6d6e24bcc4
|
[router] Add builder pattern for RouterConfig with zero duplication (#12030)
|
2025-10-23 16:46:10 -07:00 |
|
Chang Su
|
28b8a4064d
|
[router][CI] Clean up imports and prints statements in sgl-router/py_test (#12024)
|
2025-10-23 11:56:57 -07:00 |
|
Simo Lin
|
a4b637d87a
|
[router] change ci names and update log level in ci (#12021)
|
2025-10-23 10:36:19 -07:00 |
|
Arthur Cheng
|
53c2934dce
|
[Router] Consolidate ConnectionMode enum to core module (#11937)
|
2025-10-23 05:15:49 -07:00 |
|
Keyang Ru
|
e321c97113
|
[router] Add comprehensive E2E tests for Response API (#11988)
|
2025-10-23 05:13:51 -07:00 |
|
Simo Lin
|
5dccf69713
|
[router] create worker removal step and clean up worker manager (#11921)
|
2025-10-22 13:26:06 -07:00 |
|
Keyang Ru
|
77258ce039
|
[router] Support multiple worker URLs for OpenAI router (#11723)
|
2025-10-22 09:27:58 -07:00 |
|
Chang Su
|
590bc4b7a7
|
[router][grpc] Fix background tasks stored with wrong id (#11945)
|
2025-10-21 18:38:51 -07:00 |
|
Keyang Ru
|
63cfe1b032
|
[router] Add gRPC E2E test suite (#11790)
|
2025-10-21 17:51:21 -07:00 |
|
Chang Su
|
70f6309cd4
|
[router][grpc] Support v1/responses API (#11926)
|
2025-10-21 17:41:48 -07:00 |
|
Keyang Ru
|
87a92e459a
|
Fix openai input_text type compatibility (#11935)
|
2025-10-21 16:10:35 -07:00 |
|
Simo Lin
|
8a801ee38d
|
[router] release router 0.2.1 (#11885)
|
2025-10-20 21:08:45 -07:00 |
|
Simo Lin
|
1111030395
|
[router] clean up workflow logs to debug for implementation details logs (#11886)
|
2025-10-20 18:24:55 -07:00 |
|
Tien Nguyen
|
28ddfb37d7
|
fix(sql-router): fix conflict port in test (#11826)
Co-authored-by: Simo Lin <linsimo.mark@gmail.com>
|
2025-10-20 18:06:34 -07:00 |
|
Chang Su
|
e69094df64
|
[router][grpc] Remove continue_final_message in ChatTemplateParams and add minijinja-contrib (#11882)
|
2025-10-20 18:03:09 -07:00 |
|
Simo Lin
|
b4948512b8
|
[router] remove encoding header for oai router (#11881)
|
2025-10-20 17:39:00 -07:00 |
|
Simo Lin
|
ddcba74b4d
|
[router] Worker Management Workflow Engine (#11868)
|
2025-10-20 17:00:22 -07:00 |
|
ybyang
|
d513ee93ef
|
[2/2] [feature] support openai like classification api in router (#11670)
|
2025-10-18 19:31:08 -07:00 |
|
Simo Lin
|
a7ae61ed77
|
[router] Add Configurable L0 and L1 Tokenizer Caching (#11688)
|
2025-10-18 18:33:53 -07:00 |
|
Chang Su
|
ca240eefb4
|
[router][grpc] Support parallel queue puts in grpc_request_manager and remove mutex for grpc_client (#11798)
|
2025-10-17 20:49:43 -07:00 |
|
Chang Su
|
d1984e218c
|
[router][grpc] Remove timeout for connections and remove max_tokens deprecation warning log (#11775)
|
2025-10-17 12:36:36 -07:00 |
|
Keyang Ru
|
2bc3fcd420
|
[doc] update router document (#11767)
|
2025-10-17 10:26:54 -07:00 |
|
Simo Lin
|
a5978a20f0
|
[router] fix grpc client time out to 1h (#11768)
|
2025-10-17 10:26:12 -07:00 |
|
Simo Lin
|
e483c1eae5
|
[router] Fix UTF-8 Boundary Panic in Stop Sequence Decoder (#11766)
|
2025-10-17 10:21:00 -07:00 |
|
Keyang Ru
|
7780230a15
|
Revert "[router] fix get_models endpoint for openai router (#11687)" (#11740)
|
2025-10-16 18:36:53 -07:00 |
|
Chang Su
|
dc01313da1
|
[router] Add rustfmt and set group imports by default (#11732)
|
2025-10-16 17:33:29 -07:00 |
|
Keyang Ru
|
7a7f99beb7
|
[router] add spec.rs to enables tests under spec folder (#11734)
|
2025-10-16 16:07:26 -07:00 |
|
Chang Su
|
c7962868c1
|
[router] Fix tool_choice normalization in ChatCompletionRequest and fix ut (#11731)
|
2025-10-16 14:20:13 -07:00 |
|
Simo Lin
|
64affab495
|
[router] fix p and d worker filtering and bootstrap port handling (#11729)
|
2025-10-16 14:19:39 -07:00 |
|
Keyang Ru
|
4c9bcb9d56
|
[Router] Refactor protocol definitions: split spec.rs into modular files (#11677)
Co-authored-by: Chang Su <chang.s.su@oracle.com>
|
2025-10-16 13:44:44 -07:00 |
|
Keyang Ru
|
0975ba99bc
|
[router] fix get_models endpoint for openai router (#11687)
|
2025-10-16 09:00:08 -07:00 |
|
Simo Lin
|
f5d30dae89
|
[router] Refactor StopSequenceDecoder to Use Sequence for Incremental Decoding (#11676)
|
2025-10-15 16:31:03 -07:00 |
|
Chang Su
|
2479b89405
|
[router][grpc] Simplify model_id determination (#11684)
|
2025-10-15 15:56:58 -07:00 |
|
Keyang Ru
|
d2478cd4ff
|
[router] Fix response api related spec (#11621)
|
2025-10-15 09:59:38 -07:00 |
|
Simo Lin
|
74737b2863
|
[router] upgrade to 0.2.0 (#11642)
|
2025-10-14 22:10:30 -04:00 |
|
Simo Lin
|
40e0082d8d
|
[router] add worker self discovery for metadata (#11638)
|
2025-10-14 22:07:25 -04:00 |
|
Simo Lin
|
49345a68cf
|
[router] update router readme to latest features (#11619)
|
2025-10-14 11:47:38 -07:00 |
|
Simo Lin
|
9e8a15a74c
|
[router] add chang and keyang to sgl router author (#11620)
|
2025-10-14 11:10:49 -07:00 |
|
Simo Lin
|
3962e39d7c
|
[router] cleanup app context and move to startup (#11617)
|
2025-10-14 10:19:28 -07:00 |
|
Keyang Ru
|
eb8cac6fe2
|
[router] add py binding and readme for openai router and history backend (#11453)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-10-14 09:42:34 -07:00 |
|
Simo Lin
|
a04efc4933
|
[router] when given both local tokenizer and chat template, log all (#11601)
|
2025-10-14 02:22:58 -07:00 |
|
Simo Lin
|
da7fac1b75
|
[router] allow router launch server to use grpc mode (#11600)
|
2025-10-14 01:42:43 -07:00 |
|
Simo Lin
|
28ad2297a0
|
[router] delete useless table content comment in spec (#11597)
|
2025-10-14 01:08:18 -07:00 |
|
Simo Lin
|
4b62af92ef
|
[router] change worker api to async instead of sync (#11566)
|
2025-10-14 00:32:21 -07:00 |
|
Simo Lin
|
0b9915c132
|
[router] update generate spec to align with sgl io struct (#11591)
|
2025-10-14 02:51:33 -04:00 |
|
Chang Su
|
27ef1459e6
|
[router][protocols] Add Axum validate extractor and use it for /v1/chat/completions endpoint (#11588)
|
2025-10-13 22:51:15 -07:00 |
|
Chang Su
|
887c2b4575
|
[router][grpc] Add serve_grpc to launch_server and log id for HealthCheck (#11564)
|
2025-10-13 16:07:19 -07:00 |
|
Chang Su
|
4b694e7d5a
|
[router][grpc] Add error handling to generate_tool_constraints (#11562)
|
2025-10-13 12:26:09 -07:00 |
|
Jonah Bernard
|
f4aa78801e
|
[router] Add Rust CLI flags for queue size, timeout, and rate limit for token bucket rate limiter (#11483)
Co-authored-by: Simo Lin <linsimo.mark@gmail.com>
|
2025-10-13 11:08:48 -07:00 |
|
Simo Lin
|
728af88781
|
[router] allow user to specify chat template path (#11549)
|
2025-10-13 10:47:57 -07:00 |
|