Commit Graph

18 Commits

Author SHA1 Message Date
Simo Lin
728af88781 [router] allow user to specify chat template path (#11549) 2025-10-13 10:47:57 -07:00
Simo Lin
7c94eaeeb0 [router] allow tokenizer path to be dir (#11530) 2025-10-13 09:30:09 -04:00
Chang Su
a1080b72a0 [router] Fix all unused_qualifications (#11341) 2025-10-08 13:55:27 -07:00
Chang Su
64582caa84 [router][grpc] Refactor chat template content format detection (#11288) 2025-10-07 08:38:51 -07:00
Simo Lin
5519766a4d [router] fix chat template loading and tokenizer path (#10999) 2025-09-27 23:54:12 -04:00
Simo Lin
aae7ead2d0 [router] remove old/oudated/useless comments across code base (#10968) 2025-09-26 10:48:50 -07:00
Simo Lin
458c0219a6 [router] simplify tokenizer dev doc (#10895) 2025-09-24 22:15:56 -07:00
Chang Su
4a87ba217f router-grpc: Add tools processing and other paramters for apply_chat_template (#10877) 2025-09-24 15:23:06 -07:00
Chang Su
9209b209be router-grpc: Support jinja chat template content format detection (#10832) 2025-09-24 11:45:01 -07:00
Chang Su
7ff93e613f router(grpc): Implement route for chat_cmpl endpoint (#10761) 2025-09-23 11:26:33 -07:00
Chang Su
598c0bc19d [router] add tokenizer download support from hf hub (#9882) 2025-09-01 10:40:37 -07:00
Simo Lin
49f9d02538 [router] tokenizer arch doc (#9513) 2025-08-22 09:52:33 -07:00
Chang Su
53e2cd464f [router] remove all tokenizer metrics for performance (#9474) 2025-08-21 18:35:24 -07:00
Simo Lin
5fbad308cd [router] add tokenizer chat template support (#9370)
Co-authored-by: Chang Su <chang.s.su@oracle.com>
2025-08-19 20:14:02 -07:00
Simo Lin
0b95a01a8f [router] add tiktokenizer and sequence in router (#9354)
Co-authored-by: Chang Su <chang.s.su@oracle.com>
2025-08-19 10:46:28 -07:00
Simo Lin
24247b4168 [router] add tokenizer metrics (#9307)
Co-authored-by: Chang Su <chang.s.su@oracle.com>
2025-08-18 09:25:51 -07:00
Simo Lin
d08663eec1 [router] tokenizer factory, hf tokenizer, and stop sequence detector (#9293)
Co-authored-by: Chang Su <chang.s.su@oracle.com>
2025-08-17 22:38:38 -07:00
Simo Lin
ff0cf51c8e [router] introducing tokenizer trait (#9287) 2025-08-17 16:30:01 -07:00