Commit Graph

8 Commits

Author SHA1 Message Date
Liangsheng Yin
7de6034534 Fix the prefix indices (#1037) 2024-08-11 17:57:02 -07:00
Lianmin Zheng
9dae407812 Improve type annotation (#1029) 2024-08-11 02:44:59 -07:00
Liangsheng Yin
43fbb6d919 Fix input_ids && rename to fill_ids (#1021) 2024-08-10 16:24:12 -07:00
Liangsheng Yin
62757db6f0 Reduce the overhead when cache is disabled (#1010) 2024-08-09 16:36:57 -07:00
Liangsheng Yin
7623091d97 RadixCache method adjust (#977) 2024-08-07 15:52:24 -07:00
Zhiqiang Xie
6db27f7b3b misc: correct the int data type for token ids and indices (#969) 2024-08-08 04:40:07 +08:00
Liangsheng Yin
c020f9ceda Support chunked prefill when radix cache is disabled (#811) 2024-08-01 00:29:01 -07:00
Liangsheng Yin
cdcbde5fc3 Code structure refactor (#807) 2024-07-29 23:04:48 -07:00