Fix logprob_start_len for multi modal models (#2597)
Co-authored-by: libra <lihu723@gmail.com> Co-authored-by: fzyzcjy <ch271828n@outlook.com> Co-authored-by: Wang, Haoyu <haoyu.wang@intel.com>
This commit is contained in:
@@ -565,7 +565,7 @@ class Scheduler:
|
||||
|
||||
if req.logprob_start_len == -1:
|
||||
# By default, only return the logprobs for output tokens
|
||||
req.logprob_start_len = len(recv_req.input_ids) - 1
|
||||
req.logprob_start_len = len(req.origin_input_ids) - 1
|
||||
|
||||
# Truncate prompts that are too long
|
||||
if len(req.origin_input_ids) > self.max_req_input_len:
|
||||
|
||||
Reference in New Issue
Block a user