[Bugfix][V1] Fix deepseek with v1 (#958)
### What this PR does / why we need it? Fix deepseek with v1, this error is introdeced by https://github.com/vllm-project/vllm-ascend/pull/945. and this pr fix the block table of mla ### How was this patch tested? CI passed with new addedtest. Signed-off-by: Mengqing Cao <cmq0113@163.com>
This commit is contained in:
@@ -239,10 +239,8 @@ class AscendMLAMetadataBuilder:
|
||||
# it blocks on all previous kernels.
|
||||
device = self.runner.device
|
||||
|
||||
block_table = self.runner.input_batch.block_table[0].get_device_tensor(
|
||||
)
|
||||
block_table[:num_reqs, :self.runner.max_num_blocks_per_req] = (
|
||||
block_table[:num_reqs])
|
||||
block_table = (self.runner.input_batch.block_table[0].
|
||||
get_device_tensor()[:num_reqs])
|
||||
slot_mapping = self.runner.slot_mapping_cpu[:num_actual_tokens].to(
|
||||
device, non_blocking=True)
|
||||
input_positions = self.runner.positions_cpu[:num_actual_tokens].to(
|
||||
|
||||
Reference in New Issue
Block a user