[Cleanup] Remove unused attn_metadata parameter from Proposer classes (#4862)
The `attn_metadata` is not used by any draft proposer, so we can remove
it.
- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c
---------
Signed-off-by: Jade Zheng <zheng.shoujian@outlook.com>
This commit is contained in:
@@ -1383,7 +1383,7 @@ class NPUModelRunner(GPUModelRunner):
|
||||
draft_token_ids = self.drafter.generate_token_ids(
|
||||
valid_sampled_token_ids, sampling_metadata, scheduler_output,
|
||||
spec_decode_metadata, positions, num_scheduled_tokens,
|
||||
hidden_states, attn_metadata, aux_hidden_states)
|
||||
hidden_states, aux_hidden_states)
|
||||
return draft_token_ids
|
||||
|
||||
def _select_moe_comm_method(self,
|
||||
|
||||
Reference in New Issue
Block a user