Files

LICO67373 2a6d95c389 [Cleanup] Remove dead code make_attention_mask function (#5818 )

### What this PR does / why we need it?

This PR removes the unused `make_attention_mask` function from
`vllm_ascend/worker/v2/attn_utils.py`.

**Why it's dead code:**
- After PR #4870 (attention mask unification refactor), attention mask
generation has been centralized in the `AttentionMaskBuilder` singleton
class
- The mask is now generated directly by metadata builders when needed
(e.g., `AscendAttentionMetadataBuilder`, `AscendMLAMetadataBuilder`)
- The `make_attention_mask` function is no longer called anywhere in the
codebase
- The function's parameters (including `attn_mask` and `spec_attn_mask`)
were also removed from `build_attn_metadata` in the same refactor

**Changes:**
- Remove `make_attention_mask` function (24 lines) from
`vllm_ascend/worker/v2/attn_utils.py`

### Does this PR introduce _any_ user-facing change?

No. This is a code cleanup that removes dead code. No user-facing
behavior changes.

### How was this patch tested?

- Verified that `make_attention_mask` is not called anywhere in the
codebase (via `grep`)
- CI tests pass to ensure no regressions
- The function has been unused since PR #4870 was merged
- vLLM version: v0.13.0
- vLLM main:
2f4e6548ef

Signed-off-by: lico67373 <918688502@qq.com>
Co-authored-by: weijinqian0 <1184188277@qq.com>

2026-01-14 16:52:51 +08:00

sample

[Feature] support eager mode in model runner v2 (#5210 )

2025-12-29 15:28:34 +08:00

spec_decode

[Feature] implement eagle spec decoding for model runner v2 (#5840 )

2026-01-14 09:18:05 +08:00

__init__.py

implement model runner v2 basic framework (#5051 )

2025-12-18 15:51:54 +08:00

aclgraph_utils.py

[Feature] support eager mode in model runner v2 (#5210 )