[Test] quick fix mla ut (#4318)

### What this PR does / why we need it?

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

- vLLM version: v0.11.0
- vLLM main:
2918c1b49c

Signed-off-by: GDzhu01 <809721801@qq.com>
This commit is contained in:
Zhu Yi Lin
2025-11-20 23:06:12 +08:00
committed by GitHub
parent 5c9f4a40c6
commit d96d5fa971

View File

@@ -211,5 +211,3 @@ class TestAscendMultiHeadLatentAttention(TestBase):
output = attn.forward(positions, hidden_states)
self.assertEqual(output.shape, (3, self.hidden_size))
self.assertTrue(
torch.allclose(output, output.view(-1, self.hidden_size)))