[Test] quick fix mla ut (#4318)
### What this PR does / why we need it?
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
- vLLM version: v0.11.0
- vLLM main:
2918c1b49c
Signed-off-by: GDzhu01 <809721801@qq.com>
This commit is contained in:
@@ -211,5 +211,3 @@ class TestAscendMultiHeadLatentAttention(TestBase):
|
||||
output = attn.forward(positions, hidden_states)
|
||||
|
||||
self.assertEqual(output.shape, (3, self.hidden_size))
|
||||
self.assertTrue(
|
||||
torch.allclose(output, output.view(-1, self.hidden_size)))
|
||||
|
||||
Reference in New Issue
Block a user