[Test] Add tests for the multi-node DeepSeek-V2-Lite network in GE Graph (#4039)
### What this PR does / why we need it?
Add tests for the multi-node DeepSeek-V2-Lite network in GE Graph mode,
and supplement the end-to-end (e2e) tests for the MLA and NZ features of
this network.
- vLLM version: v0.11.0
- vLLM main:
2918c1b49c
---------
Signed-off-by: CodeNine-CJ <chenjian343@huawei.com>
This commit is contained in:
@@ -288,3 +288,14 @@ def test_e2e_deepseekv2lite_with_torchair_v1scheduler():
|
||||
}
|
||||
_deepseek_v2_lite_torchair_test_fixure(additional_config,
|
||||
use_v1_schduler=True)
|
||||
|
||||
|
||||
# kv_cache enable e2e test
|
||||
def test_e2e_deepseekv2lite_with_nz():
|
||||
additional_config = {
|
||||
"torchair_graph_config": {
|
||||
"enabled": True,
|
||||
"enable_kv_nz": True,
|
||||
},
|
||||
}
|
||||
_deepseek_v2_lite_torchair_test_fixure(additional_config)
|
||||
|
||||
Reference in New Issue
Block a user