[Fix] DeepEP Compatibility with Low Latency (#5068)

Co-authored-by: ch-wan <cwan39@gatech.edu>
This commit is contained in:
Jinyan Chen
2025-04-09 11:31:31 +08:00
committed by GitHub
parent aac531c53b
commit bc3f6db2dd
4 changed files with 146 additions and 118 deletions

View File

@@ -72,7 +72,7 @@ class ForwardMode(IntEnum):
DUMMY_FIRST = auto()
def is_prefill(self):
return self == ForwardMode.PREFILL
return self.is_extend()
def is_extend(self):
return (