[feat] support minimum token load balance in dp attention (#7379)
This commit is contained in:
@@ -732,6 +732,7 @@ def _launch_subprocesses(
|
||||
pp_rank,
|
||||
None,
|
||||
writer,
|
||||
None,
|
||||
),
|
||||
)
|
||||
|
||||
|
||||
Reference in New Issue
Block a user