Fix nan value generated after custom all reduce (#8663)

Co-authored-by: wunhuang <wunhuang@amd.com>
This commit is contained in:
kk
2025-08-16 03:33:54 +08:00
committed by GitHub
parent 9c3e95d98b
commit 983aa4967b
2 changed files with 20 additions and 1 deletions

View File

@@ -398,7 +398,7 @@ class CustomAllreduce:
else:
# If warm up, mimic the allocation pattern since custom
# allreduce is out-of-place.
return torch.empty_like(input)
return torch.zeros_like(input)
else:
if _is_hip:
# note: outside of cuda graph context,