Fix nightly (#5413)

### What this PR does / why we need it?
This pacth mainly do the following things:
1. Bugfix for multi_node_tests log, log names must be unique when
uploading logs.
2. Optimize `get_cluster_ips` logic, increase the max retry times for
robustness
3. Abandoned the existing gh-proxy temporarily until it is stable
enough.
### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

- vLLM version: release/v0.13.0
- vLLM main:
81786c8774

---------

Signed-off-by: wangli <wangli858794774@gmail.com>
This commit is contained in:
Li Wang
2025-12-27 18:16:46 +08:00
committed by GitHub
parent e91e11d3b0
commit 1d81bfaed1
5 changed files with 8 additions and 9 deletions

View File

@@ -24,7 +24,7 @@ def temp_env(env_dict):
os.environ[k] = v
def dns_resolver(retries: int = 20, base_delay: float = 0.5):
def dns_resolver(retries: int = 240, base_delay: float = 0.5):
# We should resolve DNS with retries to avoid transient network issues.
# When the pod is just started, DNS resolution may fail.
def resolve(dns: str):