[Nightly] Optimize nightly CI (#4509)
### What this PR does / why we need it? 1. Optimize multi-node waiting logic 2. Remove the `tee` pipeline for logs, which will lead to hang issue ### How was this patch tested? - vLLM version: v0.12.0 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.12.0 Signed-off-by: wangli <wangli858794774@gmail.com>
This commit is contained in:
@@ -84,8 +84,6 @@ jobs:
|
||||
pip config set global.index-url https://mirrors.tuna.tsinghua.edu.cn/pypi/web/simple
|
||||
pip install jinja2-cli
|
||||
|
||||
#apt-get update -y && apt-get install -y git curl
|
||||
|
||||
- name: Install kubectl
|
||||
run: |
|
||||
# Install kubectl
|
||||
@@ -112,6 +110,8 @@ jobs:
|
||||
run: |
|
||||
# prepare for lws entrypoint scripts
|
||||
install -D tests/e2e/nightly/multi_node/scripts/run.sh /root/.cache/tests/run.sh
|
||||
# clear log directory
|
||||
rm -fr $RESULT_FILE
|
||||
|
||||
- name: Clear resources
|
||||
run: |
|
||||
@@ -263,5 +263,5 @@ jobs:
|
||||
- name: Post process
|
||||
if: always()
|
||||
run: |
|
||||
kubectl get pods -n $NAMESPACE
|
||||
kubectl delete -f ./lws.yaml
|
||||
kubectl get pods -n $NAMESPACE --ignore-not-found=true
|
||||
kubectl delete -f ./lws.yaml --ignore-not-found=true || true
|
||||
|
||||
@@ -61,9 +61,6 @@ jobs:
|
||||
- name: multi-node-qwenw8a8-2node
|
||||
config_file_path: Qwen3-235B-W8A8.yaml
|
||||
size: 2
|
||||
- name: multi-node-glm-2node
|
||||
config_file_path: GLM-4_5.yaml
|
||||
size: 2
|
||||
- name: multi-node-dpsk3.2-exp-2node
|
||||
config_file_path: DeepSeek-V3_2-Exp-bf16.yaml
|
||||
size: 2
|
||||
@@ -134,9 +131,6 @@ jobs:
|
||||
- name: deepseek3_2-exp-w8a8
|
||||
os: linux-aarch64-a3-16
|
||||
tests: tests/e2e/nightly/models/test_deepseek_v3_2_exp_w8a8.py
|
||||
- name: glm-4-5
|
||||
os: linux-aarch64-a3-16
|
||||
tests: tests/e2e/nightly/models/test_glm4_5.py
|
||||
uses: ./.github/workflows/_e2e_nightly_single_node.yaml
|
||||
with:
|
||||
vllm: v0.12.0
|
||||
|
||||
Reference in New Issue
Block a user