xc-llm-ascend/tools/send_request.py

from typing import Any

import requests


def send_v1_completions(prompt, model, server, request_args=None):
    data: dict[str, Any] = {"model": model, "prompt": prompt}
    if request_args:
        data.update(request_args)
    url = server.url_for("v1", "completions")
    response = requests.post(url, json=data)
    print(f"Status Code: {response.status_code}")
    response_json = response.json()
    print(f"Response json: {response_json}")
    response_text = response_json["choices"][0]["text"]
    print(f"Response: {response_text}")
    assert response_text, "empty response"


def send_v1_chat_completions(prompt, model, server, request_args=None):
    data: dict[str, Any] = {
        "model": model,
        "messages": [
            {
                "role": "user",
                "content": prompt,
            }
        ],
    }
    if request_args:
        data.update(request_args)
    url = server.url_for("v1", "chat", "completions")
    response = requests.post(url, json=data)
    print(f"Status Code: {response.status_code}")
    response_json = response.json()
    print(f"Response json: {response_json}")
    response_text = response_json["choices"][0]["message"]["content"]
    print(f"Response: {response_text}")
    assert response_text, "empty response"
[TEST]Update nightly cases and add mtpx (#4111) ### What this PR does / why we need it? This PR updates some nightly test cases and adds mtpx cases, we need to test them daily ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0 - vLLM main: https://github.com/vllm-project/vllm/commit/83f478bb19489b41e9d208b47b4bb5a95ac171ac --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-11-11 17:39:58 +08:00			`from typing import Any`

			`import requests`


[TEST]Add sending request with and without chat (#5286) ### What this PR does / why we need it? This PR adds the method for sending chat and non-chat request, we need it to test much folloing cases. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test - vLLM version: release/v0.13.0 - vLLM main: https://github.com/vllm-project/vllm/commit/ad32e3e19ccf0526cb6744a5fed09a138a5fb2f9 --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-12-26 18:04:17 +08:00			`def send_v1_completions(prompt, model, server, request_args=None):`
			`data: dict[str, Any] = {"model": model, "prompt": prompt}`
			`if request_args:`
			`data.update(request_args)`
			`url = server.url_for("v1", "completions")`
			`response = requests.post(url, json=data)`
			`print(f"Status Code: {response.status_code}")`
			`response_json = response.json()`
			`print(f"Response json: {response_json}")`
			`response_text = response_json["choices"][0]["text"]`
			`print(f"Response: {response_text}")`
			`assert response_text, "empty response"`
[TEST]Update nightly cases and add mtpx (#4111) ### What this PR does / why we need it? This PR updates some nightly test cases and adds mtpx cases, we need to test them daily ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0 - vLLM main: https://github.com/vllm-project/vllm/commit/83f478bb19489b41e9d208b47b4bb5a95ac171ac --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-11-11 17:39:58 +08:00
[TEST]Add sending request with and without chat (#5286) ### What this PR does / why we need it? This PR adds the method for sending chat and non-chat request, we need it to test much folloing cases. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test - vLLM version: release/v0.13.0 - vLLM main: https://github.com/vllm-project/vllm/commit/ad32e3e19ccf0526cb6744a5fed09a138a5fb2f9 --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-12-26 18:04:17 +08:00
			`def send_v1_chat_completions(prompt, model, server, request_args=None):`
			`data: dict[str, Any] = {`
			`"model": model,`
[Lint]Style: Convert `root`, `benchmarks`, `tools` and `docs` to `ruff format` (#5843) ### What this PR does / why we need it? Description This PR fixes linting issues in the root directory, benchmarks/, tools/ and docs/ to align with the project's Ruff configuration. This is part of a gradual effort to enable full linting coverage across the repository. The corresponding paths have been removed from the exclude list in pyproject.toml. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: https://github.com/vllm-project/vllm/commit/2f4e6548efec402b913ffddc8726230d9311948d --------- Signed-off-by: root <root@LAPTOP-VQKDDVMG.localdomain> Co-authored-by: root <root@LAPTOP-VQKDDVMG.localdomain> 2026-01-13 15:29:34 +08:00			`"messages": [`
			`{`
			`"role": "user",`
			`"content": prompt,`
			`}`
			`],`
[TEST]Add sending request with and without chat (#5286) ### What this PR does / why we need it? This PR adds the method for sending chat and non-chat request, we need it to test much folloing cases. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test - vLLM version: release/v0.13.0 - vLLM main: https://github.com/vllm-project/vllm/commit/ad32e3e19ccf0526cb6744a5fed09a138a5fb2f9 --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-12-26 18:04:17 +08:00			`}`
[TEST]Update nightly cases and add mtpx (#4111) ### What this PR does / why we need it? This PR updates some nightly test cases and adds mtpx cases, we need to test them daily ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0 - vLLM main: https://github.com/vllm-project/vllm/commit/83f478bb19489b41e9d208b47b4bb5a95ac171ac --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-11-11 17:39:58 +08:00			`if request_args:`
			`data.update(request_args)`
[TEST]Add sending request with and without chat (#5286) ### What this PR does / why we need it? This PR adds the method for sending chat and non-chat request, we need it to test much folloing cases. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test - vLLM version: release/v0.13.0 - vLLM main: https://github.com/vllm-project/vllm/commit/ad32e3e19ccf0526cb6744a5fed09a138a5fb2f9 --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-12-26 18:04:17 +08:00			`url = server.url_for("v1", "chat", "completions")`
[TEST]Update nightly cases and add mtpx (#4111) ### What this PR does / why we need it? This PR updates some nightly test cases and adds mtpx cases, we need to test them daily ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0 - vLLM main: https://github.com/vllm-project/vllm/commit/83f478bb19489b41e9d208b47b4bb5a95ac171ac --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-11-11 17:39:58 +08:00			`response = requests.post(url, json=data)`
[TEST]Add sending request with and without chat (#5286) ### What this PR does / why we need it? This PR adds the method for sending chat and non-chat request, we need it to test much folloing cases. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test - vLLM version: release/v0.13.0 - vLLM main: https://github.com/vllm-project/vllm/commit/ad32e3e19ccf0526cb6744a5fed09a138a5fb2f9 --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-12-26 18:04:17 +08:00			`print(f"Status Code: {response.status_code}")`
[TEST]Update nightly cases and add mtpx (#4111) ### What this PR does / why we need it? This PR updates some nightly test cases and adds mtpx cases, we need to test them daily ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0 - vLLM main: https://github.com/vllm-project/vllm/commit/83f478bb19489b41e9d208b47b4bb5a95ac171ac --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-11-11 17:39:58 +08:00			`response_json = response.json()`
[TEST]Add sending request with and without chat (#5286) ### What this PR does / why we need it? This PR adds the method for sending chat and non-chat request, we need it to test much folloing cases. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test - vLLM version: release/v0.13.0 - vLLM main: https://github.com/vllm-project/vllm/commit/ad32e3e19ccf0526cb6744a5fed09a138a5fb2f9 --------- Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com> 2025-12-26 18:04:17 +08:00			`print(f"Response json: {response_json}")`
			`response_text = response_json["choices"][0]["message"]["content"]`
			`print(f"Response: {response_text}")`
			`assert response_text, "empty response"`