xc-llm-ascend/benchmarks/tests/throughput-tests.json

[
  {
    "test_name": "throughput_qwen3_8B_tp1",
    "parameters": {
      "model": "Qwen/Qwen3-8B",
      "tensor_parallel_size": 1,
      "load_format": "dummy",
      "dataset_path": "/github/home/.cache/datasets/ShareGPT_V3_unfiltered_cleaned_split.json",
      "num_prompts": 200,
      "backend": "vllm"
    }
  },
  {
    "test_name": "throughput_qwen2_5vl_7B_tp1",
    "parameters": {
      "model": "Qwen/Qwen2.5-VL-7B-Instruct",
      "tensor_parallel_size": 1,
      "backend": "vllm-chat",
      "dataset_name": "hf",
      "hf_split": "train",
      "max_model_len": 16384,
      "dataset_path": "lmarena-ai/vision-arena-bench-v0.1",
      "num_prompts": 200
    }
  },
  {
    "test_name": "throughput_qwen2_5_7B_tp1",
    "parameters": {
      "model": "Qwen/Qwen2.5-7B-Instruct",
      "tensor_parallel_size": 1,
      "load_format": "dummy",
      "dataset_path": "/github/home/.cache/datasets/ShareGPT_V3_unfiltered_cleaned_split.json",
      "num_prompts": 200,
      "backend": "vllm"
    }
  }
]
[Doc]Add benchmark scripts (#74) ### What this PR does / why we need it? The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code . --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-03-21 15:54:34 +08:00			`[`
			`{`
[CI] Add benchmark workflows (#1014) ### What this PR does / why we need it? Add benchmark workflows ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Run locally --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-05-30 22:42:44 +08:00			`"test_name": "throughput_qwen3_8B_tp1",`
[Doc]Add benchmark scripts (#74) ### What this PR does / why we need it? The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code . --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-03-21 15:54:34 +08:00			`"parameters": {`
[CI] Add benchmark workflows (#1014) ### What this PR does / why we need it? Add benchmark workflows ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Run locally --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-05-30 22:42:44 +08:00			`"model": "Qwen/Qwen3-8B",`
[Doc]Add benchmark scripts (#74) ### What this PR does / why we need it? The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code . --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-03-21 15:54:34 +08:00			`"tensor_parallel_size": 1,`
			`"load_format": "dummy",`
[CI][Benchmark] Optimize performance benchmark workflow (#1039) ### What this PR does / why we need it? This is a post patch of #1014, for some convenience optimization - Set cached dataset path for speed - Use pypi to install escli-tool - Add benchmark results convert script to have a developer-friendly result - Patch the `benchmark_dataset.py` to disable streaming load for internet - Add more trigger ways for different purpose, `pr` for debug, `schedule` for daily test, `dispatch` and `pr-labled` for manual testing of a single(current) commit - Disable latency test for `qwen-2.5-vl`, (This script does not support multi-modal yet) ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-06-03 23:38:34 +08:00			`"dataset_path": "/github/home/.cache/datasets/ShareGPT_V3_unfiltered_cleaned_split.json",`
[Doc]Add benchmark scripts (#74) ### What this PR does / why we need it? The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code . --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-03-21 15:54:34 +08:00			`"num_prompts": 200,`
			`"backend": "vllm"`
			`}`
[Benchmarks] Add qwen2.5-7b test (#763) ### What this PR does / why we need it? - Add qwen2.5-7b test - Optimize the documentation to be more developer-friendly Signed-off-by: xuedinge233 <damow890@gmail.com> Co-authored-by: xuedinge233 <damow890@gmail.com> 2025-05-10 09:47:42 +08:00			`},`
			`{`
[CI] Add benchmark workflows (#1014) ### What this PR does / why we need it? Add benchmark workflows ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Run locally --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-05-30 22:42:44 +08:00			`"test_name": "throughput_qwen2_5vl_7B_tp1",`
[Benchmarks] Add qwen2.5-7b test (#763) ### What this PR does / why we need it? - Add qwen2.5-7b test - Optimize the documentation to be more developer-friendly Signed-off-by: xuedinge233 <damow890@gmail.com> Co-authored-by: xuedinge233 <damow890@gmail.com> 2025-05-10 09:47:42 +08:00			`"parameters": {`
[CI] Add benchmark workflows (#1014) ### What this PR does / why we need it? Add benchmark workflows ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Run locally --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-05-30 22:42:44 +08:00			`"model": "Qwen/Qwen2.5-VL-7B-Instruct",`
[Benchmarks] Add qwen2.5-7b test (#763) ### What this PR does / why we need it? - Add qwen2.5-7b test - Optimize the documentation to be more developer-friendly Signed-off-by: xuedinge233 <damow890@gmail.com> Co-authored-by: xuedinge233 <damow890@gmail.com> 2025-05-10 09:47:42 +08:00			`"tensor_parallel_size": 1,`
[CI] Add benchmark workflows (#1014) ### What this PR does / why we need it? Add benchmark workflows ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Run locally --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-05-30 22:42:44 +08:00			`"backend": "vllm-chat",`
			`"dataset_name": "hf",`
			`"hf_split": "train",`
			`"max_model_len": 16384,`
			`"dataset_path": "lmarena-ai/vision-arena-bench-v0.1",`
			`"num_prompts": 200`
[Benchmarks] Add qwen2.5-7b test (#763) ### What this PR does / why we need it? - Add qwen2.5-7b test - Optimize the documentation to be more developer-friendly Signed-off-by: xuedinge233 <damow890@gmail.com> Co-authored-by: xuedinge233 <damow890@gmail.com> 2025-05-10 09:47:42 +08:00			`}`
[CI][Benchmark] Add new model and v1 test to perf benchmarks (#1099) ### What this PR does / why we need it? - Add qwen2.5-7b-instruct test - Add v1 test --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-06-12 10:46:41 +08:00			`},`
			`{`
			`"test_name": "throughput_qwen2_5_7B_tp1",`
			`"parameters": {`
			`"model": "Qwen/Qwen2.5-7B-Instruct",`
			`"tensor_parallel_size": 1,`
			`"load_format": "dummy",`
			`"dataset_path": "/github/home/.cache/datasets/ShareGPT_V3_unfiltered_cleaned_split.json",`
			`"num_prompts": 200,`
			`"backend": "vllm"`
			`}`
[Doc]Add benchmark scripts (#74) ### What this PR does / why we need it? The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code . --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-03-21 15:54:34 +08:00			`}`
			`]`