xc-llm-ascend/benchmarks/tests/throughput-tests.json

[
  {
    "test_name": "throughput_llama8B_tp1",
    "parameters": {
      "model": "meta-llama/Llama-3.1-8B-Instruct",
      "tensor_parallel_size": 1,
      "load_format": "dummy",
      "dataset_path": "./ShareGPT_V3_unfiltered_cleaned_split.json",
      "num_prompts": 200,
      "backend": "vllm"
    }
  }
]
[Doc]Add benchmark scripts (#74) ### What this PR does / why we need it? The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code . --------- Signed-off-by: wangli <wangli858794774@gmail.com> 2025-03-21 15:54:34 +08:00			`[`
			`{`
			`"test_name": "throughput_llama8B_tp1",`
			`"parameters": {`
			`"model": "meta-llama/Llama-3.1-8B-Instruct",`
			`"tensor_parallel_size": 1,`
			`"load_format": "dummy",`
			`"dataset_path": "./ShareGPT_V3_unfiltered_cleaned_split.json",`
			`"num_prompts": 200,`
			`"backend": "vllm"`
			`}`
			`}`
			`]`