52 lines
1.7 KiB
Markdown
52 lines
1.7 KiB
Markdown
## benchmark测试脚本使用方式
|
|
|
|
Torch-MLU-Ops benchmark测试脚本为用户提供了进行算子性能测试的便捷入口。
|
|
用户可通过以下命令获取各个参数的含义。
|
|
|
|
```bash
|
|
# 测试命令帮助
|
|
python3 benchmark_xxx.py --help
|
|
```
|
|
各个参数含义如下:
|
|
|
|
`options`:
|
|
- -h, --help show this help message and exit
|
|
- --repeat_times REPEAT_TIMES repeat times for testing
|
|
- --csv write the report data to csv
|
|
- -o O specify the output folder name under --csv mode
|
|
|
|
```bash
|
|
# 测试命令示例如下
|
|
python3 benchmark_active.py --repeat_times 10 --csv -o './active/'
|
|
```
|
|
支持如下算子:
|
|
|
|
| op_name |
|
|
| ---------------------------------|
|
|
| active |
|
|
| apply_rotary |
|
|
| attention_project |
|
|
| ffn |
|
|
| flash_attn |
|
|
| fused_layer_norm |
|
|
| fused_moe |
|
|
| fused_norm_attention_project |
|
|
| fused_norm_residual_ffn |
|
|
| fused_rms_norm |
|
|
| group_gemm |
|
|
| matmul |
|
|
| offline_quant_to_linear_cache |
|
|
| per_token_smooth_quantize |
|
|
| preload |
|
|
| quantize |
|
|
| reshape_linear_cache |
|
|
| quant_to_linear_cache |
|
|
| reshape_paged_cache |
|
|
| single_query_cached_kv_attn |
|
|
| smooth_quant_matmul |
|
|
| weight_only_quant_matmul |
|
|
| moe_gen_idx |
|
|
| moe_expand_input |
|
|
| moe_softmax_topk |
|
|
| moe_combine_result |
|