Files
2026-02-04 17:39:32 +08:00
..
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00
2026-02-04 17:39:32 +08:00

benchmark测试脚本使用方式

Torch-MLU-Ops benchmark测试脚本为用户提供了进行算子性能测试的便捷入口。 用户可通过以下命令获取各个参数的含义。

# 测试命令帮助
python3 benchmark_xxx.py --help

各个参数含义如下:

options:

  • -h, --help show this help message and exit
  • --repeat_times REPEAT_TIMES repeat times for testing
  • --csv write the report data to csv
  • -o O specify the output folder name under --csv mode
# 测试命令示例如下
python3 benchmark_active.py --repeat_times 10 --csv -o './active/'

支持如下算子:

op_name
active
apply_rotary
attention_project
ffn
flash_attn
fused_layer_norm
fused_moe
fused_norm_attention_project
fused_norm_residual_ffn
fused_rms_norm
group_gemm
matmul
offline_quant_to_linear_cache
per_token_smooth_quantize
preload
quantize
reshape_linear_cache
quant_to_linear_cache
reshape_paged_cache
single_query_cached_kv_attn
smooth_quant_matmul
weight_only_quant_matmul
moe_gen_idx
moe_expand_input
moe_softmax_topk
moe_combine_result