* benchmark-matmul: fix command line parsing, replace macros with functions, report results in GFLOPS