* ggml : refactor forward_dup for cpu backend * clean up a bit * add quant/dequant perf test
The note is not visible to the blocked user.