options

Loops Index

28 loops have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 0.04% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Exclusive Coverage armclang_3 (%) Inclusive Coverage armclang_3 (%) Max Exclusive Time Over Threads armclang_3 (s) Max Inclusive Time Over Threads armclang_3 (s) Exclusive Time w.r.t. Wall Time armclang_3 (s) Inclusive Time w.r.t. Wall Time armclang_3 (s) Nb Threads armclang_3 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing armclang_3 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency
Loop idSource LocationSource FunctionLevelExclusive Coverage armclang_3 (%)Inclusive Coverage armclang_3 (%)Max Exclusive Time Over Threads armclang_3 (s)Max Inclusive Time Over Threads armclang_3 (s)Exclusive Time w.r.t. Wall Time armclang_3 (s)Inclusive Time w.r.t. Wall Time armclang_3 (s)Nb Threads armclang_3Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing armclang_3Stride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency
2416libggml-cpu.so - quants.c:1089-1112 [...]ggml_vec_dot_q8_0_q8_0Single1.071.070.610.610.180.187253.8563.461.21.081.423.52000000.00
×