options

Loops Index

25 loops have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 0.03% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Exclusive Coverage orig_0 (%) Inclusive Coverage orig_0 (%) Max Exclusive Time Over Threads orig_0 (s) Max Inclusive Time Over Threads orig_0 (s) Exclusive Time w.r.t. Wall Time orig_0 (s) Inclusive Time w.r.t. Wall Time orig_0 (s) Nb Threads orig_0 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing orig_0 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency
Loop idSource LocationSource FunctionLevelExclusive Coverage orig_0 (%)Inclusive Coverage orig_0 (%)Max Exclusive Time Over Threads orig_0 (s)Max Inclusive Time Over Threads orig_0 (s)Exclusive Time w.r.t. Wall Time orig_0 (s)Inclusive Time w.r.t. Wall Time orig_0 (s)Nb Threads orig_0Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing orig_0Stride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency
2221libggml-cpu.so - quants.c:979-1002 [...]ggml_vec_dot_q8_0_q8_0Single1.221.220.870.870.240.2472NANANANANA3.73NANANANANA0.00
×