options

Loops Index

69 loops have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 1.12% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Exclusive Coverage gcc_4 (%) Inclusive Coverage gcc_4 (%) Max Exclusive Time Over Threads gcc_4 (s) Max Inclusive Time Over Threads gcc_4 (s) Exclusive Time w.r.t. Wall Time gcc_4 (s) Inclusive Time w.r.t. Wall Time gcc_4 (s) Nb Threads gcc_4 GFLOPS gcc_4 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing gcc_4 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency
Loop idSource LocationSource FunctionLevelExclusive Coverage gcc_4 (%)Inclusive Coverage gcc_4 (%)Max Exclusive Time Over Threads gcc_4 (s)Max Inclusive Time Over Threads gcc_4 (s)Exclusive Time w.r.t. Wall Time gcc_4 (s)Inclusive Time w.r.t. Wall Time gcc_4 (s)Nb Threads gcc_4GFLOPS gcc_4Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing gcc_4Stride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency
400libggml-cpu.so - mmq.cpp:1597-1597 [...](anonymous namespace)::tinygemm_kernel_vnni<block_q8_0, block_q8_0, float, 1, 64, 32>::apply(int, void const*, void const*, float*, int)Single86.7986.7924.7324.7324.1424.1419289.5287.6785.7111.071.040440177.78
2154libggml-cpu.so - quants.c:298-321 [...]quantize_row_q8_0Single0.030.030.310.310.010.0161046.7359.6529.281.031.42.861.23NANANANANA0.00
×