options

Loops Index

47 loops have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 0.22% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Exclusive Coverage gcc_0 (%) Inclusive Coverage gcc_0 (%) Max Exclusive Time Over Threads gcc_0 (s) Max Inclusive Time Over Threads gcc_0 (s) Exclusive Time w.r.t. Wall Time gcc_0 (s) Inclusive Time w.r.t. Wall Time gcc_0 (s) Nb Threads gcc_0 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing gcc_0 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency
Loop idSource LocationSource FunctionLevelExclusive Coverage gcc_0 (%)Inclusive Coverage gcc_0 (%)Max Exclusive Time Over Threads gcc_0 (s)Max Inclusive Time Over Threads gcc_0 (s)Exclusive Time w.r.t. Wall Time gcc_0 (s)Inclusive Time w.r.t. Wall Time gcc_0 (s)Nb Threads gcc_0Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing gcc_0Stride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency
2041libggml-cpu.so - quants.c:979-1000 [...]ggml_vec_dot_q8_0_q8_0Innermost10.6010.603.493.492.722.7272NANANANANA3.12NANANANANA0.00
756libggml-cpu.so - vec.cpp:385-387 [...]ggml_vec_swiglu_f32Single0.050.050.390.390.010.0148097.66111.033.850003050.00
3483libllama.so - stl_algo.h:1594-1595 [...]llama_token_data_array_partial_sort_inplace(llama_token_data_array*, int)Innermost0.050.050.400.400.010.011030.362.1713.0611.50.50.50093.75
×