options

Loops Index

69 loops have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 1.20% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Exclusive Coverage orig_0 (%) Inclusive Coverage orig_0 (%) Max Exclusive Time Over Threads orig_0 (s) Max Inclusive Time Over Threads orig_0 (s) Exclusive Time w.r.t. Wall Time orig_0 (s) Inclusive Time w.r.t. Wall Time orig_0 (s) Nb Threads orig_0 GFLOPS orig_0 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing orig_0 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency
Loop idSource LocationSource FunctionLevelExclusive Coverage orig_0 (%)Inclusive Coverage orig_0 (%)Max Exclusive Time Over Threads orig_0 (s)Max Inclusive Time Over Threads orig_0 (s)Exclusive Time w.r.t. Wall Time orig_0 (s)Inclusive Time w.r.t. Wall Time orig_0 (s)Nb Threads orig_0GFLOPS orig_0Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing orig_0Stride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency
452libggml-cpu.so - mmq.cpp:1573-1597 [...]ggml_backend_amx_mul_mat(ggml_compute_params const*, ggml_tensor*)::{lambda(int, int)#2}::operator()(int, int) const::{lambda()#1}::operator()() constSingle84.980.0024.600.0024.020.0019289.29NANANANANA1.04NANANANANA0.00
2925libggml-cpu.so - quants.c:298-355 [...]quantize_row_q8_0Single0.030.030.280.280.010.0161147.0660.729.6611.342.741.2202000100.00
×