options

Loops Index

73 loops have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 1.56% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Exclusive Coverage aocc_0 (%) Inclusive Coverage aocc_0 (%) Max Exclusive Time Over Threads aocc_0 (s) Max Inclusive Time Over Threads aocc_0 (s) Exclusive Time w.r.t. Wall Time aocc_0 (s) Inclusive Time w.r.t. Wall Time aocc_0 (s) Nb Threads aocc_0 GFLOPS aocc_0 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing aocc_0 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency
Loop idSource LocationSource FunctionLevelExclusive Coverage aocc_0 (%)Inclusive Coverage aocc_0 (%)Max Exclusive Time Over Threads aocc_0 (s)Max Inclusive Time Over Threads aocc_0 (s)Exclusive Time w.r.t. Wall Time aocc_0 (s)Inclusive Time w.r.t. Wall Time aocc_0 (s)Nb Threads aocc_0GFLOPS aocc_0Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing aocc_0Stride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency
563libggml-cpu.so - mmq.cpp:1570-1597 [...]ggml_backend_amx_mul_mat(ggml_compute_params const*, ggml_tensor*)::$_1::operator()(int, int) const::{lambda()#1}::operator()() constSingle84.9284.9224.5024.5023.9723.9719289.54NANANANANA1.04NANANANANA0.00
3339libggml-cpu.so - quants.c:298-355 [...]quantize_row_q8_0Single0.030.030.290.290.010.0161046.8158.3328.7511.382.831.14NANANANANA0.00
×