options

Loops Index

78 loops have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 1.61% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Exclusive Coverage aocc_6 (%) Inclusive Coverage aocc_6 (%) Max Exclusive Time Over Threads aocc_6 (s) Max Inclusive Time Over Threads aocc_6 (s) Exclusive Time w.r.t. Wall Time aocc_6 (s) Inclusive Time w.r.t. Wall Time aocc_6 (s) Nb Threads aocc_6 GFLOPS aocc_6 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing aocc_6 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency
Loop idSource LocationSource FunctionLevelExclusive Coverage aocc_6 (%)Inclusive Coverage aocc_6 (%)Max Exclusive Time Over Threads aocc_6 (s)Max Inclusive Time Over Threads aocc_6 (s)Exclusive Time w.r.t. Wall Time aocc_6 (s)Inclusive Time w.r.t. Wall Time aocc_6 (s)Nb Threads aocc_6GFLOPS aocc_6Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing aocc_6Stride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency
511libggml-cpu.so - mmq.cpp:1570-1597 [...]ggml_backend_amx_mul_mat(ggml_compute_params const*, ggml_tensor*)::$_1::operator()(int, int) const::{lambda()#1}::operator()() constSingle85.2585.2524.5824.5823.9223.9219290.49NANANANANA1.04NANANANANA0.00
×