| Loop id | Source Location | Source Function | Level | Exclusive Coverage icx_10 (%) | Inclusive Coverage icx_10 (%) | Max Exclusive Time Over Threads icx_10 (s) | Max Inclusive Time Over Threads icx_10 (s) | Exclusive Time w.r.t. Wall Time icx_10 (s) | Inclusive Time w.r.t. Wall Time icx_10 (s) | Nb Threads icx_10 | GFLOPS icx_10 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing icx_10 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | Array Access Efficiency |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 322 | libggml-cpu.so - mmq.cpp:1573-1597 [...] | ggml_backend_amx_mul_mat(ggml_compute_params const*, ggml_tensor*)::{lambda(int, int)#2}::operator()(int, int) const::{lambda()#1}::operator()() const | Single | 84.90 | 84.90 | 24.63 | 24.63 | 24.01 | 24.01 | 192 | 89.34 | NA | NA | NA | NA | NA | 1.05 | NA | NA | NA | NA | NA | 0.00 |
| 2663 | libggml-cpu.so - quants.c:298-355 [...] | quantize_row_q8_0 | Single | 0.03 | 0.03 | 0.35 | 0.35 | 0.01 | 0.01 | 6 | 1043.67 | 60.7 | 29.66 | 1 | 1.34 | 2.74 | 1.39 | 0 | 2 | 0 | 0 | 0 | 100.00 |