| Loop id | Source Location | Source Function | Level | Exclusive Coverage gcc_0 (%) | Inclusive Coverage gcc_0 (%) | Max Exclusive Time Over Threads gcc_0 (s) | Max Inclusive Time Over Threads gcc_0 (s) | Exclusive Time w.r.t. Wall Time gcc_0 (s) | Inclusive Time w.r.t. Wall Time gcc_0 (s) | Nb Threads gcc_0 | GFLOPS gcc_0 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing gcc_0 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | Array Access Efficiency |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 405 | libggml-cpu.so - mmq.cpp:1597-1597 [...] | (anonymous namespace)::tinygemm_kernel_vnni<block_q8_0, block_q8_0, float, 1, 64, 32>::apply(int, void const*, void const*, float*, int) | Single | 86.35 | 86.35 | 24.65 | 24.65 | 24.14 | 24.14 | 192 | 89.25 | 87.67 | 85.7 | 1 | 1 | 1.07 | 1.04 | 0 | 4 | 2 | 0 | 1 | 78.57 |
| 2122 | libggml-cpu.so - quants.c:298-321 [...] | quantize_row_q8_0 | Single | 0.03 | 0.03 | 0.36 | 0.36 | 0.01 | 0.01 | 6 | 1070.90 | 60 | 28.75 | 1 | 1.4 | 2.84 | 1.48 | NA | NA | NA | NA | NA | 0.00 |