| Run orig_default | Run icx_default | Run aocc_10 | Run icx_3 |
| Loop Source Regions | - /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/simd-mappings.h: 130-130
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 108-109
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 129-131
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 1033-1042
| Loop Source Regions | - /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/simd-mappings.h: 130-130
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 108-109
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 129-131
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 1033-1042
| Loop Source Regions | - /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/simd-mappings.h: 130-130
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 108-109
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 129-131
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 1033-1042
| Loop Source Regions | |
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
| 2537 | 8.11 | 6.22 | 70.35 | 0 | 0 | 2929 | 8.13 | 6.21 | 70.08 | 0 | 0 | 1809 | 8.17 | 6.23 | 70.01 | 0 | 0 | |
| | | |
| Sum on 1 analyzed binary loop (libggml-cpu.so - 2537) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2929) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1809) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Run orig_default | Run icx_default | Run aocc_10 | Run icx_3 |
| Loop Source Regions | | Loop Source Regions | | Loop Source Regions | | Loop Source Regions | - /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/simd-mappings.h: 130-130
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/arch/x86/quants.c: 1066-1073
|
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
| | | 4223 | 8.31 | 6.34 | 69.85 | 88.89 | 20.14 |
| | | |
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 4223) |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| | | | | | Loop Computation Issues | |
| | | | | | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
| | | | | | Data Access Issues | |
| | | | | | Presence of indirect access | 1 |
| | | | | | Presence of special instructions executing on a single port | 1 |
| | | | | | Vectorization Roadblocks | |
| | | | | | Presence of indirect access | 1 |
| | | | | | Inefficient Vectorization | |
| | | | | | Presence of special instructions executing on a single port | 1 |
| Run orig_default | Run icx_default | Run aocc_10 | Run icx_3 |
| Loop Source Regions | - /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 138-138
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 818-846
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 1038-1038
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 1044-1044
| Loop Source Regions | - /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 138-138
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 818-821
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 827-846
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 966-966
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 1038-1038
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 1044-1044
| Loop Source Regions | - /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 138-138
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 826-846
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 966-966
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 1038-1038
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/llamafile/sgemm.cpp: 1044-1044
| Loop Source Regions | |
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
| 2364 | 0.08 | 0.05 | 0.56 | 0 | 0 | 2688 | 0.08 | 0.05 | 0.56 | 0 | 0 | 1659 | 0.06 | 0.03 | 0.35 | 0 | 0 | |
| | | |
| Sum on 1 analyzed binary loop (libggml-cpu.so - 2364) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2688) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1659) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Run orig_default | Run icx_default | Run aocc_10 | Run icx_3 |
| Loop Source Regions | | Loop Source Regions | | Loop Source Regions | | Loop Source Regions | |
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
| 2482 | 0.00 | 0.00 | 0.00 | 0 | 0 | 2392 | 0.00 | 0.00 | 0.00 | 0 | 0 | 2126 | 0.00 | 0.00 | 0.00 | 0 | 0 | 4189 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 2640 | 0.00 | 0.00 | 0.00 | 0 | 0 | 2504 | 0.00 | 0.00 | 0.00 | 0 | 0 | 887 | 0.00 | 0.00 | 0.00 | 0 | 0 | 5015 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 2738 | 0.00 | 0.00 | 0.00 | 0 | 0 | 1023 | 0.00 | 0.00 | 0.00 | 0 | 0 | 2125 | 0.00 | 0.00 | 0.00 | 0 | 0 | 4515 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 2842 | 0.00 | 0.00 | 0.00 | 0 | 0 | 2395 | 0.01 | 0.00 | 0.00 | 0 | 0 | 2214 | 0.00 | 0.00 | 0.00 | 0 | 0 | 4520 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 2450 | 0.00 | 0.00 | 0.00 | 0 | 0 | 2294 | 0.01 | 0.00 | 0.00 | 0 | 0 | 311 | 0.00 | 0.00 | 0.00 | 0 | 0 | 4518 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 2741 | 0.01 | 0.00 | 0.00 | 0 | 0 | 2291 | 0.00 | 0.00 | 0.00 | 0 | 0 | 2002 | 0.00 | 0.00 | 0.00 | 0 | 0 | 4524 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 99 | 0.03 | 0.01 | 0.06 | 0 | 0 | 2298 | 0.00 | 0.00 | 0.00 | 0 | 0 | 2211 | 0.00 | 0.00 | 0.00 | 0 | 0 | 1668 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 83 | 0.00 | 0.00 | 0.00 | 0 | 0 | 444 | 0.01 | 0.00 | 0.00 | 0 | 0 | 960 | 0.01 | 0.00 | 0.00 | 0 | 0 | 731 | 0.02 | 0.00 | 0.02 | 0 | 0 |
| 484 | 0.02 | 0.00 | 0.00 | 0 | 0 | 114 | 0.01 | 0.00 | 0.02 | 0 | 0 | 449 | 0.01 | 0.00 | 0.00 | 0 | 0 | 3862 | 0.00 | 0.00 | 0.00 | 0 | 0 |
| 90 | 0.01 | 0.00 | 0.02 | 0 | 0 | 106 | 0.01 | 0.00 | 0.00 | 0 | 0 | 37 | 0.01 | 0.00 | 0.03 | 0 | 0 | 1768 | 0.03 | 0.00 | 0.00 | 0 | 0 |
| 82 | 0.03 | 0.01 | 0.07 | 0 | 0 | 866 | 0.05 | 0.00 | 0.05 | 0 | 0 | 365 | 0.01 | 0.00 | 0.00 | 0 | 0 | 759 | 0.04 | 0.00 | 0.00 | 0 | 0 |
| 398 | 0.04 | 0.00 | 0.01 | 0 | 0 | 124 | 0.03 | 0.01 | 0.08 | 0 | 0 | 1796 | 0.02 | 0.00 | 0.05 | 0 | 0 | 115 | 0.02 | 0.01 | 0.07 | 0 | 0 |
| 85 | 0.01 | 0.00 | 0.00 | 0 | 0 | 108 | 0.00 | 0.00 | 0.00 | 0 | 0 | 40 | 0.01 | 0.00 | 0.00 | 0 | 0 | 3825 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 1709 | 0.01 | 0.00 | 0.01 | 0 | 0 | 2099 | 0.02 | 0.00 | 0.01 | 0 | 0 | 393 | 0.07 | 0.00 | 0.01 | 0 | 0 | 3808 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 1462 | 0.02 | 0.00 | 0.04 | 0 | 0 | 2109 | 0.03 | 0.00 | 0.05 | 0 | 0 | 1167 | 0.01 | 0.00 | 0.01 | 0 | 0 | 125 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 1456 | 0.01 | 0.00 | 0.00 | 0 | 0 | 1418 | 0.00 | 0.00 | 0.00 | 0 | 0 | 1164 | 0.01 | 0.00 | 0.00 | 0 | 0 | 2231 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 87 | 0.01 | 0.00 | 0.00 | 0 | 0 | 1707 | 0.01 | 0.00 | 0.00 | 0 | 0 | 1178 | 0.00 | 0.00 | 0.00 | 0 | 0 | 2710 | 0.02 | 0.00 | 0.00 | 0 | 0 |
| 1723 | 0.04 | 0.01 | 0.06 | 0 | 0 | 2111 | 0.01 | 0.00 | 0.00 | 0 | 0 | 1657 | 0.00 | 0.00 | 0.00 | 0 | 0 | 127 | 0.01 | 0.00 | 0.03 | 0 | 0 |
| 1195 | 0.02 | 0.00 | 0.00 | 0 | 0 | 102 | 0.02 | 0.00 | 0.02 | 0 | 0 | 36 | 0.00 | 0.00 | 0.00 | 0 | 0 | 119 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 1704 | 0.00 | 0.00 | 0.00 | 0 | 0 | 558 | 0.02 | 0.00 | 0.00 | 0 | 0 | 1027 | 0.02 | 0.01 | 0.06 | 0 | 0 | 654 | 0.01 | 0.00 | 0.01 | 0 | 0 |
| 5 | 0.01 | 0.00 | 0.00 | 0 | 0 | 2918 | 0.02 | 0.00 | 0.06 | 0 | 0 | 911 | 0.01 | 0.00 | 0.00 | 0 | 0 | 1023 | 0.02 | 0.00 | 0.00 | 0 | 0 |
| 761 | 0.02 | 0.00 | 0.04 | 0 | 0 | 2094 | 0.00 | 0.00 | 0.00 | 0 | 0 | 1176 | 0.04 | 0.01 | 0.06 | 0 | 0 | 3142 | 0.02 | 0.00 | 0.02 | 0 | 0 |
| 765 | 0.05 | 0.00 | 0.01 | 0 | 0 | 6 | 0.01 | 0.00 | 0.00 | 0 | 0 | 648 | 0.03 | 0.00 | 0.00 | 0 | 0 | 3 | 0.01 | 0.00 | 0.01 | 0 | 0 |
| 2527 | 0.02 | 0.01 | 0.07 | 0 | 0 | 112 | 0.00 | 0.00 | 0.00 | 0 | 0 | 50 | 0.03 | 0.01 | 0.06 | 0 | 0 | 144 | 0.02 | 0.00 | 0.05 | 0 | 0 |
| 358 | 0.01 | 0.00 | 0.00 | 0 | 0 | 873 | 0.08 | 0.00 | 0.01 | 0 | 0 | 3 | 0.01 | 0.00 | 0.01 | 0 | 0 | 121 | 0.01 | 0.00 | 0.01 | 0 | 0 |
| 1734 | 0.03 | 0.00 | 0.03 | 0 | 0 | 43 | 0.01 | 0.00 | 0.02 | 0 | 0 | 3118 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 401 | 0.01 | 0.00 | 0.00 | 0 | 0 | 644 | 0.03 | 0.00 | 0.04 | 0 | 0 | |
| 1948 | 0.00 | 0.00 | 0.00 | 0 | 0 | 1024 | 0.01 | 0.00 | 0.01 | 0 | 0 | |
| | 38 | 0.01 | 0.00 | 0.00 | 0 | 0 | |
| | | |
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| Run orig_default | Run icx_default | Run aocc_10 | Run icx_3 |
| Loop Source Regions | | Loop Source Regions | | Loop Source Regions | | Loop Source Regions | - /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/vec.h: 508-509
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/simd-mappings.h: 130-130
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/./ggml-impl.h: 346-346
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/./ggml-impl.h: 389-389
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/./ggml-impl.h: 399-404
|
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
| | | 3822 | 0.09 | 0.02 | 0.22 | 85.05 | 20.79 |
| | | |
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 3822) |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| | | | | | Loop Computation Issues | |
| | | | | | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
| | | | | | Data Access Issues | |
| | | | | | Presence of indirect access | 1 |
| | | | | | Presence of special instructions executing on a single port | 1 |
| | | | | | Vectorization Roadblocks | |
| | | | | | Presence of indirect access | 1 |
| | | | | | Inefficient Vectorization | |
| | | | | | Presence of special instructions executing on a single port | 1 |
| Run orig_default | Run icx_default | Run aocc_10 | Run icx_3 |
| Loop Source Regions | | Loop Source Regions | | Loop Source Regions | | Loop Source Regions | - /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/simd-mappings.h: 130-130
- /beegfs/hackathon/users/eoseret/qaas_runs_test/gmz12.benchmarkcenter.megware.com/176-976-6240/llama.cpp/build/llama.cpp/ggml/src/ggml-cpu/vec.cpp: 331-332
|
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
| | | 1754 | 0.07 | 0.01 | 0.16 | 5.88 | 8.82 |
| | | |
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1754) |
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count |
| | | | | | Loop Computation Issues | |
| | | | | | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 |
| | | | | | Data Access Issues | |
| | | | | | Presence of indirect access | 1 |
| | | | | | Vectorization Roadblocks | |
| | | | | | Presence of indirect access | 1 |