Loops
▶repack.cpp: 125 - 134.97 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2593 | 0.42 | 0.47 | 45.27 | 99.51 | 83.19 | 2985 | 0.01 | 0.00 | 0.11 | 99.19 | 85.93 | 2712 | 0.00 | 0.00 | 0.26 | 99.23 | 80.75 | ||||||
| 2590 | 0.01 | 0.00 | 0.19 | 99.23 | 80.75 | 2990 | 0.42 | 0.53 | 45.49 | 99.48 | 85.43 | 2715 | 0.42 | 0.46 | 43.65 | 99.51 | 83.19 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 2593) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2990) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2715) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||
| More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | ||||||||||||||||||
| Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | |||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||
| Use of masked instructions | 1 | Use of masked instructions | 1 | Use of masked instructions | 1 | ||||||||||||||||||
▶repack.cpp: 153 - 22.27 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2586 | 0.10 | 0.07 | 6.98 | 99.37 | 47.56 | 2980 | 0.12 | 0.09 | 7.73 | 0 | 0 | 2708 | 0.11 | 0.08 | 7.56 | 99.37 | 47.56 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 2586) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2980) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2708) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | 1 | |||||||||||||||||||
| More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | More than 20% of the loads are accessing the stack | 1 | |||||||||||||||||||
| Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | |||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | Presence of special instructions executing on a single port | 1 | |||||||||||||||||||
| Use of masked instructions | 1 | Use of masked instructions | Use of masked instructions | 1 | |||||||||||||||||||
▶vec.h: 1084 - 8.40 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 765 | 0.03 | 0.02 | 2.30 | 98 | 98.13 | 873 | 0.04 | 0.03 | 2.43 | 100 | 100 | 820 | 0.04 | 0.04 | 3.67 | 98 | 98.13 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 765) | Sum on 1 analyzed binary loop (libggml-cpu.so - 873) | Sum on 1 analyzed binary loop (libggml-cpu.so - 820) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||||||||
| Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||||||
| Control Flow Issues | Control Flow Issues | Control Flow Issues | |||||||||||||||||||||
| Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | ||||||||||||||||||
| Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 0 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||
| More than 10% of the vector loads instructions are unaligned | 1 | More than 10% of the vector loads instructions are unaligned | 1 | More than 10% of the vector loads instructions are unaligned | 1 | ||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 0 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||
| Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 1 | ||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | Presence of constant non-unit stride data access | 0 | Presence of constant non-unit stride data access | 1 | ||||||||||||||||||
| Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | |||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 0 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||
| Use of masked instructions | 1 | Use of masked instructions | 1 | Use of masked instructions | 1 | ||||||||||||||||||
▶repack.cpp: 190 - 7.97 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2562 | 0.04 | 0.02 | 2.24 | 89.71 | 38.68 | 2954 | 0.04 | 0.03 | 2.81 | 61.47 | 31.66 | 2684 | 0.05 | 0.03 | 2.92 | 89.6 | 38.84 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 2562) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2954) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2684) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||||||||
| Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 0 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 0 | ||||||||||||||||||
| Control Flow Issues | Control Flow Issues | Control Flow Issues | |||||||||||||||||||||
| Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | ||||||||||||||||||
| Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||
| More than 10% of the vector loads instructions are unaligned | 0 | More than 10% of the vector loads instructions are unaligned | 1 | More than 10% of the vector loads instructions are unaligned | 0 | ||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||
| More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 0 | More than 20% of the loads are accessing the stack | 1 | ||||||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||
| Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | Presence of more than 4 paths | 1 | ||||||||||||||||||
| Inefficient Vectorization | Inefficient Vectorization | Inefficient Vectorization | |||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | ||||||||||||||||||
| Use of masked instructions | 1 | Use of masked instructions | 0 | Use of masked instructions | 1 | ||||||||||||||||||
▶binary-ops.cpp: 10 - 2.43 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 398 | 0.03 | 0.01 | 0.70 | 0 | 6.25 | 444 | 0.02 | 0.01 | 0.87 | 0 | 6.25 | 397 | 0.02 | 0.01 | 0.86 | 0 | 6.25 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 398) | Sum on 1 analyzed binary loop (libggml-cpu.so - 444) | Sum on 1 analyzed binary loop (libggml-cpu.so - 397) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||
▶ops.cpp: 6220 - 2.13 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1462 | 0.02 | 0.01 | 0.78 | 6.67 | 7.5 | 1734 | 0.02 | 0.01 | 0.65 | 15.56 | 9.17 | 1560 | 0.02 | 0.01 | 0.70 | 0 | 6.25 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 1462) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1734) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1560) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||||||||
| Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | Presence of expensive FP instructions | 1 | ||||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 0 | Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||
| Control Flow Issues | Control Flow Issues | Control Flow Issues | |||||||||||||||||||||
| Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||||||
| Data Access Issues | Data Access Issues | Data Access Issues | |||||||||||||||||||||
| More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | More than 20% of the loads are accessing the stack | 1 | ||||||||||||||||||
| Vectorization Roadblocks | Vectorization Roadblocks | Vectorization Roadblocks | |||||||||||||||||||||
| Presence of calls | 1 | Presence of calls | 1 | Presence of calls | 1 | ||||||||||||||||||
▶ops.cpp: 4325 - 2.04 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1195 | 0.03 | 0.01 | 0.68 | 0 | 7.81 | 1418 | 0.02 | 0.01 | 0.70 | 0 | 7.81 | 1278 | 0.03 | 0.01 | 0.67 | 100 | 62.5 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 1195) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1418) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1278) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Loop Computation Issues | Loop Computation Issues | Loop Computation Issues | |||||||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||
▶vec.h: 491 - 1.75 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1723 | 0.01 | 0.00 | 0.46 | 100 | 75 | 2109 | 0.01 | 0.01 | 0.47 | 100 | 75 | 1825 | 0.03 | 0.01 | 0.82 | 100 | 75 | ||||||
| Sum on 1 analyzed binary loop (libggml-cpu.so - 1723) | Sum on 1 analyzed binary loop (libggml-cpu.so - 2109) | Sum on 1 analyzed binary loop (libggml-cpu.so - 1825) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
▶binary-ops.cpp: 18 - 1.17 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 484 | 0.01 | 0.00 | 0.41 | 0 | 6.25 | 558 | 0.01 | 0.00 | 0.26 | 0 | 6.25 | 483 | 0.02 | 0.01 | 0.50 | 0 | 6.25 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
▶<unknown>: 0 - 1.13 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2640 | 0.00 | 0.00 | 0.01 | 0 | 0 | 1023 | 0.01 | 0.00 | 0.01 | 0 | 0 | 2686 | 0.01 | 0.00 | 0.01 | 0 | 0 | 4502 | 0.00 | 0.00 | 0.00 | 0 | 0 |
| 2458 | 0.00 | 0.00 | 0.01 | 0 | 0 | 2395 | 0.01 | 0.00 | 0.01 | 0 | 0 | 2786 | 0.01 | 0.00 | 0.02 | 0 | 0 | 4505 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 2842 | 0.00 | 0.00 | 0.01 | 0 | 0 | 2504 | 0.02 | 0.00 | 0.02 | 0 | 0 | 2888 | 0.00 | 0.00 | 0.01 | 0 | 0 | 4527 | 0.00 | 0.00 | 0.00 | 0 | 0 |
| 2628 | 0.00 | 0.00 | 0.01 | 0 | 0 | 2294 | 0.01 | 0.00 | 0.01 | 0 | 0 | 99 | 0.00 | 0.00 | 0.01 | 0 | 0 | 1896 | 0.00 | 0.00 | 0.00 | 0 | 0 |
| 2741 | 0.00 | 0.00 | 0.01 | 0 | 0 | 2298 | 0.01 | 0.00 | 0.01 | 0 | 0 | 1832 | 0.00 | 0.00 | 0.06 | 0 | 0 | 726 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 2583 | 0.00 | 0.00 | 0.01 | 0 | 0 | 192 | 0.01 | 0.00 | 0.01 | 0 | 0 | 357 | 0.00 | 0.00 | 0.01 | 0 | 0 | 328 | 0.00 | 0.00 | 0.00 | 0 | 0 |
| 1709 | 0.01 | 0.00 | 0.09 | 0 | 0 | 1948 | 0.00 | 0.00 | 0.02 | 0 | 0 | 1806 | 0.00 | 0.00 | 0.01 | 0 | 0 | 735 | 0.04 | 0.00 | 0.00 | 0 | 0 |
| 1456 | 0.01 | 0.00 | 0.09 | 0 | 0 | 368 | 0.00 | 0.00 | 0.01 | 0 | 0 | 2705 | 0.00 | 0.00 | 0.02 | 0 | 0 | 3844 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 99 | 0.00 | 0.00 | 0.01 | 0 | 0 | 124 | 0.01 | 0.00 | 0.02 | 0 | 0 | 2666 | 0.00 | 0.00 | 0.10 | 0 | 0 | 122 | 0.01 | 0.00 | 0.00 | 0 | 0 |
| 358 | 0.00 | 0.00 | 0.01 | 0 | 0 | 1605 | 0.00 | 0.00 | 0.01 | 0 | 0 | 91 | 0.01 | 0.00 | 0.00 | 0 | 0 | ||||||
| 1704 | 0.00 | 0.00 | 0.01 | 0 | 0 | 2236 | 0.01 | 0.00 | 0.01 | 0 | 0 | 997 | 0.01 | 0.00 | 0.02 | 0 | 0 | ||||||
| 1730 | 0.01 | 0.00 | 0.05 | 0 | 0 | 2094 | 0.00 | 0.00 | 0.03 | 0 | 0 | 3742 | 0.01 | 0.00 | 0.00 | 0 | 0 | ||||||
| 826 | 0.00 | 0.00 | 0.01 | 0 | 0 | 2099 | 0.01 | 0.00 | 0.05 | 0 | 0 | 605 | 0.01 | 0.01 | 0.05 | 0 | 0 | ||||||
| 2111 | 0.01 | 0.00 | 0.06 | 0 | 0 | 103 | 0.00 | 0.00 | 0.00 | 0 | 0 | ||||||||||||
| 2237 | 0.00 | 0.00 | 0.01 | 0 | 0 | 3746 | 0.01 | 0.00 | 0.00 | 0 | 0 | ||||||||||||
| 97 | 0.00 | 0.00 | 0.00 | 0 | 0 | ||||||||||||||||||
| 3112 | 0.02 | 0.01 | 0.04 | 0 | 0 | ||||||||||||||||||
| 2679 | 0.01 | 0.00 | 0.01 | 0 | 0 | ||||||||||||||||||
| 2681 | 0.02 | 0.00 | 0.03 | 0 | 0 | ||||||||||||||||||
| 735 | 0.02 | 0.00 | 0.03 | 0 | 0 | ||||||||||||||||||
| 1742 | 0.03 | 0.01 | 0.04 | 0 | 0 | ||||||||||||||||||
| 3 | 0.01 | 0.00 | 0.02 | 0 | 0 | ||||||||||||||||||
| 3759 | 0.01 | 0.00 | 0.01 | 0 | 0 | ||||||||||||||||||
| 3088 | 0.01 | 0.00 | 0.01 | 0 | 0 | ||||||||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
▶repack.cpp: 1074 - 1.06 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 233 | 0.22 | 0.00 | 0.31 | 37.5 | 19.86 | 257 | 0.33 | 0.01 | 0.47 | 43.33 | 32.81 | 232 | 0.20 | 0.00 | 0.28 | 37.5 | 19.86 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 257) | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Loop Computation Issues | |||||||||||||||||||||||
| Presence of a large number of scalar integer instructions | 1 | ||||||||||||||||||||||
| Data Access Issues | |||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||||||||
| Presence of expensive instructions: scatter/gather | 1 | ||||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||
| More than 20% of the loads are accessing the stack | 1 | ||||||||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||||||||
| Presence of constant non-unit stride data access | 1 | ||||||||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||||||||
| Inefficient Vectorization | |||||||||||||||||||||||
| Presence of expensive instructions: scatter/gather | 1 | ||||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||
▶quants.c: 682 - 0.96 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4148 | 0.15 | 0.12 | 0.96 | 93.02 | 21.37 | ||||||||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 4148) | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Loop Computation Issues | |||||||||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||||
| Data Access Issues | |||||||||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||||||||
| Inefficient Vectorization | |||||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||
▶ggml-cpu.c: 3204 - 0.67 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5 | 0.01 | 0.00 | 0.26 | 100 | 100 | 6 | 0.01 | 0.00 | 0.19 | 100 | 100 | 5 | 0.01 | 0.00 | 0.22 | 100 | 100 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
▶vec.cpp: 311 - 0.66 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 761 | 0.01 | 0.00 | 0.12 | 100 | 66.67 | 866 | 0.01 | 0.00 | 0.26 | 100 | 66.67 | 812 | 0.01 | 0.00 | 0.28 | 100 | 66.67 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
▶quants.c: 2150 - 0.63 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | ||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2544 | 0.01 | 0.00 | 0.36 | 97.09 | 43.87 | 2936 | 0.01 | 0.00 | 0.28 | 96.81 | 44.08 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
▶vec.h: 508 - 0.54 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3756 | 0.12 | 0.07 | 0.54 | 85.05 | 20.79 | ||||||||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 3756) | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Loop Computation Issues | |||||||||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||||
| Data Access Issues | |||||||||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||||||||
| Inefficient Vectorization | |||||||||||||||||||||||
| Presence of special instructions executing on a single port | 1 | ||||||||||||||||||||||
▶ops.cpp: 6446 - 0.42 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1707 | 0.01 | 0.00 | 0.27 | 0 | 6.25 | 1554 | 0.01 | 0.00 | 0.15 | 0 | 6.25 | ||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
▶vec.h: 677 - 0.38 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | |||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1196 | 0.01 | 0.00 | 0.12 | 100 | 100 | 1412 | 0.01 | 0.00 | 0.12 | 100 | 100 | 1279 | 0.01 | 0.00 | 0.14 | 100 | 100 | ||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
▶vec.cpp: 331 - 0.25 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1728 | 0.06 | 0.03 | 0.25 | 5.88 | 8.82 | ||||||||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | Sum on 1 analyzed binary loop (libggml-cpu.so - 1728) | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||
| Loop Computation Issues | |||||||||||||||||||||||
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | ||||||||||||||||||||||
| Data Access Issues | |||||||||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||||||||
| Vectorization Roadblocks | |||||||||||||||||||||||
| Presence of indirect access | 1 | ||||||||||||||||||||||
▶ops.cpp: 8885 - 0.12 %
| Run orig_default | Run icx_default | Run aocc_2 | Run icx_2 | ||||||||||||||||||||
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1811 | 0.01 | 0.00 | 0.12 | 0 | 6.25 | ||||||||||||||||||
| No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | No Optimizer analysis found for any assembly loop. More loops can be analyzed using option --optimizer-loop-count. | ||||||||||||||||||||
| Analysis | Count | Analysis | Count | Analysis | Count | Analysis | Count | ||||||||||||||||

