- r_1 - engine_NEON1M11-0001_o1_m96_acfl_bycore/ - 30 analyzed loop(s)
- Loop 5290 - engine_linuxa64_ompi
- Loop 10194 - engine_linuxa64_ompi
- Loop 9850 - engine_linuxa64_ompi
- Loop 38994 - engine_linuxa64_ompi
- Loop 5587 - engine_linuxa64_ompi
- Loop 38981 - engine_linuxa64_ompi
- Loop 9938 - engine_linuxa64_ompi
- Loop 47760 - engine_linuxa64_ompi
- Loop 9877 - engine_linuxa64_ompi
- Loop 9933 - engine_linuxa64_ompi
- Loop 10130 - engine_linuxa64_ompi
- Loop 5670 - engine_linuxa64_ompi
- Loop 47929 - engine_linuxa64_ompi
- Loop 10078 - engine_linuxa64_ompi
- Loop 36824 - engine_linuxa64_ompi
- Loop 9888 - engine_linuxa64_ompi
- Loop 47848 - engine_linuxa64_ompi
- Loop 39163 - engine_linuxa64_ompi
- Loop 47726 - engine_linuxa64_ompi
- Loop 10055 - engine_linuxa64_ompi
- Loop 36822 - engine_linuxa64_ompi
- Loop 5674 - engine_linuxa64_ompi
- Loop 9851 - engine_linuxa64_ompi
- Loop 4592 - engine_linuxa64_ompi
- Loop 39158 - engine_linuxa64_ompi
- Loop 38957 - engine_linuxa64_ompi
- Loop 9890 - engine_linuxa64_ompi
- Loop 39157 - engine_linuxa64_ompi
- Loop 38992 - engine_linuxa64_ompi
- Loop 4593 - engine_linuxa64_ompi
- r_2 - engine_NEON1M11-0001_o1_m96_gnu_bycore/ - 30 analyzed loop(s)
- Loop 6195 - engine_linuxa64_gf_ompi
- Loop 10581 - engine_linuxa64_gf_ompi
- Loop 10282 - engine_linuxa64_gf_ompi
- Loop 6410 - engine_linuxa64_gf_ompi
- Loop 10533 - engine_linuxa64_gf_ompi
- Loop 37963 - engine_linuxa64_gf_ompi
- Loop 6475 - engine_linuxa64_gf_ompi
- Loop 10312 - engine_linuxa64_gf_ompi
- Loop 10472 - engine_linuxa64_gf_ompi
- Loop 10349 - engine_linuxa64_gf_ompi
- Loop 29120 - engine_linuxa64_gf_ompi
- Loop 10457 - engine_linuxa64_gf_ompi
- Loop 10304 - engine_linuxa64_gf_ompi
- Loop 29118 - engine_linuxa64_gf_ompi
- Loop 38138 - engine_linuxa64_gf_ompi
- Loop 31053 - engine_linuxa64_gf_ompi
- Loop 6477 - engine_linuxa64_gf_ompi
- Loop 10347 - engine_linuxa64_gf_ompi
- Loop 39915 - engine_linuxa64_gf_ompi
- Loop 38106 - engine_linuxa64_gf_ompi
- Loop 30875 - engine_linuxa64_gf_ompi
- Loop 10283 - engine_linuxa64_gf_ompi
- Loop 30872 - engine_linuxa64_gf_ompi
- Loop 5672 - engine_linuxa64_gf_ompi
- Loop 29119 - engine_linuxa64_gf_ompi
- Loop 30849 - engine_linuxa64_gf_ompi
- Loop 30878 - engine_linuxa64_gf_ompi
- Loop 5677 - engine_linuxa64_gf_ompi
- Loop 31047 - engine_linuxa64_gf_ompi
- Loop 7665 - engine_linuxa64_gf_ompi
Analysis | Count | Percentage | Weighted Count |
▼Loop Computation Issues– | 94 | | |
○Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 49 | 81.67 | 0.63 |
○Presence of a large number of scalar integer instructions | 28 | 46.67 | 0.42 |
○Presence of expensive FP instructions | 13 | 21.67 | 0.11 |
○Large loop body over microp cache size | 2 | 3.33 | 0.03 |
○Bottleneck in the front-end | 2 | 3.33 | 0.03 |
▼Control Flow Issues– | 25 | | |
○Presence of 2 to 4 paths | 14 | 23.33 | 0.13 |
○Presence of calls | 4 | 6.67 | 0.04 |
○Non-innermost loop | 4 | 6.67 | 0.04 |
○Presence of more than 4 paths | 3 | 5.00 | 0.02 |
▼Data Access Issues– | 39 | | |
○Presence of constant non-unit stride data access | 27 | 45.00 | 0.50 |
○Presence of indirect access | 12 | 20.00 | 0.22 |
▼Vectorization Roadblocks– | 69 | | |
○Presence of constant non-unit stride data access | 27 | 45.00 | 0.50 |
○Presence of 2 to 4 paths | 14 | 23.33 | 0.13 |
○Presence of indirect access | 12 | 20.00 | 0.22 |
○Presence of more than 4 paths | 8 | 13.33 | 0.07 |
○Non-innermost loop | 4 | 6.67 | 0.04 |
○Presence of calls | 4 | 6.67 | 0.04 |
Analysis | r_1 | r_2 |
Loop Computation Issues | Presence of expensive FP instructions | 6 | 7 |
---|
Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 24 | 25 |
Large loop body over microp cache size | 1 | 1 |
Presence of a large number of scalar integer instructions | 16 | 12 |
Bottleneck in the front-end | 1 | 1 |
Control Flow Issues | Presence of calls | 1 | 3 |
---|
Presence of 2 to 4 paths | 7 | 7 |
Presence of more than 4 paths | 2 | 1 |
Non-innermost loop | 1 | 3 |
Data Access Issues | Presence of constant non-unit stride data access | 12 | 15 |
---|
Presence of indirect access | 5 | 7 |
Vectorization Roadblocks | Presence of calls | 1 | 3 |
---|
Presence of 2 to 4 paths | 7 | 7 |
Presence of more than 4 paths | 3 | 5 |
Non-innermost loop | 1 | 3 |
Presence of constant non-unit stride data access | 12 | 15 |
Presence of indirect access | 5 | 7 |