- r_1 - O2 - 10 analyzed loop(s)
- Loop 36711 - engine_linuxa64_ompi
- Loop 5796 - engine_linuxa64_ompi
- Loop 13063 - engine_linuxa64_ompi
- Loop 41148 - engine_linuxa64_ompi
- Loop 10784 - engine_linuxa64_ompi
- Loop 36691 - engine_linuxa64_ompi
- Loop 6086 - engine_linuxa64_ompi
- Loop 48493 - engine_linuxa64_ompi
- Loop 37051 - engine_linuxa64_ompi
- Loop 6174 - engine_linuxa64_ompi
- r_2 - O3 - 10 analyzed loop(s)
- Loop 40764 - engine_linuxa64_ompi
- Loop 6105 - engine_linuxa64_ompi
- Loop 14281 - engine_linuxa64_ompi
- Loop 45327 - engine_linuxa64_ompi
- Loop 11492 - engine_linuxa64_ompi
- Loop 40744 - engine_linuxa64_ompi
- Loop 6415 - engine_linuxa64_ompi
- Loop 52901 - engine_linuxa64_ompi
- Loop 41106 - engine_linuxa64_ompi
- Loop 6507 - engine_linuxa64_ompi
- r_3 - O3+nosve - 10 analyzed loop(s)
- Loop 40706 - engine_linuxa64_ompi
- Loop 6042 - engine_linuxa64_ompi
- Loop 14390 - engine_linuxa64_ompi
- Loop 45323 - engine_linuxa64_ompi
- Loop 11606 - engine_linuxa64_ompi
- Loop 40687 - engine_linuxa64_ompi
- Loop 41049 - engine_linuxa64_ompi
- Loop 6407 - engine_linuxa64_ompi
- Loop 6547 - engine_linuxa64_ompi
- Loop 31916 - engine_linuxa64_ompi
- r_4 - O3+nosve2 - 10 analyzed loop(s)
- Loop 40770 - engine_linuxa64_ompi
- Loop 6105 - engine_linuxa64_ompi
- Loop 14284 - engine_linuxa64_ompi
- Loop 45334 - engine_linuxa64_ompi
- Loop 11495 - engine_linuxa64_ompi
- Loop 40750 - engine_linuxa64_ompi
- Loop 6414 - engine_linuxa64_ompi
- Loop 52911 - engine_linuxa64_ompi
- Loop 41112 - engine_linuxa64_ompi
- Loop 6506 - engine_linuxa64_ompi
Analysis | Count | Percentage | Weighted Count |
▼Loop Computation Issues– | 64 | | |
○Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 39 | 97.50 | 1.00 |
○Presence of a large number of scalar integer instructions | 25 | 62.50 | 0.64 |
▼Control Flow Issues– | 5 | | |
○Non-innermost loop | 4 | 10.00 | 0.08 |
○Presence of 2 to 4 paths | 1 | 2.50 | 0.01 |
▼Data Access Issues– | 24 | | |
○Presence of constant non-unit stride data access | 16 | 40.00 | 0.28 |
○Presence of indirect access | 8 | 20.00 | 0.16 |
▼Vectorization Roadblocks– | 42 | | |
○Presence of constant non-unit stride data access | 16 | 40.00 | 0.28 |
○Presence of more than 4 paths | 13 | 32.50 | 0.43 |
○Presence of indirect access | 8 | 20.00 | 0.16 |
○Non-innermost loop | 4 | 10.00 | 0.08 |
○Presence of 2 to 4 paths | 1 | 2.50 | 0.01 |
Analysis | r_1 | r_2 | r_3 | r_4 |
Loop Computation Issues | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 10 | 10 | 9 | 10 |
---|
Presence of a large number of scalar integer instructions | 6 | 6 | 7 | 6 |
Control Flow Issues | Presence of 2 to 4 paths | 0 | 0 | 1 | 0 |
---|
Non-innermost loop | 1 | 1 | 1 | 1 |
Data Access Issues | Presence of constant non-unit stride data access | 4 | 4 | 4 | 4 |
---|
Presence of indirect access | 2 | 2 | 2 | 2 |
Vectorization Roadblocks | Presence of 2 to 4 paths | 0 | 0 | 1 | 0 |
---|
Presence of more than 4 paths | 3 | 3 | 4 | 3 |
Non-innermost loop | 1 | 1 | 1 | 1 |
Presence of constant non-unit stride data access | 4 | 4 | 4 | 4 |
Presence of indirect access | 2 | 2 | 2 | 2 |