Help is available by moving the cursor above any symbol or by checking MAQAO website.
Total Time (s) | 10.70 |
Profiled Time (s) | 10.69 |
Time in analyzed loops (%) | 100.0 |
Time in analyzed innermost loops (%) | 96.5 |
Time in user code (%) | 100 |
Compilation Options Score (%) | 50 |
Perfect Flow Complexity | 1.00 |
Iterations Count | 1.93 |
Array Access Efficiency (%) | 44.8 |
Perfect OpenMP + MPI + Pthread | 1.00 |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 |
No Scalar Integer | Potential Speedup | 1.32 |
Nb Loops to get 80% | 1 |
FP Vectorised | Potential Speedup | 1.00 |
Nb Loops to get 80% | 1 |
Fully Vectorised | Potential Speedup | 1.03 |
Nb Loops to get 80% | 2 |
Data In L1 Cache | Potential Speedup | 1.23 |
Nb Loops to get 80% | 1 |
FP Arithmetic Only | Potential Speedup | 2.06 |
Nb Loops to get 80% | 1 |
Source Object | Issue |
▼convf32_avx512– | |
▼codelet.c– | |
○ | -O2, -O3 or -Ofast is missing. |
○ | -x(target) or -ax(target) is missing. |
Application | ./convf32_avx512 | | |
Timestamp | 2022-09-14 11:42:31 |
Universal Timestamp | 1663148551 |
Number of processes observed | 1 |
Number of threads observed | 1 |
Experiment Type | Sequential | | |
Machine | skylake | | |
Model Name | Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz | | |
Architecture | x86_64 |
Micro Architecture | SKYLAKE |
Cache Size | 36608 KB |
Number of Cores | 26 |
OS Version | Linux 5.16.9-arch1-1 #1 SMP PREEMPT Fri, 11 Feb 2022 22:42:06 +0000 | | |
Architecture used during static analysis | x86_64 |
Micro Architecture used during static analysis | SKYLAKE |
Compilation Options | convf32_avx512: clang based Intel(R) oneAPI DPC++/C++ Compiler 2022.0.0 (2022.0.0.20211123)
| | |
Comments | 1E5 iterations, icx -O3 -fno-alias -ansi-alias -qoverride-limits -fp-model fast=2 -xCore-AVX512 | | |
Dataset | |
Run Command | <executable> |
Number Processes | 1 |
Number Nodes | 1 |
Filter | {type = number ; value = 1 ; } |
Profile Start | {unit = none ; value = 0 ; } |
Maximal Path Number | 4 |