options

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Metricr0r1
Total Time (s)547.8117.29
Profiled Time (s)546.1117.08
Time in analyzed loops (%)41.417.3
Time in analyzed innermost loops (%)41.316.2
Time in user code (%)41.417.4
Compilation Options Score (%)100100
Array Access Efficiency (%)74.955.6
Potential Speedups
Perfect Flow Complexity1.001.00
Perfect OpenMP + MPI + Pthread1.061.20
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution2.425.78
No Scalar IntegerPotential Speedup1.001.00
Nb Loops to get 80%11
FP VectorisedPotential Speedup1.001.00
Nb Loops to get 80%11
Fully VectorisedPotential Speedup1.001.15
Nb Loops to get 80%21
Only FP ArithmeticPotential Speedup1.571.15
Nb Loops to get 80%11

Cumulated Speedup If No Scalar Integer

Cumulated Speedup If FP Vectorized

Cumulated Speedup If Fully Vectorized

Cumulated Speedup If Only FP Arithmetic

Loop Based Profiles

Innermost / Single Loops

Inbetween Loops

Outermost Loops

Cumulated Coverage With All Loops

Innermost Loop Based Profiles

Coverage

Count

Application Categorization

Time

Coverage

Compilation Options

Source ObjectIssue
permute3d_1.omp.exe
permute3d_1.omp.cpp

Path Count Profiles

Coverage

Count

Low Iteration Count Profiles

Coverage

Count

Experiment Summaries

r0r1
Experiment Name
Application./permute3d_1.omp.exe./permute3d_1.locus440.exe
Timestamp2024-11-07 10:30:172024-11-07 10:00:37
Experiment TypeOpenMP; same as r0
Machineitp06.benchmarkcenter.megware.comsame as r0
Architecturex86_64same as r0
Micro ArchitectureICELAKE_SPsame as r0
Model NameIntel(R) Xeon(R) Platinum 8368 CPU @ 2.40GHzsame as r0
Cache Size58368 KBsame as r0
Number of Cores38same as r0
Maximal Frequency3.4 GHzsame as r0
OS VersionLinux 5.14.0-427.18.1.el9_4.x86_64 #1 SMP PREEMPT_DYNAMIC Tue May 28 06:27:02 EDT 2024same as r0
Architecture used during static analysisx86_64same as r0
Micro Architecture used during static analysisICELAKE_SPsame as r0
Compilation Options permute3d_1.omp.exe: clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --driver-mode=g++ --intel -I . -o permute3d_1.omp.o -c -g -O3 -mprefer-vector-width=512 -march=native -Wall -Wno-unknown-pragmas -fiopenmp -fiopenmp -fopenmp-targets=spir64 permute3d_1.omp.cpp -fveclib=SVML -fheinous-gnu-extensions permute3d_1.locus440.exe: clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --driver-mode=g++ --intel -I . -o permute3d_1.locus440.o -c -g -O3 -mprefer-vector-width=512 -march=native -Wall -Wno-unknown-pragmas -fiopenmp -fiopenmp -fopenmp-targets=spir64 permute3d_1.locus440.cpp -fveclib=SVML -fheinous-gnu-extensions
Number of processes observed1same as r0
Number of threads observed76same as r0
Frequency Driverintel_pstatesame as r0
Frequency Governorperformancesame as r0
Huge Pagesalwayssame as r0
Hyperthreadingonsame as r0
Number of sockets2same as r0
Number of cores per socket38same as r0
MAQAO version2.20.10same as r0
MAQAO build4ac1b2f5b5fdb6964b480406b6b2a13ea0924e38::20241106-170444same as r0
Commentssame as r0
×