Help is available by moving the cursor above any symbol or by checking MAQAO website.
Global Metrics
Total Time (s)
309.57
Profiled Time (s)
129.11
Time in analyzed loops (%)
17.8
Time in analyzed innermost loops (%)
17.1
Time in user code (%)
17.6
Compilation Options Score (%)
87.0
Array Access Efficiency (%)
96.0
Potential Speedups
Perfect Flow Complexity
1.00
Perfect OpenMP + MPI + Pthread
1.02
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution
2.46
No Scalar Integer
Potential Speedup
1.00
Nb Loops to get 80%
8
FP Vectorised
Potential Speedup
1.01
Nb Loops to get 80%
9
Fully Vectorised
Potential Speedup
1.10
Nb Loops to get 80%
9
FP Arithmetic Only
Potential Speedup
1.08
Nb Loops to get 80%
7
OpenMP perfectly balanced
Potential Speedup
1.02
Nb Loops to get 80%
4
CQA Potential Speedups Summary
Loop Based Profile⏎
Innermost Loop Based Profile⏎
Application Categorization⏎
Compilation Options⏎
Source Object
Issue
▼bench_jastrow–
○
-g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target)
▼libqmckl.so.0.0.0–
○qmckl_distance_f.F90
○qmckl_jastrow_champ.c
○qmckl_jastrow_champ_f.F90
Loop Path Count Profile⏎
Cumulated Speedup If No Scalar Integer⏎
Cumulated Speedup If FP Vectorized⏎
Cumulated Speedup If Fully Vectorized⏎
Cumulated Speedup If FP Arithmetic Only⏎
Cumulated Speedup If OpenMP perfetecly balanced⏎
Experiment Summary
Application
./../qmckl_bench/build/bench_jastrow
Timestamp
2024-02-14 14:17:23
Universal Timestamp
1707916643
Number of processes observed
1
Number of threads observed
52
Experiment Type
OpenMP;
Machine
skylake
Model Name
Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz
Architecture
x86_64
Micro Architecture
SKYLAKE
Cache Size
36608 KB
Number of Cores
26
OS Version
Linux 6.5.7-arch1-1 #1 SMP PREEMPT_DYNAMIC Tue, 10 Oct 2023 21:10:21 +0000