Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | |
---|---|---|---|---|---|---|---|---|
Total Time (s) | 215.96 | 134.50 | 98.14 | 76.60 | 66.72 | 63.69 | 63.67 | |
Profiled Time (s) | 180.83 | 99.33 | 62.49 | 40.71 | 30.51 | 27.31 | 27.43 | |
Time in analyzed loops (%) | 23.7 | 22.9 | 20.1 | 19.4 | 20.4 | 20.9 | 20.3 | |
Time in analyzed innermost loops (%) | 20.8 | 20.1 | 17.9 | 17.5 | 19.1 | 19.9 | 19.7 | |
Time in user code (%) | 20.6 | 19.8 | 17.5 | 16.5 | 16.5 | 16.7 | 16.5 | |
Compilation Options Score (%) | 69.2 | 69.8 | 70.1 | 70.1 | 70.4 | 70.0 | 71.3 | |
Array Access Efficiency (%) | 92.8 | 92.8 | 93.0 | 92.5 | 92.9 | 93.7 | 94.1 | |
Scalability - Gap | 1.00 | 1.25 | 1.82 | 2.84 | 4.94 | 7.67 | 15.33 | |
Potential Speedups | ||||||||
Perfect Flow Complexity | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.07 | 1.07 | 1.05 | 1.04 | 1.05 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.08 | 1.30 | 1.55 | 1.95 | 2.17 | 2.42 | |
No Scalar Integer | Potential Speedup | 1.01 | 1.01 | 1.01 | 1.01 | 1.00 | 1.00 | 1.00 |
Nb Loops to get 80% | 3 | 3 | 3 | 3 | 4 | 7 | 6 | |
FP Vectorised | Potential Speedup | 1.04 | 1.03 | 1.03 | 1.02 | 1.02 | 1.02 | 1.01 |
Nb Loops to get 80% | 8 | 8 | 9 | 9 | 9 | 9 | 8 | |
Fully Vectorised | Potential Speedup | 1.15 | 1.15 | 1.13 | 1.12 | 1.12 | 1.12 | 1.12 |
Nb Loops to get 80% | 12 | 12 | 11 | 11 | 9 | 8 | 7 | |
Only FP Arithmetic | Potential Speedup | 1.11 | 1.11 | 1.10 | 1.10 | 1.11 | 1.11 | 1.11 |
Nb Loops to get 80% | 7 | 7 | 7 | 6 | 6 | 6 | 6 | |
OpenMP perfectly balanced | Potential Speedup | 1.00 | 1.00 | 1.03 | 1.02 | 1.02 | 1.02 | 1.02 |
Nb Loops to get 80% | 1 | 4 | 3 | 4 | 4 | 4 | 4 |
Source Object | Issue |
---|---|
▼bench_jastrow | |
▼ | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼libqmckl.so.0.0.0 | |
▼qmckl_distance_f.F90 | |
○ | |
▼qmckl_jastrow_champ.c | |
○ | |
▼qmckl_jastrow_champ_f.F90 | |
○ |
r0 | r1 | r2 | r3 | r4 | r5 | r6 | |
---|---|---|---|---|---|---|---|
Application | ./../qmckl_bench/build/bench_jastrow | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Timestamp | 2024-02-13 18:27:09 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Experiment Type | Sequential | OpenMP; | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 |
Machine | skylake | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture | SKYLAKE | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Model Name | Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Cache Size | 36608 KB | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of Cores | 26 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Maximal Frequency | 2.1 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
OS Version | Linux 6.5.7-arch1-1 #1 SMP PREEMPT_DYNAMIC Tue, 10 Oct 2023 21:10:21 +0000 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | SKYLAKE | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Compilation Options | bench_jastrow: N/A libqmckl.so.0.0.0: Intel(R) C Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.8.0 Build 20221119_000000 -I. -I/home/kcamus/comparative/qmckl/qmckl -I./include -I./src -I./include -I/home/kcamus/comparative/qmckl/qmckl/src -I/home/kcamus/comparative/qmckl/qmckl/include -I/home/kcamus/comparative/qmckl/qmckl/share/qmckl/test_data/ -I/home/kcamus/comparative/qmckl/trexio/_install/include -DHAVE_CONFIG_H -DQMCKL_TEST_DIR=\"/home/kcamus/comparative/qmckl/qmckl/share/qmckl/test_data/\" -march=native -ip -Ofast -ftz -finline -fopenmp -mkl=sequential -g -fno-omit-frame-pointer -fopenmp -MT src/qmckl_jastrow_champ.lo -MD -MP -MF src/.deps/qmckl_jastrow_champ.Tpo -c -fPIC -DPIC -o src/.libs/qmckl_jastrow_champ.o | libqmckl.so.0.0.0: Intel(R) C Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.8.0 Build 20221119_000000 -I. -I/home/kcamus/comparative/qmckl/qmckl -I./include -I./src -I./include -I/home/kcamus/comparative/qmckl/qmckl/src -I/home/kcamus/comparative/qmckl/qmckl/include -I/home/kcamus/comparative/qmckl/qmckl/share/qmckl/test_data/ -I/home/kcamus/comparative/qmckl/trexio/_install/include -DHAVE_CONFIG_H -DQMCKL_TEST_DIR=\"/home/kcamus/comparative/qmckl/qmckl/share/qmckl/test_data/\" -march=native -ip -Ofast -ftz -finline -fopenmp -mkl=sequential -g -fno-omit-frame-pointer -fopenmp -MT src/qmckl_jastrow_champ.lo -MD -MP -MF src/.deps/qmckl_jastrow_champ.Tpo -c -fPIC -DPIC -o src/.libs/qmckl_jastrow_champ.o bench_jastrow: N/A | same as r0 | same as r1 | same as r1 | same as r1 | same as r1 |
Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of threads observed | 1 | 2 | 4 | 8 | 16 | 26 | 52 |
Frequency Driver | intel_cpufreq | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Governor | performance | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Huge Pages | always | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of sockets | 2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 26 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.19.0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO build | b37ee48e971324d4eaf9054a5a16e1bfd5003152::20240201-180403 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Comments | - | - | - | - | - | - | - |