Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
---|---|---|---|---|---|---|---|---|---|
Total Time (s) | 65.27 | 32.99 | 16.76 | 8.69 | 4.62 | 2.80 | 2.43 | 2.44 | |
Profiled Time (s) | 64.47 | 32.41 | 16.28 | 8.26 | 4.23 | 2.41 | 2.04 | 2.05 | |
Time in analyzed loops (%) | 63.3 | 63.6 | 62.6 | 62.2 | 61.8 | 59.7 | 55.3 | 57.9 | |
Time in analyzed innermost loops (%) | 16.5 | 16.9 | 16.5 | 16.8 | 16.9 | 17.7 | 18.1 | 18.1 | |
Time in user code (%) | 63.8 | 64.2 | 63.2 | 62.6 | 62.2 | 60.1 | 55.6 | 58.2 | |
Compilation Options Score (%) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
Array Access Efficiency (%) | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | |
Scalability - Gap | 1.00 | 1.01 | 1.03 | 1.06 | 1.13 | 1.37 | 2.38 | 3.00 | |
Potential Speedups | |||||||||
Perfect Flow Complexity | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.02 | 1.02 | |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.02 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.01 | 1.07 | 1.04 | |
No Scalar Integer | Potential Speedup | 1.34 | 1.34 | 1.33 | 1.33 | 1.33 | 1.31 | 1.26 | 1.28 |
Nb Loops to get 80% | 8 | 8 | 8 | 8 | 9 | 9 | 10 | 9 | |
FP Vectorised | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
Fully Vectorised | Potential Speedup | 1.25 | 1.25 | 1.24 | 1.24 | 1.24 | 1.23 | 1.18 | 1.17 |
Nb Loops to get 80% | 8 | 8 | 8 | 8 | 9 | 10 | 10 | 10 | |
Only FP Arithmetic | Potential Speedup | 1.61 | 1.61 | 1.60 | 1.59 | 1.58 | 1.56 | 1.48 | 1.53 |
Nb Loops to get 80% | 10 | 10 | 10 | 10 | 10 | 10 | 11 | 10 |
r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
---|---|---|---|---|---|---|---|---|
Experiment Name | ||||||||
Application | /users/m23012/camus/code/qmckl/qmckl_bench/bench_aos | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Timestamp | 2024-02-26 18:03:17 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Experiment Type | Sequential | OpenMP; | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 |
Machine | turpancomp2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture | ARM_NEOVERSE_N1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Model Name | ||||||||
Cache Size | ||||||||
Number of Cores | ||||||||
Maximal Frequency | 3 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
OS Version | Linux 4.18.0-477.27.1.el8_8.aarch64 #1 SMP Thu Aug 31 11:00:23 EDT 2023 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | ARM_NEOVERSE_N1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Compilation Options | libqmckl.so.0.0.0: Arm C/C++/Fortran Compiler version 23.10 (build number 32) (based on LLVM 17.0.0) | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | + [vdso]: N/A libqmckl.so.0.0.0: Arm C/C++/Fortran Compiler version 23.10 (build number 32) (based on LLVM 17.0.0) | same as r6 |
Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of threads observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 80 |
Frequency Driver | cppc_cpufreq | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Governor | performance | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Huge Pages | never | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of sockets | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 80 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.19.2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO build | b4419cd98e02cf0e1ddec16d03c3ae3a99469c7b::20240223-153634 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Comments | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |