Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | |
---|---|---|---|---|---|---|---|---|
Total Time (s) | 2.03 E3 | 1.04 E3 | 537.54 | 281.95 | 152.59 | 90.88 | 51.46 | |
Profiled Time (s) | 2.03 E3 | 1.03 E3 | 531.72 | 277.66 | 148.80 | 89.32 | 50.05 | |
Time in analyzed loops (%) | 94.3 | 93.4 | 91.6 | 90.3 | 85.5 | 72.9 | 67.7 | |
Time in analyzed innermost loops (%) | 87.5 | 86.5 | 84.7 | 83.1 | 78.6 | 67.0 | 62.1 | |
Time in user code (%) | 0 | 0.56 | 2.09 | 3.11 | 7.04 | 20.7 | 24.6 | |
Compilation Options Score (%) | 75.0 | 75.0 | 75.0 | 75.0 | 75.0 | 75.0 | 75.0 | |
Perfect Flow Complexity | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 | |
Array Access Efficiency (%) | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.01 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.02 | |
No Scalar Integer | Potential Speedup | 1.05 | 1.05 | 1.05 | 1.05 | 1.05 | 1.04 | 1.04 |
Nb Loops to get 80% | 9 | 11 | 11 | 12 | 14 | 14 | 14 | |
FP Vectorised | Potential Speedup | 1.42 | 1.41 | 1.40 | 1.39 | 1.36 | 1.29 | 1.26 |
Nb Loops to get 80% | 5 | 5 | 5 | 5 | 5 | 4 | 4 | |
Fully Vectorised | Potential Speedup | 1.91 | 1.92 | 1.89 | 1.87 | 1.79 | 1.60 | 1.54 |
Nb Loops to get 80% | 11 | 12 | 12 | 13 | 13 | 12 | 13 | |
Only FP Arithmetic | Potential Speedup | 1.43 | 1.44 | 1.43 | 1.42 | 1.39 | 1.32 | 1.29 |
Nb Loops to get 80% | 13 | 15 | 15 | 15 | 16 | 15 | 16 | |
Scalability - Gap | 1.00 | 1.02 | 1.06 | 1.11 | 1.20 | 1.43 | 1.62 |
Source Object | Issue |
---|---|
▼libgromacs_mpi.so.7.0.0 | |
▼lincs.cpp | |
○ | -march=(target) is missing. |
▼pbc.cpp | |
○ | -march=(target) is missing. |
▼domdec.cpp | |
○ | -march=(target) is missing. |
▼pme_redistribute.cpp | |
○ | -march=(target) is missing. |
▼fft5d.cpp | |
○ | -march=(target) is missing. |
▼calc_verletbuf.cpp | |
○ | -march=(target) is missing. |
▼stl_vector.h | |
○ | -march=(target) is missing. |
▼threaded_force_buffer.cpp | |
○ | -march=(target) is missing. |
▼bonded.cpp | |
○ | -march=(target) is missing. |
▼pme_pp.cpp | |
○ | -march=(target) is missing. |
▼localtopology.cpp | |
○ | -march=(target) is missing. |
▼pme_grid.cpp | |
○ | -march=(target) is missing. |
▼pme_solve.cpp | |
○ | -march=(target) is missing. |
▼kernel_outer.h | |
○ | -march=(target) is missing. |
▼arm_neon.h | |
○ | -march=(target) is missing. |
▼manage_threading.cpp | |
○ | -march=(target) is missing. |
▼kernel_prune.cpp | |
○ | -march=(target) is missing. |
▼atomdata.cpp | |
○ | -march=(target) is missing. |
▼pairlist.cpp | |
○ | -march=(target) is missing. |
▼settle.cpp | |
○ | -march=(target) is missing. |
▼pairs.cpp | |
○ | -march=(target) is missing. |
▼vec.h | |
○ | -march=(target) is missing. |
▼sim_util.cpp | |
○ | -march=(target) is missing. |
▼grid.cpp | |
○ | -march=(target) is missing. |
▼md_support.cpp | |
○ | -march=(target) is missing. |
▼pme_spread.cpp | |
○ | -march=(target) is missing. |
▼redistribute.cpp | |
○ | -march=(target) is missing. |
▼update.cpp | |
○ | -march=(target) is missing. |
▼mdatoms.cpp | |
○ | -march=(target) is missing. |
▼domdec_constraints.cpp | |
○ | -march=(target) is missing. |
r0 | r1 | r2 | r3 | r4 | r5 | r6 | |
---|---|---|---|---|---|---|---|
Application | /home/eoseret/GROMACS/build/gcc_1/bin/gmx_mpi | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Timestamp | 2023-02-21 16:28:39 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Experiment Type | MPI; | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Machine | ip-172-31-8-114 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture | arm64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture | ARM_NEOVERSE_V1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Model Name | |||||||
Cache Size | |||||||
Number of Cores | |||||||
Maximal Frequency | 0 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
OS Version | Linux 5.15.0-1030-aws #34~20.04.1-Ubuntu SMP Tue Jan 24 15:16:39 UTC 2023 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | arm64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Compilation Options | libgromacs_mpi.so.7.0.0: GNU C++17 11.1.0 -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=c++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fexcess-precision=fast -funroll-all-loops -fopenmp -fasynchronous-unwind-tables -fstack-protector-strong -fstack-clash-protection | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of processes observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 |
Number of threads observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 |
MAQAO version | 2.16.3 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO build | Build information not available | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Comments | GNU 11.1 (SIMD=Neon), AWS G3 (Neoverse V1), scalability | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |