Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Total Time (s) | 374.98 | |
| Profiled Time (s) | 374.74 | |
| Time in analyzed loops (%) | 93.4 | |
| Time in analyzed innermost loops (%) | 85.8 | |
| Time in user code (%) | 95.2 | |
| Compilation Options Score (%) | 100 | |
| Perfect Flow Complexity | 1.03 | |
| Array Access Efficiency (%) | Not Available | |
| GFLOPS | 0.0 | |
| Perfect OpenMP + MPI + Pthread | 1.00 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | |
| No Scalar Integer | Potential Speedup | 1.17 | |
| Nb Loops to get 80% | 14 | |
| FP Vectorised | Potential Speedup | 1.61 | |
| Nb Loops to get 80% | 13 | |
| Fully Vectorised | Potential Speedup | 1.67 | |
| Nb Loops to get 80% | 15 | |
| FP Arithmetic Only | Potential Speedup | 1.75 | |
| Nb Loops to get 80% | 15 | |
| Source Object | Issue |
| ▼libgromacs_mpi.so.7– | |
| ○lincs.cpp | |
| ○pbc.cpp | |
| ○fft5d.cpp | |
| ○kernel_prune.cpp | |
| ○impl_arm_sve_util_float.h | |
| ○redistribute.cpp | |
| ○threaded_force_buffer.cpp | |
| ○pme_gather.cpp | |
| ○pme_grid.cpp | |
| ○localtopology.cpp | |
| ○pme_spread.cpp | |
| ○pme_solve.cpp | |
| ○kernel_outer.h | |
| ○vec.h | |
| ○manage_threading.cpp | |
| ○calc_verletbuf.cpp | |
| ○settle.cpp | |
| ○atomdata.cpp | |
| ○vcm.cpp | |
| ○pairs.cpp | |
| ○pairlist.cpp | |
| ○sim_util.cpp | |
| ○grid.cpp | |
| ○md_support.cpp | |
| ○update.cpp | |
| ○domdec_constraints.cpp | |
| ○partition.cpp | |
| ○mdatoms.cpp | |
| ○bonded.cpp | |
| Application | /home/eoseret/GROMACS/build/gcc_2/bin/gmx_mpi | | |
| Timestamp | 2023-08-08 09:21:48 |
Universal Timestamp | 1691486508 |
| Number of processes observed | 1 |
Number of threads observed | 1 |
| Experiment Type | MPI; OpenMP; | | |
| Machine | ip-172-31-47-199 | | |
| Architecture | aarch64 |
Micro Architecture | ARM_NEOVERSE_V1 |
| OS Version | Linux 5.15.0-1039-aws #44~20.04.1-Ubuntu SMP Thu Jun 22 12:21:08 UTC 2023 | | |
| Architecture used during static analysis | aarch64 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V1 |
| Frequency Driver | NA |
Frequency Governor | NA |
| Huge Pages | madvise |
Hyperthreading | off |
| Number of sockets | 1 |
Number of cores per socket | 64 |
| Compilation Options | libgromacs_mpi.so.7: GNU C++17 11.1.0 -march=armv8.2-a+sve -msve-vector-bits=256 -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=c++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fexcess-precision=fast -funroll-all-loops -fopenmp -fasynchronous-unwind-tables -fstack-protector-strong -fstack-clash-protection | | |
| Comments | GNU g++ 12.2.0 (SIMD=SVE), AWS G3 (Neoverse V1), 2000 steps, single core | | |
| Dataset | |
| Run Command | <executable> mdrun -s ion_channel.tpr -nsteps 2000 -pin on -deffnm gcc |
| MPI Command | mpirun -n <number_processes> --bind-to core |
| Number Processes | 1 |
| Number Nodes | 1 |
| Number Processes per Nodes | 1 |
| Filter | Not Used |
| Profile Start | Not Used |
| Maximal Path Number | 4 |