Help is available by moving the cursor above any symbol or by checking MAQAO website.
Total Time (s) | 117.81 | ||
Max (Thread Active Time) (s) | 95.37 | ||
Average Active Time (s) | 65.99 | ||
Time in analyzed loops (%) | 77.2 | ||
Time in analyzed innermost loops (%) | 38.6 | ||
Time in user code (%) | 96.8 | ||
Compilation Options Score (%) | 100 | ||
Array Access Efficiency (%) | 67.0 | ||
Potential Speedups | |||
Perfect Flow Complexity | 1.00 | ||
Perfect OpenMP + MPI + Pthread | 1.01 | ||
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.46 | ||
No Scalar Integer | Potential Speedup | 1.45 | |
Nb Loops to get 80% | 5 | ||
FP Vectorised | Potential Speedup | 1.14 | |
Nb Loops to get 80% | 4 | ||
Fully Vectorised | Potential Speedup | 1.35 | |
Nb Loops to get 80% | 9 | ||
FP Arithmetic Only | Potential Speedup | 1.75 | |
Nb Loops to get 80% | 10 |
Source Object | Issue |
---|---|
▼libatlab-1.so.0.0.0 | |
○domain.f90 | |
○f_functions.f90 | |
○numerics.f90 | |
▼libPSolver-1.so.9.0.0 | |
○exctx_calculation.f90 | |
○PSolver_Base_new.f90 | |
○PStypes.f90 | |
○PSolver_Core.f90 | |
▼[vdso] | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼libfutile-1.so.9.0.0 | |
○fft3d.f90 |
Application | /software/cepp/Linux/rhel-9.3/aarch64/mpi/openmpi/4.1.6/gcc/14.2.0/ucx/1.17.0/cuda/12.5/bin/mpirun | ||||
Timestamp | 2024-12-11 14:20:33 | Universal Timestamp | 1733923233 | ||
Number of processes observed | 1 | Number of threads observed | 72 | ||
Experiment Type | OpenMP; | ||||
Machine | pm6-nod059 | ||||
Architecture | aarch64 | Micro Architecture | ARM_NEOVERSE_V2 | ||
OS Version | Linux 5.14.0-362.24.1.el9_3.aarch64+64k #1 SMP PREEMPT_DYNAMIC Thu Feb 15 09:20:29 EST 2024 | ||||
Architecture used during static analysis | aarch64 | Micro Architecture used during static analysis | ARM_NEOVERSE_V2 | ||
Frequency Driver | cppc_cpufreq | Frequency Governor | performance | ||
Huge Pages | never | Hyperthreading | off | ||
Number of sockets | 4 | Number of cores per socket | 72 | ||
Compilation Options | + [vdso]: N/A libPSolver-1.so.9.0.0: GNU Fortran2008 14.2.0 -mcpu=neoverse-v2+nosve -mlittle-endian -mabi=lp64 -g -O3 -funroll-loops -fno-omit-frame-pointer -fopenmp -fPIC -fallow-argument-mismatch -fintrinsic-modules-path /software/cepp/Linux/rhel-9.3/aarch64/compilers/gcc/14.2.0/bin/../lib/gcc/aarch64-pc-linux-gnu/14.2.0/finclude -fpre-include=/usr/include/finclude/math-vector-fortran.h libatlab-1.so.0.0.0: GNU Fortran2008 14.2.0 -mcpu=neoverse-v2+nosve -mlittle-endian -mabi=lp64 -g -O3 -funroll-loops -fno-omit-frame-pointer -fopenmp -fPIC -fallow-argument-mismatch -fintrinsic-modules-path /software/cepp/Linux/rhel-9.3/aarch64/compilers/gcc/14.2.0/bin/../lib/gcc/aarch64-pc-linux-gnu/14.2.0/finclude -fpre-include=/usr/include/finclude/math-vector-fortran.h libfutile-1.so.9.0.0: GNU Fortran2008 14.2.0 -mcpu=neoverse-v2+nosve -mlittle-endian -mabi=lp64 -g -O3 -funroll-loops -fno-omit-frame-pointer -fopenmp -fPIC -fallow-argument-mismatch -fintrinsic-modules-path /software/cepp/Linux/rhel-9.3/aarch64/compilers/gcc/14.2.0/bin/../lib/gcc/aarch64-pc-linux-gnu/14.2.0/finclude -fpre-include=/usr/include/finclude/math-vector-fortran.h |
Dataset | |
Run Command | <executable> --bind-to none -n 9 -- /home_nfs/blucidol/rev/scripts/job_run -auto -gnt -bot 0 -v -- /home_nfs/blucidol/src/max/bigdft/bigdft-suite/build_grace_aarch64_gcc14.2.0_nosve/psolver/tests/Fock -g P -n 216 -o 144 -a No |
Number Processes | 1 |
Number Nodes | 1 |
Filter | Not Used |
Profile Start | Not Used |