Help is available by moving the cursor above any symbol or by checking MAQAO website.
Total Time (s) | 165.71 | ||
Max (Thread Active Time) (s) | 83.85 | ||
Average Active Time (s) | 79.50 | ||
Time in analyzed loops (%) | 62.7 | ||
Time in analyzed innermost loops (%) | 48.1 | ||
Time in user code (%) | 80.8 | ||
Compilation Options Score (%) | 99.8 | ||
Array Access Efficiency (%) | 56.7 | ||
Potential Speedups | |||
Perfect Flow Complexity | 1.00 | ||
Perfect OpenMP + MPI + Pthread | 1.02 | ||
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.28 | ||
No Scalar Integer | Potential Speedup | 1.33 | |
Nb Loops to get 80% | 7 | ||
FP Vectorised | Potential Speedup | 1.12 | |
Nb Loops to get 80% | 5 | ||
Fully Vectorised | Potential Speedup | 1.28 | |
Nb Loops to get 80% | 9 | ||
FP Arithmetic Only | Potential Speedup | 1.34 | |
Nb Loops to get 80% | 10 |
Source Object | Issue |
---|---|
▼[vdso] | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼libPSolver-1.so.9.0.0 | |
○exctx_calculation.f90 | |
○PSolver_Base_new.f90 | |
○PStypes.f90 | |
○PSolver_Core.f90 | |
▼libatlab-1.so.0.0.0 | |
○domain.f90 | |
○numerics.f90 | |
▼libfutile-1.so.9.0.0 | |
○fft3d.f90 |
Application | /software/cepp/Linux/rhel-9.3/aarch64/mpi/openmpi/4.1.6/acfl/24.04/ucx/1.17.0/cuda/12.5/bin/mpirun | ||||
Timestamp | 2024-12-11 14:17:12 | Universal Timestamp | 1733923032 | ||
Number of processes observed | 1 | Number of threads observed | 72 | ||
Experiment Type | OpenMP; | ||||
Machine | pm6-nod059 | ||||
Architecture | aarch64 | Micro Architecture | ARM_NEOVERSE_V2 | ||
OS Version | Linux 5.14.0-362.24.1.el9_3.aarch64+64k #1 SMP PREEMPT_DYNAMIC Thu Feb 15 09:20:29 EST 2024 | ||||
Architecture used during static analysis | aarch64 | Micro Architecture used during static analysis | ARM_NEOVERSE_V2 | ||
Frequency Driver | cppc_cpufreq | Frequency Governor | performance | ||
Huge Pages | never | Hyperthreading | off | ||
Number of sockets | 4 | Number of cores per socket | 72 | ||
Compilation Options | + [vdso]: N/A libPSolver-1.so.9.0.0: Arm F90 F90 Flang - 1.5 2017-05-01 flang -I . -I /home_nfs/blucidol/src/max/bigdft/bigdft-suite/psolver/src -I /home_nfs/blucidol/src/max/bigdft/bigdft-suite/build_grace_aarch64_acfl24.04_sve2_128/install/include/atlab -I /home_nfs/blucidol/src/max/bigdft/bigdft-suite/build_grace_aarch64_acfl24.04_sve2_128/install/include/futile -I /software/cepp/Linux/rhel-8.8/aarch64/compilers/acfl/24.04/armpl-24.04.0_RHEL-8_arm-linux-compiler/lib/../include -O3 -g -mcpu=neoverse-v2 -msve-vector-bits=128 -grecord-command-line -funroll-loops -fno-omit-frame-pointer -Wno-int-conversion -fopenmp -fPIC -I /home_nfs/blucidol/src/max/bigdft/bigdft-suite/build_grace_aarch64_acfl24.04_sve2_128/install/include -I /software/cepp/Linux/rhel-9.3/aarch64//utilities/python3/3.12.4/include -c -o PSolver_Core.o -I /software/cepp/Linux/rhel-9.3/aarch64/mpi/openmpi/4.1.6/acfl/24.04/ucx/1.17.0/cuda/12.5/include -I /software/cepp/Linux/rhel-9.3/aarch64/mpi/openmpi/4.1.6/acfl/24.04/ucx/1.17.0/cuda/12.5/lib libatlab-1.so.0.0.0: Arm F90 F90 Flang - 1.5 2017-05-01 flang -I ./ -I /home_nfs/blucidol/src/max/bigdft/bigdft-suite/atlab/src -I /home_nfs/blucidol/src/max/bigdft/bigdft-suite/build_grace_aarch64_acfl24.04_sve2_128/install/include/futile -I /software/cepp/Linux/rhel-8.8/aarch64/compilers/acfl/24.04/armpl-24.04.0_RHEL-8_arm-linux-compiler/lib/../include -O3 -g -mcpu=neoverse-v2 -msve-vector-bits=128 -grecord-command-line -funroll-loops -fno-omit-frame-pointer -Wno-int-conversion -fopenmp -fPIC -I /home_nfs/blucidol/src/max/bigdft/bigdft-suite/build_grace_aarch64_acfl24.04_sve2_128/install/include -I /software/cepp/Linux/rhel-9.3/aarch64//utilities/python3/3.12.4/include -c -o domain.o -I /software/cepp/Linux/rhel-9.3/aarch64/mpi/openmpi/4.1.6/acfl/24.04/ucx/1.17.0/cuda/12.5/include -I /software/cepp/Linux/rhel-9.3/aarch64/mpi/openmpi/4.1.6/acfl/24.04/ucx/1.17.0/cuda/12.5/lib libfutile-1.so.9.0.0: Arm F90 F90 Flang - 1.5 2017-05-01 flang -I . -I ../dicts/ -I ../flib/ -I ./mpi -I /software/cepp/Linux/rhel-8.8/aarch64/compilers/acfl/24.04/armpl-24.04.0_RHEL-8_arm-linux-compiler/lib/../include -O3 -g -mcpu=neoverse-v2 -msve-vector-bits=128 -grecord-command-line -funroll-loops -fno-omit-frame-pointer -Wno-int-conversion -fopenmp -fPIC -I /home_nfs/blucidol/src/max/bigdft/bigdft-suite/build_grace_aarch64_acfl24.04_sve2_128/install/include -I /software/cepp/Linux/rhel-9.3/aarch64//utilities/python3/3.12.4/include -c -o fft/fft3d.o -I /software/cepp/Linux/rhel-9.3/aarch64/mpi/openmpi/4.1.6/acfl/24.04/ucx/1.17.0/cuda/12.5/include -I /software/cepp/Linux/rhel-9.3/aarch64/mpi/openmpi/4.1.6/acfl/24.04/ucx/1.17.0/cuda/12.5/lib |
Dataset | |
Run Command | <executable> --bind-to none -n 18 -- /home_nfs/blucidol/rev/scripts/job_run -auto -gnt -bot 0 -v -- /home_nfs/blucidol/src/max/bigdft/bigdft-suite/build_grace_aarch64_acfl24.04_sve2_128/psolver/tests/Fock -g P -n 216 -o 144 -a No |
Number Processes | 1 |
Number Nodes | 1 |
Filter | Not Used |
Profile Start | Not Used |