Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | r9 | r10 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Total Time (s) | 83.93 | 82.66 | 82.60 | 72.15 | 68.10 | 75.77 | 70.47 | 65.40 | 69.33 | 66.54 | 66.49 | |
Max (Thread Active Time) (s) | 67.46 | 67.88 | 68.05 | 52.69 | 53.39 | 58.65 | 56.51 | 51.58 | 51.08 | 51.45 | 51.06 | |
Average Active Time (s) | 66.48 | 66.68 | 66.79 | 51.69 | 52.19 | 57.42 | 55.40 | 50.35 | 50.19 | 50.23 | 50.01 | |
Activity Ratio (%) | 85.2 | 85.1 | 85.2 | 81.2 | 81.5 | 82.8 | 82.4 | 81.4 | 81.1 | 81.3 | 81.2 | |
Average number of active threads | 114.059 | 116.170 | 116.443 | 103.161 | 110.368 | 109.123 | 113.219 | 110.865 | 104.252 | 108.707 | 108.313 | |
Affinity Stability (%) | 98.3 | 98.3 | 98.1 | 97.3 | 97.4 | 97.8 | 97.6 | 97.7 | 97.4 | 97.5 | 97.3 | |
Time in analyzed loops (%) | 79.3 | 79.3 | 79.1 | 75.7 | 76.0 | 69.2 | 72.3 | 75.5 | 75.8 | 75.7 | 75.5 | |
Time in analyzed innermost loops (%) | 50.6 | 54.7 | 54.5 | 56.1 | 61.7 | 56.1 | 59.0 | 53.9 | 54.1 | 54.1 | 54.5 | |
Time in user code (%) | 79.3 | 79.3 | 79.1 | 76.0 | 76.3 | 69.5 | 72.6 | 75.5 | 75.8 | 75.7 | 75.5 | |
Compilation Options Score (%) | 16.7 | 16.7 | 16.7 | 100.0 | 100.0 | 100.0 | 66.7 | 100.0 | 87.5 | 87.5 | 62.5 | |
Array Access Efficiency (%) | 71.7 | 64.1 | 65.6 | 55.6 | 33.8 | 33.8 | 33.8 | 43.8 | 43.8 | 43.8 | 67.3 | |
Potential Speedups | ||||||||||||
Perfect Flow Complexity | 1.01 | 1.01 | 1.01 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
Perfect OpenMP + MPI + Pthread | 1.09 | 1.10 | 1.10 | 1.12 | 1.13 | 1.13 | 1.14 | 1.13 | 1.12 | 1.13 | 1.13 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.18 | 1.18 | 1.19 | 1.21 | 1.20 | 1.32 | 1.26 | 1.21 | 1.20 | 1.21 | 1.21 | |
No Scalar Integer | Potential Speedup | 1.26 | 1.18 | 1.20 | 1.28 | 1.36 | 1.32 | 1.30 | 1.26 | 1.27 | 1.26 | 1.22 |
Nb Loops to get 80% | 1 | 3 | 3 | 2 | 2 | 2 | 2 | 3 | 3 | 3 | 2 | |
FP Vectorised | Potential Speedup | 1.14 | 1.11 | 1.11 | 1.07 | 1.19 | 1.17 | 1.19 | 1.17 | 1.17 | 1.17 | 1.15 |
Nb Loops to get 80% | 1 | 2 | 2 | 1 | 2 | 2 | 2 | 2 | 2 | 2 | 1 | |
Fully Vectorised | Potential Speedup | 1.41 | 1.24 | 1.25 | 1.28 | 1.43 | 1.38 | 1.39 | 1.55 | 1.55 | 1.55 | 1.54 |
Nb Loops to get 80% | 3 | 2 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | |
Only FP Arithmetic | Potential Speedup | 1.66 | 1.84 | 1.82 | 1.52 | 1.49 | 1.43 | 1.38 | 1.30 | 1.30 | 1.30 | 1.28 |
Nb Loops to get 80% | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 2 |
Source Object | Issue |
---|---|
▼[vdso] | |
▼ | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
○ | -O2, -O3 or -Ofast is missing. |
○ | -mcpu=native is missing. |
▼libiPIC3Dlib.so | |
▼EMfields3D.cpp | |
○ | -g is missing for some functions (possibly ones added by the compiler), but debug locations are available. Some analysis may be inaccurate. |
○ | -O2, -O3 or -Ofast is missing. |
○ | -mcpu=native is missing. |
▼Particles3Dcomm.cpp | |
○ | -g is missing for some functions (possibly ones added by the compiler), but debug locations are available. Some analysis may be inaccurate. |
○ | -O2, -O3 or -Ofast is missing. |
○ | -mcpu=native is missing. |
▼BlockCommunicator.h | |
○ | -g is missing for some functions (possibly ones added by the compiler), but debug locations are available. Some analysis may be inaccurate. |
○ | -O2, -O3 or -Ofast is missing. |
○ | -mcpu=native is missing. |
▼Particles3D.cpp | |
○ | -g is missing for some functions (possibly ones added by the compiler), but debug locations are available. Some analysis may be inaccurate. |
○ | -O2, -O3 or -Ofast is missing. |
○ | -mcpu=native is missing. |
▼iPIC3D | |
▼ | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
○ | -O2, -O3 or -Ofast is missing. |
○ | -mcpu=native is missing. |
r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | r9 | r10 | |
---|---|---|---|---|---|---|---|---|---|---|---|
Experiment Name | |||||||||||
Application | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_nvhpc_25.1_sve2_128_fast/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_nvhpc_25.1_sve2_128/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_nvhpc_25.1_novect/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_llvm_20.1.1_sve2_128_fast/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_llvm_20.1.1_sve2_128/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_llvm_20.1.1_neon/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_llvm_20.1.1_novect/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_gcc_14.2.0_sve2_128_fast/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_gcc_14.2.0_sve2_128/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_gcc_14.2.0_neon/bin/iPIC3D | /home_nfs/blucidol/src/space/ipic/bench/install/aarch64_gcc_14.2.0_novect/bin/iPIC3D |
Timestamp | 2025-04-15 18:33:18 | 2025-04-15 18:36:09 | 2025-04-15 18:39:14 | 2025-04-15 18:42:21 | 2025-04-15 18:45:00 | 2025-04-15 18:47:43 | 2025-04-15 18:50:31 | 2025-04-15 18:53:23 | 2025-04-15 18:56:06 | 2025-04-15 18:58:50 | 2025-04-15 19:01:29 |
Experiment Type | MPI; | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Machine | gw86023 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Model Name | |||||||||||
Cache Size | |||||||||||
Number of Cores | |||||||||||
Maximal Frequency | 3.357 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
OS Version | Linux 5.14.0-427.37.1.el9_4.aarch64+64k #1 SMP PREEMPT_DYNAMIC Fri Sep 13 16:56:28 UTC 2024 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Compilation Options | + [vdso]: N/A iPIC3D: N/A libiPIC3Dlib.so: NVC++ 25.1-0 | same as r0 | same as r0 | + [vdso]: N/A iPIC3D: N/A libiPIC3Dlib.so: clang version 20.1.1 /home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/compilers/llvm/20.1.1/prefix/bin/clang-20 -Wl,-rpath,/home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/compilers/llvm/20.1.1/prefix/lib --driver-mode=g++ -D LOG_TASKS_TOTAL_TIME -D NO_HDF5 -D iPIC3Dlib_EXPORTS -I /home_nfs/blucidol/src/space/ipic/ipic3d-cpu-space-coe/include -isystem /home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/neoverse_v2/mpi/openmpi-ib-cpu/5.0.7/llvm/20.1.1/ucx/1.18.0/prefix/include -funroll-loops -g -grecord-command-line -fno-omit-frame-pointer -Ofast -ffast-math -mcpu=neoverse-v2 -msve-vector-bits=128 -O3 -D NDEBUG -std=gnu++17 -fPIC -MD -MT CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o -MF CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o.d -o CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o -c /home_nfs/blucidol/src/space/ipic/ipic3d-cpu-space-coe/particles/Particles3D.cpp | + [vdso]: N/A iPIC3D: N/A libiPIC3Dlib.so: clang version 20.1.1 /home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/compilers/llvm/20.1.1/prefix/bin/clang-20 -Wl,-rpath,/home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/compilers/llvm/20.1.1/prefix/lib --driver-mode=g++ -D LOG_TASKS_TOTAL_TIME -D NO_HDF5 -D iPIC3Dlib_EXPORTS -I /home_nfs/blucidol/src/space/ipic/ipic3d-cpu-space-coe/include -isystem /home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/neoverse_v2/mpi/openmpi-ib-cpu/5.0.7/llvm/20.1.1/ucx/1.18.0/prefix/include -funroll-loops -g -grecord-command-line -fno-omit-frame-pointer -O2 -mcpu=neoverse-v2 -msve-vector-bits=128 -O3 -D NDEBUG -std=gnu++17 -fPIC -MD -MT CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o -MF CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o.d -o CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o -c /home_nfs/blucidol/src/space/ipic/ipic3d-cpu-space-coe/particles/Particles3D.cpp | + [vdso]: N/A libiPIC3Dlib.so: clang version 20.1.1 /home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/compilers/llvm/20.1.1/prefix/bin/clang-20 -Wl,-rpath,/home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/compilers/llvm/20.1.1/prefix/lib --driver-mode=g++ -D LOG_TASKS_TOTAL_TIME -D NO_HDF5 -D iPIC3Dlib_EXPORTS -I /home_nfs/blucidol/src/space/ipic/ipic3d-cpu-space-coe/include -isystem /home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/neoverse_v2/mpi/openmpi-ib-cpu/5.0.7/llvm/20.1.1/ucx/1.18.0/prefix/include -funroll-loops -g -grecord-command-line -fno-omit-frame-pointer -O2 -mcpu=neoverse-v2+nosve -O3 -D NDEBUG -std=gnu++17 -fPIC -MD -MT CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o -MF CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o.d -o CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o -c /home_nfs/blucidol/src/space/ipic/ipic3d-cpu-space-coe/particles/Particles3D.cpp | + [vdso]: N/A libiPIC3Dlib.so: clang version 20.1.1 /home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/compilers/llvm/20.1.1/prefix/bin/clang-20 -Wl,-rpath,/home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/compilers/llvm/20.1.1/prefix/lib --driver-mode=g++ -D LOG_TASKS_TOTAL_TIME -D NO_HDF5 -D iPIC3Dlib_EXPORTS -I /home_nfs/blucidol/src/space/ipic/ipic3d-cpu-space-coe/include -isystem /home_nfs/projects/pro_2023_space/software/Linux/rhel-9.4/aarch64/neoverse_v2/mpi/openmpi-ib-cpu/5.0.7/llvm/20.1.1/ucx/1.18.0/prefix/include -funroll-loops -g -grecord-command-line -fno-omit-frame-pointer -O2 -fno-vectorize -fno-vectorize -fno-slp-vectorize -O3 -D NDEBUG -std=gnu++17 -fPIC -MD -MT CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o -MF CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o.d -o CMakeFiles/iPIC3Dlib.dir/particles/Particles3D.cpp.o -c /home_nfs/blucidol/src/space/ipic/ipic3d-cpu-space-coe/particles/Particles3D.cpp | + [vdso]: N/A iPIC3D: N/A libiPIC3Dlib.so: GNU C++17 14.2.0 -mcpu=neoverse-v2 -msve-vector-bits=128 -mlittle-endian -mabi=lp64 -g -Ofast -O3 -std=gnu++17 -funroll-loops -fno-omit-frame-pointer -fPIC | + [vdso]: N/A iPIC3D: N/A libiPIC3Dlib.so: GNU C++17 14.2.0 -mcpu=neoverse-v2 -msve-vector-bits=128 -mlittle-endian -mabi=lp64 -g -O2 -O3 -std=gnu++17 -funroll-loops -fno-omit-frame-pointer -fPIC | + [vdso]: N/A iPIC3D: N/A libiPIC3Dlib.so: GNU C++17 14.2.0 -mcpu=neoverse-v2+nosve -mlittle-endian -mabi=lp64 -g -O2 -O3 -std=gnu++17 -funroll-loops -fno-omit-frame-pointer -fPIC | + [vdso]: N/A iPIC3D: N/A libiPIC3Dlib.so: GNU C++17 14.2.0 -mlittle-endian -mabi=lp64 -g -O2 -O3 -std=gnu++17 -funroll-loops -fno-omit-frame-pointer -fno-tree-vectorize -fno-tree-loop-vectorize -fno-tree-slp-vectorize -fPIC |
Number of processes observed | 144 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of threads observed | 144 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Driver | cppc_cpufreq | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Governor | performance | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Huge Pages | never | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of sockets | 4 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 72 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.21.2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO build | 605a656e415499abc9db62b5ea50b183e5295485::20250228-110016 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Comments | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |