Help is available by moving the cursor above any symbol or by checking MAQAO website.
- r0: run_0
- r1: omp_2_threads
- r2: omp_4_threads
- r3: omp_8_threads
- r4: omp_16_threads
- r5: omp_32_threads
- r6: omp_64_threads
Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 |
---|
Total Time (s) | 447.72 | 319.62 | 175.32 | 110.72 | 90.27 | 57.43 | 26.62 |
Profiled Time (s) | 447.58 | 319.51 | 175.23 | 110.48 | 89.98 | 57.02 | 26.18 |
Time in analyzed loops (%) | 99.3 | 99.3 | 99.1 | 98.3 | 98.2 | 95.6 | 91.6 |
Time in analyzed innermost loops (%) | 88.4 | 90.8 | 91.3 | 92.0 | 93.9 | 90.6 | 83.0 |
Time in user code (%) | 99.4 | 99.4 | 99.2 | 98.3 | 98.2 | 95.6 | 91.7 |
Compilation Options Score (%) | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
Array Access Efficiency (%) | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available | Not Available |
Scalability - Gap | 1.00 | 1.43 | 1.57 | 1.98 | 3.23 | 4.10 | 3.80 |
|
Potential Speedups |
Perfect Flow Complexity | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.01 | 1.01 | 1.02 | 1.03 | 1.07 |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.01 | 1.02 | 1.04 | 1.05 | 1.10 | 1.21 |
No Scalar Integer | Potential Speedup | 1.10 | 1.08 | 1.07 | 1.06 | 1.04 | 1.05 | 1.08 |
Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
FP Vectorised | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
Fully Vectorised | Potential Speedup | 3.67 | 3.67 | 3.64 | 3.54 | 3.51 | 3.29 | 3.04 |
Nb Loops to get 80% | 3 | 3 | 3 | 3 | 2 | 2 | 3 |
Only FP Arithmetic | Potential Speedup | 1.10 | 1.08 | 1.07 | 1.06 | 1.04 | 1.05 | 1.08 |
Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
OpenMP perfectly balanced | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.03 | 1.05 |
Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
Source Object | Issue |
▼spmxv.exe– | |
▼ooo_cmdline.h– | |
○ | |
▼ooo_cmdline.cpp– | |
○ | |
▼main.cpp– | |
○ | |
Source Object | Issue |
▼spmxv.exe– | |
▼ooo_cmdline.h– | |
○ | |
▼ooo_cmdline.cpp– | |
○ | |
▼main.cpp– | |
○ | |
Source Object | Issue |
▼spmxv.exe– | |
▼ooo_cmdline.h– | |
○ | |
▼ooo_cmdline.cpp– | |
○ | |
▼main.cpp– | |
○ | |
Source Object | Issue |
▼spmxv.exe– | |
▼ooo_cmdline.h– | |
○ | |
▼ooo_cmdline.cpp– | |
○ | |
▼main.cpp– | |
○ | |
Source Object | Issue |
▼spmxv.exe– | |
▼ooo_cmdline.h– | |
○ | |
▼ooo_cmdline.cpp– | |
○ | |
▼main.cpp– | |
○ | |
Source Object | Issue |
▼spmxv.exe– | |
▼ooo_cmdline.h– | |
○ | |
▼ooo_cmdline.cpp– | |
○ | |
▼main.cpp– | |
○ | |
Source Object | Issue |
▼spmxv.exe– | |
▼ooo_cmdline.h– | |
○ | |
▼ooo_cmdline.cpp– | |
○ | |
▼main.cpp– | |
○ | |
| r0 | r1 | r2 | r3 | r4 | r5 | r6 |
Experiment Name | omp_1_thread | omp_1_thread | omp_1_thread | omp_1_thread | omp_1_thread | omp_1_thread | omp_1_thread |
Application | ./spmxv.exe | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Timestamp | 2024-06-27 11:24:35 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Experiment Type | Sequential | OpenMP; | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 |
Machine | ip-172-31-42-13 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture | ARM_NEOVERSE_V1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Model Name | | | | | | | |
Cache Size | | | | | | | |
Number of Cores | | | | | | | |
Maximal Frequency | 0 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
OS Version | Linux 6.5.0-1020-aws #20~22.04.1-Ubuntu SMP Wed May 1 16:38:06 UTC 2024 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Compilation Options |
spmxv.exe: Arm C/C++/Fortran Compiler version 23.04 (build number 21) (based on LLVM 16.0.0) /opt/arm/arm-linux-compiler-23.04_Ubuntu-20.04/llvm-bin/clang-16 --driver-mode=g++ -I . -I utils -MMD -MP -g -fopenmp -mcpu=native -larmpl -grecord-command-line -lprompt_armclang -O3 -c -o main.o main.cpp | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of threads observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 |
Frequency Driver | NA | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Governor | NA | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Huge Pages | madvise | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of sockets | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.20.3 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO build | bfc89c69b7374f41fdba9d7e1e206b0cf5900829::20240621-165222 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Comments | | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |