Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- There is no filter information to display
| Total Time (s) | 79.39 | |
| Max (Thread Active Time) (s) | 64.99 | |
| Average Active Time (s) | 10.94 | |
| Activity Ratio (%) | 23.2 | |
| Average number of active threads | 17.637 | |
| Affinity Stability (%) | 88.4 | |
| Time in analyzed loops (%) | 0.35 | |
| Time in analyzed innermost loops (%) | 0.14 | |
| Time in user code (%) | 0.57 | |
| Compilation Options Score (%) | 75.0 | |
| Array Access Efficiency (%) | 96.4 | |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.00 | |
| Perfect OpenMP + MPI + Pthread | 1.00 | |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 5.98 | |
| No Scalar Integer | Potential Speedup | 1.00 | |
| Nb Loops to get 80% | 1 | |
| FP Vectorised | Potential Speedup | 1.00 | |
| Nb Loops to get 80% | 1 | |
| Fully Vectorised | Potential Speedup | 1.00 | |
| Nb Loops to get 80% | 1 | |
| FP Arithmetic Only | Potential Speedup | 1.00 | |
| Nb Loops to get 80% | 1 | |
| Source Object | Issue |
| ▼multithreading_assembly_perf_test– | |
| ○sparse_matrix_utilities.hpp | -funroll-loops is missing. |
| ○finite_elements.hpp | -funroll-loops is missing. |
| ○sparse_matrix.hpp | -funroll-loops is missing. |
| ○shared_ptr_base.h | -funroll-loops is missing. |
| ○graph_tools.hpp | -funroll-loops is missing. |
| Application | ./multithreading_assembly_perf_test |
| Timestamp | 2025-05-19 15:45:40 |
Universal Timestamp | 1747662340 |
| Number of processes observed | 1 |
Number of threads observed | 128 |
| Experiment Type | OpenMP; |
| Machine | be-par057 |
| Model Name | AMD EPYC 9534 64-Core Processor |
| Architecture | x86_64 |
Micro Architecture | ZEN_V4 |
| Cache Size | 1024 KB |
Number of Cores | 64 |
| OS Version | Linux 4.18.0-477.10.1.el8_8.x86_64 #1 SMP Wed Apr 5 13:35:01 EDT 2023 |
| Architecture used during static analysis | x86_64 |
Micro Architecture used during static analysis | ZEN_V4 |
| Frequency Driver | acpi-cpufreq |
Frequency Governor | performance |
| Huge Pages | always |
Hyperthreading | off |
| Number of sockets | 2 |
Number of cores per socket | 64 |
| Compilation Options | multithreading_assembly_perf_test: GNU C++20 13.2.0 -march=znver4 -g3 -O3 -std=c++20 -fno-omit-frame-pointer -fopenmp | | |
| Dataset | |
| Run Command | <executable> --max_threads 128 --ncut 200 |
| Number Processes | 1 |
| Number Nodes | 1 |
| Filter | Not Used |
| Profile Start | Not Used |