Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | |
---|---|---|---|
Total Time (s) | 127.00 | 141.08 | |
Max (Thread Active Time) (s) | 63.90 | 70.03 | |
Average Active Time (s) | 12.96 | 15.18 | |
Activity Ratio (%) | 13.6 | 13.9 | |
Average number of active threads | 13.062 | 13.775 | |
Affinity Stability (%) | 61.8 | 56.0 | |
Time in analyzed loops (%) | 53.9 | 57.1 | |
Time in analyzed innermost loops (%) | 28.4 | 28.6 | |
Time in user code (%) | 69.4 | 71.2 | |
Compilation Options Score (%) | 100.0 | 100.0 | |
Array Access Efficiency (%) | 88.1 | 90.2 | |
Potential Speedups | ![]() | ||
Perfect Flow Complexity | 1.02 | 1.02 | |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 4.95 | 4.64 | |
No Scalar Integer | Potential Speedup | 1.07 | 1.06 |
Nb Loops to get 80% | 8 | 8 | |
FP Vectorised | Potential Speedup | 1.02 | 1.01 |
Nb Loops to get 80% | 4 | 5 | |
Fully Vectorised | Potential Speedup | 1.39 | 1.39 |
Nb Loops to get 80% | 22 | 20 | |
Only FP Arithmetic | Potential Speedup | 1.16 | 1.14 |
Nb Loops to get 80% | 22 | 21 |
Source Object | Issue |
---|---|
▼libasolve_test_helpers.so | |
▼ | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
○ | -O2, -O3 or -Ofast is missing. |
○ | -march=(target) is missing. |
▼libfinite_elements.so | |
▼AssignEvaluator.h | |
○ | |
▼InverseImpl.h | |
○ | |
▼element_U.tpp | |
○ | |
▼PlainObjectBase.h | |
○ | |
▼TensorMap.h | |
○ | |
▼TensorDeviceDefault.h | |
○ | |
▼GeneralMatrixMatrix.h | |
○ | |
▼GeneralProduct.h | |
○ | |
▼generic_elements.hpp | |
○ | |
▼Matrix.h | |
○ | |
▼stl_vector.h | |
○ | |
▼GeneralMatrixVector.h | |
○ | |
▼MapBase.h | |
○ | |
▼material_brick.hpp | |
○ | |
▼GeneralBlockPanelKernel.h | |
○ | |
▼[vdso] | |
▼ | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
○ | -O2, -O3 or -Ofast is missing. |
○ | -march=(target) is missing. |
▼libdofs.so | |
▼dof.cpp | |
○ | |
▼dof_list.cpp | |
○ | |
▼MapBase.h | |
○ | |
▼stl_vector.h | |
○ | |
▼multithreading_assembly_perf_test | |
▼basic_string.tcc | |
○ | |
▼enumerable_thread_specific.h | |
○ | |
▼assembler.hpp | |
○ | |
▼sparse_matrix_utilities.hpp | |
○ | |
▼partitioner.h | |
○ | |
▼finite_elements.hpp | |
○ | |
▼parallel_for.h | |
○ | |
▼sparse_matrix.hpp | |
○ | |
▼multithreading_assembly_perf_test.cpp | |
○ | |
▼stl_iterator.h | |
○ | |
▼vector.tcc | |
○ | |
▼libfe_space.so | |
▼stl_tree.h | |
○ | |
▼libmesh.so | |
▼ | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
○ | -O2, -O3 or -Ofast is missing. |
○ | -march=(target) is missing. |
r0 | r1 | |
---|---|---|
Experiment Name | ||
Application | ./multithreading_assembly_perf_test | same as r0 |
Timestamp | 2025-06-11 09:48:49 | 2025-06-11 10:10:31 |
Experiment Type | OpenMP; | same as r0 |
Machine | be-seq022 | be-seq017 |
Architecture | x86_64 | same as r0 |
Micro Architecture | ZEN_V4 | same as r0 |
Model Name | AMD EPYC 9534 64-Core Processor | same as r0 |
Cache Size | 1024 KB | same as r0 |
Number of Cores | 64 | same as r0 |
Maximal Frequency | 3.718066 GHz | same as r0 |
OS Version | Linux 4.18.0-477.10.1.el8_8.x86_64 #1 SMP Wed Apr 5 13:35:01 EDT 2023 | same as r0 |
Architecture used during static analysis | x86_64 | same as r0 |
Micro Architecture used during static analysis | ZEN_V4 | same as r0 |
Compilation Options | + [vdso]: N/A libasolve_test_helpers.so: N/A libdofs.so: GNU C++20 13.2.0 -march=znver3 -g3 -g3 -O3 -std=c++20 -fno-omit-frame-pointer -funroll-loops -fPIC libfe_space.so: GNU C++20 13.2.0 -march=znver3 -g3 -g3 -O3 -std=c++20 -fno-omit-frame-pointer -funroll-loops -fPIC libfinite_elements.so: GNU C++20 13.2.0 -march=znver3 -g3 -g3 -O3 -std=c++20 -fno-omit-frame-pointer -funroll-loops -fPIC libmesh.so: N/A multithreading_assembly_perf_test: GNU C++20 13.2.0 -march=znver3 -g3 -g3 -O3 -std=c++20 -fno-omit-frame-pointer -funroll-loops | + [vdso]: N/A libasolve_test_helpers.so: N/A libdofs.so: GNU C++20 13.2.0 -march=znver3 -g3 -g3 -O3 -std=c++20 -fno-omit-frame-pointer -funroll-loops -fPIC libfe_space.so: GNU C++20 13.2.0 -march=znver3 -g3 -g3 -O3 -std=c++20 -fno-omit-frame-pointer -funroll-loops -fPIC libfinite_elements.so: GNU C++20 13.2.0 -march=znver3 -g3 -g3 -O3 -std=c++20 -fno-omit-frame-pointer -funroll-loops -fPIC multithreading_assembly_perf_test: GNU C++20 13.2.0 -march=znver3 -g3 -g3 -O3 -std=c++20 -fno-omit-frame-pointer -funroll-loops |
Number of processes observed | 1 | same as r0 |
Number of threads observed | 128 | same as r0 |
Frequency Driver | acpi-cpufreq | same as r0 |
Frequency Governor | performance | same as r0 |
Huge Pages | always | same as r0 |
Hyperthreading | off | same as r0 |
Number of sockets | 2 | same as r0 |
Number of cores per socket | 64 | same as r0 |
MAQAO version | 2025.1.0 | same as r0 |
MAQAO build | b107544c0173fc3785aa7d997ff783dc12b975d2::20250527-133805 | same as r0 |
Comments | same as r0 |