Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
|---|---|---|---|---|---|---|---|---|---|
| Total Time (s) | 480.47 | 294.14 | 224.28 | 157.31 | 99.77 | 66.47 | 50.47 | 48.23 | |
| Max (Thread Active Time) (s) | 447.98 | 268.75 | 199.85 | 133.66 | 79.00 | 44.69 | 28.53 | 26.04 | |
| Average Active Time (s) | 308.26 | 173.61 | 196.64 | 131.52 | 77.61 | 44.15 | 28.31 | 25.81 | |
| Activity Ratio (%) | 65.6 | 61.2 | 87.7 | 83.7 | 77.9 | 66.6 | 56.2 | 53.6 | |
| Average number of active threads | 1.283 | 2.361 | 3.507 | 6.688 | 12.447 | 21.257 | 35.897 | 46.022 | |
| Affinity Stability (%) | 24.6 | 60.1 | 37.5 | 60.9 | 72.7 | 63.0 | 53.9 | 50.4 | |
| Time in analyzed loops (%) | 29.6 | 25.6 | 10.0 | 7.19 | 5.91 | 6.98 | 7.61 | 7.33 | |
| Time in analyzed innermost loops (%) | 11.5 | 10.6 | 6.04 | 5.64 | 5.29 | 6.64 | 7.44 | 7.21 | |
| Time in user code (%) | 29.7 | 26.3 | 12.8 | 11.3 | 10.1 | 10.7 | 10.7 | 10.4 | |
| Compilation Options Score (%) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Array Access Efficiency (%) | 75.5 | 75.1 | 74.1 | 74.2 | 76.3 | 76.8 | 76.5 | 76.5 | |
| Potential Speedups | |||||||||
| Perfect Flow Complexity | 1.00 | 1.00 | 1.01 | 1.01 | 1.01 | 1.03 | 1.04 | 1.04 | |
| Perfect OpenMP/MPI/Pthread/TBB | 1.00 | 1.11 | 1.30 | 1.54 | 1.64 | 1.37 | 1.48 | 1.45 | |
| Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution | 1.49 | 1.71 | 1.51 | 1.77 | 2.08 | 2.07 | 2.30 | 2.41 | |
| Scalability - Gap | 1.00 | 1.22 | 0.93 | 1.31 | 1.66 | 2.21 | 3.36 | 4.32 | |
| No Scalar Integer | Potential Speedup | 1.01 | 1.01 | 1.02 | 1.02 | 1.02 | 1.02 | 1.02 | 1.01 |
| Nb Loops to get 80% | 2 | 5 | 4 | 2 | 2 | 2 | 2 | 2 | |
| FP Vectorised | Potential Speedup | 1.11 | 1.09 | 1.03 | 1.02 | 1.01 | 1.01 | 1.01 | 1.01 |
| Nb Loops to get 80% | 2 | 2 | 3 | 3 | 3 | 3 | 2 | 2 | |
| Fully Vectorised | Potential Speedup | 1.36 | 1.30 | 1.10 | 1.07 | 1.06 | 1.07 | 1.07 | 1.07 |
| Nb Loops to get 80% | 3 | 3 | 7 | 8 | 7 | 6 | 5 | 4 | |
| Only FP Arithmetic | Potential Speedup | 1.02 | 1.02 | 1.02 | 1.03 | 1.02 | 1.02 | 1.02 | 1.02 |
| Nb Loops to get 80% | 2 | 5 | 6 | 4 | 4 | 4 | 3 | 3 | |
| Source Object | Issue |
|---|---|
| ▼libmumps_common.so | |
| ▼ | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ○ | -O2, -O3 or -Ofast is missing. |
| ○ | -march=(target) is missing. |
| ▼libdmumps.so | |
| ▼ | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ○ | -O2, -O3 or -Ofast is missing. |
| ○ | -march=(target) is missing. |
| ▼[vdso] | |
| ▼ | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ○ | -O2, -O3 or -Ofast is missing. |
| ○ | -march=(target) is missing. |
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
| r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
|---|---|---|---|---|---|---|---|---|
| Experiment Name | ||||||||
| Application | /home/mlkaps_org/kevin/spack/opt/spack/linux-sapphirerapids/mumps-parametrable-launcher-0.1.0-o6hsbww3geaahcxu4yt5wg4eq4q34pyz/bin/mumps-parametrable-launcher | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Timestamp | 2026-04-30 13:26:42 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Experiment Type | MPI; OpenMP; | same as r0 | MPI; | same as r2 | same as r2 | same as r2 | same as r2 | same as r2 |
| Machine | igk-0805 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | GRANITE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Model Name | Intel(R) Xeon(R) 6787P | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Cache Size | 344064 KB | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of Cores | 86 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Maximal Frequency | 3.80 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| OS Version | Linux 6.8.0-53-generic #55-Ubuntu SMP PREEMPT_DYNAMIC Fri Jan 17 15:37:52 UTC 2025 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | GRANITE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Compilation Options | libdmumps.so: N/A libmumps_common.so: N/A | same as r0 | same as r0 | same as r0 | + [vdso]: N/A libdmumps.so: N/A libmumps_common.so: N/A | same as r4 | same as r4 | same as r4 |
| Number of processes observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 86 |
| Number of threads observed | 2 | 4 | same as r1 | 8 | 16 | 32 | 64 | 86 |
| Frequency Driver | intel_pstate | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Frequency Governor | performance | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Huge Pages | madvise | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Hyperthreading | on | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of sockets | 2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of cores per socket | 86 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO version | 2026.0.0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO build | 5bd027de85fa695d85760b0d35347d512bc7640d::20260416-150426 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Comments | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |