Help is available by moving the cursor above any
symbol or by checking MAQAO website.
| Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | |
|---|---|---|---|---|---|---|---|---|---|---|
| Total Time (s) | 646.23 | 358.95 | 242.40 | 167.79 | 124.71 | 91.22 | 81.92 | 94.29 | 102.05 | |
| Max (Thread Active Time) (s) | 607.21 | 321.02 | 197.37 | 125.58 | 84.14 | 50.82 | 41.42 | 53.54 | 59.11 | |
| Average Active Time (s) | 607.21 | 320.89 | 195.27 | 116.75 | 77.62 | 46.28 | 39.51 | 51.62 | 57.47 | |
| Activity Ratio (%) | 94.0 | 89.4 | 80.6 | 69.6 | 62.3 | 50.8 | 48.3 | 54.8 | 56.5 | |
| Average number of active threads | 0.940 | 1.788 | 3.222 | 5.566 | 9.959 | 16.234 | 30.869 | 70.072 | 96.863 | |
| Affinity Stability (%) | 43.2 | 68.4 | 41.4 | 49.0 | 61.5 | 51.4 | 49.7 | 54.1 | 43.4 | |
| Time in analyzed loops (%) | 8.26 | 8.05 | 7.37 | 7.00 | 5.89 | 5.50 | 4.63 | 3.53 | 3.11 | |
| Time in analyzed innermost loops (%) | 8.15 | 7.97 | 7.31 | 6.95 | 5.87 | 5.45 | 4.58 | 3.51 | 3.09 | |
| Time in user code (%) | 8.40 | 8.58 | 9.20 | 9.70 | 10.1 | 9.19 | 8.74 | 8.12 | 7.39 | |
| Compilation Options Score (%) | 16.7 | 16.7 | 16.7 | 16.7 | 16.7 | 16.7 | 16.7 | 16.7 | 16.7 | |
| Array Access Efficiency (%) | 67.1 | 67.0 | 66.8 | 64.8 | 65.9 | 64.3 | 65.2 | 67.5 | 67.7 | |
| Potential Speedups | ||||||||||
| Perfect Flow Complexity | 1.04 | 1.04 | 1.03 | 1.02 | 1.02 | 1.01 | 1.01 | 1.01 | 1.01 | |
| Perfect OpenMP/MPI/Pthread/TBB | 1.00 | 1.04 | 1.15 | 1.24 | 1.48 | 1.95 | 1.56 | 1.82 | 1.93 | |
| Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution | 1.00 | 1.04 | 1.23 | 1.47 | 1.81 | 2.06 | 2.62 | 3.73 | 4.23 | |
| Scalability - Gap | 1.00 | 1.11 | 1.50 | 2.08 | 3.09 | 4.52 | 8.11 | 18.68 | 27.16 | |
| No Scalar Integer | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 |
| Nb Loops to get 80% | 1 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | |
| FP Vectorised | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.01 | 1.01 | 1.01 | 1.01 |
| Nb Loops to get 80% | 1 | 3 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | |
| Fully Vectorised | Potential Speedup | 1.09 | 1.08 | 1.07 | 1.07 | 1.06 | 1.05 | 1.04 | 1.03 | 1.03 |
| Nb Loops to get 80% | 2 | 3 | 5 | 7 | 7 | 7 | 6 | 5 | 5 | |
| Only FP Arithmetic | Potential Speedup | 1.06 | 1.06 | 1.05 | 1.05 | 1.04 | 1.03 | 1.03 | 1.02 | 1.02 |
| Nb Loops to get 80% | 2 | 2 | 3 | 4 | 4 | 5 | 5 | 4 | 4 | |
| Source Object | Issue |
|---|---|
| ▼libmumps_common.so | |
| ▼lr_stats.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼mumps_type2_blocking.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼mumps_comm_buffer_common.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼tools_common.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼mumps_load.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼fac_descband_data_m.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼libdmumps.so | |
| ▼darrowheads.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_mem_stack_aux.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_dist_arrowheads_omp.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_front_aux.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_process_blfac_slave.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_front_LU_type1.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_lr.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_asm_master_ELT_m.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dsol_aux.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dstatic_ptr_m.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_par_m.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dsol_lr.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_asm.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dfac_determinant.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dend_driver.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼dana_driver.F | |
| ○ | For some functions (possibly ones added by the compiler), debug locations are available but not compilation options. Recompile with -grecord-gcc-switches. |
| ▼[vdso] | |
| ▼ | |
| ○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
| ○ | -O2, -O3 or -Ofast is missing. |
| ○ | -march=(target) is missing. |
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
Enable log scale
| r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | r8 | |
|---|---|---|---|---|---|---|---|---|---|
| Experiment Name | |||||||||
| Application | /home/mlkaps_org/kevin/spack/opt/spack/linux-sapphirerapids/mumps-parametrable-launcher-0.1.0-o6hsbww3geaahcxu4yt5wg4eq4q34pyz/bin/mumps-parametrable-launcher | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Timestamp | 2026-04-24 16:32:17 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Experiment Type | MPI; | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Machine | igk-0805 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | GRANITE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Model Name | Intel(R) Xeon(R) 6787P | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Cache Size | 344064 KB | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of Cores | 86 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Maximal Frequency | 3.80 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| OS Version | Linux 6.8.0-53-generic #55-Ubuntu SMP PREEMPT_DYNAMIC Fri Jan 17 15:37:52 UTC 2025 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | GRANITE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Compilation Options | libdmumps.so: Intel(R) Fortran 25.0-1601 libmumps_common.so: Intel(R) Fortran 25.0-1601 | same as r0 | same as r0 | same as r0 | same as r0 | + [vdso]: N/A libdmumps.so: Intel(R) Fortran 25.0-1601 libmumps_common.so: Intel(R) Fortran 25.0-1601 | same as r5 | same as r5 | same as r5 |
| Number of processes observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 172 |
| Number of threads observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 172 |
| Frequency Driver | intel_pstate | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Frequency Governor | performance | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Huge Pages | madvise | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Hyperthreading | on | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of sockets | 2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Number of cores per socket | 86 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO version | 2026.0.0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| MAQAO build | 5bd027de85fa695d85760b0d35347d512bc7640d::20260416-150426 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
| Comments | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |