Help is available by moving the cursor above any
symbol or by checking MAQAO website.
- r0: n1 - option thread_filter-threshold (0.1%) discards 36 threads, cumulating 1.78 seconds CPU time.
- r1: n2 - option thread_filter-threshold (0.1%) discards 72 threads, cumulating 1.76 seconds CPU time.
- r2: n4 - option thread_filter-threshold (0.1%) discards 106 threads, cumulating 1.50 seconds CPU time.
- r3: n8 - option thread_filter-threshold (0.1%) discards 29 threads, cumulating 0.31 seconds CPU time.
| Metric | r0 | r1 | r2 | r3 |
|---|
| Total Time (s) | 1.87 E3 | 920.96 | 477.69 | 245.89 |
| Max (Thread Active Time) (s) | 1.85 E3 | 910.19 | 470.73 | 241.39 |
| Average Active Time (s) | 1.85 E3 | 908.83 | 469.56 | 240.67 |
| Activity Ratio (%) | 99.0 | 98.7 | 98.3 | 97.9 |
| Average number of active threads | 35.643 | 71.051 | 141.551 | 281.886 |
| Affinity Stability (%) | 99.9 | 99.6 | 99.3 | 98.9 |
| Time in analyzed loops (%) | 78.8 | 78.0 | 74.2 | 70.6 |
| Time in analyzed innermost loops (%) | 53.6 | 52.4 | 49.1 | 46.2 |
| Time in user code (%) | 76.8 | 76.0 | 72.2 | 68.8 |
| Compilation Options Score (%) | 100 | 100 | 100 | 100 |
| Array Access Efficiency (%) | 80.4 | 80.8 | 80.9 | 81.0 |
|
| Potential Speedups |
| Perfect Flow Complexity | 1.01 | 1.01 | 1.01 | 1.01 |
| Perfect OpenMP + MPI + Pthread | 1.02 | 1.03 | 1.03 | 1.06 |
| Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.06 | 1.06 | 1.09 | 1.13 |
| Scalability - Gap | 1.00 | 0.99 | 1.02 | 1.05 |
| No Scalar Integer | Potential Speedup | 1.24 | 1.23 | 1.22 | 1.20 |
| Nb Loops to get 80% | 28 | 28 | 28 | 28 |
| FP Vectorised | Potential Speedup | 1.17 | 1.17 | 1.16 | 1.15 |
| Nb Loops to get 80% | 38 | 37 | 37 | 37 |
| Fully Vectorised | Potential Speedup | 2.23 | 2.19 | 2.06 | 1.96 |
| Nb Loops to get 80% | 41 | 41 | 41 | 41 |
| Only FP Arithmetic | Potential Speedup | 1.71 | 1.68 | 1.62 | 1.56 |
| Nb Loops to get 80% | 41 | 41 | 41 | 41 |
| Source Object | Issue |
| ▼AVBP_V7_dev.KRAKEN– | |
| ▼cons_tens.f90– | |
| ○ | |
| ▼msource_cell.f90– | |
| ○ | |
| ▼get_Y.f90– | |
| ○ | |
| ▼bndflux_les2.f90– | |
| ○ | |
| ▼gradqen.f90– | |
| ○ | |
| ▼mass_product.f90– | |
| ○ | |
| ▼euler_timestep.f90– | |
| ○ | |
| ▼nsflux_les.f90– | |
| ○ | |
| ▼avis_lp_rre.f90– | |
| ○ | |
| ▼specsource_cell.f90– | |
| ○ | |
| ▼efcy_dyn.f90– | |
| ○ | |
| ▼stress_nv2.f90– | |
| ○ | |
| ▼heatflux_nv2.f90– | |
| ○ | |
| ▼mod_pmesh_transfer.f90– | |
| ○ | |
| ▼temperature.f90– | |
| ○ | |
| ▼grad_4obj.f90– | |
| ○ | |
| ▼div.f90– | |
| ○ | |
| ▼FE_add_dw.f90– | |
| ○ | |
| ▼wtowp.f90– | |
| ○ | |
| ▼avis_lp.f90– | |
| ○ | |
| ▼update.f90– | |
| ○ | |
| ▼scatter_grad.f90– | |
| ○ | |
| ▼update_rho.f90– | |
| ○ | |
| ▼mod_pmesh_scatter_add.f90– | |
| ○ | |
| ▼scatter_add.f90– | |
| ○ | |
| ▼rot_2delta.f90– | |
| ○ | |
| ▼specflux_visc_c_nv.f90– | |
| ○ | |
| ▼rrate_cell.f90– | |
| ○ | |
| ▼eflux.f90– | |
| ○ | |
| ▼compute_FE_implicit_residual.f90– | |
| ○ | |
| ▼wale_cell.f90– | |
| ○ | |
| ▼scatter_o_add.f90– | |
| ○ | |
| ▼gather_o_cpy.f90– | |
| ○ | |
| ▼ns_timestep.f90– | |
| ○ | |
| ▼specflux_invc.f90– | |
| ○ | |
| ▼thermo_variables.f90– | |
| ○ | |
| ▼prebound.f90– | |
| ○ | |
| ▼scatter_o_sub.f90– | |
| ○ | |
| ▼central_nv.f90– | |
| ○ | |
| ▼laxwe.f90– | |
| ○ | |
| ▼ave.f90– | |
| ○ | |
| ▼scale.f90– | |
| ○ | |
| ▼boxe_2delta.f90– | |
| ○ | |
| ▼calc_diffus.f90– | |
| ○ | |
| ▼calc_visc_eff.f90– | |
| ○ | |
| ▼central_nv_bnd_generic.f90– | |
| ○ | |
| ▼get_uvwT.f90– | |
| ○ | |
| ▼central.f90– | |
| ○ | |
| ▼savis_Colin_NS.f90– | |
| ○ | |
| ▼mod_copy.f90– | |
| ○ | |
| ▼savis_spec.f90– | |
| ○ | |
| ▼velocity_group.f90– | |
| ○ | |
| ▼cons_tens_cell.f90– | |
| ○ | |
| ▼scheme.f90– | |
| ○ | |
| ▼compute_diffus_max.f90– | |
| ○ | |
| Source Object | Issue |
| ▼AVBP_V7_dev.KRAKEN– | |
| ▼cons_tens.f90– | |
| ○ | |
| ▼msource_cell.f90– | |
| ○ | |
| ▼get_Y.f90– | |
| ○ | |
| ▼bndflux_les2.f90– | |
| ○ | |
| ▼gradqen.f90– | |
| ○ | |
| ▼mass_product.f90– | |
| ○ | |
| ▼euler_timestep.f90– | |
| ○ | |
| ▼nsflux_les.f90– | |
| ○ | |
| ▼avis_lp_rre.f90– | |
| ○ | |
| ▼specsource_cell.f90– | |
| ○ | |
| ▼efcy_dyn.f90– | |
| ○ | |
| ▼stress_nv2.f90– | |
| ○ | |
| ▼heatflux_nv2.f90– | |
| ○ | |
| ▼mod_pmesh_transfer.f90– | |
| ○ | |
| ▼temperature.f90– | |
| ○ | |
| ▼grad_4obj.f90– | |
| ○ | |
| ▼div.f90– | |
| ○ | |
| ▼FE_add_dw.f90– | |
| ○ | |
| ▼wtowp.f90– | |
| ○ | |
| ▼avis_lp.f90– | |
| ○ | |
| ▼update.f90– | |
| ○ | |
| ▼scatter_grad.f90– | |
| ○ | |
| ▼update_rho.f90– | |
| ○ | |
| ▼mod_pmesh_scatter_add.f90– | |
| ○ | |
| ▼scatter_add.f90– | |
| ○ | |
| ▼rot_2delta.f90– | |
| ○ | |
| ▼specflux_visc_c_nv.f90– | |
| ○ | |
| ▼rrate_cell.f90– | |
| ○ | |
| ▼eflux.f90– | |
| ○ | |
| ▼compute_FE_implicit_residual.f90– | |
| ○ | |
| ▼wale_cell.f90– | |
| ○ | |
| ▼scatter_o_add.f90– | |
| ○ | |
| ▼gather_o_cpy.f90– | |
| ○ | |
| ▼ns_timestep.f90– | |
| ○ | |
| ▼specflux_invc.f90– | |
| ○ | |
| ▼thermo_variables.f90– | |
| ○ | |
| ▼prebound.f90– | |
| ○ | |
| ▼scatter_o_sub.f90– | |
| ○ | |
| ▼central_nv.f90– | |
| ○ | |
| ▼laxwe.f90– | |
| ○ | |
| ▼ave.f90– | |
| ○ | |
| ▼scale.f90– | |
| ○ | |
| ▼boxe_2delta.f90– | |
| ○ | |
| ▼calc_diffus.f90– | |
| ○ | |
| ▼calc_visc_eff.f90– | |
| ○ | |
| ▼central_nv_bnd_generic.f90– | |
| ○ | |
| ▼get_uvwT.f90– | |
| ○ | |
| ▼central.f90– | |
| ○ | |
| ▼savis_Colin_NS.f90– | |
| ○ | |
| ▼mod_copy.f90– | |
| ○ | |
| ▼savis_spec.f90– | |
| ○ | |
| ▼velocity_group.f90– | |
| ○ | |
| ▼cons_tens_cell.f90– | |
| ○ | |
| ▼scheme.f90– | |
| ○ | |
| ▼compute_diffus_max.f90– | |
| ○ | |
| Source Object | Issue |
| ▼AVBP_V7_dev.KRAKEN– | |
| ▼cons_tens.f90– | |
| ○ | |
| ▼msource_cell.f90– | |
| ○ | |
| ▼get_Y.f90– | |
| ○ | |
| ▼bndflux_les2.f90– | |
| ○ | |
| ▼gradqen.f90– | |
| ○ | |
| ▼mass_product.f90– | |
| ○ | |
| ▼euler_timestep.f90– | |
| ○ | |
| ▼nsflux_les.f90– | |
| ○ | |
| ▼avis_lp_rre.f90– | |
| ○ | |
| ▼specsource_cell.f90– | |
| ○ | |
| ▼efcy_dyn.f90– | |
| ○ | |
| ▼stress_nv2.f90– | |
| ○ | |
| ▼heatflux_nv2.f90– | |
| ○ | |
| ▼mod_pmesh_transfer.f90– | |
| ○ | |
| ▼temperature.f90– | |
| ○ | |
| ▼grad_4obj.f90– | |
| ○ | |
| ▼div.f90– | |
| ○ | |
| ▼FE_add_dw.f90– | |
| ○ | |
| ▼wtowp.f90– | |
| ○ | |
| ▼avis_lp.f90– | |
| ○ | |
| ▼update.f90– | |
| ○ | |
| ▼scatter_grad.f90– | |
| ○ | |
| ▼update_rho.f90– | |
| ○ | |
| ▼mod_pmesh_scatter_add.f90– | |
| ○ | |
| ▼scatter_add.f90– | |
| ○ | |
| ▼rot_2delta.f90– | |
| ○ | |
| ▼specflux_visc_c_nv.f90– | |
| ○ | |
| ▼rrate_cell.f90– | |
| ○ | |
| ▼eflux.f90– | |
| ○ | |
| ▼compute_FE_implicit_residual.f90– | |
| ○ | |
| ▼wale_cell.f90– | |
| ○ | |
| ▼scatter_o_add.f90– | |
| ○ | |
| ▼gather_o_cpy.f90– | |
| ○ | |
| ▼ns_timestep.f90– | |
| ○ | |
| ▼specflux_invc.f90– | |
| ○ | |
| ▼thermo_variables.f90– | |
| ○ | |
| ▼prebound.f90– | |
| ○ | |
| ▼scatter_o_sub.f90– | |
| ○ | |
| ▼central_nv.f90– | |
| ○ | |
| ▼laxwe.f90– | |
| ○ | |
| ▼ave.f90– | |
| ○ | |
| ▼scale.f90– | |
| ○ | |
| ▼boxe_2delta.f90– | |
| ○ | |
| ▼calc_diffus.f90– | |
| ○ | |
| ▼calc_visc_eff.f90– | |
| ○ | |
| ▼central_nv_bnd_generic.f90– | |
| ○ | |
| ▼get_uvwT.f90– | |
| ○ | |
| ▼central.f90– | |
| ○ | |
| ▼savis_Colin_NS.f90– | |
| ○ | |
| ▼mod_copy.f90– | |
| ○ | |
| ▼savis_spec.f90– | |
| ○ | |
| ▼velocity_group.f90– | |
| ○ | |
| ▼cons_tens_cell.f90– | |
| ○ | |
| ▼scheme.f90– | |
| ○ | |
| ▼compute_diffus_max.f90– | |
| ○ | |
| Source Object | Issue |
| ▼AVBP_V7_dev.KRAKEN– | |
| ▼cons_tens.f90– | |
| ○ | |
| ▼msource_cell.f90– | |
| ○ | |
| ▼get_Y.f90– | |
| ○ | |
| ▼bndflux_les2.f90– | |
| ○ | |
| ▼gradqen.f90– | |
| ○ | |
| ▼mass_product.f90– | |
| ○ | |
| ▼euler_timestep.f90– | |
| ○ | |
| ▼nsflux_les.f90– | |
| ○ | |
| ▼avis_lp_rre.f90– | |
| ○ | |
| ▼specsource_cell.f90– | |
| ○ | |
| ▼efcy_dyn.f90– | |
| ○ | |
| ▼stress_nv2.f90– | |
| ○ | |
| ▼heatflux_nv2.f90– | |
| ○ | |
| ▼mod_pmesh_transfer.f90– | |
| ○ | |
| ▼temperature.f90– | |
| ○ | |
| ▼grad_4obj.f90– | |
| ○ | |
| ▼div.f90– | |
| ○ | |
| ▼FE_add_dw.f90– | |
| ○ | |
| ▼wtowp.f90– | |
| ○ | |
| ▼avis_lp.f90– | |
| ○ | |
| ▼update.f90– | |
| ○ | |
| ▼scatter_grad.f90– | |
| ○ | |
| ▼update_rho.f90– | |
| ○ | |
| ▼mod_pmesh_scatter_add.f90– | |
| ○ | |
| ▼scatter_add.f90– | |
| ○ | |
| ▼rot_2delta.f90– | |
| ○ | |
| ▼specflux_visc_c_nv.f90– | |
| ○ | |
| ▼rrate_cell.f90– | |
| ○ | |
| ▼eflux.f90– | |
| ○ | |
| ▼compute_FE_implicit_residual.f90– | |
| ○ | |
| ▼wale_cell.f90– | |
| ○ | |
| ▼scatter_o_add.f90– | |
| ○ | |
| ▼gather_o_cpy.f90– | |
| ○ | |
| ▼ns_timestep.f90– | |
| ○ | |
| ▼specflux_invc.f90– | |
| ○ | |
| ▼thermo_variables.f90– | |
| ○ | |
| ▼prebound.f90– | |
| ○ | |
| ▼scatter_o_sub.f90– | |
| ○ | |
| ▼central_nv.f90– | |
| ○ | |
| ▼laxwe.f90– | |
| ○ | |
| ▼ave.f90– | |
| ○ | |
| ▼scale.f90– | |
| ○ | |
| ▼boxe_2delta.f90– | |
| ○ | |
| ▼calc_diffus.f90– | |
| ○ | |
| ▼calc_visc_eff.f90– | |
| ○ | |
| ▼central_nv_bnd_generic.f90– | |
| ○ | |
| ▼get_uvwT.f90– | |
| ○ | |
| ▼central.f90– | |
| ○ | |
| ▼savis_Colin_NS.f90– | |
| ○ | |
| ▼mod_copy.f90– | |
| ○ | |
| ▼savis_spec.f90– | |
| ○ | |
| ▼velocity_group.f90– | |
| ○ | |
| ▼cons_tens_cell.f90– | |
| ○ | |
| ▼scheme.f90– | |
| ○ | |
| ▼compute_diffus_max.f90– | |
| ○ | |
| r0 | r1 | r2 | r3 |
| Experiment Name | | | | |
| Application | /home/exter/camus/avbp-dev/HOST/KRAKEN/BIN/AVBP_V7_dev.KRAKEN | same as r0 | same as r0 | same as r0 |
| Timestamp | 2025-03-03 16:12:01 | same as r0 | same as r0 | same as r0 |
| Experiment Type | MPI; | same as r0 | same as r0 | same as r0 |
| Machine | node108,node106,node112,node107,node109,node113,node111,node110 | same as r0 | same as r0 | same as r0 |
| Architecture | x86_64 | same as r0 | same as r0 | same as r0 |
| Micro Architecture | SKYLAKE | same as r0 | same as r0 | same as r0 |
| Model Name | Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz | same as r0 | same as r0 | same as r0 |
| Cache Size | 25344 KB | same as r0 | same as r0 | same as r0 |
| Number of Cores | 18 | same as r0 | same as r0 | same as r0 |
| Maximal Frequency | 3.7 GHz | same as r0 | same as r0 | same as r0 |
| OS Version | Linux 4.18.0-553.el8_10.x86_64 #1 SMP Fri May 24 13:05:10 UTC 2024 | same as r0 | same as r0 | same as r0 |
| Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 |
| Micro Architecture used during static analysis | SKYLAKE | same as r0 | same as r0 | same as r0 |
| Compilation Options |
AVBP_V7_dev.KRAKEN: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.10.0 Build 20230609_000000 -I/softs/local_intel/phdf5/1.8.20/include -I/softs/local_intel/parmetis/403_64/include -I/softs/local_intel/ptscotch/6.0.5a/include -I. -I../SOURCES/GENERIC/ -IAMR_INTERFACE/ -IBNDY/ -ICFD/ -ICHEM/ -ICHEM/ANALYTIC/ -ICHEM/ANALYTIC/LIB/ -ICHEM/HYB/ -ICHEM/NOX/ -ICHEM/SOOT_ANALYTIC/ -ICOMMON/ -ICOUPLING/ -IGENERIC/ -IIO/ -ILAGRANGE/ -ILAGRANGE/SOOT_EL/ -ILES/ -IMAIN/ -IMAIN/COMPUTE/ -IMAIN/SLAVE/ -INUMERICS/ -IPARSER/ -IPLASMA/ -IPLASMA/CHEMISTRY/ -IPLASMA/CHEMISTRY/CUSTOM_KINETICS_LIB/ -IPLASMA/DRIFTDIFFUSION/ -IPLASMA/DRIFTDIFFUSION/SCHEMES/ -IPLASMA/ELECTROMAG/ -IPLASMA/EULER/ -IPLASMA/FREEZE/ -IPLASMA/PHOTO/ -IPLASMA/THERMO/ -IPMESH/generic/ -IPMESH/interf_avbp/ -IPMESH/interp_tree_search/ -IPMESH/pmeshlib/ -IPMESH/pproc/ -ISMOOTH/ -ITTC/ -ITTC/LES/ -I/softs/intel/oneapi/mpi/2021.10.0//include -I/softs/intel/oneapi/mpi/2021.10.0/include -g -O3 -fpp -traceback -fno-alias -ip -assume byterecl -convert big_endian -align -march=core-avx2 -fma -axCORE-AVX2 -DHAS_PMETIS -DPARMETIS4 -DMETIS5 -DHAS_PTSCOTCH -c -o GENERIC/gather_o_cpy.o | same as r0 | same as r0 | same as r0 |
| Number of processes observed | 36 | 72 | 144 | 288 |
| Number of threads observed | 36 | 72 | 144 | 288 |
| Frequency Driver | intel_cpufreq | same as r0 | same as r0 | same as r0 |
| Frequency Governor | performance | same as r0 | same as r0 | same as r0 |
| Huge Pages | always | same as r0 | same as r0 | same as r0 |
| Hyperthreading | off | same as r0 | same as r0 | same as r0 |
| Number of sockets | 2 | same as r0 | same as r0 | same as r0 |
| Number of cores per socket | 18 | same as r0 | same as r0 | same as r0 |
| MAQAO version | 2.21.1 | same as r0 | same as r0 | same as r0 |
| MAQAO build | 5485021ea6c10887b73ecb44ccd8bc21f8bac10a::20250204-111307 | same as r0 | same as r0 | same as r0 |
| Comments | | same as r0 | same as r0 | same as r0 |