OV - - Global

AVBP_V7_dev.KRAKEN - 2025-06-18 16:16:30 - MAQAO 2025.1.0

Help is available by moving the cursor above any symbol or by checking MAQAO website.

▶Filter Information

There is no filter information to display

Global Metrics

Total Time (s)		84.13
Max (Thread Active Time) (s)		82.72
Average Active Time (s)		82.31
Activity Ratio (%)		98.1
Average number of active threads		35.221
Affinity Stability (%)		99.1
Time in analyzed loops (%)		55.6
Time in analyzed innermost loops (%)		39.4
Time in user code (%)		54.5
Compilation Options Score (%)		100
Array Access Efficiency (%)		81.1

Potential Speedups
Iterations Count		1.07
Perfect Flow Complexity		1.00
Perfect OpenMP + MPI + Pthread		1.14
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution		1.17
No Scalar Integer	Potential Speedup	1.12
No Scalar Integer	Nb Loops to get 80%	18
FP Vectorised	Potential Speedup	1.08
FP Vectorised	Nb Loops to get 80%	20
Fully Vectorised	Potential Speedup	1.34
Fully Vectorised	Nb Loops to get 80%	41
Data In L1 Cache	Potential Speedup	1.02
Data In L1 Cache	Nb Loops to get 80%	3
FP Arithmetic Only	Potential Speedup	1.34
FP Arithmetic Only	Nb Loops to get 80%	41

CQA Potential Speedups Summary

Average Active Threads Count⏎

Loop Based Profile⏎

Innermost Loop Based Profile⏎

Application Categorization⏎

Compilation Options⏎

Source Object	Issue
▼AVBP_V7_dev.KRAKEN–
○cons_tens.f90
○scatter_o_add.f90
○nsflux_les.f90
○ns_timestep.f90
○euler_timestep.f90
○compute_FE_implicit_residual.f90
○specsource_cell.f90
○heatflux_nv2.f90
○temperature.f90
○grad_4obj.f90
○avis_lp.f90
○FE_add_dw.f90
○specflux_invc.f90
○wtowp.f90
○savis_Colin_spec.f90
○scatter_grad.f90
○avis_lp_rre.f90
○ave.f90
○update.f90
○rot_2delta.f90
○specflux_visc_c_nv.f90
○rrate_cell.f90
○mod_pmesh_transfer.f90
○central.f90
○wale_cell.f90
○div.f90
○gather_o_cpy.f90
○mod_pmesh_scatter_add.f90
○scatter_o_sub.f90
○thermo_variables.f90
○prebound.f90
○efcy_dyn.f90
○central_nv.f90
○laxwe.f90
○savis_Colin_NS.f90
○calc_visc_eff.f90
○boxe_2delta.f90
○calc_diffus.f90
○mass_product.f90
○stress_nv2.f90
○eflux.f90
○scheme.f90
○scatter_add.f90
○get_Y.f90
○get_uvwT.f90
○velocity_group.f90
○cons_tens_cell.f90
○scale.f90
○compute_diffus_max.f90

Loop Iteration Count Profile⏎

Loop Path Count Profile⏎

Cumulated Speedup If No Scalar Integer⏎

Cumulated Speedup If FP Vectorized⏎

Cumulated Speedup If Fully Vectorized⏎

Cumulated Speedup If Data In L1⏎

Cumulated Speedup If FP Arithmetic Only⏎

Experiment Summary

Experiment Name
Application	/home/exter/camus/avbp-dev/HOST/KRAKEN/BIN/AVBP_V7_dev.KRAKEN
Timestamp	2025-06-18 16:16:30	Universal Timestamp	1750256190
Number of processes observed	36	Number of threads observed	36
Experiment Type	MPI;
Machine	krakenepyc1.cluster
Model Name	AMD EPYC 7702 64-Core Processor
Architecture	x86_64	Micro Architecture	ZEN_V2
Cache Size	512 KB	Number of Cores	64
OS Version	Linux 4.18.0-553.el8_10.x86_64 #1 SMP Fri May 24 13:05:10 UTC 2024
Architecture used during static analysis	x86_64	Micro Architecture used during static analysis	ZEN_V2
Frequency Driver	acpi-cpufreq	Frequency Governor	performance
Huge Pages	always	Hyperthreading	off
Number of sockets	2	Number of cores per socket	64
Compilation Options	AVBP_V7_dev.KRAKEN: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.10.0 Build 20230609_000000 -I/softs/local_intel/phdf5/1.8.20/include -I/softs/local_intel/parmetis/403_64/include -I/softs/local_intel/ptscotch/6.0.5a/include -I. -I../SOURCES/GENERIC/ -IAMR_INTERFACE/ -IBNDY/ -ICFD/ -ICHEM/ -ICHEM/ANALYTIC/ -ICHEM/ANALYTIC/LIB/ -ICHEM/HYB/ -ICHEM/NOX/ -ICHEM/SOOT_ANALYTIC/ -ICOMMON/ -ICONDUCTION/ -ICOUPLING/ -IGENERIC/ -IIO/ -ILAGRANGE/ -ILAGRANGE/SOOT_EL/ -ILES/ -IMAIN/ -IMAIN/COMPUTE/ -IMAIN/SLAVE/ -INUMERICS/ -IPARSER/ -IPLASMA/ -IPLASMA/CHEMISTRY/ -IPLASMA/CHEMISTRY/CUSTOM_KINETICS_LIB/ -IPLASMA/DRIFTDIFFUSION/ -IPLASMA/DRIFTDIFFUSION/SCHEMES/ -IPLASMA/ELECTROMAG/ -IPLASMA/EULER/ -IPLASMA/FREEZE/ -IPLASMA/PHOTO/ -IPLASMA/THERMO/ -IPMESH/generic/ -IPMESH/interf_avbp/ -IPMESH/interp_tree_search/ -IPMESH/pmeshlib/ -IPMESH/pproc/ -ISMOOTH/ -ITTC/ -ITTC/LES/ -I/softs/intel/oneapi/mpi/2021.10.0//include -I/softs/intel/oneapi/mpi/2021.10.0/include -g -O3 -fpp -traceback -fno-alias -ip -assume byterecl -convert big_endian -align -march=core-avx2 -fma -axCORE-AVX2 -DHAS_PMETIS -DPARMETIS4 -DMETIS5 -DHAS_PTSCOTCH -c -o GENERIC/gather_o_cpy.o
Comments

Configuration Summary

Dataset
Run Command	<executable>
MPI Command	mpirun -np <number_processes>
Number Processes	36
Number Nodes	1
Number Processes per Nodes	36
Filter	{type = number ; nvprof = 10 ; ndecan = 10 ; }
Profile Start	{unit = none ; value = 0 ; }
Maximal Path Number	4

Report Configuration

AVBP_V7_dev.KRAKEN - 2025-06-18 16:16:30 - MAQAO 2025.1.0

▶Filter Information

Global Metrics

CQA Potential Speedups Summary

Average Active Threads Count⏎

Loop Based Profile⏎

Innermost Loop Based Profile⏎

Application Categorization⏎

Compilation Options⏎

Loop Iteration Count Profile⏎

Loop Path Count Profile⏎

Cumulated Speedup If No Scalar Integer⏎

Cumulated Speedup If FP Vectorized⏎

Cumulated Speedup If Fully Vectorized⏎

Cumulated Speedup If Data In L1⏎

Cumulated Speedup If FP Arithmetic Only⏎

Experiment Summary

Configuration Summary