* Info: Detected 4 Lprof instances in node098.
If this is incorrect, rerun with number-processes-per-node=X
__ ______ _____ __ ________ _
/\ \ / / _ \| __ \ \ \ / /____ || |
/ \ \ / /| |_) | |__) | \ \ / / / /_| | _____ __
/ /\ \ \/ / | _ <| ___/ \ \/ / / / _` |/ _ \ \ / /
/ ____ \ / | |_) | | \ / / / (_| | __/\ V /
/_/ \_\/ |____/|_| \/ /_/ \__,_|\___| \_/
Using branch :
Version date : Mon, 18 Nov 2024 11:40:50 +0100
Commit : b58af1ea20
MPI processes : 4
Computation #1/1
Compilation info : mpiifort -g -O3 -fpp -traceback -fno-alias -ip -assume byterecl -convert big_endian -align -march=core-avx2 -fma -axCORE-AVX2 -I/softs/local_intel/phdf5/1.8.20/include -DHAS_PMETIS -I/softs/local_intel/parmetis/403_64/include -DPARMETIS4 -DMETIS5 -DHAS_PTSCOTCH -I/softs/local_intel/ptscotch/6.0.5a/include
Compilation wrapper info :
Compilation user : camus
Compilation date : 2025-01-13 17:18:55
Compilation MPI version : Intel(R) MPI Library for Linux* OS, Version 2021.10 Build 20230619 (id: c2e19c2f3e)
AVBP version : 7.15.0
Reading input file version : 7.15.0
----> Reading run parameters : .//run.params
----> Using NATURAL reordering
----> command.dat file API is enabled
----> Using TTGC
with UNCLOSED boundary terms
----> Using colin viscosity model
>>>>> WARNING
>>>>> No solution storage required: additional variables are deactivated!
>>>>> WARNING
>>>>> Temporals are not computed!
----> Reading mesh : .//../Mesh/Bench_simple.mesh.h5
Meshfile signature: e736f2c3c25f98227815b524d9568f9e
----> el2part file found: el2part_4.h5
Use existing partitioning
>> Reading ...took 0.084s
----> Initialize the solution writers (4 writers)
>>>>> WARNING
>>>>> No instantaneous solution storage required: the calculation of additional variables is deactivated.
Checking TFLES table parameters...
Flame thickening is applied with fthick = 17.00
----> Reading boundary conditions in asciibound file : .//../Mesh/Bench_simple.asciiBound.key
_______________________________________________________________________________________
| Boundary patches (no reordering) |
|______________________________________________________________________________________|
| Patch number Patch name Boundary condition |
| ------------ ---------- ------------------ |
| 1 INLET INLET_RELAX_UVW_T_Y |
| 2 OUTLET OUTLET_RELAX_P |
| 3 WALL WALL_NOSLIP_ADIAB |
|______________________________________________________________________________________|
______________________________________________________________
| Info on initial grid |
|_____________________________________________________________|
| number of dimensions : 3 |
| number of nodes : 514475 |
| number of cells : 2958592 |
| - tetrahedra : 2958592 |
| number of cell per group : 64 |
| number of boundary nodes : 48048 |
| number of periodic nodes : 0 |
| number of axi-periodic nodes : 0 |
|_____________________________________________________________|
| After partitioning |
|_____________________________________________________________|
| number of nodes : 526504 |
| extra nodes due to partitioning : 12029 [+ 2.34‰] |
|_____________________________________________________________|
______________________________________________________________
| Partitioning Quality |
|_____________________________________________________________|
| Maximum number of neighbors : 3.00 |
| Average number of neighbors : 2.50 |
| Maximum number of exchange nodes : 7799.00 |
| Average number of exchange nodes : 5994.25 |
|_____________________________________________________________|
----> Reading initial solution : .//../Mesh/Bench_simple.h5
----> Reading took 0.146s
______________________________________________________________
| Info on chemistry |
|_____________________________________________________________|
| Kinetic scheme : CH4-AIR-2S-CM2_FLAMMABLE |
| Validity range : 300K/1bar |
| |
| Chemical reaction #1 |
| fthick : 1.70000000E+01 |
| Preexponential / fthick [SI] : 2.00000000E+09 |
| Activation temperature [K] : 1.76130282E+04 |
| |
| Chemical reaction #2 |
| fthick : 1.70000000E+01 |
| Preexponential / fthick [SI] : 2.00000000E+06 |
| Activation temperature [K] : 6.03875251E+03 |
|_____________________________________________________________|
______________________________________________________________
| Info on initial solution |
|_____________________________________________________________|
| number of Navier-Stokes equations : 5 |
| number of species : 6 |
| number of reactions : 2 |
| number of tpf equations : 0 |
| number of fictive species : 0 |
| initial iteration : 0 |
| initial time : 0.00000000E+00 |
|_____________________________________________________________|
----> Reading solutbound : .//../Mesh/Bench_simple.solutBound.h5
- Using 6.X format
----> Reading took 0.011s
----> Initialising metrics
----> Total volume of the mesh [m3] : 1.44077204E-01
----> Smallest cell volume [m3] : 2.97811897E-11
----> Found cached wall distance computation. Checking: ./ywall.h5
> Signatures match
----> Reading cached wall distance computation: ./ywall.h5
----> Reading took 0.004s
----> Boundary MPIs: 4
----> End pre-processing.
________________________________________________________________________________________________________
----> Starts the temporal loop.
----> End computation.
________________________________________________________________________________________________________
____________________________________________________________________________________________
| 4 MPI tasks Elapsed real time [s] [s.cores] [h.cores] |
|___________________________________________________________________________________________|
| AVBP : 634.86 2.5394E+03 7.0540E-01 |
| Temporal loop : 627.89 2.5116E+03 6.9766E-01 |
| Per simulated second : 1.2706E+07 5.0822E+07 1.4117E+04 |
| Per iteration : 3.1395 1.2558E+01 |
|-------------------------------------------------------------------------------------------|
| RCT [s.mpi/node/it] : 2.44090806E-05 |
|___________________________________________________________________________________________|
----> Initial physical time : 0.00000000E+00
Initial iteration : 0
Initial timestep : 2.46481884E-07
----> Final physical time : 4.94188563E-05
Final iteration : 200
Final timestep : 2.47527782E-07
----> Simulated physical time : 4.94188563E-05
Simulated iterations : 200
________________________________________________________________________________________________________
TIMERS
________________________________________________________________________________________________________
Prints relevant timers and breaks down percentage regarding reference timers.
> The 'Total slave simulation' time corresponds to the 1st level, and is measured by slave_timer (sum of pre temporal loop, temporal loop and post temporal loop).
> The 'Computation' time corresponds to the time integration loops, and is measured by rungekutta_timer.
> Levels are depicted using [X.Y.Z. ...] lists. The number of entry in the list corresponds to the level.
> References to the upper level is made to compute the contribution of one sub-level to its parent level.
> The times displayed are those of the master processor.
> For each timer, the minimum, maximum and mean values for all processors are also shown in the 3 right-hand columns.
> A json file 'timers.json' containing all the data is also available in the temporal output directory.
----- 1st level timers
time [s] | relative to [ min [s] mean [s] max [s] ]
| tot. slave [%] [ ]
> [0] Total slave simulation : 6.3486E+02 | 100.00% [ 6.3486E+02 6.3486E+02 6.3486E+02 ]
----- 2nd level timers
time [s] | relative to [ min [s] mean [s] max [s] ]
| tot. slave [%] [ ]
> > [0.1] Pre temporal loop : 6.9588E+00 | 1.10% [ 6.9577E+00 6.9580E+00 6.9588E+00 ]
> > [0.2] Temporal loop : 6.2789E+02 | 98.90% [ 6.2789E+02 6.2789E+02 6.2789E+02 ]
> > [0.2a] Temporal loop without IO : 6.2789E+02 | 98.90% [ 6.2789E+02 6.2789E+02 6.2789E+02 ]
> > [0.3] Post temporal loop : 5.0439E-03 | 0.00% [ 5.0439E-03 9.5120E-03 1.1661E-02 ]
> > [0.4] Point to Point communications : 6.7393E+00 | 1.06% [ 6.7393E+00 3.4461E+01 7.4534E+01 ]
----- 3rd level timers
time [s] | relative to | relative to [ min [s] mean [s] max [s] ]
| tot. slave [%] | upper level [%] [ ]
> > [0.1] Pre temporal loop : 6.9588E+00 | 1.10% [ 6.9577E+00 6.9580E+00 6.9588E+00 ]
> > > [0.1.1] Build online postprocessing objects : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > [0.2] Temporal loop : 6.2789E+02 | 98.90% [ 6.2789E+02 6.2789E+02 6.2789E+02 ]
> > > [0.2.1] Computation : 6.2600E+02 | 98.60% | 99.70% [ 6.2553E+02 6.2626E+02 6.2789E+02 ]
> > > [0.2.2] Temporal post-processing : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > [0.2.3] Instantaneous solution post-processing : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > [0.2.4] Average solution post-processing : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > [0.2.5] Online post-processing compute and storage : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
----- 4th level timers: focus on Computation level (rungekutta_timer)
time [s] | relative to | relative to | relative to [ min [s] mean [s] max [s] ]
| tot. slave [%] | computation [%]| upper level [%][ ]
> > > [0.2.1] Computation : 6.2600E+02 | 98.60% | 100.00% [ 6.2553E+02 6.2626E+02 6.2789E+02 ]
> > > > [0.2.1.1] Convective scheme : 7.6094E+01 | 11.99% | 12.16% | 12.16% [ 7.3186E+01 7.5041E+01 7.6540E+01 ]
> > > > [0.2.1.2] Diffusion operator : 1.9897E+02 | 31.34% | 31.79% | 31.79% [ 1.7837E+02 1.9087E+02 2.0120E+02 ]
> > > > [0.2.1.4] Time-step calculation : 1.6518E+01 | 2.60% | 2.64% | 2.64% [ 1.6272E+01 1.6686E+01 1.7065E+01 ]
> > > > [0.2.1.5] Transport calculation : 2.3860E+00 | 0.38% | 0.38% | 0.38% [ 2.0188E+00 2.2318E+00 2.4057E+00 ]
> > > > [0.2.1.6] Thermo calculation : 3.7146E+00 | 0.59% | 0.59% | 0.59% [ 3.2153E+00 3.5294E+00 3.8009E+00 ]
> > > > [0.2.1.7] Gradient calculation : 5.7523E+01 | 9.06% | 9.19% | 9.19% [ 5.4949E+01 5.6827E+01 5.8892E+01 ]
> > > > [0.2.1.8] Boundary : 4.3352E+00 | 0.68% | 0.69% | 0.69% [ 2.9459E+00 5.1182E+00 9.9974E+00 ]
> > > > [0.2.1.9] Turbulent viscosity model : 1.2512E+01 | 1.97% | 2.00% | 2.00% [ 1.2677E+01 1.2677E+01 1.2668E+01 ]
> > > > [0.2.1.10] Combustion (source term + TFLES + efcy + efcy I0) : 5.6530E+01 | 8.90% | 9.03% | 9.03% [ 5.5949E+01 5.6434E+01 5.7060E+01 ]
> > > > > [0.2.1.10.1] Chemical source terms calculation : 4.5723E+01 | 7.20% | 7.30% | 80.88% [ 4.5494E+01 4.5720E+01 4.6019E+01 ]
> > > > > [0.2.1.10.2] TFLES model calculation : 0.0000E+00 | 0.00% | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > > > [0.2.1.10.3] Efficiency function calculation : 1.0806E+01 | 1.70% | 1.73% | 19.12% [ 1.0454E+01 1.0714E+01 1.1040E+01 ]
> > > > > [0.2.1.10.4] Efficiency I0 function calculation : 0.0000E+00 | 0.00% | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > > [0.2.1.11] Artificial viscosity : 7.0354E+01 | 11.08% | 11.24% | 11.24% [ 6.2859E+01 6.7143E+01 7.0354E+01 ]
> > > > [0.2.1.17] Source terms : 0.0000E+00 | 0.00% | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
----> End of AVBP session
----> Found 3 warning messages for this computation, check your output file!
***** Memory usage (system): Max: 720.359 MB (rank:1) Min: 622.836 MB (rank:2) Ave: 670.936 MB Std: 38.572 MB
***** Maximum memory (mod_alloc) : 492612740 B ( 4.697921E+02 MB)
IPL WARN> unknown option 'spread'
Your experiment path is /scratch/exter/camus/DATASET/SIMPLE/Run/test_avbp_200iter_m4/tools/lprof_npsu_run_0
To display your profiling results:
###############################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch/exter/camus/DATASET/SIMPLE/Run/test_avbp_200iter_m4/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch/exter/camus/DATASET/SIMPLE/Run/test_avbp_200iter_m4/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch/exter/camus/DATASET/SIMPLE/Run/test_avbp_200iter_m4/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch/exter/camus/DATASET/SIMPLE/Run/test_avbp_200iter_m4/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch/exter/camus/DATASET/SIMPLE/Run/test_avbp_200iter_m4/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch/exter/camus/DATASET/SIMPLE/Run/test_avbp_200iter_m4/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch/exter/camus/DATASET/SIMPLE/Run/test_avbp_200iter_m4/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch/exter/camus/DATASET/SIMPLE/Run/test_avbp_200iter_m4/tools/lprof_npsu_run_0 #
###############################################################################################################################################