* Info: Detected 4 Lprof instances in node160.
If this is incorrect, rerun with number-processes-per-node=X
__ ______ _____ __ ________ _
/\ \ / / _ \| __ \ \ \ / /____ || |
/ \ \ / /| |_) | |__) | \ \ / / / /_| | _____ __
/ /\ \ \/ / | _ <| ___/ \ \/ / / / _` |/ _ \ \ / /
/ ____ \ / | |_) | | \ / / / (_| | __/\ V /
/_/ \_\/ |____/|_| \/ /_/ \__,_|\___| \_/
Using branch :
Version date : Mon, 18 Nov 2024 11:40:50 +0100
Commit : b58af1ea20
MPI processes : 4
Computation #1/1
Compilation info : mpiifort -g -O3 -fpp -traceback -fno-alias -ip -assume byterecl -convert big_endian -align -march=core-avx2 -fma -axCORE-AVX2 -I/softs/local_intel/phdf5/1.8.20/include -DHAS_PMETIS -I/softs/local_intel/parmetis/403_64/include -DPARMETIS4 -DMETIS5 -DHAS_PTSCOTCH -I/softs/local_intel/ptscotch/6.0.5a/include
Compilation wrapper info :
Compilation user : camus
Compilation date : 2025-01-13 17:18:55
Compilation MPI version : Intel(R) MPI Library for Linux* OS, Version 2021.10 Build 20230619 (id: c2e19c2f3e)
AVBP version : 7.15.0
Reading input file version : 7.15.0
----> Reading run parameters : .//run.params
----> Using NATURAL reordering
----> command.dat file API is enabled
----> Using TTGC
with UNCLOSED boundary terms
----> Using colin viscosity model
>>>>> WARNING
>>>>> No solution storage required: additional variables are deactivated!
>>>>> WARNING
>>>>> Temporals are not computed!
----> Reading mesh : .//../Mesh/Bench_simple.mesh.h5
Meshfile signature: e736f2c3c25f98227815b524d9568f9e
----> el2part file found: el2part_4.h5
Use existing partitioning
>> Reading ...took 0.090s
----> Initialize the solution writers (4 writers)
>>>>> WARNING
>>>>> No instantaneous solution storage required: the calculation of additional variables is deactivated.
Checking TFLES table parameters...
Flame thickening is applied with fthick = 17.00
----> Reading boundary conditions in asciibound file : .//../Mesh/Bench_simple.asciiBound.key
_______________________________________________________________________________________
| Boundary patches (no reordering) |
|______________________________________________________________________________________|
| Patch number Patch name Boundary condition |
| ------------ ---------- ------------------ |
| 1 INLET INLET_RELAX_UVW_T_Y |
| 2 OUTLET OUTLET_RELAX_P |
| 3 WALL WALL_NOSLIP_ADIAB |
|______________________________________________________________________________________|
______________________________________________________________
| Info on initial grid |
|_____________________________________________________________|
| number of dimensions : 3 |
| number of nodes : 514475 |
| number of cells : 2958592 |
| - tetrahedra : 2958592 |
| number of cell per group : 64 |
| number of boundary nodes : 48048 |
| number of periodic nodes : 0 |
| number of axi-periodic nodes : 0 |
|_____________________________________________________________|
| After partitioning |
|_____________________________________________________________|
| number of nodes : 526504 |
| extra nodes due to partitioning : 12029 [+ 2.34‰] |
|_____________________________________________________________|
______________________________________________________________
| Partitioning Quality |
|_____________________________________________________________|
| Maximum number of neighbors : 3.00 |
| Average number of neighbors : 2.50 |
| Maximum number of exchange nodes : 7799.00 |
| Average number of exchange nodes : 5994.25 |
|_____________________________________________________________|
----> Reading initial solution : .//../Mesh/Bench_simple.h5
----> Reading took 0.159s
______________________________________________________________
| Info on chemistry |
|_____________________________________________________________|
| Kinetic scheme : CH4-AIR-2S-CM2_FLAMMABLE |
| Validity range : 300K/1bar |
| |
| Chemical reaction #1 |
| fthick : 1.70000000E+01 |
| Preexponential / fthick [SI] : 2.00000000E+09 |
| Activation temperature [K] : 1.76130282E+04 |
| |
| Chemical reaction #2 |
| fthick : 1.70000000E+01 |
| Preexponential / fthick [SI] : 2.00000000E+06 |
| Activation temperature [K] : 6.03875251E+03 |
|_____________________________________________________________|
______________________________________________________________
| Info on initial solution |
|_____________________________________________________________|
| number of Navier-Stokes equations : 5 |
| number of species : 6 |
| number of reactions : 2 |
| number of tpf equations : 0 |
| number of fictive species : 0 |
| initial iteration : 0 |
| initial time : 0.00000000E+00 |
|_____________________________________________________________|
----> Reading solutbound : .//../Mesh/Bench_simple.solutBound.h5
- Using 6.X format
----> Reading took 0.012s
----> Initialising metrics
----> Total volume of the mesh [m3] : 1.44077204E-01
----> Smallest cell volume [m3] : 2.97811897E-11
----> Found cached wall distance computation. Checking: ./ywall.h5
> Signatures match
----> Reading cached wall distance computation: ./ywall.h5
----> Reading took 0.004s
----> Boundary MPIs: 4
----> End pre-processing.
________________________________________________________________________________________________________
----> Starts the temporal loop.
----> End computation.
________________________________________________________________________________________________________
____________________________________________________________________________________________
| 4 MPI tasks Elapsed real time [s] [s.cores] [h.cores] |
|___________________________________________________________________________________________|
| AVBP : 10.52 4.2078E+01 1.1688E-02 |
| Temporal loop : 3.20 1.2782E+01 3.5504E-03 |
| Per simulated second : 1.2964E+07 5.1856E+07 1.4404E+04 |
| Per iteration : 3.1954 1.2782E+01 |
|-------------------------------------------------------------------------------------------|
| RCT [s.mpi/node/it] : 2.48438120E-05 |
|___________________________________________________________________________________________|
----> Initial physical time : 0.00000000E+00
Initial iteration : 0
Initial timestep : 2.46481884E-07
----> Final physical time : 2.46481884E-07
Final iteration : 1
Final timestep : 2.46481884E-07
----> Simulated physical time : 2.46481884E-07
Simulated iterations : 1
________________________________________________________________________________________________________
TIMERS
________________________________________________________________________________________________________
Prints relevant timers and breaks down percentage regarding reference timers.
> The 'Total slave simulation' time corresponds to the 1st level, and is measured by slave_timer (sum of pre temporal loop, temporal loop and post temporal loop).
> The 'Computation' time corresponds to the time integration loops, and is measured by rungekutta_timer.
> Levels are depicted using [X.Y.Z. ...] lists. The number of entry in the list corresponds to the level.
> References to the upper level is made to compute the contribution of one sub-level to its parent level.
> The times displayed are those of the master processor.
> For each timer, the minimum, maximum and mean values for all processors are also shown in the 3 right-hand columns.
> A json file 'timers.json' containing all the data is also available in the temporal output directory.
----- 1st level timers
time [s] | relative to [ min [s] mean [s] max [s] ]
| tot. slave [%] [ ]
> [0] Total slave simulation : 1.0519E+01 | 100.00% [ 1.0519E+01 1.0523E+01 1.0525E+01 ]
----- 2nd level timers
time [s] | relative to [ min [s] mean [s] max [s] ]
| tot. slave [%] [ ]
> > [0.1] Pre temporal loop : 7.3190E+00 | 69.58% [ 7.3180E+00 7.3183E+00 7.3190E+00 ]
> > [0.2] Temporal loop : 3.1954E+00 | 30.38% [ 3.1954E+00 3.1954E+00 3.1954E+00 ]
> > [0.2a] Temporal loop without IO : 3.1954E+00 | 30.38% [ 3.1954E+00 3.1954E+00 3.1954E+00 ]
> > [0.3] Post temporal loop : 4.9880E-03 | 0.05% [ 4.9880E-03 8.8326E-03 1.0924E-02 ]
> > [0.4] Point to Point communications : 4.9582E-02 | 0.47% [ 4.9582E-02 2.0242E-01 4.0238E-01 ]
----- 3rd level timers
time [s] | relative to | relative to [ min [s] mean [s] max [s] ]
| tot. slave [%] | upper level [%] [ ]
> > [0.1] Pre temporal loop : 7.3190E+00 | 69.58% [ 7.3180E+00 7.3183E+00 7.3190E+00 ]
> > > [0.1.1] Build online postprocessing objects : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > [0.2] Temporal loop : 3.1954E+00 | 30.38% [ 3.1954E+00 3.1954E+00 3.1954E+00 ]
> > > [0.2.1] Computation : 3.1889E+00 | 30.31% | 99.80% [ 3.1835E+00 3.1879E+00 3.1952E+00 ]
> > > [0.2.2] Temporal post-processing : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > [0.2.3] Instantaneous solution post-processing : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > [0.2.4] Average solution post-processing : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > [0.2.5] Online post-processing compute and storage : 0.0000E+00 | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
----- 4th level timers: focus on Computation level (rungekutta_timer)
time [s] | relative to | relative to | relative to [ min [s] mean [s] max [s] ]
| tot. slave [%] | computation [%]| upper level [%][ ]
> > > [0.2.1] Computation : 3.1889E+00 | 30.31% | 100.00% [ 3.1835E+00 3.1879E+00 3.1952E+00 ]
> > > > [0.2.1.1] Convective scheme : 3.8657E-01 | 3.67% | 12.12% | 12.12% [ 3.7177E-01 3.8115E-01 3.8908E-01 ]
> > > > [0.2.1.2] Diffusion operator : 9.9443E-01 | 9.45% | 31.18% | 31.18% [ 8.9088E-01 9.5569E-01 1.0029E+00 ]
> > > > [0.2.1.4] Time-step calculation : 8.3679E-02 | 0.80% | 2.62% | 2.62% [ 8.2311E-02 8.4391E-02 8.6241E-02 ]
> > > > [0.2.1.5] Transport calculation : 1.1653E-02 | 0.11% | 0.37% | 0.37% [ 1.0025E-02 1.1004E-02 1.1947E-02 ]
> > > > [0.2.1.6] Thermo calculation : 4.4300E-02 | 0.42% | 1.39% | 1.39% [ 2.9696E-02 4.0828E-02 4.5505E-02 ]
> > > > [0.2.1.7] Gradient calculation : 2.9280E-01 | 2.78% | 9.18% | 9.18% [ 2.8236E-01 2.9070E-01 2.9655E-01 ]
> > > > [0.2.1.8] Boundary : 2.6828E-02 | 0.26% | 0.84% | 0.84% [ 1.7385E-02 2.9182E-02 5.2640E-02 ]
> > > > [0.2.1.9] Turbulent viscosity model : 6.2927E-02 | 0.60% | 1.97% | 1.97% [ 6.4486E-02 6.4714E-02 6.6846E-02 ]
> > > > [0.2.1.10] Combustion (source term + TFLES + efcy + efcy I0) : 2.8822E-01 | 2.74% | 9.04% | 9.04% [ 2.8426E-01 2.8693E-01 2.8990E-01 ]
> > > > > [0.2.1.10.1] Chemical source terms calculation : 2.3295E-01 | 2.21% | 7.30% | 80.82% [ 2.3065E-01 2.3202E-01 2.3334E-01 ]
> > > > > [0.2.1.10.2] TFLES model calculation : 0.0000E+00 | 0.00% | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > > > [0.2.1.10.3] Efficiency function calculation : 5.5274E-02 | 0.53% | 1.73% | 19.18% [ 5.3612E-02 5.4909E-02 5.6559E-02 ]
> > > > > [0.2.1.10.4] Efficiency I0 function calculation : 0.0000E+00 | 0.00% | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
> > > > [0.2.1.11] Artificial viscosity : 3.5271E-01 | 3.35% | 11.06% | 11.06% [ 3.1419E-01 3.3589E-01 3.5271E-01 ]
> > > > [0.2.1.17] Source terms : 0.0000E+00 | 0.00% | 0.00% | 0.00% [ 0.0000E+00 0.0000E+00 0.0000E+00 ]
----> End of AVBP session
----> Found 3 warning messages for this computation, check your output file!
***** Memory usage (system): Max: 716.398 MB (rank:1) Min: 621.879 MB (rank:2) Ave: 667.792 MB Std: 38.369 MB
***** Maximum memory (mod_alloc) : 492612740 B ( 4.697921E+02 MB)
Info: 3/4 lprof instances finished
IPL WARN> unknown option 'close'
Your experiment path is /home/exter/camus/DATASET/SIMPLE/Run/test_avbp_1iter_m4/tools/lprof_npsu_run_0
To display your profiling results:
##########################################################################################################################################
# LEVEL | REPORT | COMMAND #
##########################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/exter/camus/DATASET/SIMPLE/Run/test_avbp_1iter_m4/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/exter/camus/DATASET/SIMPLE/Run/test_avbp_1iter_m4/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/exter/camus/DATASET/SIMPLE/Run/test_avbp_1iter_m4/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/exter/camus/DATASET/SIMPLE/Run/test_avbp_1iter_m4/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/exter/camus/DATASET/SIMPLE/Run/test_avbp_1iter_m4/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/exter/camus/DATASET/SIMPLE/Run/test_avbp_1iter_m4/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/exter/camus/DATASET/SIMPLE/Run/test_avbp_1iter_m4/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/exter/camus/DATASET/SIMPLE/Run/test_avbp_1iter_m4/tools/lprof_npsu_run_0 #
##########################################################################################################################################