* Info: Detected 10 Lprof instances in otterfall.
If this is incorrect, rerun with number-processes-per-node=X
* Info: Selecting the 'perf-high-ppn' engine for node otterfall
* Info: Process launched (host otterfall, process 93356)
* Info: Process launched (host otterfall, process 93357)
* Info: Process launched (host otterfall, process 93359)
* Info: Process launched (host otterfall, process 93362)
* Info: Process launched (host otterfall, process 93364)
* Info: Process launched (host otterfall, process 93366)
* Info: Process launched (host otterfall, process 93368)
* Info: Process launched (host otterfall, process 93370)
* Info: Process launched (host otterfall, process 93372)
* Info: Process launched (host otterfall, process 93373)
oooo .o8
888 888
888 888oooo. .ooooo.
888 d88 88b d88 Y8
888 888 888 888
888 888 888 888 .o8
o888o `Y8bod8P `Y8bod8P
Lattice Boltzmann Kernel
PARAMETERS
-----------------------------
Problem: cavity
prc%np 1 1 10
umax 0.0100
Re 207.3600
Re_acc 11971.9362
nu 0.0185
omega 1.80000
Problem size 256 256 384
Max Tstep 100
Out Tstep 100
Ramp until -1
Beg Out at 1
Init until 1
Model: D3Q19
Relaxation is: bgk
----------------------------------------------
Rank 0 Domain size: 256 256 39 mpi-crd 0 0 0
Rank 1 Domain size: 256 256 39 mpi-crd 0 0 1
Rank 2 Domain size: 256 256 39 mpi-crd 0 0 2
Rank 3 Domain size: 256 256 39 mpi-crd 0 0 3
Rank 4 Domain size: 256 256 39 mpi-crd 0 0 4
Rank 5 Domain size: 256 256 39 mpi-crd 0 0 5
Rank 6 Domain size: 256 256 39 mpi-crd 0 0 6
Rank 7 Domain size: 256 256 39 mpi-crd 0 0 7
Rank 8 Domain size: 256 256 39 mpi-crd 0 0 8
Rank 9 Domain size: 256 256 33 mpi-crd 0 0 9
Memory allocated per thread [MB] 553.551
Additional Memory for master [MB] 0.000
Total Memory needed [MB] 5535.507
Default case: Set complete domain to rho=1, u=0
Default case: Set complete domain to rho=1, u=0
Default case: Set complete domain to rho=1, u=0
Default case: Set complete domain to rho=1, u=0
Default case: Set complete domain to rho=1, u=0
Default case: Set complete domain to rho=1, u=0
Default case: Set complete domain to rho=1, u=0
Default case: Set complete domain to rho=1, u=0
Default case: Set complete domain to rho=1, u=0
Default case: Set complete domain to rho=1, u=0
Setting Boundary conditions resulted in
# NRBC: 0 # Periodic: 0
assigning zero field...
assigning zero field...
assigning zero field...
assigning zero field...
assigning zero field...
assigning zero field...
assigning zero field...
assigning zero field...
assigning zero field...
--------------------------------------
Starting main time loop
assigning zero field...
Done.
Done.
Done.
Done.
Done.
Done.
Done.
Done.
Done.
Done.
------------------------
Initial Relaxation done.
Here, a Norm calculation should be implemented to check, if pressure field has
converged
This routine must be called before the set_bnd of the current iteration.
Setting Zero-values for nrbc to current values...
Total density: 24645111.469818 at timestep 100
--------------------------------------
Statistics
Communication method: sendrecvR
Memory layout of densities: lijk
Max. Total wall time: 0.309E+02
Mean Total duration: 30.890 s (th0: 30.888 )
initialization duration .236E+01 s
Communication duration 3.321 s
buffer copy 0.848 s 25.533 % ( 0.48E+00 0.98E+00 )
buffer exchange 1.512 s 45.527 % ( 0.57E+00 0.59E+01 )
synchronization 0.000 s 0.000 % ( 0.00E+00 0.00E+00 )
buffer copy back 0.961 s 28.926 % ( 0.46E+00 0.12E+01 )
Data size 479.260800 MB min,****** max******
Data rate 841.862951MB/s mean 316.995894 MB/s
Communication duration 3.321 s 10.751 % ( 0.26E+01 0.69E+01 )
Set boundary duration 0.001 s 0.003 % ( 0.56E-03 0.00E+00 )
Calc fEq duration 0.000 s 0.000 %
Collision duration 27.426 s 88.786 % ( 0.24E+02 0.28E+02 )
Propagation duration 0.000 s 0.000 % ( 0.00E+00 0.00E+00 )
Bounceback duration 0.000 s 0.000 %
Calcmacr vals duration 0.000 s 0.000 %
Performance [MLUPs] 81.469 MLUPs
Timings of all processes
Rk, set_bnd, coll, comm, prop, bncb
0 0.00087948 27.98144064 2.76122951 0.00000000 0.00000000
1 0.00077178 27.68968850 3.06420675 0.00000000 0.00000000
2 0.00078727 27.71070738 3.04458997 0.00000000 0.00000000
3 0.00074971 28.16198132 2.59489664 0.00000000 0.00000000
4 0.00071285 27.70809839 3.04981463 0.00000000 0.00000000
5 0.00078570 27.83179802 2.92856415 0.00000000 0.00000000
6 0.00076867 27.76778723 2.99213912 0.00000000 0.00000000
7 0.00106527 27.83557321 2.91579479 0.00000000 0.00000000
8 0.00100256 27.77597321 2.97056013 0.00000000 0.00000000
9 0.00055643 23.79833029 6.88706611 0.00000000 0.00000000
... done!
* Info: Process finished (host otterfall, process 93356)
* Info: Process finished (host otterfall, process 93359)
* Info: Process finished (host otterfall, process 93373)
* Info: Process finished (host otterfall, process 93366)
* Info: Process finished (host otterfall, process 93372)
* Info: Process finished (host otterfall, process 93370)
* Info: Process finished (host otterfall, process 93362)
* Info: Process finished (host otterfall, process 93364)
* Info: Process finished (host otterfall, process 93357)
* Info: Process finished (host otterfall, process 93368)
Your experiment path is /home/kcamus/POP3/lbm/maqao_runs/lbc_ofast_intel_m10_x256-y256-z384/tools/lprof_npsu_run_0
To display your profiling results:
######################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP3/lbm/maqao_runs/lbc_ofast_intel_m10_x256-y256-z384/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP3/lbm/maqao_runs/lbc_ofast_intel_m10_x256-y256-z384/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP3/lbm/maqao_runs/lbc_ofast_intel_m10_x256-y256-z384/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP3/lbm/maqao_runs/lbc_ofast_intel_m10_x256-y256-z384/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP3/lbm/maqao_runs/lbc_ofast_intel_m10_x256-y256-z384/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP3/lbm/maqao_runs/lbc_ofast_intel_m10_x256-y256-z384/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP3/lbm/maqao_runs/lbc_ofast_intel_m10_x256-y256-z384/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP3/lbm/maqao_runs/lbc_ofast_intel_m10_x256-y256-z384/tools/lprof_npsu_run_0 #
######################################################################################################################################################