* Info: Detected 8 Lprof instances in gpu04sas.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
* Info: Selecting the 'perf-high-ppn' engine for node gpu04sas.benchmarkcenter.megware.com
* Info: "ref-cycles" not supported on gpu04sas.benchmarkcenter.megware.com: fallback to "cpu-clock"
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 17331)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 17333)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 17334)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 17336)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 17335)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 17337)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 17339)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 17338)Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 3200, 400)
(Px, Py, Pz) = (1, 8, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 3.330935 seconds
wall MFLOPS = 0.000000
cpu clock time = 77.771853 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.377537 seconds
wall MFLOPS = 0.000000
cpu clock time = 5.999735 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 16.819085 seconds
wall MFLOPS = 0.000000
cpu clock time = 285.300567 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 8.951141e+08
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 57.198178 seconds
wall MFLOPS = 0.000000
cpu clock time = 1356.181045 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.722267e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 6.053777e+09
Figure of Merit (FOM_1): 4.764111e+09
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 17335)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 17331)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 17334)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 17339)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 17333)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 17336)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 17337)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 17338)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/defaults/orig/oneview_results_1720192256/tools/lprof_npsu_run_0
To display your profiling results:
####################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
####################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/defaults/orig/oneview_results_1720192256/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/defaults/orig/oneview_results_1720192256/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/defaults/orig/oneview_results_1720192256/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/defaults/orig/oneview_results_1720192256/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/defaults/orig/oneview_results_1720192256/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/defaults/orig/oneview_results_1720192256/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/defaults/orig/oneview_results_1720192256/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/defaults/orig/oneview_results_1720192256/tools/lprof_npsu_run_0 #
####################################################################################################################################################################################################