* Info: Detected 8 Lprof instances in gpu04sas.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
* Info: Selecting the 'perf-high-ppn' engine for node gpu04sas.benchmarkcenter.megware.com
* Info: "ref-cycles" not supported on gpu04sas.benchmarkcenter.megware.com: fallback to "cpu-clock"
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170255)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170258)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170257)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170259)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170262)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170261)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170260)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 170263)Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 3200, 400)
(Px, Py, Pz) = (1, 8, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 3.339041 seconds
wall MFLOPS = 0.000000
cpu clock time = 77.951675 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.377322 seconds
wall MFLOPS = 0.000000
cpu clock time = 5.932266 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 17.050122 seconds
wall MFLOPS = 0.000000
cpu clock time = 291.553719 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 8.829849e+08
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 56.698744 seconds
wall MFLOPS = 0.000000
cpu clock time = 1344.512809 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.722267e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 6.107102e+09
Figure of Merit (FOM_1): 4.801073e+09
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170259)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170260)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170261)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170258)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170257)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170262)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170263)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 170255)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/compilers/icx_9/oneview_results_1720203326/tools/lprof_npsu_run_0 #
######################################################################################################################################################################################################