* Info: Detected 8 Lprof instances in gpu04sas.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
* Info: Selecting the 'perf-high-ppn' engine for node gpu04sas.benchmarkcenter.megware.com
* Info: "ref-cycles" not supported on gpu04sas.benchmarkcenter.megware.com: fallback to "cpu-clock"
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177758)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177760)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177766)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177761)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177763)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177762)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177764)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177765)Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 3200, 400)
(Px, Py, Pz) = (1, 8, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 10.860650 seconds
wall MFLOPS = 0.000000
cpu clock time = 10.804403 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.305132 seconds
wall MFLOPS = 0.000000
cpu clock time = 0.303895 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 157.127129 seconds
wall MFLOPS = 0.000000
cpu clock time = 156.474658 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 9.581414e+07
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 177.169969 seconds
wall MFLOPS = 0.000000
cpu clock time = 176.319647 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.722267e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 1.954423e+09
Figure of Merit (FOM_1): 1.489771e+09
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177766)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177763)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177761)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177764)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177762)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177765)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177760)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177758)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_0
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_0 #
######################################################################################################################################################################################################
* Info: Detected 8 Lprof instances in gpu04sas.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
* Info: Selecting the 'perf-high-ppn' engine for node gpu04sas.benchmarkcenter.megware.com
* Info: "ref-cycles" not supported on gpu04sas.benchmarkcenter.megware.com: fallback to "cpu-clock"
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177905)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177907)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177908)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177909)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177910)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177911)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177912)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 177913)Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 3200, 400)
(Px, Py, Pz) = (1, 8, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 6.293998 seconds
wall MFLOPS = 0.000000
cpu clock time = 12.444888 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.361796 seconds
wall MFLOPS = 0.000000
cpu clock time = 0.614393 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 82.738578 seconds
wall MFLOPS = 0.000000
cpu clock time = 159.765172 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 1.819587e+08
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 86.948617 seconds
wall MFLOPS = 0.000000
cpu clock time = 173.046443 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.722267e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 3.982410e+09
Figure of Merit (FOM_1): 3.032297e+09
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177905)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177908)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177907)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177913)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177912)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177911)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177910)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 177909)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_1
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_1 #
######################################################################################################################################################################################################
* Info: Detected 8 Lprof instances in gpu04sas.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
* Info: Selecting the 'perf-high-ppn' engine for node gpu04sas.benchmarkcenter.megware.com
* Info: "ref-cycles" not supported on gpu04sas.benchmarkcenter.megware.com: fallback to "cpu-clock"
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178054)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178056)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178057)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178058)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178063)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178064)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178062)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178065)Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 3200, 400)
(Px, Py, Pz) = (1, 8, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 4.105480 seconds
wall MFLOPS = 0.000000
cpu clock time = 16.154236 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.377390 seconds
wall MFLOPS = 0.000000
cpu clock time = 1.176807 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 46.471704 seconds
wall MFLOPS = 0.000000
cpu clock time = 170.009982 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 3.239606e+08
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 74.048271 seconds
wall MFLOPS = 0.000000
cpu clock time = 294.666770 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.722267e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 4.676206e+09
Figure of Merit (FOM_1): 3.588145e+09
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178064)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178054)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178057)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178058)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178063)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178062)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178065)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178056)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_2
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_2 #
######################################################################################################################################################################################################
* Info: Detected 8 Lprof instances in gpu04sas.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
* Info: Selecting the 'perf-high-ppn' engine for node gpu04sas.benchmarkcenter.megware.com
* Info: "ref-cycles" not supported on gpu04sas.benchmarkcenter.megware.com: fallback to "cpu-clock"
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178230)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178232)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178234)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178233)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178235)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178237)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178236)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178238)Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 3200, 400)
(Px, Py, Pz) = (1, 8, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 3.429383 seconds
wall MFLOPS = 0.000000
cpu clock time = 26.600155 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.361996 seconds
wall MFLOPS = 0.000000
cpu clock time = 2.104455 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 27.844350 seconds
wall MFLOPS = 0.000000
cpu clock time = 186.816055 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 5.406842e+08
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 59.943951 seconds
wall MFLOPS = 0.000000
cpu clock time = 476.749605 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.722267e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 5.776479e+09
Figure of Merit (FOM_1): 4.467531e+09
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178236)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178230)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178233)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178232)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178234)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178238)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178235)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178237)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_3
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_3 #
######################################################################################################################################################################################################
* Info: Detected 8 Lprof instances in gpu04sas.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
* Info: Selecting the 'perf-high-ppn' engine for node gpu04sas.benchmarkcenter.megware.com
* Info: "ref-cycles" not supported on gpu04sas.benchmarkcenter.megware.com: fallback to "cpu-clock"
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178475)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178477)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178479)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178478)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178480)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178481)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178482)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178483)Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 3200, 400)
(Px, Py, Pz) = (1, 8, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 3.318594 seconds
wall MFLOPS = 0.000000
cpu clock time = 51.720730 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.369164 seconds
wall MFLOPS = 0.000000
cpu clock time = 4.032432 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 19.299746 seconds
wall MFLOPS = 0.000000
cpu clock time = 232.205505 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 7.800621e+08
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 62.102265 seconds
wall MFLOPS = 0.000000
cpu clock time = 984.816264 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.722267e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 5.575723e+09
Figure of Merit (FOM_1): 4.376807e+09
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178475)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178480)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178482)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178479)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178477)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178483)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178481)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178478)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_4
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_4 #
######################################################################################################################################################################################################
* Info: Detected 8 Lprof instances in gpu04sas.benchmarkcenter.megware.com.
If this is incorrect, rerun with number-processes-per-node=X
* Info: Selecting the 'perf-high-ppn' engine for node gpu04sas.benchmarkcenter.megware.com
* Info: "ref-cycles" not supported on gpu04sas.benchmarkcenter.megware.com: fallback to "cpu-clock"
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178839)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178841)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178847)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178842)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178843)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178845)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178846)
* Info: Process launched (host gpu04sas.benchmarkcenter.megware.com, process 178844)Running with these driver parameters:
solver ID = 1
Laplacian_27pt:
(Nx, Ny, Nz) = (400, 3200, 400)
(Px, Py, Pz) = (1, 8, 1)
=============================================
Generate Matrix:
=============================================
Spatial Operator:
wall clock time = 3.329805 seconds
wall MFLOPS = 0.000000
cpu clock time = 77.628359 seconds
cpu MFLOPS = 0.000000
RHS vector has unit components
Initial guess is 0
=============================================
IJ Vector Setup:
=============================================
RHS and Initial Guess:
wall clock time = 0.382659 seconds
wall MFLOPS = 0.000000
cpu clock time = 5.961078 seconds
cpu MFLOPS = 0.000000
=============================================
Problem 1: AMG Setup Time:
=============================================
PCG Setup:
wall clock time = 17.069933 seconds
wall MFLOPS = 0.000000
cpu clock time = 292.032369 seconds
cpu MFLOPS = 0.000000
FOM_Setup: nnz_AP / Setup Phase Time: 8.819601e+08
=============================================
Problem 1: AMG-PCG Solve Time:
=============================================
PCG Solve:
wall clock time = 57.251992 seconds
wall MFLOPS = 0.000000
cpu clock time = 1358.214839 seconds
cpu MFLOPS = 0.000000
Iterations = 23
Final Relative Residual Norm = 9.722267e-09
FOM_Solve: nnz_AP * Iterations / Solve Phase Time: 6.048086e+09
Figure of Merit (FOM_1): 4.756555e+09
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178847)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178845)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178844)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178841)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178846)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178839)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178843)
* Info: Process finished (host gpu04sas.benchmarkcenter.megware.com, process 178842)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_5
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/172-019-1763/intel/AMG/run/oneview_runs/multicore/icx_9/oneview_results_1720211326/tools/lprof_npsu_run_5 #
######################################################################################################################################################################################################