* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 14336)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.04887 +- 0.000001. Correct Result: 234.048872
Configuration
Number of Threads: 1
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 592.08
Minimum kernel time: 0.00589174
Maximum kernel time: 0.00665916
Arithm. Mean kernel time: 0.00592068
Performance results
Total GFlops/s: 2.44789
Minimum GFlops/s: 2.17648
Maximum GFlops/s: 2.45997
Arithm. Mean GFlops/s: 2.44795
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 14336)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 14336)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 14336)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 14336)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0
To display your profiling results:
##################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_0 #
##################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 14591)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.59921 +- 0.000001. Correct Result: 233.599206
Configuration
Number of Threads: 2
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 320.312
Minimum kernel time: 0.00316651
Maximum kernel time: 0.00430069
Arithm. Mean kernel time: 0.00320301
Performance results
Total GFlops/s: 4.5248
Minimum GFlops/s: 3.37004
Maximum GFlops/s: 4.57712
Arithm. Mean GFlops/s: 4.52496
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 14591)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 14591)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 14591)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 14591)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1
To display your profiling results:
##################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_1 #
##################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 14834)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.94766 +- 0.000001. Correct Result: 233.947659
Configuration
Number of Threads: 4
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 208.684
Minimum kernel time: 0.00207006
Maximum kernel time: 0.00267596
Arithm. Mean kernel time: 0.00208676
Performance results
Total GFlops/s: 6.9452
Minimum GFlops/s: 5.41619
Maximum GFlops/s: 7.00147
Arithm. Mean GFlops/s: 6.94546
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 14834)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 14834)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 14834)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 14834)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2
To display your profiling results:
##################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_2 #
##################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 15070)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.17492 +- 0.000001. Correct Result: 234.174919
Configuration
Number of Threads: 8
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 135.675
Minimum kernel time: 0.00134281
Maximum kernel time: 0.00191158
Arithm. Mean kernel time: 0.00135669
Performance results
Total GFlops/s: 10.6825
Minimum GFlops/s: 7.58196
Maximum GFlops/s: 10.7934
Arithm. Mean GFlops/s: 10.683
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 15070)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 15070)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 15070)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 15070)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3
To display your profiling results:
##################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_3 #
##################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 15306)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.56038 +- 0.000001. Correct Result: 233.560378
Configuration
Number of Threads: 16
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 121.377
Minimum kernel time: 0.000676864
Maximum kernel time: 0.002936
Arithm. Mean kernel time: 0.00121371
Performance results
Total GFlops/s: 11.9409
Minimum GFlops/s: 4.93647
Maximum GFlops/s: 21.4127
Arithm. Mean GFlops/s: 11.9415
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 15306)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 15306)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 15306)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 15306)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4
To display your profiling results:
##################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_4 #
##################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 15551)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.47951 +- 0.000001. Correct Result: 234.479514
Configuration
Number of Threads: 32
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 51.365
Minimum kernel time: 0.000371371
Maximum kernel time: 0.00409776
Arithm. Mean kernel time: 0.000513581
Performance results
Total GFlops/s: 28.2167
Minimum GFlops/s: 3.53693
Maximum GFlops/s: 39.027
Arithm. Mean GFlops/s: 28.2204
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 15551)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 15551)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 15551)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 15551)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5
To display your profiling results:
##################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_5 #
##################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 15806)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.56348 +- 0.000001. Correct Result: 233.563482
Configuration
Number of Threads: 52
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 35.5334
Minimum kernel time: 0.00020388
Maximum kernel time: 0.00993908
Arithm. Mean kernel time: 0.000355245
Performance results
Total GFlops/s: 40.7884
Minimum GFlops/s: 1.45823
Maximum GFlops/s: 71.0884
Arithm. Mean GFlops/s: 40.7986
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 15806)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 15806)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 15806)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 15806)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6
To display your profiling results:
##################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_6 #
##################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 16081)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 235.62510 +- 0.000001. Correct Result: 235.625101
Configuration
Number of Threads: 104
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 7.89645
Minimum kernel time: 6.4954e-05
Maximum kernel time: 0.00669264
Arithm. Mean kernel time: 7.88913e-05
Performance results
Total GFlops/s: 183.544
Minimum GFlops/s: 2.16559
Maximum GFlops/s: 223.135
Arithm. Mean GFlops/s: 183.715
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 16081)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 16081)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 16081)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 16081)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7
To display your profiling results:
##################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_7 #
##################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node ifcp01.benchmarkcenter.megware.com
* Info: Process launched (host ifcp01.benchmarkcenter.megware.com, process 16411)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.85291 +- 0.000001. Correct Result: 234.852910
Configuration
Number of Threads: 208
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 9.13867
Minimum kernel time: 7.5579e-05
Maximum kernel time: 0.00740341
Arithm. Mean kernel time: 9.11083e-05
Performance results
Total GFlops/s: 158.595
Minimum GFlops/s: 1.95768
Maximum GFlops/s: 191.766
Arithm. Mean GFlops/s: 159.08
* Info: Process finished (host ifcp01.benchmarkcenter.megware.com, process 16411)
* Info: Dumping samples (host ifcp01.benchmarkcenter.megware.com, process 16411)
* Info: Dumping source info for callchain nodes (host ifcp01.benchmarkcenter.megware.com, process 16411)
* Info: Building/writing metadata (host ifcp01.benchmarkcenter.megware.com)
* Info: Finished collect step (host ifcp01.benchmarkcenter.megware.com, process 16411)
Your experiment path is /home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8
To display your profiling results:
##################################################################################################################################
# LEVEL | REPORT | COMMAND #
##################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/epi-spmxv-main/gcc_ov1_scala_ofast/tools/lprof_npsu_run_8 #
##################################################################################################################################