* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 712613)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 235.14376 +- 0.000001. Correct Result: 235.143764
Configuration
Number of Threads: 1
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 1086.96
Minimum kernel time: 0.0106314
Maximum kernel time: 0.0116609
Arithm. Mean kernel time: 0.0108695
Performance results
Total GFlops/s: 1.33339
Minimum GFlops/s: 1.24292
Maximum GFlops/s: 1.36327
Arithm. Mean GFlops/s: 1.33341
* Info: Process finished (host skylake, process 712613)
* Info: Dumping samples (host skylake, process 712613)
* Info: Dumping source info for callchain nodes (host skylake, process 712613)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712613)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0
To display your profiling results:
###############################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_0 #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 712749)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.20245 +- 0.000001. Correct Result: 233.202453
Configuration
Number of Threads: 2
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 561.626
Minimum kernel time: 0.00550001
Maximum kernel time: 0.0079923
Arithm. Mean kernel time: 0.00561613
Performance results
Total GFlops/s: 2.58063
Minimum GFlops/s: 1.81343
Maximum GFlops/s: 2.63517
Arithm. Mean GFlops/s: 2.58069
* Info: Process finished (host skylake, process 712749)
* Info: Dumping samples (host skylake, process 712749)
* Info: Dumping source info for callchain nodes (host skylake, process 712749)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712749)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1
To display your profiling results:
###############################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_1 #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 712850)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.42015 +- 0.000001. Correct Result: 234.420152
Configuration
Number of Threads: 4
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 292.985
Minimum kernel time: 0.00278983
Maximum kernel time: 0.0221535
Arithm. Mean kernel time: 0.00292976
Performance results
Total GFlops/s: 4.94684
Minimum GFlops/s: 0.654231
Maximum GFlops/s: 5.19512
Arithm. Mean GFlops/s: 4.947
* Info: Process finished (host skylake, process 712850)
* Info: Dumping samples (host skylake, process 712850)
* Info: Dumping source info for callchain nodes (host skylake, process 712850)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712850)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2
To display your profiling results:
###############################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_2 #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 712938)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.66557 +- 0.000001. Correct Result: 233.665565
Configuration
Number of Threads: 8
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 156.983
Minimum kernel time: 0.00147383
Maximum kernel time: 0.00450785
Arithm. Mean kernel time: 0.0015697
Performance results
Total GFlops/s: 9.23255
Minimum GFlops/s: 3.21517
Maximum GFlops/s: 9.8339
Arithm. Mean GFlops/s: 9.23327
* Info: Process finished (host skylake, process 712938)
* Info: Dumping samples (host skylake, process 712938)
* Info: Dumping source info for callchain nodes (host skylake, process 712938)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 712938)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3
To display your profiling results:
###############################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_3 #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 713023)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.58561 +- 0.000001. Correct Result: 234.585612
Configuration
Number of Threads: 16
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 86.2404
Minimum kernel time: 0.00082339
Maximum kernel time: 0.00680713
Arithm. Mean kernel time: 0.000862338
Performance results
Total GFlops/s: 16.8059
Minimum GFlops/s: 2.12916
Maximum GFlops/s: 17.6022
Arithm. Mean GFlops/s: 16.8072
* Info: Process finished (host skylake, process 713023)
* Info: Dumping samples (host skylake, process 713023)
* Info: Dumping source info for callchain nodes (host skylake, process 713023)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 713023)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4
To display your profiling results:
###############################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_4 #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 713108)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 234.05586 +- 0.000001. Correct Result: 234.055865
Configuration
Number of Threads: 26
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 70.8301
Minimum kernel time: 0.000689473
Maximum kernel time: 0.0041108
Arithm. Mean kernel time: 0.000708186
Performance results
Total GFlops/s: 20.4623
Minimum GFlops/s: 3.52571
Maximum GFlops/s: 21.0211
Arithm. Mean GFlops/s: 20.4657
* Info: Process finished (host skylake, process 713108)
* Info: Dumping samples (host skylake, process 713108)
* Info: Dumping source info for callchain nodes (host skylake, process 713108)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 713108)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5
To display your profiling results:
###############################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_5 #
###############################################################################################################################################
* Info: Selecting the 'perf-low-ppn' engine for node skylake
* Info: Process launched (host skylake, process 713205)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS
Correctness check
Success, correct result: 233.71083 +- 0.000001. Correct Result: 233.710830
Configuration
Number of Threads: 52
Number of Repetitions: 100000
Input filename: input-matrix/mat_dim_493039.txt
Time measurements
Total experiment time: 33.2772
Minimum kernel time: 0.000286154
Maximum kernel time: 0.0131419
Arithm. Mean kernel time: 0.000332461
Performance results
Total GFlops/s: 43.5538
Minimum GFlops/s: 1.10285
Maximum GFlops/s: 50.6492
Arithm. Mean GFlops/s: 43.5946
* Info: Process finished (host skylake, process 713205)
* Info: Dumping samples (host skylake, process 713205)
* Info: Dumping source info for callchain nodes (host skylake, process 713205)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 713205)
Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6
To display your profiling results:
###############################################################################################################################################
# LEVEL | REPORT | COMMAND #
###############################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/gcc_scala/tools/lprof_npsu_run_6 #
###############################################################################################################################################