options

Executable Output


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 706528)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.88004 +- 0.000001. Correct Result: 234.880041

Configuration              
Number of Threads:         1
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     1093.33
Minimum kernel time:       0.0108221
Maximum kernel time:       0.019305
Arithm. Mean kernel time:  0.0109333

Performance results        
Total GFlops/s:            1.32562
Minimum GFlops/s:          0.750764
Maximum GFlops/s:          1.33925
Arithm. Mean GFlops/s:     1.32563


* Info: Process finished (host skylake, process 706528)
* Info: Dumping samples (host skylake, process 706528)
* Info: Dumping source info for callchain nodes (host skylake, process 706528)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706528)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0

To display your profiling results:
######################################################################################################################################################
#    LEVEL    |     REPORT     |                                                       COMMAND                                                       #
######################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_0  #
######################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 706660)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.43286 +- 0.000001. Correct Result: 234.432862

Configuration              
Number of Threads:         2
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     627.395
Minimum kernel time:       0.00603509
Maximum kernel time:       0.0211191
Arithm. Mean kernel time:  0.00627379

Performance results        
Total GFlops/s:            2.31011
Minimum GFlops/s:          0.686274
Maximum GFlops/s:          2.40154
Arithm. Mean GFlops/s:     2.31017


* Info: Process finished (host skylake, process 706660)
* Info: Dumping samples (host skylake, process 706660)
* Info: Dumping source info for callchain nodes (host skylake, process 706660)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706660)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1

To display your profiling results:
######################################################################################################################################################
#    LEVEL    |     REPORT     |                                                       COMMAND                                                       #
######################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_1  #
######################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 706764)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.59205 +- 0.000001. Correct Result: 234.592049

Configuration              
Number of Threads:         4
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     304.998
Minimum kernel time:       0.00294995
Maximum kernel time:       0.0110881
Arithm. Mean kernel time:  0.00304989

Performance results        
Total GFlops/s:            4.752
Minimum GFlops/s:          1.30712
Maximum GFlops/s:          4.91313
Arithm. Mean GFlops/s:     4.75213


* Info: Process finished (host skylake, process 706764)
* Info: Dumping samples (host skylake, process 706764)
* Info: Dumping source info for callchain nodes (host skylake, process 706764)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706764)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2

To display your profiling results:
######################################################################################################################################################
#    LEVEL    |     REPORT     |                                                       COMMAND                                                       #
######################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_2  #
######################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 706863)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 234.21198 +- 0.000001. Correct Result: 234.211983

Configuration              
Number of Threads:         8
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     161.584
Minimum kernel time:       0.00153303
Maximum kernel time:       0.00894308
Arithm. Mean kernel time:  0.00161572

Performance results        
Total GFlops/s:            8.96961
Minimum GFlops/s:          1.62064
Maximum GFlops/s:          9.45414
Arithm. Mean GFlops/s:     8.97031


* Info: Process finished (host skylake, process 706863)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host skylake, process 706863)
* Info: Dumping source info for callchain nodes (host skylake, process 706863)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706863)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3

To display your profiling results:
######################################################################################################################################################
#    LEVEL    |     REPORT     |                                                       COMMAND                                                       #
######################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_3  #
######################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 706950)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 232.04590 +- 0.000001. Correct Result: 232.045902

Configuration              
Number of Threads:         16
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     79.459
Minimum kernel time:       0.000757933
Maximum kernel time:       0.00465798
Arithm. Mean kernel time:  0.000794509

Performance results        
Total GFlops/s:            18.2402
Minimum GFlops/s:          3.11154
Maximum GFlops/s:          19.1224
Arithm. Mean GFlops/s:     18.2421


* Info: Process finished (host skylake, process 706950)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host skylake, process 706950)
* Info: Dumping source info for callchain nodes (host skylake, process 706950)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 706950)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4

To display your profiling results:
######################################################################################################################################################
#    LEVEL    |     REPORT     |                                                       COMMAND                                                       #
######################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_4  #
######################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 707036)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.04515 +- 0.000001. Correct Result: 233.045148

Configuration              
Number of Threads:         26
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     53.072
Minimum kernel time:       0.000494003
Maximum kernel time:       0.00719094
Arithm. Mean kernel time:  0.000530647

Performance results        
Total GFlops/s:            27.3091
Minimum GFlops/s:          2.01552
Maximum GFlops/s:          29.3389
Arithm. Mean GFlops/s:     27.3129


* Info: Process finished (host skylake, process 707036)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host skylake, process 707036)
* Info: Dumping source info for callchain nodes (host skylake, process 707036)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 707036)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5

To display your profiling results:
######################################################################################################################################################
#    LEVEL    |     REPORT     |                                                       COMMAND                                                       #
######################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_5  #
######################################################################################################################################################


* Info: Selecting the 'perf-low-ppn' engine for node skylake

* Info: Process launched (host skylake, process 707132)reading matrix in matlab format from input-matrix/mat_dim_493039.txt
Loaded Matrix and random RHS

Correctness check
Success, correct result: 233.89933 +- 0.000001. Correct Result: 233.899329

Configuration              
Number of Threads:         52
Number of Repetitions:     100000
Input filename:            input-matrix/mat_dim_493039.txt

Time measurements          
Total experiment time:     29.832
Minimum kernel time:       0.000257015
Maximum kernel time:       0.0319731
Arithm. Mean kernel time:  0.000298199

Performance results        
Total GFlops/s:            48.5837
Minimum GFlops/s:          0.453302
Maximum GFlops/s:          56.3916
Arithm. Mean GFlops/s:     48.6034


* Info: Process finished (host skylake, process 707132)
* Info: Callchains info will be incomplete
* Info: Try to recompile your application with -fno-omit-frame-pointer or to rerun with btm=stack
* Info: Dumping samples (host skylake, process 707132)
* Info: Dumping source info for callchain nodes (host skylake, process 707132)
* Info: Building/writing metadata (host skylake)
* Info: Finished collect step (host skylake, process 707132)

Your experiment path is /home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6

To display your profiling results:
######################################################################################################################################################
#    LEVEL    |     REPORT     |                                                       COMMAND                                                       #
######################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/kcamus/POP/POP2_miniapp/spmxv/epi-spmxv-main/icx_scala_prompt/tools/lprof_npsu_run_6  #
######################################################################################################################################################

×