Run o1 | Number processes: 1Number nodes: 1Run Command: <executable> -t 1 -c -f input-matrix/mat_dim_493039.txt -r 100000MPI Command: Dataset: Run Directory: /home/kcamus/epi-spmxv-main |
---|---|
Run o2 | Run Command: <executable> -t 2 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 2 |
Run o4 | Run Command: <executable> -t 4 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 4 |
Run o8 | Run Command: <executable> -t 8 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 8 |
Run o16 | Run Command: <executable> -t 16 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 16 |
Run o32 | Run Command: <executable> -t 32 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 32 |
Run o52 | Run Command: <executable> -t 52 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 52 |
Run o104 | Run Command: <executable> -t 104 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 104 |
Run o208 | Run Command: <executable> -t 208 -c -f input-matrix/mat_dim_493039.txt -r 100000OMP_NUM_THREADS: 208 |
Loop id | Source Location | Source Function | Level | Exclusive coverage o1 (%) | Exclusive coverage o2 (%) | Exclusive coverage o4 (%) | Exclusive coverage o8 (%) | Exclusive coverage o16 (%) | Exclusive coverage o32 (%) | Exclusive coverage o52 (%) | Exclusive coverage o104 (%) | Exclusive coverage o208 (%) | Inclusive coverage o1 (%) | Inclusive coverage o2 (%) | Inclusive coverage o4 (%) | Inclusive coverage o8 (%) | Inclusive coverage o16 (%) | Inclusive coverage o32 (%) | Inclusive coverage o52 (%) | Inclusive coverage o104 (%) | Inclusive coverage o208 (%) | Max Exclusive Time Over Threads o1 (s) | Max Exclusive Time Over Threads o2 (s) | Max Exclusive Time Over Threads o4 (s) | Max Exclusive Time Over Threads o8 (s) | Max Exclusive Time Over Threads o16 (s) | Max Exclusive Time Over Threads o32 (s) | Max Exclusive Time Over Threads o52 (s) | Max Exclusive Time Over Threads o104 (s) | Max Exclusive Time Over Threads o208 (s) | Max Inclusive Time Over Threads o1 (s) | Max Inclusive Time Over Threads o2 (s) | Max Inclusive Time Over Threads o4 (s) | Max Inclusive Time Over Threads o8 (s) | Max Inclusive Time Over Threads o16 (s) | Max Inclusive Time Over Threads o32 (s) | Max Inclusive Time Over Threads o52 (s) | Max Inclusive Time Over Threads o104 (s) | Max Inclusive Time Over Threads o208 (s) | Exclusive Time w.r.t. Wall Time o1 (s) | Exclusive Time w.r.t. Wall Time o2 (s) | Exclusive Time w.r.t. Wall Time o4 (s) | Exclusive Time w.r.t. Wall Time o8 (s) | Exclusive Time w.r.t. Wall Time o16 (s) | Exclusive Time w.r.t. Wall Time o32 (s) | Exclusive Time w.r.t. Wall Time o52 (s) | Exclusive Time w.r.t. Wall Time o104 (s) | Exclusive Time w.r.t. Wall Time o208 (s) | Inclusive Time w.r.t. Wall Time o1 (s) | Inclusive Time w.r.t. Wall Time o2 (s) | Inclusive Time w.r.t. Wall Time o4 (s) | Inclusive Time w.r.t. Wall Time o8 (s) | Inclusive Time w.r.t. Wall Time o16 (s) | Inclusive Time w.r.t. Wall Time o32 (s) | Inclusive Time w.r.t. Wall Time o52 (s) | Inclusive Time w.r.t. Wall Time o104 (s) | Inclusive Time w.r.t. Wall Time o208 (s) | Nb Threads o1 | Nb Threads o2 | Nb Threads o4 | Nb Threads o8 | Nb Threads o16 | Nb Threads o32 | Nb Threads o52 | Nb Threads o104 | Nb Threads o208 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing o1 | Speedup If Perfect Load Balancing o2 | Speedup If Perfect Load Balancing o4 | Speedup If Perfect Load Balancing o8 | Speedup If Perfect Load Balancing o16 | Speedup If Perfect Load Balancing o32 | Speedup If Perfect Load Balancing o52 | Speedup If Perfect Load Balancing o104 | Speedup If Perfect Load Balancing o208 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (o1) Efficiency | (o1) Potential Speed-Up (%) | (o2) Efficiency | (o2) Potential Speed-Up (%) | (o4) Efficiency | (o4) Potential Speed-Up (%) | (o8) Efficiency | (o8) Potential Speed-Up (%) | (o16) Efficiency | (o16) Potential Speed-Up (%) | (o32) Efficiency | (o32) Potential Speed-Up (%) | (o52) Efficiency | (o52) Potential Speed-Up (%) | (o104) Efficiency | (o104) Potential Speed-Up (%) | (o208) Efficiency | (o208) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
5 | spmxv.exe - main.cpp:72-74 | spmxv(ooo_options*, ooo_input*) [clone .extracted] | Innermost | 63.01 | 64.19 | 67.18 | 70.14 | 69.34 | 54.09 | 56.38 | 35.15 | 32.49 | 63.01 | 64.19 | 67.18 | 70.14 | 69.34 | 54.09 | 56.38 | 35.15 | 32.49 | 372.99 | 206.73 | 141.29 | 97.17 | 55.43 | 32.24 | 17.85 | 2.88 | 2.55 | 372.99 | 206.73 | 141.29 | 97.17 | 55.43 | 32.24 | 17.85 | 2.88 | 2.55 | 372.99 | 206.98 | 141.40 | 97.07 | 55.57 | 28.71 | 16.20 | 2.57 | 2.17 | 372.99 | 206.98 | 141.40 | 97.07 | 55.57 | 28.71 | 16.20 | 2.57 | 2.17 | 1 | 2 | 4 | 8 | 16 | 32 | 52 | 104 | 208 | 100 | 37.5 | 1 | 1.52 | 3.2 | 1 | 1 | 1 | 1.01 | 1.01 | 1.15 | 1.16 | 1.38 | 1.49 | 0 | 2 | 0 | 0 | 1 | 1 | 0 | 0.9 | 6.35 | 0.66 | 22.88 | 0.48 | 36.45 | 0.42 | 40.25 | 0.41 | 32.13 | 0.44 | 31.41 | 1.39 | 0 | 0.83 | 5.62 |
4 | spmxv.exe - main.cpp:72-74 | spmxv(ooo_options*, ooo_input*) [clone .extracted] | Innermost | 21.21 | 20.68 | 20.18 | 19.66 | 20.10 | 22.75 | 20.09 | 16.49 | 13.91 | 21.21 | 20.68 | 20.18 | 19.66 | 20.10 | 22.75 | 20.09 | 16.49 | 13.91 | 125.54 | 67.31 | 43.13 | 27.40 | 16.24 | 15.51 | 6.66 | 1.23 | 0.92 | 125.54 | 67.31 | 43.13 | 27.40 | 16.24 | 15.51 | 6.66 | 1.23 | 0.92 | 125.54 | 66.68 | 42.47 | 27.21 | 16.11 | 12.08 | 5.77 | 1.21 | 0.93 | 125.54 | 66.68 | 42.47 | 27.21 | 16.11 | 12.08 | 5.77 | 1.21 | 0.93 | 1 | 2 | 4 | 8 | 16 | 32 | 52 | 104 | 208 | 0 | 12.5 | 1 | 2.91 | 8 | 1 | 1.01 | 1.02 | 1.02 | 1.02 | 1.32 | 1.21 | 1.26 | 1.26 | 0 | 2 | 0 | 0 | 1 | 1 | 0 | 0.94 | 1.21 | 0.74 | 5.27 | 0.58 | 8.33 | 0.49 | 10.31 | 0.32 | 15.36 | 0.42 | 11.69 | 1 | -0 | 0.65 | 4.87 |
3 | spmxv.exe - main.cpp:66-77 | spmxv(ooo_options*, ooo_input*) [clone .extracted] | InBetween | 15.49 | 14.59 | 12.11 | 8.90 | 8.31 | 7.43 | 9.47 | 19.13 | 21.10 | 99.70 | 99.46 | 99.46 | 98.71 | 97.75 | 84.27 | 85.94 | 70.78 | 67.50 | 91.69 | 47.38 | 25.54 | 12.74 | 6.89 | 4.78 | 3.01 | 1.36 | 1.38 | 590.22 | 320.52 | 208.53 | 136.28 | 77.55 | 51.35 | 26.90 | 5.20 | 4.59 | 91.69 | 47.04 | 25.49 | 12.32 | 6.66 | 3.94 | 2.72 | 1.40 | 1.41 | 590.22 | 320.70 | 209.37 | 136.60 | 78.35 | 44.73 | 24.69 | 5.18 | 4.51 | 1 | 2 | 4 | 8 | 16 | 32 | 52 | 104 | 208 | 30.65 | 15.87 | 1.91 | 1.92 | 11.25 | 1 | 1.01 | 1.01 | 1.04 | 1.05 | 1.24 | 1.16 | 1.2 | 1.24 | 0.5 | 0.5 | 1.5 | 3.5 | 0.5 | 1 | 0 | 0.97 | 0.37 | 0.9 | 1.22 | 0.93 | 0.62 | 0.86 | 1.16 | 0.73 | 2.03 | 0.65 | 3.33 | 0.63 | 7.09 | 0.31 | 14.5 |
78 | spmxv.exe - ooo_cmdline.h:83-97 [...] | void load_drops_matlab_matrix<double, int>(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, int*&, int*&, double*&, int&, int&, int&) | Single | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.01 | 0.03 | 0.02 | 0.04 | 0.03 | 0.02 | 0.03 | 0.05 | 0.02 | 0.01 | 0.03 | 0.02 | 0.04 | 0.03 | 0.02 | 0.03 | 0.05 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 10.82 | 2.84 | 1 | 34.31 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2.5 | 0 | 1 | 1 | 0 | 1.33 | -0 | 0.57 | 0 | 0.99 | 0 | 0.56 | 0 | 0.65 | 0 | 0.76 | 0 | 0.54 | 0 | 0.32 | 0 |
77 | spmxv.exe - ooo_cmdline.cpp:171-173 | print_error_check(double*, ooo_input*, ooo_options*) | Innermost | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 1 | 1 | 1 | 100 | 37.5 | 1 | 1.52 | 3.2 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 2 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0.99 | 0 | 0.48 | 0 | 0.81 | 0 | ||||||||||||||||||||||||||||
75 | spmxv.exe - ooo_cmdline.cpp:165-175 | print_error_check(double*, ooo_input*, ooo_options*) | Outermost | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 1 | 1 | 26.7 | 14.84 | 1.61 | 3.34 | 11.05 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1.25 | 0 | 0 | 2.5 | 1.5 | 1 | 0 | 0.98 | 0 | |||||||||||||||||||||||||||||||||||||||||||||||||
76 | spmxv.exe - ooo_cmdline.cpp:171-173 | print_error_check(double*, ooo_input*, ooo_options*) | Innermost | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 1 | 0 | 12.5 | 1 | 2.91 | 8 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 2 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | ||||||||||||||||||||||||||||||||||||||||||
2 | spmxv.exe - main.cpp:65-77 | spmxv(ooo_options*, ooo_input*) [clone .extracted] | Outermost | 0.00 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.04 | 0.05 | 0.00 | 99.49 | 99.50 | 98.72 | 97.77 | 84.29 | 85.96 | 70.81 | 67.55 | 0.10 | 0.08 | 0.04 | 0.03 | 0.04 | 0.02 | 0.02 | 0.02 | 320.61 | 208.59 | 136.29 | 77.56 | 51.06 | 26.91 | 5.07 | 4.60 | 0.09 | 0.06 | 0.02 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 320.79 | 209.43 | 136.62 | 78.37 | 44.74 | 24.70 | 5.18 | 4.51 | 2 | 4 | 8 | 13 | 26 | 35 | 37 | 85 | 0 | 7.69 | 1 | 1 | 14.29 | 0 | 1.08 | 1.25 | 1.78 | 1.75 | 2.6 | 2.86 | 2.36 | 3.04 | 1 | 1 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||
74 | spmxv.exe - ooo_cmdline.cpp:180-181 | print_error_check(double*, ooo_input*, ooo_options*) | Single | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 100 | 50 | 1 | 1.78 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
71 | spmxv.exe - ooo_cmdline.cpp:11-16 | print_performance_results(ooo_options*, double, double, timespan*, ooo_input*) | Single | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 100 | 40 | 1 | 1.27 | 2.8 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 | spmxv.exe - main.cpp:60-79 [...] | spmxv(ooo_options*, ooo_input*) | Single | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 1 | 0 | 8.55 | 5.29 | 1 | 13.82 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0 | ||||||||||||||||||||||||||||||||||||||||||
8 | spmxv.exe - main.cpp:43-54 | spmxv(ooo_options*, ooo_input*) [clone .extracted.9] | Innermost | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 9.77 | 1 | 1 | 15.19 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 |