Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
PrOMPT - Coverage per Parallel Efficiency |
PrOMPT - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so (%) | libqmckl.so.0 (%) | libqmckl.so.0.0.0 (%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼m1o1 | 180.83 | 0 | 0 | 0 | 70.95 | 0.04 | 0 | 0 | 0 | 8.42 | 0 | 0 | 20.59 | 0 |
▼Node skylake | 180.83 | 0 | 0 | 0 | 70.95 | 0.04 | 0 | 0 | 0 | 8.42 | 0 | 0 | 20.59 | 0 |
▼Process 2441632 | 180.83 | 0 | 0 | 0 | 70.95 | 0.04 | 0 | 0 | 0 | 8.42 | 0 | 0 | 20.59 | 0 |
○Thread 2441632 | 180.83 | 0 | 0 | 0 | 70.95 | 0.04 | 0 | 0 | 0 | 8.42 | 0 | 0 | 20.59 | 0 |
▼m1o2 | 99.33 | 0 | 0 | 7.25 | 65 | 0.04 | 0 | 0 | 0 | 7.89 | 0 | 0 | 19.82 | 0 |
▼Node skylake | 99.33 | 0 | 0 | 7.25 | 65 | 0.04 | 0 | 0 | 0 | 7.89 | 0 | 0 | 19.82 | 0 |
▼Process 2441702 | 99.33 | 0 | 0 | 7.25 | 65 | 0.04 | 0 | 0 | 0 | 7.89 | 0 | 0 | 19.82 | 0 |
○Thread 2441702 | 99.31 | 0 | 0 | 0.07 | 65.17 | 0.05 | 0 | 0 | 0 | 11.85 | 0 | 0 | 22.85 | 0.01 |
○Thread 2441755 | 99.33 | 0 | 0 | 14.42 | 64.83 | 0.04 | 0 | 0 | 0 | 3.93 | 0 | 0 | 16.78 | 0 |
▼m1o4 | 62.49 | 0 | 0 | 21.82 | 53.7 | 0.05 | 0 | 0 | 0 | 6.94 | 0 | 0 | 17.5 | 0 |
▼Node skylake | 62.49 | 0 | 0 | 21.82 | 53.7 | 0.05 | 0 | 0 | 0 | 6.94 | 0 | 0 | 17.5 | 0 |
▼Process 2441773 | 62.49 | 0 | 0 | 21.82 | 53.7 | 0.05 | 0 | 0 | 0 | 6.94 | 0 | 0 | 17.5 | 0 |
○Thread 2441773 | 62.49 | 0 | 0 | 6.96 | 52.26 | 0.09 | 0 | 0 | 0 | 16.24 | 0 | 0 | 24.44 | 0.01 |
○Thread 2441826 | 61.62 | 0 | 0 | 23.85 | 56.55 | 0.02 | 0 | 0 | 0 | 4.74 | 0 | 0 | 14.84 | 0 |
○Thread 2441827 | 60.84 | 0 | 0 | 28.39 | 52.98 | 0.04 | 0 | 0 | 0 | 3.17 | 0 | 0 | 15.41 | 0 |
○Thread 2441828 | 60.84 | 0 | 0 | 28.44 | 53.01 | 0.04 | 0 | 0 | 0 | 3.37 | 0 | 0 | 15.14 | 0 |
▼m1o8 | 40.71 | 0 | 0 | 34.46 | 42.48 | 0.07 | 0 | 0 | 0 | 6.53 | 0 | 0 | 16.46 | 0 |
▼Node skylake | 40.71 | 0 | 0 | 34.46 | 42.48 | 0.07 | 0 | 0 | 0 | 6.53 | 0 | 0 | 16.46 | 0 |
▼Process 2441839 | 40.71 | 0 | 0 | 34.46 | 42.48 | 0.07 | 0 | 0 | 0 | 6.53 | 0 | 0 | 16.46 | 0 |
○Thread 2441839 | 40.71 | 0 | 0 | 6.68 | 41.24 | 0.2 | 0 | 0 | 0 | 22.98 | 0 | 0 | 28.89 | 0.01 |
○Thread 2441892 | 40.25 | 0 | 0 | 35.79 | 44.24 | 0.06 | 0 | 0 | 0 | 4.91 | 0 | 0 | 15.01 | 0 |
○Thread 2441893 | 40.31 | 0 | 0 | 36 | 44.13 | 0.06 | 0 | 0 | 0 | 4.7 | 0 | 0 | 15.11 | 0 |
○Thread 2441894 | 39.5 | 0 | 0 | 39.99 | 41.37 | 0.09 | 0.01 | 0 | 0 | 3.81 | 0 | 0 | 14.73 | 0 |
○Thread 2441895 | 40.29 | 0 | 0 | 42.75 | 40.15 | 0.05 | 0 | 0 | 0 | 3.28 | 0 | 0 | 13.78 | 0 |
○Thread 2441896 | 40.08 | 0 | 0 | 37.09 | 43.79 | 0.05 | 0 | 0 | 0 | 4.34 | 0 | 0 | 14.72 | 0 |
○Thread 2441897 | 39.15 | 0 | 0 | 40.96 | 41.2 | 0.05 | 0 | 0 | 0 | 3.42 | 0 | 0 | 14.37 | 0 |
○Thread 2441898 | 40.11 | 0 | 0 | 37.02 | 43.7 | 0.02 | 0 | 0 | 0 | 4.45 | 0 | 0 | 14.8 | 0 |
▼m1o16 | 30.51 | 0 | 0 | 47.76 | 29 | 0.08 | 0 | 0 | 0 | 6.64 | 0 | 0 | 16.52 | 0 |
▼Node skylake | 30.51 | 0 | 0 | 47.76 | 29 | 0.08 | 0 | 0 | 0 | 6.64 | 0 | 0 | 16.52 | 0 |
▼Process 2441906 | 30.51 | 0 | 0 | 47.76 | 29 | 0.08 | 0 | 0 | 0 | 6.64 | 0 | 0 | 16.52 | 0 |
○Thread 2441906 | 30.51 | 0 | 0 | 4.59 | 28.88 | 0.29 | 0 | 0.02 | 0 | 30.97 | 0 | 0 | 35.25 | 0 |
○Thread 2441959 | 30.3 | 0 | 0 | 48.26 | 30.61 | 0 | 0 | 0 | 0 | 5.26 | 0 | 0 | 15.87 | 0 |
○Thread 2441960 | 29.99 | 0 | 0 | 50.62 | 28.49 | 0.08 | 0 | 0 | 0 | 5.02 | 0 | 0 | 15.79 | 0 |
○Thread 2441961 | 30.29 | 0 | 0 | 48.33 | 30.11 | 0.05 | 0 | 0 | 0 | 5.4 | 0 | 0 | 16.11 | 0 |
○Thread 2441962 | 30.05 | 0 | 0 | 52.71 | 27.82 | 0.1 | 0 | 0 | 0 | 4.59 | 0 | 0 | 14.78 | 0 |
○Thread 2441963 | 29.99 | 0 | 0 | 49.89 | 29.57 | 0.1 | 0 | 0 | 0 | 5.22 | 0 | 0 | 15.22 | 0 |
○Thread 2441964 | 30 | 0 | 0 | 49.85 | 29.77 | 0.05 | 0 | 0 | 0 | 5.23 | 0 | 0 | 15.1 | 0 |
○Thread 2441965 | 29.98 | 0 | 0 | 49.65 | 29.75 | 0.1 | 0 | 0 | 0 | 5.17 | 0 | 0 | 15.33 | 0 |
○Thread 2441966 | 30.06 | 0 | 0 | 52.79 | 27.74 | 0.1 | 0 | 0 | 0 | 4.64 | 0 | 0 | 14.72 | 0 |
○Thread 2441967 | 30 | 0 | 0 | 49.88 | 29.78 | 0.05 | 0 | 0 | 0 | 5.08 | 0 | 0 | 15.21 | 0 |
○Thread 2441968 | 30 | 0 | 0 | 49.73 | 29.6 | 0.08 | 0 | 0 | 0 | 5.18 | 0 | 0 | 15.4 | 0 |
○Thread 2441969 | 29.75 | 0 | 0 | 51.72 | 28.49 | 0.07 | 0 | 0 | 0 | 4.87 | 0 | 0 | 14.84 | 0 |
○Thread 2441970 | 30.06 | 0 | 0 | 50.17 | 29.41 | 0.07 | 0 | 0 | 0 | 5.11 | 0 | 0 | 15.25 | 0 |
○Thread 2441971 | 29.63 | 0 | 0 | 52.44 | 27.81 | 0.03 | 0 | 0 | 0 | 4.61 | 0 | 0 | 15.11 | 0 |
○Thread 2441972 | 29.64 | 0 | 0 | 52.18 | 28.01 | 0.08 | 0 | 0 | 0 | 4.69 | 0 | 0 | 15.03 | 0 |
○Thread 2441973 | 29.63 | 0 | 0 | 52.27 | 28.04 | 0.07 | 0 | 0 | 0 | 4.74 | 0 | 0 | 14.88 | 0 |
▼m1o26 | 27.31 | 0 | 0 | 53.33 | 23.01 | 0.1 | 0 | 0 | 0 | 6.86 | 0 | 0 | 16.7 | 0 |
▼Node skylake | 27.31 | 0 | 0 | 53.33 | 23.01 | 0.1 | 0 | 0 | 0 | 6.86 | 0 | 0 | 16.7 | 0 |
▼Process 2441983 | 27.31 | 0 | 0 | 53.33 | 23.01 | 0.1 | 0 | 0 | 0 | 6.86 | 0 | 0 | 16.7 | 0 |
○Thread 2441983 | 27.31 | 0 | 0 | 3.55 | 23.11 | 0.22 | 0 | 0 | 0 | 34.8 | 0 | 0 | 38.3 | 0.02 |
○Thread 2442036 | 26.99 | 0 | 0 | 54.15 | 24.03 | 0.11 | 0 | 0 | 0 | 5.84 | 0 | 0 | 15.88 | 0 |
○Thread 2442037 | 27.07 | 0 | 0 | 55.28 | 22.79 | 0.13 | 0 | 0 | 0 | 5.93 | 0 | 0 | 15.87 | 0 |
○Thread 2442038 | 27.05 | 0 | 0 | 55.25 | 22.53 | 0.11 | 0 | 0 | 0 | 5.79 | 0 | 0 | 16.32 | 0 |
○Thread 2442039 | 27.27 | 0 | 0 | 55.42 | 22.68 | 0.15 | 0 | 0 | 0 | 5.74 | 0 | 0 | 16.01 | 0 |
○Thread 2442040 | 27.03 | 0 | 0 | 53.86 | 24.18 | 0.09 | 0 | 0 | 0 | 6.03 | 0 | 0 | 15.84 | 0 |
○Thread 2442041 | 27.03 | 0 | 0 | 53.87 | 24.2 | 0.09 | 0 | 0 | 0 | 5.83 | 0 | 0 | 16.02 | 0 |
○Thread 2442042 | 27.06 | 0 | 0 | 53.5 | 24.41 | 0.06 | 0 | 0 | 0 | 5.99 | 0 | 0 | 16.04 | 0 |
○Thread 2442043 | 27.14 | 0 | 0 | 55.11 | 23 | 0.09 | 0 | 0 | 0 | 5.75 | 0 | 0 | 16.05 | 0 |
○Thread 2442044 | 27 | 0 | 0 | 56.23 | 21.73 | 0.07 | 0 | 0 | 0 | 6.04 | 0 | 0 | 15.93 | 0 |
○Thread 2442045 | 27.07 | 0 | 0 | 53.94 | 24.18 | 0.07 | 0 | 0 | 0 | 5.87 | 0 | 0 | 15.92 | 0 |
○Thread 2442046 | 27.04 | 0 | 0 | 53.55 | 24.37 | 0.09 | 0 | 0 | 0 | 5.97 | 0 | 0 | 16.01 | 0 |
○Thread 2442047 | 27.15 | 0 | 0 | 55.26 | 23.01 | 0.07 | 0 | 0 | 0 | 5.82 | 0 | 0 | 15.84 | 0 |
○Thread 2442048 | 27.1 | 0 | 0 | 53.85 | 24.21 | 0.07 | 0 | 0 | 0 | 6.05 | 0 | 0 | 15.81 | 0 |
○Thread 2442049 | 27.03 | 0 | 0 | 56.37 | 21.65 | 0.11 | 0 | 0 | 0 | 5.88 | 0 | 0 | 15.99 | 0 |
○Thread 2442050 | 27.03 | 0 | 0 | 56.1 | 21.48 | 0.09 | 0 | 0 | 0 | 5.88 | 0 | 0 | 16.44 | 0 |
○Thread 2442051 | 27.25 | 0 | 0 | 56.51 | 21.25 | 0.11 | 0 | 0 | 0 | 5.82 | 0 | 0 | 16.31 | 0 |
○Thread 2442052 | 27.03 | 0 | 0 | 56.26 | 21.61 | 0.11 | 0 | 0 | 0 | 5.92 | 0 | 0 | 16.1 | 0 |
○Thread 2442053 | 27.04 | 0 | 0 | 56.24 | 21.47 | 0.09 | 0 | 0 | 0 | 6.05 | 0 | 0 | 16.15 | 0 |
○Thread 2442054 | 27.1 | 0 | 0 | 53.96 | 24.03 | 0.04 | 0 | 0 | 0 | 5.91 | 0 | 0 | 16.07 | 0 |
○Thread 2442055 | 27.2 | 0 | 0 | 56.67 | 21.42 | 0.07 | 0 | 0 | 0 | 6.07 | 0 | 0 | 15.77 | 0 |
○Thread 2442056 | 27.06 | 0 | 0 | 53.55 | 24.39 | 0.09 | 0 | 0 | 0 | 6.01 | 0 | 0 | 15.96 | 0 |
○Thread 2442057 | 26.41 | 0 | 0 | 57.72 | 22.12 | 0.09 | 0 | 0 | 0 | 4.75 | 0 | 0 | 15.32 | 0 |
○Thread 2442058 | 26.28 | 0 | 0 | 56.96 | 23.52 | 0.1 | 0 | 0 | 0 | 4.79 | 0 | 0 | 14.63 | 0 |
○Thread 2442059 | 26.53 | 0 | 0 | 55.71 | 24.97 | 0.11 | 0 | 0 | 0 | 4.73 | 0 | 0 | 14.47 | 0 |
○Thread 2442060 | 26.35 | 0 | 0 | 58.38 | 22.09 | 0.06 | 0.02 | 0 | 0 | 4.73 | 0 | 0 | 14.73 | 0 |
▼m1o52 | 27.43 | 0 | 0 | 57.56 | 19.45 | 0.09 | 0 | 0 | 0 | 6.4 | 0 | 0 | 16.49 | 0 |
▼Node skylake | 27.43 | 0 | 0 | 57.56 | 19.45 | 0.09 | 0 | 0 | 0 | 6.4 | 0 | 0 | 16.49 | 0 |
▼Process 2442070 | 27.43 | 0 | 0 | 57.56 | 19.45 | 0.09 | 0 | 0 | 0 | 6.4 | 0 | 0 | 16.49 | 0 |
○Thread 2442070 | 27.4 | 0 | 0 | 4.84 | 22.5 | 0.22 | 0 | 0 | 0 | 34.98 | 0 | 0 | 37.45 | 0.02 |
○Thread 2442123 | 26.73 | 0 | 0 | 59.6 | 18.16 | 0.21 | 0 | 0 | 0 | 5.57 | 0 | 0 | 16.46 | 0 |
○Thread 2442124 | 26.82 | 0 | 0 | 59.43 | 18.33 | 0.13 | 0 | 0 | 0 | 5.65 | 0 | 0 | 16.46 | 0 |
○Thread 2442125 | 26.78 | 0 | 0 | 59.05 | 18.24 | 0.13 | 0 | 0 | 0 | 5.69 | 0 | 0 | 16.88 | 0 |
○Thread 2442126 | 27.3 | 0 | 0 | 56.74 | 21.15 | 0.02 | 0 | 0 | 0 | 6.26 | 0 | 0 | 15.82 | 0 |
○Thread 2442127 | 26.88 | 0 | 0 | 56.94 | 20.46 | 0.09 | 0 | 0 | 0 | 6.16 | 0 | 0 | 16.35 | 0 |
○Thread 2442128 | 26.77 | 0 | 0 | 57.49 | 19.59 | 0.07 | 0 | 0 | 0 | 5.81 | 0 | 0 | 17.03 | 0 |
○Thread 2442129 | 27.12 | 0 | 0 | 56.91 | 20.48 | 0.15 | 0 | 0 | 0 | 6.12 | 0 | 0 | 16.33 | 0 |
○Thread 2442130 | 27.22 | 0 | 0 | 58.98 | 18.88 | 0.15 | 0 | 0 | 0 | 5.84 | 0 | 0 | 16.15 | 0 |
○Thread 2442131 | 26.91 | 0 | 0 | 58.82 | 18.77 | 0.04 | 0 | 0 | 0 | 5.91 | 0 | 0 | 16.47 | 0 |
○Thread 2442132 | 27.11 | 0 | 0 | 57.06 | 20.32 | 0.11 | 0 | 0 | 0 | 6.12 | 0 | 0 | 16.38 | 0 |
○Thread 2442133 | 26.34 | 0 | 0 | 59.24 | 18.41 | 0.17 | 0 | 0 | 0 | 5.69 | 0 | 0 | 16.48 | 0 |
○Thread 2442134 | 27.27 | 0 | 0 | 57.66 | 20.28 | 0.18 | 0 | 0 | 0 | 6.09 | 0 | 0 | 15.79 | 0 |
○Thread 2442135 | 26.68 | 0 | 0 | 58.6 | 18.7 | 0.09 | 0 | 0 | 0 | 5.7 | 0 | 0 | 16.9 | 0 |
○Thread 2442136 | 26.82 | 0 | 0 | 58.96 | 18.89 | 0.04 | 0 | 0 | 0 | 5.93 | 0 | 0 | 16.18 | 0 |
○Thread 2442137 | 27.25 | 0 | 0 | 56.35 | 21.36 | 0.06 | 0 | 0 | 0 | 6.39 | 0 | 0 | 15.85 | 0 |
○Thread 2442138 | 27.26 | 0 | 0 | 59.99 | 18.58 | 0.02 | 0.02 | 0 | 0 | 5.72 | 0 | 0 | 15.67 | 0 |
○Thread 2442139 | 26.98 | 0 | 0 | 57.31 | 20.09 | 0.09 | 0 | 0 | 0 | 5.97 | 0 | 0 | 16.53 | 0 |
○Thread 2442140 | 26.98 | 0 | 0 | 56.81 | 20.47 | 0.09 | 0 | 0 | 0 | 6.15 | 0 | 0 | 16.47 | 0 |
○Thread 2442141 | 26.85 | 0 | 0 | 57.28 | 19.96 | 0.04 | 0 | 0 | 0 | 6.11 | 0 | 0 | 16.61 | 0 |
○Thread 2442142 | 26.95 | 0 | 0 | 58.79 | 18.78 | 0.13 | 0 | 0 | 0 | 5.92 | 0 | 0 | 16.39 | 0 |
○Thread 2442143 | 26.69 | 0 | 0 | 59.05 | 19 | 0.07 | 0 | 0 | 0 | 6.07 | 0 | 0 | 15.81 | 0 |
○Thread 2442144 | 26.76 | 0 | 0 | 58.81 | 19.04 | 0.02 | 0 | 0 | 0 | 6.02 | 0 | 0 | 16.11 | 0 |
○Thread 2442145 | 26.72 | 0 | 0 | 59.23 | 18.95 | 0.09 | 0 | 0 | 0 | 5.93 | 0 | 0 | 15.79 | 0 |
○Thread 2442146 | 26.83 | 0 | 0 | 58.83 | 19.01 | 0.06 | 0.02 | 0 | 0 | 5.89 | 0 | 0 | 16.2 | 0 |
○Thread 2442147 | 26.73 | 0 | 0 | 58.92 | 18.99 | 0.06 | 0 | 0 | 0 | 5.87 | 0 | 0 | 16.16 | 0 |
○Thread 2442148 | 26.92 | 0 | 0 | 59.43 | 18.97 | 0.11 | 0 | 0 | 0 | 5.85 | 0 | 0 | 15.64 | 0 |
○Thread 2442149 | 26.94 | 0 | 0 | 58.72 | 18.78 | 0.07 | 0 | 0 | 0 | 5.99 | 0 | 0 | 16.43 | 0 |
○Thread 2442150 | 27.22 | 0 | 0 | 57.82 | 19.86 | 0.11 | 0 | 0 | 0 | 6.23 | 0 | 0 | 15.98 | 0 |
○Thread 2442151 | 26.83 | 0 | 0 | 59.37 | 18.88 | 0.02 | 0 | 0 | 0 | 5.87 | 0 | 0 | 15.86 | 0 |
○Thread 2442152 | 26.92 | 0 | 0 | 59.22 | 18.73 | 0.17 | 0 | 0 | 0 | 5.81 | 0 | 0 | 16.07 | 0 |
○Thread 2442153 | 27.11 | 0 | 0 | 56.92 | 20.36 | 0.09 | 0 | 0 | 0 | 6.14 | 0 | 0 | 16.49 | 0 |
○Thread 2442154 | 27.43 | 0 | 0 | 59 | 18.92 | 0.13 | 0 | 0 | 0 | 6.02 | 0 | 0 | 15.93 | 0 |
○Thread 2442155 | 27.11 | 0 | 0 | 58.33 | 19.52 | 0.11 | 0 | 0 | 0 | 6.35 | 0 | 0 | 15.7 | 0 |
○Thread 2442156 | 26.83 | 0 | 0 | 58.56 | 19.43 | 0.04 | 0 | 0 | 0 | 6.26 | 0 | 0 | 15.71 | 0 |
○Thread 2442157 | 26.15 | 0 | 0 | 57.69 | 19.87 | 0.11 | 0 | 0 | 0 | 6.25 | 0 | 0 | 16.08 | 0 |
○Thread 2442158 | 26.36 | 0 | 0 | 57.09 | 19.76 | 0.08 | 0 | 0 | 0 | 6.34 | 0 | 0 | 16.73 | 0 |
○Thread 2442159 | 24.72 | 0 | 0 | 60.31 | 17.03 | 0.14 | 0 | 0 | 0 | 5.71 | 0 | 0 | 16.81 | 0 |
○Thread 2442160 | 24.37 | 0 | 0 | 56.32 | 20.93 | 0.14 | 0 | 0 | 0 | 6.16 | 0 | 0 | 16.45 | 0 |
○Thread 2442161 | 26.28 | 0 | 0 | 57.03 | 19.73 | 0.1 | 0 | 0 | 0 | 6.22 | 0 | 0 | 16.92 | 0 |
○Thread 2442162 | 27.21 | 0 | 0 | 60.99 | 16.98 | 0.04 | 0 | 0 | 0 | 5.71 | 0 | 0 | 16.28 | 0 |
○Thread 2442163 | 26.46 | 0 | 0 | 60.18 | 17.33 | 0.09 | 0 | 0 | 0 | 6.01 | 0 | 0 | 16.39 | 0 |
○Thread 2442164 | 26.32 | 0 | 0 | 57.98 | 19.15 | 0.02 | 0 | 0 | 0 | 6.15 | 0 | 0 | 16.7 | 0 |
○Thread 2442165 | 26.68 | 0 | 0 | 57.62 | 19.44 | 0.04 | 0 | 0 | 0 | 6.2 | 0 | 0 | 16.7 | 0 |
○Thread 2442166 | 27.07 | 0 | 0 | 56.94 | 20.72 | 0.07 | 0 | 0 | 0 | 6.39 | 0 | 0 | 15.87 | 0 |
○Thread 2442167 | 26.85 | 0 | 0 | 60.85 | 17.47 | 0.07 | 0 | 0 | 0 | 5.89 | 0 | 0 | 15.72 | 0 |
○Thread 2442168 | 26.67 | 0 | 0 | 58 | 19.4 | 0.07 | 0 | 0 | 0 | 6.17 | 0 | 0 | 16.35 | 0 |
○Thread 2442169 | 26.95 | 0 | 0 | 58.37 | 19.24 | 0.07 | 0 | 0 | 0 | 6.12 | 0 | 0 | 16.2 | 0 |
○Thread 2442170 | 26 | 0 | 0 | 60.92 | 21.38 | 0.08 | 0 | 0 | 0 | 3.75 | 0 | 0 | 13.87 | 0 |
○Thread 2442171 | 25.73 | 0 | 0 | 60.41 | 21.4 | 0.06 | 0 | 0 | 0 | 3.95 | 0 | 0 | 14.19 | 0 |
○Thread 2442172 | 25.23 | 0 | 0 | 61.61 | 20.51 | 0.08 | 0 | 0 | 0 | 3.69 | 0 | 0 | 14.11 | 0 |
○Thread 2442173 | 25.26 | 0 | 0 | 63.95 | 20.31 | 0.1 | 0 | 0 | 0 | 3.03 | 0 | 0 | 12.61 | 0 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | OMP (%) | Math (%) | System (%) | Memory (%) | libqmckl.so.0.0.0 (%) |
---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 70.95 | 0.04 | 8.42 | 20.59 |
m1o2 | 2 | 7.25 | 65 | 0.04 | 7.89 | 19.82 |
m1o4 | 4 | 21.82 | 53.7 | 0.05 | 6.94 | 17.5 |
m1o8 | 8 | 34.46 | 42.48 | 0.07 | 6.53 | 16.46 |
m1o16 | 16 | 47.76 | 29 | 0.08 | 6.64 | 16.52 |
m1o26 | 26 | 53.33 | 23.01 | 0.1 | 6.86 | 16.7 |
m1o52 | 52 | 57.56 | 19.45 | 0.09 | 6.4 | 16.49 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | System (s) | Memory (s) | libqmckl.so.0.0.0 (s) |
---|---|---|---|---|---|---|---|
m1o1 | 1 | 180.83 | 0 | 128.3 | 0.07 | 15.23 | 37.23 |
m1o2 | 2 | 99.33 | 7.2 | 64.56 | 0.04 | 7.84 | 19.69 |
m1o4 | 4 | 62.5 | 13.64 | 33.56 | 0.03 | 4.34 | 10.94 |
m1o8 | 8 | 40.71 | 14.03 | 17.29 | 0.03 | 2.66 | 6.7 |
m1o16 | 16 | 30.51 | 14.57 | 8.85 | 0.02 | 2.03 | 5.04 |
m1o26 | 26 | 27.31 | 14.56 | 6.28 | 0.03 | 1.87 | 4.56 |
m1o52 | 52 | 27.43 | 15.79 | 5.34 | 0.02 | 1.76 | 4.52 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
m1o1 | 1 | 1 |
m1o2 | 2 | 0.8 |
m1o4 | 4 | 0.55 |
m1o8 | 8 | 0.35 |
m1o16 | 16 | 0.2 |
m1o26 | 26 | 0.13 |
m1o52 | 52 | 0.07 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0.02 | 0 | 0.02 | 0.44 | 0 | 0 | 3.64 | 88.6 | 7.28 |
m1o4 | 4 | 0 | 0 | 0.75 | 0 | 0.01 | 0.05 | 0 | 2.68 | 17.93 | 56.73 | 21.85 |
m1o8 | 8 | 0 | 1.11 | 0.07 | 0 | 0 | 2.81 | 7.67 | 9.76 | 2.21 | 41.89 | 34.48 |
m1o16 | 16 | 1.5 | 0 | 0.06 | 11.56 | 6.63 | 0 | 1.51 | 2.22 | 0.71 | 27.95 | 47.86 |
m1o26 | 26 | 1.79 | 0.08 | 12.41 | 7.31 | 3.68 | 0 | 0.28 | 0.09 | 19.51 | 1.46 | 53.39 |
m1o52 | 52 | 2.03 | 21.56 | 2.64 | 0 | 0.22 | 14.88 | 0 | 0 | 0.07 | 0.97 | 57.63 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0 | 0 | 0 | 7.29 | 92.68 | 0.01 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21.82 | 78.14 | 0.04 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34.49 | 65.51 | 0 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 47.84 | 52.14 | 0.02 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 55.15 | 44.82 | 0.03 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1.96 | 57.58 | 40.41 | 0.05 |
PrOMPT - Coverage per Parallel Efficiency at OpenMP Region Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0.34 | 0 | 0 | 3.53 | 59.33 | 36.8 |
m1o4 | 4 | 0 | 0 | 0.51 | 0 | 0 | 0 | 0 | 6.79 | 3.08 | 38.35 | 51.26 |
m1o8 | 8 | 0 | 0.61 | 0 | 0 | 2.83 | 3.33 | 2.49 | 0 | 23.78 | 1.45 | 65.51 |
m1o16 | 16 | 0.71 | 0 | 3.62 | 4.79 | 0 | 0 | 0.31 | 0 | 14.95 | 0 | 75.63 |
m1o26 | 26 | 0.8 | 3.5 | 4.78 | 0.29 | 0.02 | 0 | 0 | 10.61 | 0.07 | 0.48 | 79.45 |
m1o52 | 52 | 7.99 | 2.4 | 0.01 | 0 | 9.35 | 0 | 0.04 | 0 | 0 | 0.25 | 79.95 |
PrOMPT - Coverage per Parallel Speedup at OpenMP Region Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 63.2 | 36.8 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.51 | 48.22 | 51.26 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34.49 | 65.51 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24.37 | 75.63 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.8 | 19.75 | 79.45 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.97 | 0 | 0 | 19.07 | 79.95 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | m1o1 | m1o2 | m1o4 | m1o8 | m1o16 | m1o26 | m1o52 |
---|---|---|---|---|---|---|---|
/home/kcamus/comparative/qmckl/qmckl_bench/build/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
/home/kcamus/comparative/qmckl/trexio/_install/lib/libtrexio.so.0.0.0 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifcoremt.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifport.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libimf.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libintlc.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libiomp5.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libsvml.so | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_avx512.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_core.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_intel_lp64.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_sequential.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_vml_avx512.so.2 | |||||||
/opt/other/hdf5/gcc/seq/1.10.2/lib/libhdf5.so.101.1.0 | |||||||
/usr/lib/ld-linux-x86-64.so.2 | |||||||
/usr/lib/libc.so.6 | |||||||
/usr/lib/libdl.so.2 | |||||||
/usr/lib/libgcc_s.so.1 | |||||||
/usr/lib/libm.so.6 | |||||||
/usr/lib/libpthread.so.0 | |||||||
/usr/lib/librt.so.1 | |||||||
/usr/lib/libz.so.1.3 |