Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
PrOMPT - Coverage per Parallel Efficiency |
PrOMPT - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so (%) | libqmckl.so.0 (%) | libqmckl.so.0.0.0 (%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼m1o1 | 91.56 | 0 | 0 | 0 | 14.59 | 0.02 | 0 | 0.01 | 0 | 15.86 | 0 | 0 | 69.52 | 0.01 |
▼Node skylake | 91.56 | 0 | 0 | 0 | 14.59 | 0.02 | 0 | 0.01 | 0 | 15.86 | 0 | 0 | 69.52 | 0.01 |
▼Process 235516 | 91.56 | 0 | 0 | 0 | 14.59 | 0.02 | 0 | 0.01 | 0 | 15.86 | 0 | 0 | 69.52 | 0.01 |
○Thread 235516 | 91.56 | 0 | 0 | 0 | 14.59 | 0.02 | 0 | 0.01 | 0 | 15.86 | 0 | 0 | 69.52 | 0.01 |
▼m1o2 | 45.32 | 0 | 0 | 0.2 | 14.7 | 0.01 | 0 | 0.01 | 0 | 14.9 | 0 | 0 | 70.18 | 0 |
▼Node skylake | 45.32 | 0 | 0 | 0.2 | 14.7 | 0.01 | 0 | 0.01 | 0 | 14.9 | 0 | 0 | 70.18 | 0 |
▼Process 235577 | 45.32 | 0 | 0 | 0.2 | 14.7 | 0.01 | 0 | 0.01 | 0 | 14.9 | 0 | 0 | 70.18 | 0 |
○Thread 235577 | 45.28 | 0 | 0 | 0.06 | 14.81 | 0.02 | 0 | 0.01 | 0 | 14.92 | 0 | 0 | 70.18 | 0 |
○Thread 235630 | 45.32 | 0 | 0 | 0.35 | 14.59 | 0 | 0 | 0 | 0 | 14.88 | 0 | 0 | 70.18 | 0 |
▼m1o4 | 22.93 | 0 | 0 | 1.4 | 14.5 | 0.01 | 0 | 0.01 | 0 | 15.06 | 0 | 0 | 69.03 | 0 |
▼Node skylake | 22.93 | 0 | 0 | 1.4 | 14.5 | 0.01 | 0 | 0.01 | 0 | 15.06 | 0 | 0 | 69.03 | 0 |
▼Process 235639 | 22.93 | 0 | 0 | 1.4 | 14.5 | 0.01 | 0 | 0.01 | 0 | 15.06 | 0 | 0 | 69.03 | 0 |
○Thread 235639 | 22.89 | 0 | 0 | 0.66 | 14.2 | 0.02 | 0 | 0.02 | 0 | 15.88 | 0 | 0 | 69.23 | 0 |
○Thread 235692 | 22.88 | 0 | 0 | 1.77 | 14.55 | 0 | 0 | 0 | 0 | 15.14 | 0 | 0 | 68.53 | 0 |
○Thread 235693 | 22.93 | 0 | 0 | 1.4 | 14.58 | 0 | 0 | 0 | 0 | 14.74 | 0 | 0 | 69.28 | 0 |
○Thread 235694 | 22.93 | 0 | 0 | 1.77 | 14.65 | 0.02 | 0 | 0 | 0 | 14.5 | 0 | 0 | 69.06 | 0 |
▼m1o8 | 11.96 | 0 | 0 | 4.05 | 14.23 | 0.02 | 0 | 0.01 | 0 | 14.52 | 0 | 0 | 67.18 | 0 |
▼Node skylake | 11.96 | 0 | 0 | 4.05 | 14.23 | 0.02 | 0 | 0.01 | 0 | 14.52 | 0 | 0 | 67.18 | 0 |
▼Process 235704 | 11.96 | 0 | 0 | 4.05 | 14.23 | 0.02 | 0 | 0.01 | 0 | 14.52 | 0 | 0 | 67.18 | 0 |
○Thread 235704 | 11.91 | 0 | 0 | 3.44 | 14.2 | 0.04 | 0 | 0.04 | 0 | 14.91 | 0 | 0 | 67.37 | 0 |
○Thread 235757 | 11.95 | 0 | 0 | 3.01 | 14.23 | 0 | 0 | 0 | 0 | 14.86 | 0 | 0 | 67.89 | 0 |
○Thread 235758 | 11.95 | 0 | 0 | 3.68 | 13.94 | 0 | 0 | 0 | 0 | 14.73 | 0 | 0 | 67.64 | 0 |
○Thread 235759 | 11.95 | 0 | 0 | 3.85 | 15.61 | 0 | 0 | 0 | 0 | 15.32 | 0 | 0 | 65.22 | 0 |
○Thread 235760 | 11.95 | 0 | 0 | 4.56 | 14.35 | 0 | 0 | 0 | 0 | 14.77 | 0 | 0 | 66.32 | 0 |
○Thread 235761 | 11.96 | 0 | 0 | 4.73 | 13.51 | 0.04 | 0 | 0 | 0 | 14.05 | 0 | 0 | 67.67 | 0 |
○Thread 235762 | 11.95 | 0 | 0 | 3.56 | 13.94 | 0 | 0 | 0 | 0 | 14.65 | 0 | 0 | 67.85 | 0 |
○Thread 235763 | 11.93 | 0 | 0 | 5.58 | 14.05 | 0.04 | 0 | 0 | 0 | 12.87 | 0 | 0 | 67.46 | 0 |
▼m1o16 | 6.06 | 0 | 0 | 4.82 | 14.38 | 0.05 | 0 | 0.01 | 0 | 14.25 | 0 | 0 | 66.5 | 0 |
▼Node skylake | 6.06 | 0 | 0 | 4.82 | 14.38 | 0.05 | 0 | 0.01 | 0 | 14.25 | 0 | 0 | 66.5 | 0 |
▼Process 235768 | 6.06 | 0 | 0 | 4.82 | 14.38 | 0.05 | 0 | 0.01 | 0 | 14.25 | 0 | 0 | 66.5 | 0 |
○Thread 235768 | 6 | 0 | 0 | 2.17 | 15.17 | 0.25 | 0 | 0.08 | 0 | 16.17 | 0 | 0 | 66.17 | 0 |
○Thread 235821 | 6.03 | 0 | 0 | 7.22 | 12.95 | 0 | 0 | 0 | 0 | 12.12 | 0 | 0 | 67.72 | 0 |
○Thread 235822 | 6.03 | 0 | 0 | 4.81 | 12.68 | 0.08 | 0 | 0 | 0 | 13.5 | 0 | 0 | 68.93 | 0 |
○Thread 235823 | 6.04 | 0 | 0 | 4.47 | 13.65 | 0.08 | 0 | 0 | 0 | 14.81 | 0 | 0 | 67 | 0 |
○Thread 235824 | 6.04 | 0 | 0 | 4.55 | 13.58 | 0 | 0 | 0 | 0 | 15.31 | 0 | 0 | 66.56 | 0 |
○Thread 235825 | 6.04 | 0 | 0 | 4.3 | 13.91 | 0.08 | 0 | 0 | 0 | 15.4 | 0 | 0 | 66.31 | 0 |
○Thread 235826 | 6.04 | 0 | 0 | 4.97 | 15.15 | 0.17 | 0 | 0 | 0 | 13.91 | 0 | 0 | 65.81 | 0 |
○Thread 235827 | 6.04 | 0 | 0 | 4.8 | 15.23 | 0 | 0 | 0 | 0 | 14.74 | 0 | 0 | 65.23 | 0 |
○Thread 235828 | 6.04 | 0 | 0 | 5.38 | 14.57 | 0 | 0 | 0 | 0 | 12.5 | 0 | 0 | 67.55 | 0 |
○Thread 235829 | 6.05 | 0 | 0 | 5.04 | 15.04 | 0 | 0 | 0 | 0 | 14.46 | 0 | 0 | 65.45 | 0 |
○Thread 235830 | 6.05 | 0 | 0 | 5.05 | 13.4 | 0 | 0 | 0 | 0 | 14.56 | 0 | 0 | 67 | 0 |
○Thread 235831 | 6.05 | 0 | 0 | 5.21 | 14.46 | 0 | 0 | 0 | 0 | 13.31 | 0 | 0 | 67.02 | 0 |
○Thread 235832 | 6.05 | 0 | 0 | 5.79 | 12.48 | 0 | 0 | 0 | 0 | 13.47 | 0 | 0 | 68.26 | 0 |
○Thread 235833 | 6.05 | 0 | 0 | 5.45 | 15.21 | 0 | 0 | 0 | 0 | 15.04 | 0 | 0 | 64.3 | 0 |
○Thread 235834 | 6.05 | 0 | 0 | 4.05 | 16.2 | 0.08 | 0 | 0 | 0 | 13.72 | 0 | 0 | 65.95 | 0 |
○Thread 235835 | 6.06 | 0 | 0 | 3.8 | 16.35 | 0 | 0 | 0 | 0 | 15.03 | 0 | 0 | 64.82 | 0 |
▼m1o26 | 4.25 | 0 | 0 | 6.8 | 12.68 | 0.03 | 0 | 0 | 0 | 19.41 | 0 | 0 | 61.07 | 0 |
▼Node skylake | 4.25 | 0 | 0 | 6.8 | 12.68 | 0.03 | 0 | 0 | 0 | 19.41 | 0 | 0 | 61.07 | 0 |
▼Process 235842 | 4.25 | 0 | 0 | 6.8 | 12.68 | 0.03 | 0 | 0 | 0 | 19.41 | 0 | 0 | 61.07 | 0 |
○Thread 235842 | 4.2 | 0 | 0 | 2.86 | 13.21 | 0.24 | 0 | 0.12 | 0 | 22.86 | 0 | 0 | 60.6 | 0.12 |
○Thread 235895 | 4.22 | 0 | 0 | 10.43 | 12.68 | 0 | 0 | 0 | 0 | 17.42 | 0 | 0 | 59.48 | 0 |
○Thread 235896 | 4.21 | 0 | 0 | 15.18 | 12.81 | 0 | 0 | 0 | 0 | 10.32 | 0 | 0 | 61.68 | 0 |
○Thread 235897 | 4.23 | 0 | 0 | 6.26 | 11.69 | 0 | 0 | 0 | 0 | 19.13 | 0 | 0 | 62.93 | 0 |
○Thread 235898 | 4.23 | 0 | 0 | 6.26 | 13.81 | 0 | 0 | 0 | 0 | 20.31 | 0 | 0 | 59.62 | 0 |
○Thread 235899 | 4.23 | 0 | 0 | 7.56 | 12.75 | 0 | 0 | 0 | 0 | 21.37 | 0 | 0 | 58.32 | 0 |
○Thread 235900 | 4.23 | 0 | 0 | 7.2 | 10.51 | 0.12 | 0 | 0 | 0 | 19.83 | 0 | 0 | 62.34 | 0 |
○Thread 235901 | 4.23 | 0 | 0 | 6.49 | 9.92 | 0 | 0 | 0 | 0 | 19.83 | 0 | 0 | 63.75 | 0 |
○Thread 235902 | 4.24 | 0 | 0 | 6.25 | 13.56 | 0 | 0 | 0 | 0 | 18.75 | 0 | 0 | 61.44 | 0 |
○Thread 235903 | 4.24 | 0 | 0 | 6.96 | 11.56 | 0 | 0 | 0 | 0 | 19.34 | 0 | 0 | 62.15 | 0 |
○Thread 235904 | 4.23 | 0 | 0 | 5.08 | 12.51 | 0 | 0 | 0 | 0 | 19.01 | 0 | 0 | 63.4 | 0 |
○Thread 235905 | 4.24 | 0 | 0 | 6.72 | 13.21 | 0 | 0 | 0 | 0 | 20.64 | 0 | 0 | 59.43 | 0 |
○Thread 235906 | 4.24 | 0 | 0 | 6.95 | 12.6 | 0.12 | 0 | 0 | 0 | 17.55 | 0 | 0 | 62.78 | 0 |
○Thread 235907 | 4.25 | 0 | 0 | 5.18 | 11.88 | 0 | 0 | 0 | 0 | 21.53 | 0 | 0 | 61.41 | 0 |
○Thread 235908 | 4.25 | 0 | 0 | 4.12 | 16.71 | 0 | 0 | 0 | 0 | 20.59 | 0 | 0 | 58.59 | 0 |
○Thread 235909 | 4.23 | 0 | 0 | 5.9 | 12.75 | 0 | 0 | 0 | 0 | 20.78 | 0 | 0 | 60.57 | 0 |
○Thread 235910 | 4.24 | 0 | 0 | 7.07 | 10.84 | 0 | 0 | 0 | 0 | 19.79 | 0 | 0 | 62.31 | 0 |
○Thread 235911 | 4.23 | 0 | 0 | 8.15 | 12.4 | 0 | 0 | 0 | 0 | 18.18 | 0 | 0 | 61.28 | 0 |
○Thread 235912 | 4.23 | 0 | 0 | 6.85 | 11.45 | 0 | 0 | 0 | 0 | 19.72 | 0 | 0 | 61.98 | 0 |
○Thread 235913 | 4.24 | 0 | 0 | 6.84 | 13.56 | 0 | 0 | 0 | 0 | 20.64 | 0 | 0 | 58.96 | 0 |
○Thread 235914 | 4.23 | 0 | 0 | 4.84 | 13.22 | 0 | 0 | 0 | 0 | 21.02 | 0 | 0 | 60.92 | 0 |
○Thread 235915 | 4.24 | 0 | 0 | 4.6 | 16.27 | 0 | 0 | 0 | 0 | 20.4 | 0 | 0 | 58.73 | 0 |
○Thread 235916 | 4.24 | 0 | 0 | 8.02 | 12.38 | 0 | 0 | 0 | 0 | 17.81 | 0 | 0 | 61.79 | 0 |
○Thread 235917 | 4.24 | 0 | 0 | 8.25 | 12.38 | 0.24 | 0 | 0 | 0 | 17.33 | 0 | 0 | 61.79 | 0 |
○Thread 235918 | 4.24 | 0 | 0 | 8.02 | 11.79 | 0 | 0 | 0 | 0 | 15.92 | 0 | 0 | 64.27 | 0 |
○Thread 235919 | 4.24 | 0 | 0 | 4.83 | 13.31 | 0 | 0 | 0 | 0 | 24.5 | 0 | 0 | 57.36 | 0 |
▼m1o52 | 3.89 | 0 | 0 | 8.01 | 6.99 | 0.07 | 0 | 0 | 0 | 41.35 | 0 | 0 | 43.57 | 0 |
▼Node skylake | 3.89 | 0 | 0 | 8.01 | 6.99 | 0.07 | 0 | 0 | 0 | 41.35 | 0 | 0 | 43.57 | 0 |
▼Process 235926 | 3.89 | 0 | 0 | 8.01 | 6.99 | 0.07 | 0 | 0 | 0 | 41.35 | 0 | 0 | 43.57 | 0 |
○Thread 235926 | 3.83 | 0 | 0 | 3.13 | 6 | 0.26 | 0 | 0 | 0 | 46.94 | 0 | 0 | 43.55 | 0.13 |
○Thread 235979 | 3.82 | 0 | 0 | 11.39 | 5.89 | 0 | 0 | 0 | 0 | 39.79 | 0 | 0 | 42.93 | 0 |
○Thread 235980 | 3.82 | 0 | 0 | 4.84 | 7.84 | 0 | 0 | 0 | 0 | 43.53 | 0 | 0 | 43.79 | 0 |
○Thread 235981 | 3.81 | 0 | 0 | 12.45 | 7.47 | 0.13 | 0 | 0 | 0 | 36.7 | 0 | 0 | 43.25 | 0 |
○Thread 235982 | 3.81 | 0 | 0 | 5.37 | 8.78 | 0.13 | 0 | 0 | 0 | 40.63 | 0 | 0 | 45.09 | 0 |
○Thread 235983 | 3.81 | 0 | 0 | 5.24 | 7.21 | 0.13 | 0 | 0 | 0 | 44.69 | 0 | 0 | 42.73 | 0 |
○Thread 235984 | 3.87 | 0 | 0 | 15.63 | 7.62 | 0 | 0 | 0 | 0 | 36.3 | 0 | 0 | 40.44 | 0 |
○Thread 235985 | 3.87 | 0 | 0 | 10.47 | 5.3 | 0.13 | 0 | 0 | 0 | 42.25 | 0 | 0 | 41.86 | 0 |
○Thread 235986 | 3.81 | 0 | 0 | 5.11 | 9.17 | 0 | 0 | 0 | 0 | 40.37 | 0 | 0 | 45.35 | 0 |
○Thread 235987 | 3.81 | 0 | 0 | 5.51 | 5.91 | 0.26 | 0 | 0 | 0 | 44.23 | 0 | 0 | 44.09 | 0 |
○Thread 235988 | 3.8 | 0 | 0 | 5.26 | 8.15 | 0.13 | 0 | 0 | 0 | 43.76 | 0 | 0 | 42.71 | 0 |
○Thread 235989 | 3.79 | 0 | 0 | 3.69 | 7.65 | 0.13 | 0 | 0 | 0 | 43.67 | 0 | 0 | 44.85 | 0 |
○Thread 235990 | 3.8 | 0 | 0 | 4.73 | 5.65 | 0 | 0 | 0 | 0 | 42.31 | 0 | 0 | 47.31 | 0 |
○Thread 235991 | 3.8 | 0 | 0 | 4.99 | 7.88 | 0 | 0 | 0 | 0 | 44.42 | 0 | 0 | 42.71 | 0 |
○Thread 235992 | 3.8 | 0 | 0 | 4.6 | 8.15 | 0 | 0 | 0 | 0 | 45.73 | 0 | 0 | 41.52 | 0 |
○Thread 235993 | 3.81 | 0 | 0 | 9.71 | 9.06 | 0 | 0 | 0 | 0 | 39.76 | 0 | 0 | 41.47 | 0 |
○Thread 235994 | 3.89 | 0 | 0 | 7.84 | 8.1 | 0 | 0 | 0 | 0 | 40.36 | 0 | 0 | 43.7 | 0 |
○Thread 235995 | 3.81 | 0 | 0 | 5.38 | 5.77 | 0.13 | 0 | 0 | 0 | 44.49 | 0 | 0 | 44.23 | 0 |
○Thread 235996 | 3.79 | 0 | 0 | 11.35 | 7.65 | 0 | 0 | 0 | 0 | 36.94 | 0 | 0 | 44.06 | 0 |
○Thread 235997 | 3.77 | 0 | 0 | 9.15 | 6.1 | 0 | 0 | 0 | 0 | 41.25 | 0 | 0 | 43.5 | 0 |
○Thread 235998 | 3.8 | 0 | 0 | 5.13 | 8.03 | 0 | 0 | 0 | 0 | 44.21 | 0 | 0 | 42.63 | 0 |
○Thread 235999 | 3.79 | 0 | 0 | 9.88 | 6.46 | 0 | 0 | 0 | 0 | 38.6 | 0 | 0 | 45.06 | 0 |
○Thread 236000 | 3.79 | 0 | 0 | 5.8 | 7.51 | 0.13 | 0 | 0 | 0 | 43.08 | 0 | 0 | 43.48 | 0 |
○Thread 236001 | 3.79 | 0 | 0 | 9.49 | 6.98 | 0 | 0 | 0 | 0 | 38.21 | 0 | 0 | 45.32 | 0 |
○Thread 236002 | 3.78 | 0 | 0 | 5.95 | 9.26 | 0 | 0 | 0 | 0 | 42.46 | 0 | 0 | 42.33 | 0 |
○Thread 236003 | 3.8 | 0 | 0 | 8.29 | 7.37 | 0.26 | 0 | 0 | 0 | 40.92 | 0 | 0 | 43.16 | 0 |
○Thread 236004 | 3.82 | 0 | 0 | 5.76 | 7.98 | 0.13 | 0 | 0 | 0 | 46.07 | 0 | 0 | 40.05 | 0 |
○Thread 236005 | 3.8 | 0 | 0 | 4.6 | 5.39 | 0.13 | 0 | 0 | 0 | 42.31 | 0 | 0 | 47.57 | 0 |
○Thread 236006 | 3.8 | 0 | 0 | 10.78 | 6.31 | 0 | 0 | 0 | 0 | 38.63 | 0 | 0 | 44.28 | 0 |
○Thread 236007 | 3.8 | 0 | 0 | 9.72 | 5.65 | 0.13 | 0 | 0 | 0 | 43.23 | 0 | 0 | 41.26 | 0 |
○Thread 236008 | 3.8 | 0 | 0 | 9.72 | 4.99 | 0 | 0 | 0 | 0 | 39.55 | 0 | 0 | 45.73 | 0 |
○Thread 236009 | 3.82 | 0 | 0 | 8.77 | 6.02 | 0.26 | 0 | 0 | 0 | 38.87 | 0 | 0 | 46.07 | 0 |
○Thread 236010 | 3.81 | 0 | 0 | 10.89 | 6.82 | 0.26 | 0 | 0 | 0 | 38.58 | 0 | 0 | 43.44 | 0 |
○Thread 236011 | 3.79 | 0 | 0 | 4.61 | 8.04 | 0.26 | 0 | 0 | 0 | 43.21 | 0 | 0 | 43.87 | 0 |
○Thread 236012 | 3.78 | 0 | 0 | 9.11 | 7.79 | 0 | 0 | 0 | 0 | 38.84 | 0 | 0 | 44.25 | 0 |
○Thread 236013 | 3.79 | 0 | 0 | 10.14 | 7.11 | 0 | 0 | 0 | 0 | 39.39 | 0 | 0 | 43.35 | 0 |
○Thread 236014 | 3.8 | 0 | 0 | 4.08 | 6.32 | 0 | 0 | 0 | 0 | 46.58 | 0 | 0 | 43.03 | 0 |
○Thread 236015 | 3.88 | 0 | 0 | 11.08 | 7.6 | 0.13 | 0 | 0 | 0 | 39.82 | 0 | 0 | 41.37 | 0 |
○Thread 236016 | 3.86 | 0 | 0 | 10.87 | 4.92 | 0 | 0 | 0 | 0 | 39.07 | 0 | 0 | 45.15 | 0 |
○Thread 236017 | 3.8 | 0 | 0 | 9.07 | 7.23 | 0.13 | 0 | 0 | 0 | 38.9 | 0 | 0 | 44.68 | 0 |
○Thread 236018 | 3.89 | 0 | 0 | 7.2 | 6.56 | 0 | 0 | 0 | 0 | 42.16 | 0 | 0 | 44.09 | 0 |
○Thread 236019 | 3.8 | 0 | 0 | 8.02 | 6.96 | 0 | 0 | 0 | 0 | 39.42 | 0 | 0 | 45.6 | 0 |
○Thread 236020 | 3.88 | 0 | 0 | 12.24 | 5.8 | 0.13 | 0 | 0 | 0 | 36.6 | 0 | 0 | 45.23 | 0 |
○Thread 236021 | 3.8 | 0 | 0 | 9.33 | 5.65 | 0.13 | 0 | 0 | 0 | 42.18 | 0 | 0 | 42.71 | 0 |
○Thread 236022 | 3.79 | 0 | 0 | 6.59 | 5.53 | 0 | 0 | 0 | 0 | 45.98 | 0 | 0 | 41.9 | 0 |
○Thread 236023 | 3.87 | 0 | 0 | 11.23 | 6.97 | 0 | 0 | 0 | 0 | 39.35 | 0 | 0 | 42.45 | 0 |
○Thread 236024 | 3.81 | 0 | 0 | 10.24 | 6.69 | 0.26 | 0 | 0 | 0 | 37.8 | 0 | 0 | 45.01 | 0 |
○Thread 236025 | 3.83 | 0 | 0 | 11.88 | 7.44 | 0 | 0 | 0 | 0 | 36.55 | 0 | 0 | 44.13 | 0 |
○Thread 236026 | 3.89 | 0 | 0 | 7.2 | 6.81 | 0 | 0 | 0 | 0 | 44.47 | 0 | 0 | 41.52 | 0 |
○Thread 236027 | 3.88 | 0 | 0 | 9.4 | 6.31 | 0 | 0 | 0 | 0 | 42.21 | 0 | 0 | 42.08 | 0 |
○Thread 236028 | 3.89 | 0 | 0 | 7.84 | 7.97 | 0 | 0 | 0 | 0 | 43.96 | 0 | 0 | 40.23 | 0 |
○Thread 236029 | 3.81 | 0 | 0 | 5.38 | 7.48 | 0 | 0 | 0 | 0 | 41.21 | 0 | 0 | 45.93 | 0 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | OMP (%) | Math (%) | System (%) | Memory (%) | libqmckl.so.0.0.0 (%) |
---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 14.59 | 0.02 | 15.86 | 69.52 |
m1o2 | 2 | 0.2 | 14.7 | 0.01 | 14.9 | 70.18 |
m1o4 | 4 | 1.4 | 14.5 | 0.01 | 15.06 | 69.03 |
m1o8 | 8 | 4.05 | 14.23 | 0.02 | 14.52 | 67.18 |
m1o16 | 16 | 4.82 | 14.38 | 0.05 | 14.25 | 66.5 |
m1o26 | 26 | 6.8 | 12.68 | 0.03 | 19.41 | 61.07 |
m1o52 | 52 | 8.01 | 6.99 | 0.07 | 41.35 | 43.57 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | System (s) | Memory (s) | libqmckl.so.0.0.0 (s) |
---|---|---|---|---|---|---|---|
m1o1 | 1 | 91.57 | 0 | 13.36 | 0.02 | 14.52 | 63.65 |
m1o2 | 2 | 45.32 | 0.09 | 6.66 | 0 | 6.75 | 31.81 |
m1o4 | 4 | 22.93 | 0.32 | 3.32 | 0 | 3.45 | 15.83 |
m1o8 | 8 | 11.96 | 0.48 | 1.7 | 0 | 1.74 | 8.03 |
m1o16 | 16 | 6.06 | 0.29 | 0.87 | 0 | 0.86 | 4.03 |
m1o26 | 26 | 4.25 | 0.29 | 0.54 | 0 | 0.82 | 2.6 |
m1o52 | 52 | 3.89 | 0.31 | 0.27 | 0 | 1.61 | 1.69 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
m1o1 | 1 | 1 |
m1o2 | 2 | 1.01 |
m1o4 | 4 | 0.99 |
m1o8 | 8 | 0.95 |
m1o16 | 16 | 0.92 |
m1o26 | 26 | 0.8 |
m1o52 | 52 | 0.43 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.08 | 99.69 | 0.23 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.08 | 98.49 | 1.43 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.05 | 0.07 | 95.79 | 4.09 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1.25 | 93.74 | 5.01 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 19.29 | 0 | 0 | 73.75 | 6.96 |
m1o52 | 52 | 0 | 41.29 | 0 | 0 | 0 | 0 | 29.4 | 0 | 14.16 | 6.99 | 8.16 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.27 | 99.77 | 0 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1.46 | 98.57 | 0 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4.14 | 95.91 | 0 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5.05 | 94.99 | 0 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6.91 | 93.04 | 0.05 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8.15 | 91.84 | 0.01 |
PrOMPT - Coverage per Parallel Efficiency at OpenMP Region Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 99.48 | 0.52 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 98.96 | 1.04 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 97.99 | 2.01 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 65.26 | 31.01 | 3.73 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 65.28 | 29.13 | 5.59 |
m1o52 | 52 | 0 | 0 | 0 | 75.19 | 0 | 0 | 0 | 18.74 | 0 | 0 | 6.07 |
PrOMPT - Coverage per Parallel Speedup at OpenMP Region Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 99.48 | 0.52 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 98.96 | 1.04 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 97.99 | 2.01 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 96.27 | 3.73 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 94.41 | 5.59 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 93.93 | 6.07 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | m1o1 | m1o2 | m1o4 | m1o8 | m1o16 | m1o26 | m1o52 |
---|---|---|---|---|---|---|---|
/home/kcamus/comparative/qmckl/qmckl_bench/build23feb/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
/home/kcamus/comparative/qmckl/qmckl_bench/build23feb/libtrexio/__install/lib/libtrexio.so.0.0.0 | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libifcoremt.so.5 | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libifport.so.5 | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libimf.so | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libintlc.so.5 | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libiomp5.so | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libirng.so | |||||||
/home/kcamus/intel/oneapi/compiler/2022.2.0/linux/compiler/lib/intel64_lin/libsvml.so | |||||||
/home/kcamus/intel/oneapi/mkl/2022.2.0/lib/intel64/libmkl_core.so.2 | |||||||
/home/kcamus/intel/oneapi/mkl/2022.2.0/lib/intel64/libmkl_intel_lp64.so.2 | |||||||
/home/kcamus/intel/oneapi/mkl/2022.2.0/lib/intel64/libmkl_sequential.so.2 | |||||||
/opt/other/hdf5/icc_2017.4/seq/1.10.2/lib/libhdf5.so.101.1.0 | |||||||
/usr/lib/ld-linux-x86-64.so.2 | |||||||
/usr/lib/libc.so.6 | |||||||
/usr/lib/libdl.so.2 | |||||||
/usr/lib/libgcc_s.so.1 | |||||||
/usr/lib/libm.so.6 | |||||||
/usr/lib/libpthread.so.0 | |||||||
/usr/lib/librt.so.1 | |||||||
/usr/lib/libz.so.1.3 |