Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
PrOMPT - Coverage per Parallel Efficiency |
PrOMPT - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so (%) | libqmckl.so.0 (%) | libqmckl.so.0.0.0 (%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_0 | 110.94 | 0 | 0 | 0 | 11.8 | 0.04 | 0 | 0 | 0 | 27.52 | 0 | 0 | 60.64 | 0 |
▼Node skylake | 110.94 | 0 | 0 | 0 | 11.8 | 0.04 | 0 | 0 | 0 | 27.52 | 0 | 0 | 60.64 | 0 |
▼Process 2418553 | 110.94 | 0 | 0 | 0 | 11.8 | 0.04 | 0 | 0 | 0 | 27.52 | 0 | 0 | 60.64 | 0 |
○Thread 2418553 | 110.94 | 0 | 0 | 0 | 11.8 | 0.04 | 0 | 0 | 0 | 27.52 | 0 | 0 | 60.64 | 0 |
▼run_1 | 56 | 0 | 0 | 0.46 | 11.45 | 0.04 | 0 | 0 | 0 | 27.75 | 0 | 0 | 60.3 | 0 |
▼Node skylake | 56 | 0 | 0 | 0.46 | 11.45 | 0.04 | 0 | 0 | 0 | 27.75 | 0 | 0 | 60.3 | 0 |
▼Process 2418617 | 56 | 0 | 0 | 0.46 | 11.45 | 0.04 | 0 | 0 | 0 | 27.75 | 0 | 0 | 60.3 | 0 |
○Thread 2418617 | 56 | 0 | 0 | 0.2 | 11.35 | 0.05 | 0 | 0 | 0 | 28.33 | 0 | 0 | 60.07 | 0 |
○Thread 2418670 | 55.73 | 0 | 0 | 0.73 | 11.55 | 0.03 | 0 | 0 | 0 | 27.17 | 0 | 0 | 60.53 | 0 |
▼run_2 | 34.33 | 0 | 0 | 14.13 | 9.18 | 0.03 | 0 | 0 | 0 | 26.64 | 0 | 0 | 50.01 | 0 |
▼Node skylake | 34.33 | 0 | 0 | 14.13 | 9.18 | 0.03 | 0 | 0 | 0 | 26.64 | 0 | 0 | 50.01 | 0 |
▼Process 2418678 | 34.33 | 0 | 0 | 14.13 | 9.18 | 0.03 | 0 | 0 | 0 | 26.64 | 0 | 0 | 50.01 | 0 |
○Thread 2418678 | 34.33 | 0 | 0 | 17.55 | 8.78 | 0.04 | 0 | 0 | 0 | 24.02 | 0 | 0 | 49.59 | 0.01 |
○Thread 2418731 | 34.13 | 0 | 0 | 2.8 | 9.08 | 0.04 | 0 | 0 | 0 | 36.54 | 0 | 0 | 51.54 | 0 |
○Thread 2418732 | 34.07 | 0 | 0 | 18.85 | 9.58 | 0.03 | 0 | 0 | 0 | 21.99 | 0 | 0 | 49.55 | 0 |
○Thread 2418733 | 33.91 | 0 | 0 | 17.33 | 9.29 | 0.01 | 0 | 0 | 0 | 24.02 | 0 | 0 | 49.34 | 0 |
▼run_3 | 14.88 | 0 | 0 | 4.56 | 10.56 | 0.04 | 0 | 0 | 0 | 27.14 | 0 | 0 | 57.7 | 0 |
▼Node skylake | 14.88 | 0 | 0 | 4.56 | 10.56 | 0.04 | 0 | 0 | 0 | 27.14 | 0 | 0 | 57.7 | 0 |
▼Process 2418740 | 14.88 | 0 | 0 | 4.56 | 10.56 | 0.04 | 0 | 0 | 0 | 27.14 | 0 | 0 | 57.7 | 0 |
○Thread 2418740 | 14.88 | 0 | 0 | 2.62 | 11.42 | 0.07 | 0 | 0 | 0 | 28.23 | 0 | 0 | 57.66 | 0 |
○Thread 2418793 | 14.62 | 0 | 0 | 3.42 | 11.73 | 0.03 | 0 | 0 | 0 | 25.44 | 0 | 0 | 59.37 | 0 |
○Thread 2418794 | 14.62 | 0 | 0 | 4.48 | 10.88 | 0.07 | 0 | 0 | 0 | 27.26 | 0 | 0 | 57.32 | 0 |
○Thread 2418795 | 14.63 | 0 | 0 | 4.24 | 9.74 | 0.03 | 0 | 0 | 0 | 26.52 | 0 | 0 | 59.47 | 0 |
○Thread 2418796 | 14.72 | 0 | 0 | 5.64 | 10.47 | 0.03 | 0 | 0 | 0 | 27.35 | 0 | 0 | 56.51 | 0 |
○Thread 2418797 | 14.72 | 0 | 0 | 5.4 | 9.04 | 0 | 0 | 0 | 0 | 27.01 | 0 | 0 | 58.55 | 0 |
○Thread 2418798 | 14.72 | 0 | 0 | 5.64 | 10.16 | 0 | 0 | 0 | 0 | 27.79 | 0 | 0 | 56.4 | 0 |
○Thread 2418799 | 14.72 | 0 | 0 | 5.06 | 11.04 | 0.07 | 0 | 0 | 0 | 27.45 | 0 | 0 | 56.37 | 0 |
▼run_4 | 8.73 | 0 | 0 | 14.08 | 9.28 | 0.07 | 0 | 0 | 0 | 25.42 | 0 | 0 | 51.15 | 0 |
▼Node skylake | 8.73 | 0 | 0 | 14.08 | 9.28 | 0.07 | 0 | 0 | 0 | 25.42 | 0 | 0 | 51.15 | 0 |
▼Process 2418809 | 8.73 | 0 | 0 | 14.08 | 9.28 | 0.07 | 0 | 0 | 0 | 25.42 | 0 | 0 | 51.15 | 0 |
○Thread 2418809 | 8.73 | 0 | 0 | 10.99 | 8.7 | 0.23 | 0 | 0 | 0 | 30.4 | 0 | 0 | 49.69 | 0 |
○Thread 2418862 | 8.44 | 0 | 0 | 13.97 | 9.77 | 0.06 | 0 | 0 | 0 | 25.64 | 0 | 0 | 50.56 | 0 |
○Thread 2418863 | 8.44 | 0 | 0 | 14.33 | 9.77 | 0.18 | 0 | 0 | 0 | 25.1 | 0 | 0 | 50.62 | 0 |
○Thread 2418864 | 8.44 | 0 | 0 | 15.45 | 8.82 | 0.06 | 0 | 0 | 0 | 24.33 | 0 | 0 | 51.33 | 0 |
○Thread 2418865 | 8.44 | 0 | 0 | 16.23 | 9.66 | 0 | 0 | 0 | 0 | 24.76 | 0 | 0 | 49.35 | 0 |
○Thread 2418866 | 8.44 | 0 | 0 | 15.94 | 9.42 | 0 | 0 | 0 | 0 | 23.22 | 0 | 0 | 51.42 | 0 |
○Thread 2418867 | 8.46 | 0 | 0 | 10.17 | 9.4 | 0.06 | 0 | 0 | 0 | 27.36 | 0 | 0 | 53.01 | 0 |
○Thread 2418868 | 8.46 | 0 | 0 | 9.27 | 8.86 | 0.06 | 0 | 0 | 0 | 28.88 | 0 | 0 | 52.92 | 0 |
○Thread 2418869 | 8.49 | 0 | 0 | 15.67 | 8.42 | 0.12 | 0 | 0 | 0 | 23.67 | 0 | 0 | 52.12 | 0 |
○Thread 2418870 | 8.46 | 0 | 0 | 9.63 | 9.87 | 0.06 | 0 | 0 | 0 | 29.85 | 0 | 0 | 50.59 | 0 |
○Thread 2418871 | 8.49 | 0 | 0 | 15.67 | 9.13 | 0.06 | 0 | 0 | 0 | 24.5 | 0 | 0 | 50.65 | 0 |
○Thread 2418872 | 8.49 | 0 | 0 | 15.49 | 9.54 | 0.06 | 0 | 0 | 0 | 24.85 | 0 | 0 | 50.06 | 0 |
○Thread 2418873 | 8.49 | 0 | 0 | 16.02 | 8.66 | 0.06 | 0 | 0 | 0 | 25.32 | 0 | 0 | 49.94 | 0 |
○Thread 2418874 | 8.49 | 0 | 0 | 15.49 | 9.07 | 0.06 | 0 | 0 | 0 | 22.97 | 0 | 0 | 52.41 | 0 |
○Thread 2418875 | 8.49 | 0 | 0 | 15.49 | 9.6 | 0 | 0 | 0 | 0 | 22.38 | 0 | 0 | 52.53 | 0 |
○Thread 2418876 | 8.49 | 0 | 0 | 15.55 | 9.84 | 0.12 | 0 | 0 | 0 | 23.26 | 0 | 0 | 51.24 | 0 |
▼run_5 | 6.22 | 0 | 0 | 21.43 | 8.24 | 0.08 | 0 | 0 | 0 | 24.06 | 0 | 0 | 46.19 | 0 |
▼Node skylake | 6.22 | 0 | 0 | 21.43 | 8.24 | 0.08 | 0 | 0 | 0 | 24.06 | 0 | 0 | 46.19 | 0 |
▼Process 2418882 | 6.22 | 0 | 0 | 21.43 | 8.24 | 0.08 | 0 | 0 | 0 | 24.06 | 0 | 0 | 46.19 | 0 |
○Thread 2418882 | 6.22 | 0 | 0 | 16.09 | 6.84 | 0.56 | 0 | 0 | 0 | 32.42 | 0 | 0 | 44.09 | 0 |
○Thread 2418935 | 5.96 | 0 | 0 | 22.25 | 7.72 | 0.08 | 0 | 0 | 0 | 24.35 | 0 | 0 | 45.59 | 0 |
○Thread 2418936 | 5.96 | 0 | 0 | 22.99 | 9.31 | 0.08 | 0 | 0 | 0 | 25.42 | 0 | 0 | 42.2 | 0 |
○Thread 2418937 | 5.96 | 0 | 0 | 23.41 | 8.64 | 0 | 0 | 0 | 0 | 22.15 | 0 | 0 | 45.81 | 0 |
○Thread 2418938 | 5.96 | 0 | 0 | 24.35 | 7.98 | 0 | 0 | 0 | 0 | 20.82 | 0 | 0 | 46.85 | 0 |
○Thread 2418939 | 5.98 | 0 | 0 | 14.97 | 9.03 | 0 | 0 | 0 | 0 | 27.51 | 0 | 0 | 48.49 | 0 |
○Thread 2418940 | 5.98 | 0 | 0 | 15.47 | 9.36 | 0.08 | 0 | 0 | 0 | 29.18 | 0 | 0 | 45.9 | 0 |
○Thread 2418941 | 5.97 | 0 | 0 | 17.59 | 7.12 | 0.17 | 0 | 0 | 0 | 28.31 | 0 | 0 | 46.82 | 0 |
○Thread 2418942 | 5.97 | 0 | 0 | 18.69 | 7.8 | 0 | 0 | 0 | 0 | 25.9 | 0 | 0 | 47.61 | 0 |
○Thread 2418943 | 5.97 | 0 | 0 | 19.7 | 7.96 | 0 | 0 | 0 | 0 | 26.15 | 0 | 0 | 46.19 | 0 |
○Thread 2418944 | 5.99 | 0 | 0 | 18.88 | 8.1 | 0 | 0 | 0 | 0 | 26.4 | 0 | 0 | 46.62 | 0 |
○Thread 2418945 | 6 | 0 | 0 | 18.75 | 8.75 | 0.08 | 0 | 0 | 0 | 24.25 | 0 | 0 | 48.17 | 0 |
○Thread 2418946 | 6 | 0 | 0 | 19.6 | 8.67 | 0.08 | 0 | 0 | 0 | 23.27 | 0 | 0 | 48.37 | 0 |
○Thread 2418947 | 5.99 | 0 | 0 | 21.7 | 8.43 | 0 | 0 | 0 | 0 | 24.62 | 0 | 0 | 45.24 | 0 |
○Thread 2418948 | 5.99 | 0 | 0 | 23.39 | 7.77 | 0 | 0 | 0 | 0 | 22.56 | 0 | 0 | 46.28 | 0 |
○Thread 2418949 | 5.99 | 0 | 0 | 23.56 | 8.6 | 0 | 0 | 0 | 0 | 21.3 | 0 | 0 | 46.53 | 0 |
○Thread 2418950 | 5.99 | 0 | 0 | 23.62 | 8.85 | 0.25 | 0 | 0 | 0 | 22.45 | 0 | 0 | 44.82 | 0 |
○Thread 2418951 | 5.99 | 0 | 0 | 23.62 | 9.35 | 0.33 | 0 | 0 | 0 | 19.53 | 0 | 0 | 47.16 | 0 |
○Thread 2418952 | 5.99 | 0 | 0 | 23.64 | 8.19 | 0 | 0 | 0 | 0 | 22.89 | 0 | 0 | 45.28 | 0 |
○Thread 2418953 | 5.99 | 0 | 0 | 23.54 | 9.02 | 0.08 | 0 | 0 | 0 | 20.03 | 0 | 0 | 47.33 | 0 |
○Thread 2418954 | 5.99 | 0 | 0 | 23.39 | 7.6 | 0 | 0 | 0 | 0 | 22.56 | 0 | 0 | 46.45 | 0 |
○Thread 2418955 | 5.99 | 0 | 0 | 23.56 | 8.27 | 0 | 0 | 0 | 0 | 23.56 | 0 | 0 | 44.61 | 0 |
○Thread 2418956 | 5.99 | 0 | 0 | 23.64 | 7.18 | 0.17 | 0 | 0 | 0 | 20.8 | 0 | 0 | 48.2 | 0 |
○Thread 2418957 | 5.99 | 0 | 0 | 23.81 | 7.44 | 0.08 | 0 | 0 | 0 | 22.56 | 0 | 0 | 46.12 | 0 |
○Thread 2418958 | 5.99 | 0 | 0 | 23.71 | 8.18 | 0.08 | 0 | 0 | 0 | 24.04 | 0 | 0 | 43.99 | 0 |
○Thread 2418959 | 5.99 | 0 | 0 | 23.37 | 8.18 | 0 | 0 | 0 | 0 | 22.29 | 0 | 0 | 46.16 | 0 |
▼run_6 | 7.32 | 0 | 0 | 8.99 | 4.6 | 0.09 | 0 | 0 | 0 | 40.22 | 0 | 0 | 46.1 | 0 |
▼Node skylake | 7.32 | 0 | 0 | 8.99 | 4.6 | 0.09 | 0 | 0 | 0 | 40.22 | 0 | 0 | 46.1 | 0 |
▼Process 2418964 | 7.32 | 0 | 0 | 8.99 | 4.6 | 0.09 | 0 | 0 | 0 | 40.22 | 0 | 0 | 46.1 | 0 |
○Thread 2418964 | 7.32 | 0 | 0 | 3.42 | 4.1 | 0.27 | 0.07 | 0 | 0 | 38 | 0 | 0 | 54.14 | 0 |
○Thread 2419017 | 6.77 | 0 | 0 | 8.42 | 4.36 | 0.22 | 0 | 0 | 0 | 50.96 | 0 | 0 | 36.04 | 0 |
○Thread 2419018 | 7.07 | 0 | 0 | 9.62 | 3.89 | 0.07 | 0 | 0 | 0 | 51.49 | 0 | 0 | 34.94 | 0 |
○Thread 2419019 | 6.83 | 0 | 0 | 8.35 | 4.9 | 0 | 0 | 0 | 0 | 27.67 | 0 | 0 | 59.08 | 0 |
○Thread 2419020 | 7.07 | 0 | 0 | 10.69 | 3.68 | 0.14 | 0 | 0 | 0 | 52.51 | 0 | 0 | 32.98 | 0 |
○Thread 2419021 | 7.08 | 0 | 0 | 9.4 | 5.23 | 0.14 | 0 | 0 | 0 | 26.71 | 0 | 0 | 58.52 | 0 |
○Thread 2419022 | 6.94 | 0 | 0 | 9.51 | 4.25 | 0.14 | 0 | 0 | 0 | 50.22 | 0 | 0 | 35.88 | 0 |
○Thread 2419023 | 7.07 | 0 | 0 | 9.2 | 5.52 | 0 | 0 | 0 | 0 | 28.52 | 0 | 0 | 56.76 | 0 |
○Thread 2419024 | 7.08 | 0 | 0 | 9.46 | 5.44 | 0.14 | 0 | 0 | 0 | 26.98 | 0 | 0 | 57.98 | 0 |
○Thread 2419025 | 7.08 | 0 | 0 | 9.32 | 4.52 | 0.07 | 0 | 0 | 0 | 28.6 | 0 | 0 | 57.49 | 0 |
○Thread 2419026 | 7.07 | 0 | 0 | 9.34 | 5.38 | 0 | 0 | 0 | 0 | 26.75 | 0 | 0 | 58.53 | 0 |
○Thread 2419027 | 7.1 | 0 | 0 | 10.42 | 4.58 | 0.07 | 0 | 0 | 0 | 51.41 | 0 | 0 | 33.52 | 0 |
○Thread 2419028 | 7.07 | 0 | 0 | 9.06 | 5.31 | 0.07 | 0 | 0 | 0 | 27.32 | 0 | 0 | 58.24 | 0 |
○Thread 2419029 | 7.11 | 0 | 0 | 10.77 | 4.08 | 0.07 | 0 | 0 | 0 | 52.15 | 0 | 0 | 32.93 | 0 |
○Thread 2419030 | 7.11 | 0 | 0 | 10.48 | 3.38 | 0.14 | 0 | 0 | 0 | 52.18 | 0 | 0 | 33.83 | 0 |
○Thread 2419031 | 6.96 | 0 | 0 | 7.55 | 5.68 | 0.22 | 0 | 0 | 0 | 30.48 | 0 | 0 | 56.07 | 0 |
○Thread 2419032 | 6.97 | 0 | 0 | 8.04 | 5.03 | 0.14 | 0 | 0 | 0 | 29.15 | 0 | 0 | 57.65 | 0 |
○Thread 2419033 | 6.9 | 0 | 0 | 7.47 | 5.58 | 0.15 | 0 | 0 | 0 | 26.47 | 0 | 0 | 60.33 | 0 |
○Thread 2419034 | 6.93 | 0 | 0 | 7.58 | 5.78 | 0.07 | 0 | 0 | 0 | 26.43 | 0 | 0 | 60.14 | 0 |
○Thread 2419035 | 6.94 | 0 | 0 | 7.71 | 5.33 | 0.07 | 0 | 0 | 0 | 27.74 | 0 | 0 | 59.15 | 0 |
○Thread 2419036 | 6.97 | 0 | 0 | 8.4 | 4.88 | 0 | 0 | 0 | 0 | 29.43 | 0 | 0 | 57.29 | 0 |
○Thread 2419037 | 6.98 | 0 | 0 | 9.17 | 4.58 | 0 | 0 | 0 | 0 | 53.65 | 0 | 0 | 32.59 | 0 |
○Thread 2419038 | 6.98 | 0 | 0 | 8.95 | 4.08 | 0.07 | 0 | 0 | 0 | 52.01 | 0 | 0 | 34.89 | 0 |
○Thread 2419039 | 6.99 | 0 | 0 | 8.37 | 3.43 | 0.21 | 0 | 0 | 0 | 52.72 | 0 | 0 | 35.26 | 0 |
○Thread 2419040 | 6.96 | 0 | 0 | 7.9 | 5.39 | 0.14 | 0 | 0 | 0 | 27.87 | 0 | 0 | 58.69 | 0 |
○Thread 2419041 | 6.98 | 0 | 0 | 8.03 | 4.16 | 0 | 0 | 0 | 0 | 51.76 | 0 | 0 | 36.06 | 0 |
○Thread 2419042 | 7.06 | 0 | 0 | 8.72 | 5.1 | 0.07 | 0 | 0 | 0 | 27.99 | 0 | 0 | 58.11 | 0 |
○Thread 2419043 | 7.08 | 0 | 0 | 9.32 | 5.16 | 0.14 | 0 | 0 | 0 | 28.25 | 0 | 0 | 57.13 | 0 |
○Thread 2419044 | 7.12 | 0 | 0 | 10.81 | 3.93 | 0 | 0 | 0 | 0 | 52.46 | 0 | 0 | 32.79 | 0 |
○Thread 2419045 | 7.1 | 0 | 0 | 10.92 | 3.59 | 0.14 | 0 | 0 | 0 | 50.07 | 0 | 0 | 35.28 | 0 |
○Thread 2419046 | 7.07 | 0 | 0 | 9.26 | 4.88 | 0.07 | 0 | 0 | 0 | 27.72 | 0 | 0 | 58.06 | 0 |
○Thread 2419047 | 7.1 | 0 | 0 | 10.56 | 4.23 | 0.21 | 0 | 0 | 0 | 50.56 | 0 | 0 | 34.44 | 0 |
○Thread 2419048 | 6.99 | 0 | 0 | 8.51 | 5.58 | 0.14 | 0 | 0 | 0 | 28.61 | 0 | 0 | 57.15 | 0 |
○Thread 2419049 | 7.04 | 0 | 0 | 8.81 | 2.91 | 0.07 | 0 | 0 | 0 | 52.06 | 0 | 0 | 36.15 | 0 |
○Thread 2419050 | 6.95 | 0 | 0 | 7.63 | 5.9 | 0 | 0 | 0 | 0 | 28.8 | 0 | 0 | 57.67 | 0 |
○Thread 2419051 | 7.12 | 0 | 0 | 9.06 | 4.63 | 0.07 | 0 | 0 | 0 | 52.11 | 0 | 0 | 34.13 | 0 |
○Thread 2419052 | 7.12 | 0 | 0 | 9.63 | 3.87 | 0.07 | 0 | 0 | 0 | 52.21 | 0 | 0 | 34.22 | 0 |
○Thread 2419053 | 6.95 | 0 | 0 | 7.55 | 6.19 | 0.14 | 0 | 0 | 0 | 29.78 | 0 | 0 | 56.33 | 0 |
○Thread 2419054 | 6.95 | 0 | 0 | 7.34 | 4.75 | 0 | 0 | 0 | 0 | 27.43 | 0 | 0 | 60.48 | 0 |
○Thread 2419055 | 6.98 | 0 | 0 | 8.31 | 4.01 | 0 | 0 | 0 | 0 | 54.23 | 0 | 0 | 33.45 | 0 |
○Thread 2419056 | 7.11 | 0 | 0 | 10.83 | 4.08 | 0 | 0.07 | 0 | 0 | 51.83 | 0 | 0 | 33.19 | 0 |
○Thread 2419057 | 7.12 | 0 | 0 | 10.88 | 3.65 | 0.07 | 0 | 0 | 0 | 52.46 | 0 | 0 | 32.94 | 0 |
○Thread 2419058 | 7.1 | 0 | 0 | 11.35 | 3.95 | 0.14 | 0 | 0 | 0 | 48.2 | 0 | 0 | 36.36 | 0 |
○Thread 2419059 | 6.98 | 0 | 0 | 8.96 | 4.37 | 0.22 | 0 | 0 | 0 | 51.68 | 0 | 0 | 34.77 | 0 |
○Thread 2419060 | 7.1 | 0 | 0 | 10.14 | 3.17 | 0 | 0 | 0 | 0 | 52.25 | 0 | 0 | 34.44 | 0 |
○Thread 2419061 | 7.06 | 0 | 0 | 8.78 | 3.12 | 0.07 | 0 | 0 | 0 | 54.39 | 0 | 0 | 33.64 | 0 |
○Thread 2419062 | 7.06 | 0 | 0 | 8.92 | 4.96 | 0 | 0 | 0 | 0 | 28.61 | 0 | 0 | 57.51 | 0 |
○Thread 2419063 | 6.98 | 0 | 0 | 8.31 | 4.44 | 0.14 | 0 | 0 | 0 | 52.51 | 0 | 0 | 34.6 | 0 |
○Thread 2419064 | 7.09 | 0 | 0 | 9.6 | 5.22 | 0 | 0 | 0 | 0 | 27.59 | 0 | 0 | 57.59 | 0 |
○Thread 2419065 | 6.97 | 0 | 0 | 7.9 | 4.81 | 0.22 | 0 | 0 | 0 | 28.21 | 0 | 0 | 58.87 | 0 |
○Thread 2419066 | 6.94 | 0 | 0 | 7.71 | 5.91 | 0 | 0 | 0 | 0 | 28.46 | 0 | 0 | 57.92 | 0 |
○Thread 2419067 | 7.11 | 0 | 0 | 10.62 | 4.36 | 0.14 | 0 | 0 | 0 | 51.97 | 0 | 0 | 32.91 | 0 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | OMP (%) | Math (%) | System (%) | Memory (%) | libqmckl.so.0.0.0 (%) |
---|---|---|---|---|---|---|
run_0 | 1 | 0 | 11.8 | 0.04 | 27.52 | 60.64 |
run_1 | 2 | 0.46 | 11.45 | 0.04 | 27.75 | 60.3 |
run_2 | 4 | 14.13 | 9.18 | 0.03 | 26.64 | 50.01 |
run_3 | 8 | 4.56 | 10.56 | 0.04 | 27.14 | 57.7 |
run_4 | 16 | 14.08 | 9.28 | 0.07 | 25.42 | 51.15 |
run_5 | 26 | 21.43 | 8.24 | 0.08 | 24.06 | 46.19 |
run_6 | 52 | 8.99 | 4.6 | 0.09 | 40.22 | 46.1 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | System (s) | Memory (s) | libqmckl.so.0.0.0 (s) |
---|---|---|---|---|---|---|---|
run_0 | 1 | 110.94 | 0 | 13.09 | 0.04 | 30.53 | 67.27 |
run_1 | 2 | 56 | 0.26 | 6.41 | 0.02 | 15.54 | 33.77 |
run_2 | 4 | 34.33 | 4.85 | 3.15 | 0.01 | 9.15 | 17.17 |
run_3 | 8 | 14.88 | 0.68 | 1.57 | 0.01 | 4.04 | 8.59 |
run_4 | 16 | 8.73 | 1.23 | 0.81 | 0.01 | 2.22 | 4.47 |
run_5 | 26 | 6.22 | 1.33 | 0.51 | 0 | 1.5 | 2.87 |
run_6 | 52 | 7.32 | 0.66 | 0.34 | 0.01 | 2.94 | 3.37 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_0 | 1 | 1 |
run_1 | 2 | 0.98 |
run_2 | 4 | 0.78 |
run_3 | 8 | 0.83 |
run_4 | 16 | 0.7 |
run_5 | 26 | 0.58 |
run_6 | 52 | 0.25 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0.04 | 0 | 0 | 0.03 | 0 | 99.44 | 0.49 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.07 | 26.19 | 59.6 | 14.14 |
run_3 | 8 | 0 | 0 | 0 | 0.06 | 0 | 0 | 0 | 0 | 0 | 95.34 | 4.6 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24.99 | 60.84 | 14.17 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23.7 | 28.78 | 25.99 | 21.53 |
run_6 | 52 | 0.09 | 40.03 | 38.01 | 0 | 0 | 0 | 0 | 4.13 | 8.08 | 0.61 | 9.05 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.5 | 99.47 | 0.03 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14.12 | 85.86 | 0.02 |
run_3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4.59 | 95.4 | 0.01 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14.16 | 85.83 | 0.01 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21.52 | 78.47 | 0.01 |
run_6 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9.02 | 90.95 | 0.03 |
PrOMPT - Coverage per Parallel Efficiency at OpenMP Region Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 96.98 | 3.02 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 94.17 | 5.83 |
run_3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 65.31 | 0 | 25.12 | 9.58 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 58.61 | 23.56 | 17.83 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 57.75 | 0 | 0 | 19.56 | 22.69 |
run_6 | 52 | 0 | 0 | 0 | 59.22 | 0 | 0 | 0 | 14.59 | 0 | 0 | 26.19 |
PrOMPT - Coverage per Parallel Speedup at OpenMP Region Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 96.98 | 3.02 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 94.17 | 5.83 |
run_3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 90.42 | 9.58 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 82.17 | 17.83 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 77.31 | 22.69 |
run_6 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 73.81 | 26.19 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_0 | run_1 | run_2 | run_3 | run_4 | run_5 | run_6 |
---|---|---|---|---|---|---|---|
/home/kcamus/comparative/qmckl/qmckl_bench/build/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
/home/kcamus/comparative/qmckl/trexio/_install/lib/libtrexio.so.0.0.0 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifcoremt.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifport.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libimf.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libintlc.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libiomp5.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libsvml.so | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_core.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_intel_lp64.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_sequential.so.2 | |||||||
/opt/other/hdf5/gcc/seq/1.10.2/lib/libhdf5.so.101.1.0 | |||||||
/usr/lib/ld-linux-x86-64.so.2 | |||||||
/usr/lib/libc.so.6 | |||||||
/usr/lib/libdl.so.2 | |||||||
/usr/lib/libgcc_s.so.1 | |||||||
/usr/lib/libm.so.6 | |||||||
/usr/lib/libpthread.so.0 | |||||||
/usr/lib/librt.so.1 | |||||||
/usr/lib/libz.so.1.3 |