Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
PrOMPT - Coverage per Parallel Efficiency |
PrOMPT - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so (%) | libqmckl.so.0 (%) | libqmckl.so.0.0.0 (%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼m1o1 | 107.64 | 0 | 0 | 0 | 11.66 | 0.01 | 0 | 0 | 0 | 27.44 | 0 | 0 | 60.88 | 0 |
▼Node skylake | 107.64 | 0 | 0 | 0 | 11.66 | 0.01 | 0 | 0 | 0 | 27.44 | 0 | 0 | 60.88 | 0 |
▼Process 2521341 | 107.64 | 0 | 0 | 0 | 11.66 | 0.01 | 0 | 0 | 0 | 27.44 | 0 | 0 | 60.88 | 0 |
○Thread 2521341 | 107.64 | 0 | 0 | 0 | 11.66 | 0.01 | 0 | 0 | 0 | 27.44 | 0 | 0 | 60.88 | 0 |
▼m1o2 | 54.01 | 0 | 0 | 0.07 | 12.07 | 0.03 | 0 | 0 | 0 | 27.35 | 0 | 0 | 60.48 | 0 |
▼Node skylake | 54.01 | 0 | 0 | 0.07 | 12.07 | 0.03 | 0 | 0 | 0 | 27.35 | 0 | 0 | 60.48 | 0 |
▼Process 2521407 | 54.01 | 0 | 0 | 0.07 | 12.07 | 0.03 | 0 | 0 | 0 | 27.35 | 0 | 0 | 60.48 | 0 |
○Thread 2521407 | 54 | 0 | 0 | 0.13 | 12.45 | 0.03 | 0 | 0 | 0 | 27.14 | 0 | 0 | 60.25 | 0 |
○Thread 2521461 | 54.01 | 0 | 0 | 0.01 | 11.69 | 0.03 | 0 | 0 | 0 | 27.56 | 0 | 0 | 60.71 | 0 |
▼m1o4 | 27.44 | 0 | 0 | 1.17 | 11.36 | 0.02 | 0 | 0 | 0 | 27.64 | 0 | 0 | 59.81 | 0 |
▼Node skylake | 27.44 | 0 | 0 | 1.17 | 11.36 | 0.02 | 0 | 0 | 0 | 27.64 | 0 | 0 | 59.81 | 0 |
▼Process 2521471 | 27.44 | 0 | 0 | 1.17 | 11.36 | 0.02 | 0 | 0 | 0 | 27.64 | 0 | 0 | 59.81 | 0 |
○Thread 2521471 | 27.44 | 0 | 0 | 1.48 | 11.57 | 0 | 0 | 0 | 0 | 27.12 | 0 | 0 | 59.83 | 0 |
○Thread 2521525 | 27.27 | 0 | 0 | 0.24 | 11.2 | 0.04 | 0 | 0 | 0 | 29.23 | 0 | 0 | 59.29 | 0 |
○Thread 2521526 | 27.44 | 0 | 0 | 1.44 | 11.08 | 0.04 | 0 | 0 | 0 | 27.51 | 0 | 0 | 59.93 | 0 |
○Thread 2521527 | 27.44 | 0 | 0 | 1.51 | 11.59 | 0 | 0 | 0 | 0 | 26.71 | 0 | 0 | 60.19 | 0 |
▼m1o8 | 14.32 | 0 | 0 | 3.44 | 11.15 | 0.02 | 0 | 0 | 0 | 27.29 | 0 | 0 | 58.1 | 0 |
▼Node skylake | 14.32 | 0 | 0 | 3.44 | 11.15 | 0.02 | 0 | 0 | 0 | 27.29 | 0 | 0 | 58.1 | 0 |
▼Process 2521533 | 14.32 | 0 | 0 | 3.44 | 11.15 | 0.02 | 0 | 0 | 0 | 27.29 | 0 | 0 | 58.1 | 0 |
○Thread 2521533 | 14.31 | 0 | 0 | 4.72 | 11.18 | 0.03 | 0 | 0 | 0 | 26.59 | 0 | 0 | 57.48 | 0 |
○Thread 2521587 | 14.24 | 0 | 0 | 3.09 | 10.88 | 0 | 0 | 0 | 0 | 27.95 | 0 | 0 | 58.08 | 0 |
○Thread 2521588 | 14.24 | 0 | 0 | 2.88 | 10.18 | 0 | 0 | 0 | 0 | 27.67 | 0 | 0 | 59.27 | 0 |
○Thread 2521589 | 14.23 | 0 | 0 | 3.76 | 10.92 | 0.04 | 0 | 0 | 0 | 26.17 | 0 | 0 | 59.11 | 0 |
○Thread 2521590 | 14.24 | 0 | 0 | 0.81 | 11.44 | 0 | 0 | 0 | 0 | 29.24 | 0 | 0 | 58.51 | 0 |
○Thread 2521591 | 14.32 | 0 | 0 | 4.26 | 11.59 | 0 | 0 | 0 | 0 | 25.69 | 0 | 0 | 58.46 | 0 |
○Thread 2521592 | 14.32 | 0 | 0 | 3.7 | 12.18 | 0.07 | 0 | 0 | 0 | 27.71 | 0 | 0 | 56.34 | 0 |
○Thread 2521593 | 14.32 | 0 | 0 | 4.26 | 10.82 | 0.03 | 0 | 0 | 0 | 27.33 | 0 | 0 | 57.56 | 0 |
▼m1o16 | 8.01 | 0 | 0 | 11.51 | 9.78 | 0.07 | 0 | 0 | 0 | 25.51 | 0 | 0 | 53.13 | 0 |
▼Node skylake | 8.01 | 0 | 0 | 11.51 | 9.78 | 0.07 | 0 | 0 | 0 | 25.51 | 0 | 0 | 53.13 | 0 |
▼Process 2521602 | 8.01 | 0 | 0 | 11.51 | 9.78 | 0.07 | 0 | 0 | 0 | 25.51 | 0 | 0 | 53.13 | 0 |
○Thread 2521602 | 8 | 0 | 0 | 13.56 | 10.63 | 0 | 0 | 0 | 0 | 23.25 | 0 | 0 | 52.56 | 0 |
○Thread 2521656 | 7.96 | 0 | 0 | 9.92 | 9.36 | 0.13 | 0 | 0 | 0 | 27.26 | 0 | 0 | 53.33 | 0 |
○Thread 2521657 | 7.96 | 0 | 0 | 10.92 | 9.17 | 0 | 0 | 0 | 0 | 27.5 | 0 | 0 | 52.42 | 0 |
○Thread 2521658 | 7.96 | 0 | 0 | 11.81 | 10.3 | 0 | 0 | 0 | 0 | 22.42 | 0 | 0 | 55.46 | 0 |
○Thread 2521659 | 7.95 | 0 | 0 | 13.14 | 10.31 | 0 | 0 | 0 | 0 | 24.14 | 0 | 0 | 52.42 | 0 |
○Thread 2521660 | 7.95 | 0 | 0 | 13.46 | 8.93 | 0.19 | 0 | 0 | 0 | 22.64 | 0 | 0 | 54.78 | 0 |
○Thread 2521661 | 8.01 | 0 | 0 | 9.23 | 10.29 | 0.19 | 0 | 0 | 0 | 28.63 | 0 | 0 | 51.65 | 0 |
○Thread 2521662 | 7.97 | 0 | 0 | 7.34 | 9.66 | 0 | 0 | 0 | 0 | 29.97 | 0 | 0 | 53.04 | 0 |
○Thread 2521663 | 8.01 | 0 | 0 | 9.99 | 9.61 | 0.12 | 0 | 0 | 0 | 28.4 | 0 | 0 | 51.87 | 0 |
○Thread 2521664 | 8 | 0 | 0 | 13.38 | 9.5 | 0 | 0 | 0 | 0 | 24 | 0 | 0 | 53.12 | 0 |
○Thread 2521665 | 7.96 | 0 | 0 | 9.29 | 10.3 | 0.13 | 0 | 0 | 0 | 28.06 | 0 | 0 | 52.23 | 0 |
○Thread 2521666 | 8 | 0 | 0 | 13.19 | 10.19 | 0.13 | 0 | 0 | 0 | 24.31 | 0 | 0 | 52.19 | 0 |
○Thread 2521667 | 8 | 0 | 0 | 13.63 | 9.69 | 0.06 | 0 | 0 | 0 | 22.5 | 0 | 0 | 54.12 | 0 |
○Thread 2521668 | 7.97 | 0 | 0 | 8.78 | 10.98 | 0.13 | 0 | 0 | 0 | 27.54 | 0 | 0 | 52.57 | 0 |
○Thread 2521669 | 8 | 0 | 0 | 13.38 | 8.94 | 0.06 | 0 | 0 | 0 | 25.13 | 0 | 0 | 52.5 | 0 |
○Thread 2521670 | 8 | 0 | 0 | 13.13 | 8.63 | 0 | 0 | 0 | 0 | 22.38 | 0 | 0 | 55.87 | 0 |
▼m1o26 | 5.6 | 0 | 0 | 15.54 | 8.67 | 0.07 | 0 | 0 | 0 | 26.82 | 0 | 0 | 48.9 | 0 |
▼Node skylake | 5.6 | 0 | 0 | 15.54 | 8.67 | 0.07 | 0 | 0 | 0 | 26.82 | 0 | 0 | 48.9 | 0 |
▼Process 2521675 | 5.6 | 0 | 0 | 15.54 | 8.67 | 0.07 | 0 | 0 | 0 | 26.82 | 0 | 0 | 48.9 | 0 |
○Thread 2521675 | 5.55 | 0 | 0 | 8.82 | 8.91 | 0.09 | 0 | 0 | 0 | 35.91 | 0 | 0 | 46.26 | 0 |
○Thread 2521729 | 5.56 | 0 | 0 | 18.79 | 10.34 | 0.09 | 0 | 0 | 0 | 24.73 | 0 | 0 | 46.04 | 0 |
○Thread 2521730 | 5.55 | 0 | 0 | 20.09 | 8.65 | 0.09 | 0 | 0 | 0 | 24.14 | 0 | 0 | 47.03 | 0 |
○Thread 2521731 | 5.56 | 0 | 0 | 17.72 | 8 | 0.09 | 0 | 0 | 0 | 24.19 | 0 | 0 | 50 | 0 |
○Thread 2521732 | 5.58 | 0 | 0 | 6.36 | 7.71 | 0 | 0 | 0 | 0 | 40.59 | 0 | 0 | 45.34 | 0 |
○Thread 2521733 | 5.56 | 0 | 0 | 11.41 | 8.72 | 0 | 0 | 0 | 0 | 31.18 | 0 | 0 | 48.7 | 0 |
○Thread 2521734 | 5.59 | 0 | 0 | 7.6 | 11.45 | 0.27 | 0 | 0 | 0 | 30.86 | 0 | 0 | 49.82 | 0 |
○Thread 2521735 | 5.58 | 0 | 0 | 6.71 | 9.58 | 0 | 0 | 0 | 0 | 36.71 | 0 | 0 | 47 | 0 |
○Thread 2521736 | 5.58 | 0 | 0 | 7.62 | 7.26 | 0 | 0 | 0 | 0 | 33.42 | 0 | 0 | 51.7 | 0 |
○Thread 2521737 | 5.58 | 0 | 0 | 13.08 | 9.05 | 0.09 | 0 | 0 | 0 | 28.49 | 0 | 0 | 49.28 | 0 |
○Thread 2521738 | 5.57 | 0 | 0 | 14 | 8.53 | 0.09 | 0 | 0 | 0 | 26.93 | 0 | 0 | 50.45 | 0 |
○Thread 2521739 | 5.56 | 0 | 0 | 16.35 | 8.09 | 0.18 | 0 | 0 | 0 | 25.88 | 0 | 0 | 49.51 | 0 |
○Thread 2521740 | 5.59 | 0 | 0 | 18.34 | 9.21 | 0.09 | 0 | 0 | 0 | 24.42 | 0 | 0 | 47.94 | 0 |
○Thread 2521741 | 5.59 | 0 | 0 | 18.87 | 8.86 | 0.09 | 0 | 0 | 0 | 22.45 | 0 | 0 | 49.73 | 0 |
○Thread 2521742 | 5.59 | 0 | 0 | 18.69 | 7.96 | 0 | 0 | 0 | 0 | 21.2 | 0 | 0 | 52.15 | 0 |
○Thread 2521743 | 5.59 | 0 | 0 | 19.05 | 7.42 | 0.27 | 0 | 0 | 0 | 23.7 | 0 | 0 | 49.55 | 0 |
○Thread 2521744 | 5.59 | 0 | 0 | 18.87 | 7.78 | 0 | 0 | 0 | 0 | 24.6 | 0 | 0 | 48.75 | 0 |
○Thread 2521745 | 5.6 | 0 | 0 | 9.47 | 9.83 | 0 | 0 | 0 | 0 | 31.72 | 0 | 0 | 48.97 | 0 |
○Thread 2521746 | 5.59 | 0 | 0 | 19.14 | 9.12 | 0.09 | 0 | 0 | 0 | 23.61 | 0 | 0 | 48.03 | 0 |
○Thread 2521747 | 5.59 | 0 | 0 | 19.68 | 8.5 | 0 | 0 | 0 | 0 | 21.56 | 0 | 0 | 50.27 | 0 |
○Thread 2521748 | 5.59 | 0 | 0 | 18.16 | 8.86 | 0.09 | 0 | 0 | 0 | 21.91 | 0 | 0 | 50.98 | 0 |
○Thread 2521749 | 5.59 | 0 | 0 | 18.52 | 9.12 | 0.09 | 0 | 0 | 0 | 23.17 | 0 | 0 | 49.11 | 0 |
○Thread 2521750 | 5.58 | 0 | 0 | 19.43 | 7.79 | 0 | 0 | 0 | 0 | 24.26 | 0 | 0 | 48.52 | 0 |
○Thread 2521751 | 5.59 | 0 | 0 | 19.41 | 7.87 | 0 | 0 | 0 | 0 | 22.63 | 0 | 0 | 50.09 | 0 |
○Thread 2521752 | 5.59 | 0 | 0 | 19.14 | 7.69 | 0.09 | 0 | 0 | 0 | 24.42 | 0 | 0 | 48.66 | 0 |
○Thread 2521753 | 5.59 | 0 | 0 | 18.6 | 9.03 | 0 | 0 | 0 | 0 | 24.78 | 0 | 0 | 47.59 | 0 |
▼m1o52 | 4.03 | 0 | 0 | 4.15 | 6.96 | 0.09 | 0 | 0 | 0 | 38.13 | 0 | 0 | 50.66 | 0 |
▼Node skylake | 4.03 | 0 | 0 | 4.15 | 6.96 | 0.09 | 0 | 0 | 0 | 38.13 | 0 | 0 | 50.66 | 0 |
▼Process 2521758 | 4.03 | 0 | 0 | 4.15 | 6.96 | 0.09 | 0 | 0 | 0 | 38.13 | 0 | 0 | 50.66 | 0 |
○Thread 2521758 | 4 | 0 | 0 | 3 | 7.24 | 0 | 0 | 0 | 0 | 34.33 | 0 | 0 | 55.43 | 0 |
○Thread 2521812 | 4.01 | 0 | 0 | 3.86 | 6.97 | 0 | 0 | 0 | 0 | 33.37 | 0 | 0 | 55.79 | 0 |
○Thread 2521813 | 4.01 | 0 | 0 | 3.49 | 7.11 | 0 | 0 | 0 | 0 | 31.3 | 0 | 0 | 58.1 | 0 |
○Thread 2521814 | 4.01 | 0 | 0 | 3.49 | 6.11 | 0 | 0 | 0 | 0 | 31.92 | 0 | 0 | 58.48 | 0 |
○Thread 2521815 | 4.02 | 0 | 0 | 3.36 | 8.58 | 0.12 | 0 | 0 | 0 | 33.33 | 0 | 0 | 54.6 | 0 |
○Thread 2521816 | 4.01 | 0 | 0 | 3.11 | 7.85 | 0.12 | 0 | 0 | 0 | 33.13 | 0 | 0 | 55.79 | 0 |
○Thread 2521817 | 3.95 | 0 | 0 | 9.23 | 7.46 | 0.51 | 0 | 0 | 0 | 40.58 | 0 | 0 | 42.23 | 0 |
○Thread 2521818 | 4.02 | 0 | 0 | 3.6 | 6.34 | 0 | 0 | 0 | 0 | 31.8 | 0 | 0 | 58.26 | 0 |
○Thread 2521819 | 4 | 0 | 0 | 3.87 | 7.37 | 0 | 0 | 0 | 0 | 30.96 | 0 | 0 | 57.8 | 0 |
○Thread 2521820 | 4 | 0 | 0 | 3.37 | 7.49 | 0.12 | 0 | 0 | 0 | 32.96 | 0 | 0 | 56.05 | 0 |
○Thread 2521821 | 4 | 0 | 0 | 3.25 | 5.62 | 0 | 0 | 0 | 0 | 31.09 | 0 | 0 | 60.05 | 0 |
○Thread 2521822 | 4.01 | 0 | 0 | 3.24 | 8.73 | 0 | 0 | 0 | 0 | 31.67 | 0 | 0 | 56.36 | 0 |
○Thread 2521823 | 4.02 | 0 | 0 | 2.74 | 6.59 | 0 | 0 | 0 | 0 | 32.46 | 0 | 0 | 58.21 | 0 |
○Thread 2521824 | 3.97 | 0 | 0 | 7.92 | 6.04 | 0.38 | 0 | 0 | 0 | 40 | 0 | 0 | 45.66 | 0 |
○Thread 2521825 | 4.01 | 0 | 0 | 2.87 | 5.74 | 0 | 0 | 0 | 0 | 33.29 | 0 | 0 | 58.1 | 0 |
○Thread 2521826 | 3.98 | 0 | 0 | 7.53 | 6.65 | 0 | 0 | 0 | 0 | 43.29 | 0 | 0 | 42.53 | 0 |
○Thread 2521827 | 3.98 | 0 | 0 | 7.54 | 4.65 | 0.13 | 0 | 0 | 0 | 43.47 | 0 | 0 | 44.22 | 0 |
○Thread 2521828 | 4.02 | 0 | 0 | 2.86 | 7.09 | 0 | 0 | 0 | 0 | 28.61 | 0 | 0 | 61.44 | 0 |
○Thread 2521829 | 3.99 | 0 | 0 | 6.26 | 9.26 | 0.25 | 0 | 0 | 0 | 44.18 | 0 | 0 | 40.05 | 0 |
○Thread 2521830 | 3.98 | 0 | 0 | 7.41 | 7.54 | 0 | 0 | 0 | 0 | 41.58 | 0 | 0 | 43.47 | 0 |
○Thread 2521831 | 3.98 | 0 | 0 | 7.29 | 7.79 | 0 | 0 | 0 | 0 | 40.33 | 0 | 0 | 44.6 | 0 |
○Thread 2521832 | 3.98 | 0 | 0 | 6.16 | 6.66 | 0.13 | 0 | 0 | 0 | 41.08 | 0 | 0 | 45.98 | 0 |
○Thread 2521833 | 3.98 | 0 | 0 | 5.4 | 7.28 | 0 | 0 | 0 | 0 | 47.18 | 0 | 0 | 40.15 | 0 |
○Thread 2521834 | 4.01 | 0 | 0 | 2.62 | 6.35 | 0 | 0 | 0 | 0 | 30.39 | 0 | 0 | 60.65 | 0 |
○Thread 2521835 | 4.02 | 0 | 0 | 3.6 | 7.58 | 0.12 | 0 | 0 | 0 | 32.55 | 0 | 0 | 56.15 | 0 |
○Thread 2521836 | 4 | 0 | 0 | 3.37 | 8.11 | 0 | 0 | 0 | 0 | 30.71 | 0 | 0 | 57.8 | 0 |
○Thread 2521837 | 4 | 0 | 0 | 5.38 | 6.5 | 0.38 | 0 | 0 | 0 | 43.13 | 0 | 0 | 44.63 | 0 |
○Thread 2521838 | 3.98 | 0 | 0 | 6.4 | 6.02 | 0 | 0 | 0 | 0 | 42.28 | 0 | 0 | 45.29 | 0 |
○Thread 2521839 | 4.01 | 0 | 0 | 3.36 | 6.85 | 0 | 0 | 0 | 0 | 34.62 | 0 | 0 | 55.17 | 0 |
○Thread 2521840 | 3.99 | 0 | 0 | 5.14 | 6.27 | 0 | 0 | 0 | 0 | 44.49 | 0 | 0 | 44.11 | 0 |
○Thread 2521841 | 3.99 | 0 | 0 | 4.14 | 5.89 | 0.13 | 0 | 0 | 0 | 45.61 | 0 | 0 | 44.24 | 0 |
○Thread 2521842 | 4.01 | 0 | 0 | 4.36 | 7.48 | 0 | 0 | 0 | 0 | 47.76 | 0 | 0 | 40.4 | 0 |
○Thread 2521843 | 4.02 | 0 | 0 | 3.35 | 8.32 | 0.12 | 0 | 0 | 0 | 30.06 | 0 | 0 | 58.14 | 0 |
○Thread 2521844 | 4.01 | 0 | 0 | 4.23 | 6.97 | 0 | 0 | 0 | 0 | 43.96 | 0 | 0 | 44.83 | 0 |
○Thread 2521845 | 4.01 | 0 | 0 | 4.49 | 6.73 | 0.12 | 0 | 0 | 0 | 44.51 | 0 | 0 | 44.14 | 0 |
○Thread 2521846 | 4.02 | 0 | 0 | 4.1 | 6.97 | 0.25 | 0 | 0 | 0 | 45.9 | 0 | 0 | 42.79 | 0 |
○Thread 2521847 | 4 | 0 | 0 | 3 | 8.99 | 0.12 | 0 | 0 | 0 | 28.71 | 0 | 0 | 59.18 | 0 |
○Thread 2521848 | 4.01 | 0 | 0 | 3.12 | 5.74 | 0.12 | 0 | 0 | 0 | 49.13 | 0 | 0 | 41.9 | 0 |
○Thread 2521849 | 4.02 | 0 | 0 | 2.86 | 8.07 | 0 | 0 | 0 | 0 | 29.57 | 0 | 0 | 59.5 | 0 |
○Thread 2521850 | 4 | 0 | 0 | 3.25 | 6.49 | 0.5 | 0 | 0 | 0 | 31.34 | 0 | 0 | 58.43 | 0 |
○Thread 2521851 | 3.98 | 0 | 0 | 3.64 | 5.14 | 0 | 0 | 0 | 0 | 48.81 | 0 | 0 | 42.41 | 0 |
○Thread 2521852 | 4 | 0 | 0 | 3.5 | 7.63 | 0 | 0 | 0 | 0 | 47.88 | 0 | 0 | 41 | 0 |
○Thread 2521853 | 3.99 | 0 | 0 | 4.38 | 6.76 | 0.13 | 0 | 0 | 0 | 43.55 | 0 | 0 | 45.18 | 0 |
○Thread 2521854 | 4.01 | 0 | 0 | 2.24 | 6.23 | 0.12 | 0 | 0 | 0 | 45.45 | 0 | 0 | 45.95 | 0 |
○Thread 2521855 | 4 | 0 | 0 | 3.13 | 6.38 | 0 | 0 | 0 | 0 | 48 | 0 | 0 | 42.5 | 0 |
○Thread 2521856 | 3.98 | 0 | 0 | 2.76 | 7.53 | 0.13 | 0 | 0 | 0 | 44.42 | 0 | 0 | 45.17 | 0 |
○Thread 2521857 | 4.01 | 0 | 0 | 3.24 | 6.48 | 0.12 | 0 | 0 | 0 | 47.2 | 0 | 0 | 42.96 | 0 |
○Thread 2521858 | 4.02 | 0 | 0 | 2.24 | 3.85 | 0.12 | 0 | 0 | 0 | 50.06 | 0 | 0 | 43.73 | 0 |
○Thread 2521859 | 4 | 0 | 0 | 3.62 | 8.24 | 0.25 | 0 | 0 | 0 | 29.84 | 0 | 0 | 58.05 | 0 |
○Thread 2521860 | 4.03 | 0 | 0 | 3.47 | 6.58 | 0.12 | 0 | 0 | 0 | 29.9 | 0 | 0 | 59.93 | 0 |
○Thread 2521861 | 4.01 | 0 | 0 | 3.36 | 7.35 | 0.12 | 0 | 0 | 0 | 30.64 | 0 | 0 | 58.53 | 0 |
○Thread 2521862 | 4.01 | 0 | 0 | 2.99 | 8.35 | 0.12 | 0 | 0 | 0 | 31.05 | 0 | 0 | 57.48 | 0 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | OMP (%) | Math (%) | System (%) | Memory (%) | libqmckl.so.0.0.0 (%) |
---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 11.66 | 0.01 | 27.44 | 60.88 |
m1o2 | 2 | 0.07 | 12.07 | 0.03 | 27.35 | 60.48 |
m1o4 | 4 | 1.17 | 11.36 | 0.02 | 27.64 | 59.81 |
m1o8 | 8 | 3.44 | 11.15 | 0.02 | 27.29 | 58.1 |
m1o16 | 16 | 11.51 | 9.78 | 0.07 | 25.51 | 53.13 |
m1o26 | 26 | 15.54 | 8.67 | 0.07 | 26.82 | 48.9 |
m1o52 | 52 | 4.15 | 6.96 | 0.09 | 38.13 | 50.66 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | System (s) | Memory (s) | libqmckl.so.0.0.0 (s) |
---|---|---|---|---|---|---|---|
m1o1 | 1 | 107.63 | 0 | 12.55 | 0.01 | 29.54 | 65.53 |
m1o2 | 2 | 54.01 | 0.04 | 6.52 | 0.02 | 14.77 | 32.67 |
m1o4 | 4 | 27.44 | 0.32 | 3.12 | 0.01 | 7.58 | 16.41 |
m1o8 | 8 | 14.32 | 0.49 | 1.6 | 0 | 3.91 | 8.32 |
m1o16 | 16 | 8.01 | 0.92 | 0.78 | 0.01 | 2.04 | 4.26 |
m1o26 | 26 | 5.6 | 0.87 | 0.49 | 0 | 1.5 | 2.74 |
m1o52 | 52 | 4.03 | 0.17 | 0.28 | 0 | 1.54 | 2.04 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
m1o1 | 1 | 1 |
m1o2 | 2 | 0.98 |
m1o4 | 4 | 0.94 |
m1o8 | 8 | 0.85 |
m1o16 | 16 | 0.7 |
m1o26 | 26 | 0.56 |
m1o52 | 52 | 0.35 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0.03 | 0 | 0 | 0 | 0 | 0 | 0 | 99.9 | 0.07 |
m1o4 | 4 | 0 | 0 | 0.02 | 0 | 0 | 0 | 0 | 0.03 | 0 | 98.77 | 1.18 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 96.53 | 3.47 |
m1o16 | 16 | 0 | 0.06 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 88.35 | 11.59 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26.79 | 0 | 57.56 | 15.65 |
m1o52 | 52 | 0 | 0 | 0 | 38.1 | 0 | 35.86 | 0 | 14.8 | 6.1 | 0.86 | 4.28 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0.03 | 0 | 0 | 0 | 0 | 0.06 | 99.9 | 0.01 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1.19 | 98.8 | 0.01 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3.47 | 96.53 | 0 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11.58 | 88.41 | 0.01 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15.62 | 84.35 | 0.03 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4.25 | 95.72 | 0.03 |
PrOMPT - Coverage per Parallel Efficiency at OpenMP Region Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 95.87 | 4.13 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 93.97 | 6.03 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 67.77 | 0 | 23.15 | 9.08 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 60.66 | 0 | 22.29 | 17.05 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 58.55 | 0 | 0 | 18.85 | 22.6 |
m1o52 | 52 | 0 | 0 | 67.45 | 0 | 0 | 0 | 12.47 | 0 | 0 | 0 | 20.09 |
PrOMPT - Coverage per Parallel Speedup at OpenMP Region Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 95.87 | 4.13 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 93.97 | 6.03 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 90.92 | 9.08 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 82.95 | 17.05 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 77.4 | 22.6 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 79.91 | 20.09 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | m1o1 | m1o2 | m1o4 | m1o8 | m1o16 | m1o26 | m1o52 |
---|---|---|---|---|---|---|---|
/home/kcamus/comparative/qmckl/qmckl_bench/build/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
/home/kcamus/comparative/qmckl/trexio/_install/lib/libtrexio.so.0.0.0 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifcoremt.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifport.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libimf.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libintlc.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libiomp5.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libsvml.so | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_core.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_intel_lp64.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_sequential.so.2 | |||||||
/opt/other/hdf5/gcc/seq/1.10.2/lib/libhdf5.so.101.1.0 | |||||||
/usr/lib/ld-linux-x86-64.so.2 | |||||||
/usr/lib/libc.so.6 | |||||||
/usr/lib/libdl.so.2 | |||||||
/usr/lib/libgcc_s.so.1 | |||||||
/usr/lib/libm.so.6 | |||||||
/usr/lib/libpthread.so.0 | |||||||
/usr/lib/librt.so.1 | |||||||
/usr/lib/libz.so.1.3 |