Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
PrOMPT - Coverage per Parallel Efficiency |
PrOMPT - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so (%) | libqmckl.so.0 (%) | libqmckl.so.0.0.0 (%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼m1o1 | 178.63 | 0 | 0 | 0 | 71.78 | 0.06 | 0 | 0 | 0 | 5.9 | 0 | 0 | 22.26 | 0 |
▼Node skylake | 178.63 | 0 | 0 | 0 | 71.78 | 0.06 | 0 | 0 | 0 | 5.9 | 0 | 0 | 22.26 | 0 |
▼Process 2455209 | 178.63 | 0 | 0 | 0 | 71.78 | 0.06 | 0 | 0 | 0 | 5.9 | 0 | 0 | 22.26 | 0 |
○Thread 2455209 | 178.63 | 0 | 0 | 0 | 71.78 | 0.06 | 0 | 0 | 0 | 5.9 | 0 | 0 | 22.26 | 0 |
▼m1o2 | 98.09 | 0 | 0 | 7.38 | 65.71 | 0.06 | 0 | 0 | 0 | 5.41 | 0 | 0 | 21.44 | 0 |
▼Node skylake | 98.09 | 0 | 0 | 7.38 | 65.71 | 0.06 | 0 | 0 | 0 | 5.41 | 0 | 0 | 21.44 | 0 |
▼Process 2455286 | 98.09 | 0 | 0 | 7.38 | 65.71 | 0.06 | 0 | 0 | 0 | 5.41 | 0 | 0 | 21.44 | 0 |
○Thread 2455286 | 97.96 | 0 | 0 | 0.04 | 65.92 | 0.1 | 0 | 0 | 0 | 9.46 | 0 | 0 | 24.47 | 0.01 |
○Thread 2455339 | 98.09 | 0 | 0 | 14.71 | 65.5 | 0.02 | 0 | 0 | 0 | 1.37 | 0 | 0 | 18.41 | 0 |
▼m1o4 | 61.71 | 0 | 0 | 21.83 | 54.27 | 0.09 | 0 | 0 | 0 | 4.6 | 0 | 0 | 19.22 | 0 |
▼Node skylake | 61.71 | 0 | 0 | 21.83 | 54.27 | 0.09 | 0 | 0 | 0 | 4.6 | 0 | 0 | 19.22 | 0 |
▼Process 2455355 | 61.71 | 0 | 0 | 21.83 | 54.27 | 0.09 | 0 | 0 | 0 | 4.6 | 0 | 0 | 19.22 | 0 |
○Thread 2455355 | 61.71 | 0 | 0 | 6.8 | 53 | 0.17 | 0 | 0 | 0 | 13.75 | 0 | 0 | 26.28 | 0.01 |
○Thread 2455408 | 60.63 | 0 | 0 | 23.72 | 57.18 | 0.08 | 0 | 0 | 0 | 2.15 | 0 | 0 | 16.86 | 0 |
○Thread 2455409 | 60.29 | 0 | 0 | 28.48 | 53.58 | 0.07 | 0 | 0 | 0 | 1.14 | 0 | 0 | 16.74 | 0 |
○Thread 2455410 | 59.95 | 0 | 0 | 28.71 | 53.31 | 0.04 | 0 | 0 | 0 | 1.13 | 0 | 0 | 16.81 | 0 |
▼m1o8 | 39.56 | 0 | 0 | 35.4 | 43.13 | 0.1 | 0 | 0 | 0 | 3.78 | 0 | 0 | 17.6 | 0 |
▼Node skylake | 39.56 | 0 | 0 | 35.4 | 43.13 | 0.1 | 0 | 0 | 0 | 3.78 | 0 | 0 | 17.6 | 0 |
▼Process 2455422 | 39.56 | 0 | 0 | 35.4 | 43.13 | 0.1 | 0 | 0 | 0 | 3.78 | 0 | 0 | 17.6 | 0 |
○Thread 2455422 | 39.56 | 0 | 0 | 5.98 | 42.33 | 0.14 | 0 | 0 | 0 | 21.15 | 0 | 0 | 30.41 | 0 |
○Thread 2455475 | 38.9 | 0 | 0 | 40.28 | 42.63 | 0.06 | 0 | 0 | 0 | 1.22 | 0 | 0 | 15.8 | 0 |
○Thread 2455476 | 39.56 | 0 | 0 | 36.73 | 44.94 | 0.08 | 0 | 0 | 0 | 1.59 | 0 | 0 | 16.66 | 0 |
○Thread 2455477 | 39.55 | 0 | 0 | 37.89 | 44.06 | 0.11 | 0 | 0 | 0 | 1.31 | 0 | 0 | 16.62 | 0 |
○Thread 2455478 | 39.33 | 0 | 0 | 42.24 | 41.65 | 0.08 | 0 | 0 | 0 | 1.17 | 0 | 0 | 14.86 | 0 |
○Thread 2455479 | 39.38 | 0 | 0 | 39.35 | 43.59 | 0.05 | 0 | 0 | 0 | 1.27 | 0 | 0 | 15.74 | 0 |
○Thread 2455480 | 38.9 | 0 | 0 | 41.74 | 42.01 | 0.09 | 0 | 0 | 0 | 1.17 | 0 | 0 | 15 | 0 |
○Thread 2455481 | 39.4 | 0 | 0 | 39.24 | 43.83 | 0.15 | 0 | 0 | 0 | 1.21 | 0 | 0 | 15.57 | 0 |
▼m1o16 | 29.6 | 0 | 0 | 49.8 | 30.02 | 0.13 | 0 | 0 | 0 | 2.8 | 0 | 0 | 17.26 | 0 |
▼Node skylake | 29.6 | 0 | 0 | 49.8 | 30.02 | 0.13 | 0 | 0 | 0 | 2.8 | 0 | 0 | 17.26 | 0 |
▼Process 2455492 | 29.6 | 0 | 0 | 49.8 | 30.02 | 0.13 | 0 | 0 | 0 | 2.8 | 0 | 0 | 17.26 | 0 |
○Thread 2455492 | 29.6 | 0 | 0 | 4.63 | 29.92 | 0.27 | 0 | 0 | 0 | 28.12 | 0 | 0 | 37.06 | 0 |
○Thread 2455545 | 29.17 | 0 | 0 | 49.68 | 31.85 | 0.09 | 0 | 0 | 0 | 1.49 | 0 | 0 | 16.89 | 0 |
○Thread 2455546 | 29.08 | 0 | 0 | 53.45 | 29.29 | 0.14 | 0 | 0 | 0 | 0.65 | 0 | 0 | 16.47 | 0 |
○Thread 2455547 | 29.1 | 0 | 0 | 53.69 | 29.2 | 0.15 | 0 | 0 | 0 | 0.74 | 0 | 0 | 16.22 | 0 |
○Thread 2455548 | 29.05 | 0 | 0 | 52.19 | 30.83 | 0.1 | 0.02 | 0 | 0 | 1.05 | 0 | 0 | 15.8 | 0 |
○Thread 2455549 | 28.86 | 0 | 0 | 55.48 | 28.5 | 0.05 | 0 | 0 | 0 | 0.59 | 0 | 0 | 15.37 | 0 |
○Thread 2455550 | 28.86 | 0 | 0 | 55.39 | 28.48 | 0.07 | 0 | 0 | 0 | 0.64 | 0 | 0 | 15.42 | 0 |
○Thread 2455551 | 28.86 | 0 | 0 | 55.52 | 28.44 | 0.09 | 0 | 0 | 0 | 0.69 | 0 | 0 | 15.27 | 0 |
○Thread 2455552 | 29.11 | 0 | 0 | 56.21 | 27.74 | 0.09 | 0 | 0 | 0 | 0.62 | 0 | 0 | 15.34 | 0 |
○Thread 2455553 | 29.05 | 0 | 0 | 51.38 | 31.12 | 0.07 | 0 | 0 | 0 | 1.57 | 0 | 0 | 15.87 | 0 |
○Thread 2455554 | 29.04 | 0 | 0 | 51.34 | 30.99 | 0.17 | 0 | 0 | 0 | 1.38 | 0 | 0 | 16.12 | 0 |
○Thread 2455555 | 29.07 | 0 | 0 | 51.4 | 31.07 | 0.09 | 0 | 0 | 0 | 1.53 | 0 | 0 | 15.91 | 0 |
○Thread 2455556 | 29.08 | 0 | 0 | 51.35 | 30.9 | 0.17 | 0 | 0 | 0 | 1.26 | 0 | 0 | 16.32 | 0 |
○Thread 2455557 | 29.05 | 0 | 0 | 51.41 | 31.03 | 0.05 | 0 | 0 | 0 | 1.41 | 0 | 0 | 16.09 | 0 |
○Thread 2455558 | 29.06 | 0 | 0 | 51.12 | 31.11 | 0.22 | 0 | 0 | 0 | 1.43 | 0 | 0 | 16.12 | 0 |
○Thread 2455559 | 28.84 | 0 | 0 | 53.57 | 29.75 | 0.19 | 0 | 0 | 0 | 1.09 | 0 | 0 | 15.4 | 0 |
▼m1o26 | 25.67 | 0 | 0 | 55.86 | 23.32 | 0.17 | 0 | 0 | 0 | 2.47 | 0 | 0 | 18.18 | 0 |
▼Node skylake | 25.67 | 0 | 0 | 55.86 | 23.32 | 0.17 | 0 | 0 | 0 | 2.47 | 0 | 0 | 18.18 | 0 |
▼Process 2455568 | 25.67 | 0 | 0 | 55.86 | 23.32 | 0.17 | 0 | 0 | 0 | 2.47 | 0 | 0 | 18.18 | 0 |
○Thread 2455568 | 25.67 | 0 | 0 | 1.95 | 24.54 | 0.31 | 0 | 0 | 0 | 31.92 | 0 | 0 | 41.29 | 0 |
○Thread 2455621 | 25.52 | 0 | 0 | 58.02 | 23.08 | 0.24 | 0 | 0 | 0 | 0.86 | 0 | 0 | 17.79 | 0 |
○Thread 2455622 | 25.52 | 0 | 0 | 58.26 | 23.28 | 0.14 | 0 | 0 | 0 | 0.88 | 0 | 0 | 17.44 | 0 |
○Thread 2455623 | 25.52 | 0 | 0 | 58.37 | 23.08 | 0.16 | 0 | 0 | 0 | 0.84 | 0 | 0 | 17.55 | 0 |
○Thread 2455624 | 25.57 | 0 | 0 | 58.27 | 23.02 | 0.2 | 0 | 0 | 0 | 0.86 | 0 | 0 | 17.66 | 0 |
○Thread 2455625 | 25.48 | 0 | 0 | 58.11 | 23.15 | 0.12 | 0 | 0 | 0 | 1 | 0 | 0 | 17.62 | 0 |
○Thread 2455626 | 25.46 | 0 | 0 | 59.13 | 22.17 | 0.27 | 0 | 0 | 0 | 0.96 | 0 | 0 | 17.46 | 0 |
○Thread 2455627 | 25.57 | 0 | 0 | 57.06 | 23.8 | 0.1 | 0 | 0 | 0 | 1.7 | 0 | 0 | 17.34 | 0 |
○Thread 2455628 | 25.58 | 0 | 0 | 56.96 | 23.94 | 0.12 | 0 | 0 | 0 | 1.66 | 0 | 0 | 17.32 | 0 |
○Thread 2455629 | 25.45 | 0 | 0 | 58.89 | 22.31 | 0.12 | 0 | 0 | 0 | 0.94 | 0 | 0 | 17.74 | 0 |
○Thread 2455630 | 25.55 | 0 | 0 | 57.08 | 23.72 | 0.18 | 0 | 0 | 0 | 1.45 | 0 | 0 | 17.58 | 0 |
○Thread 2455631 | 25.44 | 0 | 0 | 58.79 | 22.8 | 0.26 | 0 | 0 | 0 | 1.04 | 0 | 0 | 17.12 | 0 |
○Thread 2455632 | 25.55 | 0 | 0 | 58.01 | 22.95 | 0.22 | 0 | 0 | 0 | 1.08 | 0 | 0 | 17.75 | 0 |
○Thread 2455633 | 25.43 | 0 | 0 | 58.81 | 22.34 | 0.2 | 0 | 0 | 0 | 0.9 | 0 | 0 | 17.75 | 0 |
○Thread 2455634 | 25.57 | 0 | 0 | 56.98 | 23.82 | 0.22 | 0 | 0 | 0 | 1.62 | 0 | 0 | 17.36 | 0 |
○Thread 2455635 | 25.56 | 0 | 0 | 57.19 | 23.46 | 0.14 | 0 | 0 | 0 | 1.76 | 0 | 0 | 17.45 | 0 |
○Thread 2455636 | 25.59 | 0 | 0 | 57.19 | 23.7 | 0.08 | 0 | 0 | 0 | 1.66 | 0 | 0 | 17.37 | 0 |
○Thread 2455637 | 25.57 | 0 | 0 | 56.95 | 23.8 | 0.18 | 0 | 0 | 0 | 1.56 | 0 | 0 | 17.5 | 0 |
○Thread 2455638 | 25.55 | 0 | 0 | 57.32 | 23.25 | 0.16 | 0 | 0 | 0 | 1.6 | 0 | 0 | 17.67 | 0 |
○Thread 2455639 | 25.56 | 0 | 0 | 56.87 | 23.71 | 0.12 | 0 | 0 | 0 | 1.68 | 0 | 0 | 17.63 | 0 |
○Thread 2455640 | 25.57 | 0 | 0 | 56.87 | 23.82 | 0.16 | 0 | 0 | 0 | 1.72 | 0 | 0 | 17.43 | 0 |
○Thread 2455641 | 25.44 | 0 | 0 | 58.29 | 22.74 | 0.12 | 0 | 0 | 0 | 1.1 | 0 | 0 | 17.75 | 0 |
○Thread 2455642 | 25.17 | 0 | 0 | 60.85 | 22.36 | 0.18 | 0 | 0 | 0 | 0.71 | 0 | 0 | 15.89 | 0 |
○Thread 2455643 | 25.28 | 0 | 0 | 58.81 | 23.88 | 0.24 | 0 | 0 | 0 | 1.52 | 0 | 0 | 15.55 | 0 |
○Thread 2455644 | 25.29 | 0 | 0 | 59.07 | 23.63 | 0.04 | 0 | 0 | 0 | 1.48 | 0 | 0 | 15.78 | 0 |
○Thread 2455645 | 25.27 | 0 | 0 | 58.8 | 23.93 | 0.12 | 0 | 0 | 0 | 1.37 | 0 | 0 | 15.79 | 0 |
▼m1o52 | 25.68 | 0 | 0 | 59.45 | 20.29 | 0.21 | 0 | 0 | 0 | 2.36 | 0 | 0 | 17.68 | 0 |
▼Node skylake | 25.68 | 0 | 0 | 59.45 | 20.29 | 0.21 | 0 | 0 | 0 | 2.36 | 0 | 0 | 17.68 | 0 |
▼Process 2455654 | 25.68 | 0 | 0 | 59.45 | 20.29 | 0.21 | 0 | 0 | 0 | 2.36 | 0 | 0 | 17.68 | 0 |
○Thread 2455654 | 25.68 | 0 | 0 | 1.75 | 23.81 | 0.45 | 0 | 0 | 0 | 32.55 | 0 | 0 | 41.41 | 0.02 |
○Thread 2455707 | 25.26 | 0 | 0 | 61.71 | 19.18 | 0.14 | 0 | 0 | 0 | 1.8 | 0 | 0 | 17.16 | 0 |
○Thread 2455708 | 25.61 | 0 | 0 | 57.56 | 22.84 | 0.12 | 0 | 0 | 0 | 1.76 | 0 | 0 | 17.73 | 0 |
○Thread 2455709 | 25.61 | 0 | 0 | 57.71 | 23.04 | 0.2 | 0 | 0 | 0 | 1.64 | 0 | 0 | 17.42 | 0 |
○Thread 2455710 | 25.62 | 0 | 0 | 57.77 | 22.6 | 0.2 | 0 | 0 | 0 | 1.81 | 0 | 0 | 17.62 | 0 |
○Thread 2455711 | 25.59 | 0 | 0 | 57.51 | 22.95 | 0.35 | 0 | 0 | 0 | 1.88 | 0 | 0 | 17.31 | 0 |
○Thread 2455712 | 25.04 | 0 | 0 | 61.98 | 18.59 | 0.22 | 0 | 0 | 0 | 1.8 | 0 | 0 | 17.41 | 0 |
○Thread 2455713 | 25.6 | 0 | 0 | 57.61 | 22.87 | 0.16 | 0 | 0 | 0 | 1.86 | 0 | 0 | 17.52 | 0 |
○Thread 2455714 | 25.61 | 0 | 0 | 57.54 | 22.8 | 0.18 | 0 | 0 | 0 | 1.66 | 0 | 0 | 17.83 | 0 |
○Thread 2455715 | 25.05 | 0 | 0 | 61.56 | 18.64 | 0.22 | 0 | 0 | 0 | 1.78 | 0 | 0 | 17.8 | 0 |
○Thread 2455716 | 25.18 | 0 | 0 | 61.62 | 18.36 | 0.14 | 0 | 0 | 0 | 1.85 | 0 | 0 | 18.03 | 0 |
○Thread 2455717 | 25.6 | 0 | 0 | 57.48 | 22.89 | 0.25 | 0 | 0 | 0 | 1.8 | 0 | 0 | 17.58 | 0 |
○Thread 2455718 | 25.51 | 0 | 0 | 61.9 | 18.33 | 0.33 | 0.02 | 0 | 0 | 1.61 | 0 | 0 | 17.82 | 0 |
○Thread 2455719 | 25.23 | 0 | 0 | 62.17 | 18.63 | 0.18 | 0 | 0 | 0 | 1.88 | 0 | 0 | 17.14 | 0 |
○Thread 2455720 | 25.53 | 0 | 0 | 57.38 | 22.93 | 0.22 | 0 | 0 | 0 | 1.84 | 0 | 0 | 17.63 | 0 |
○Thread 2455721 | 25.49 | 0 | 0 | 57.41 | 22.91 | 0.2 | 0 | 0 | 0 | 1.84 | 0 | 0 | 17.63 | 0 |
○Thread 2455722 | 25.49 | 0 | 0 | 57.34 | 22.99 | 0.18 | 0 | 0 | 0 | 1.82 | 0 | 0 | 17.67 | 0 |
○Thread 2455723 | 25.48 | 0 | 0 | 57.24 | 23 | 0.24 | 0 | 0 | 0 | 1.81 | 0 | 0 | 17.72 | 0 |
○Thread 2455724 | 25.48 | 0 | 0 | 57.32 | 23.06 | 0.2 | 0 | 0 | 0 | 1.77 | 0 | 0 | 17.66 | 0 |
○Thread 2455725 | 25.27 | 0 | 0 | 58.38 | 21.42 | 0.3 | 0 | 0 | 0 | 1.82 | 0 | 0 | 18.08 | 0 |
○Thread 2455726 | 25.62 | 0 | 0 | 57.66 | 22.89 | 0.27 | 0 | 0 | 0 | 1.89 | 0 | 0 | 17.29 | 0 |
○Thread 2455727 | 25.09 | 0 | 0 | 62.39 | 18.72 | 0.14 | 0 | 0 | 0 | 1.83 | 0 | 0 | 16.92 | 0 |
○Thread 2455728 | 25.1 | 0 | 0 | 61.57 | 19.02 | 0.2 | 0 | 0 | 0 | 1.87 | 0 | 0 | 17.33 | 0 |
○Thread 2455729 | 25.47 | 0 | 0 | 57.41 | 22.92 | 0.2 | 0 | 0 | 0 | 1.84 | 0 | 0 | 17.63 | 0 |
○Thread 2455730 | 25.38 | 0 | 0 | 61.74 | 18.5 | 0.3 | 0 | 0 | 0 | 1.79 | 0 | 0 | 17.67 | 0 |
○Thread 2455731 | 25.59 | 0 | 0 | 57.76 | 22.86 | 0.2 | 0 | 0 | 0 | 1.82 | 0 | 0 | 17.37 | 0 |
○Thread 2455732 | 25.23 | 0 | 0 | 61.75 | 18.71 | 0.16 | 0 | 0 | 0 | 1.98 | 0 | 0 | 17.4 | 0 |
○Thread 2455733 | 25.22 | 0 | 0 | 62.75 | 18.54 | 0.16 | 0 | 0 | 0 | 1.92 | 0 | 0 | 16.63 | 0 |
○Thread 2455734 | 25.63 | 0 | 0 | 57.63 | 22.81 | 0.23 | 0 | 0 | 0 | 1.81 | 0 | 0 | 17.52 | 0 |
○Thread 2455735 | 25.48 | 0 | 0 | 58.89 | 21.59 | 0.29 | 0 | 0 | 0 | 1.82 | 0 | 0 | 17.41 | 0 |
○Thread 2455736 | 25.58 | 0 | 0 | 57.43 | 23.01 | 0.22 | 0 | 0 | 0 | 1.86 | 0 | 0 | 17.49 | 0 |
○Thread 2455737 | 25.57 | 0 | 0 | 57.5 | 22.76 | 0.31 | 0 | 0 | 0 | 1.88 | 0 | 0 | 17.56 | 0 |
○Thread 2455738 | 25.55 | 0 | 0 | 59.56 | 21.25 | 0.22 | 0 | 0 | 0 | 1.8 | 0 | 0 | 17.18 | 0 |
○Thread 2455739 | 25.05 | 0 | 0 | 63.4 | 17.02 | 0.28 | 0 | 0 | 0 | 1.76 | 0 | 0 | 17.54 | 0 |
○Thread 2455740 | 25.14 | 0 | 0 | 64.18 | 17.04 | 0.14 | 0 | 0 | 0 | 1.83 | 0 | 0 | 16.81 | 0 |
○Thread 2455741 | 25.03 | 0 | 0 | 62.96 | 16.94 | 0.2 | 0 | 0 | 0 | 1.84 | 0 | 0 | 18.06 | 0 |
○Thread 2455742 | 25.41 | 0 | 0 | 58.5 | 21.76 | 0.16 | 0 | 0 | 0 | 1.71 | 0 | 0 | 17.87 | 0 |
○Thread 2455743 | 25.4 | 0 | 0 | 58.31 | 21.91 | 0.3 | 0 | 0 | 0 | 1.85 | 0 | 0 | 17.64 | 0 |
○Thread 2455744 | 25.13 | 0 | 0 | 63.35 | 17.01 | 0.26 | 0 | 0 | 0 | 1.69 | 0 | 0 | 17.69 | 0 |
○Thread 2455745 | 25.17 | 0 | 0 | 63.59 | 17.02 | 0.12 | 0 | 0 | 0 | 1.91 | 0 | 0 | 17.36 | 0 |
○Thread 2455746 | 25.16 | 0 | 0 | 63.14 | 17.01 | 0.22 | 0 | 0 | 0 | 1.91 | 0 | 0 | 17.72 | 0 |
○Thread 2455747 | 25.13 | 0 | 0 | 63.87 | 17.29 | 0.3 | 0 | 0 | 0 | 1.81 | 0 | 0 | 16.73 | 0 |
○Thread 2455748 | 25.02 | 0 | 0 | 63.52 | 17.22 | 0.1 | 0 | 0 | 0 | 1.86 | 0 | 0 | 17.3 | 0 |
○Thread 2455749 | 25 | 0 | 0 | 63.89 | 17.12 | 0.24 | 0 | 0 | 0 | 1.82 | 0 | 0 | 16.94 | 0 |
○Thread 2455750 | 25.57 | 0 | 0 | 58.53 | 21.8 | 0.14 | 0 | 0 | 0 | 1.76 | 0 | 0 | 17.77 | 0 |
○Thread 2455751 | 25.15 | 0 | 0 | 63.44 | 17.04 | 0.1 | 0 | 0 | 0 | 1.93 | 0 | 0 | 17.49 | 0 |
○Thread 2455752 | 25.54 | 0 | 0 | 58.77 | 21.67 | 0.22 | 0 | 0 | 0 | 1.82 | 0 | 0 | 17.52 | 0 |
○Thread 2455753 | 25.15 | 0 | 0 | 63.94 | 17.14 | 0.26 | 0 | 0 | 0 | 1.93 | 0 | 0 | 16.74 | 0 |
○Thread 2455754 | 24.97 | 0 | 0 | 68.61 | 16.98 | 0.14 | 0 | 0 | 0 | 1.02 | 0 | 0 | 13.25 | 0 |
○Thread 2455755 | 24.67 | 0 | 0 | 67.44 | 17.2 | 0.12 | 0 | 0 | 0 | 1.11 | 0 | 0 | 14.12 | 0 |
○Thread 2455756 | 25.05 | 0 | 0 | 61.8 | 22.16 | 0.18 | 0 | 0 | 0 | 1 | 0 | 0 | 14.87 | 0 |
○Thread 2455757 | 24.55 | 0 | 0 | 67.73 | 17.19 | 0.22 | 0 | 0 | 0 | 0.96 | 0 | 0 | 13.91 | 0 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | OMP (%) | Math (%) | System (%) | Memory (%) | libqmckl.so.0.0.0 (%) |
---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 71.78 | 0.06 | 5.9 | 22.26 |
m1o2 | 2 | 7.38 | 65.71 | 0.06 | 5.41 | 21.44 |
m1o4 | 4 | 21.83 | 54.27 | 0.09 | 4.6 | 19.22 |
m1o8 | 8 | 35.4 | 43.13 | 0.1 | 3.78 | 17.6 |
m1o16 | 16 | 49.8 | 30.02 | 0.13 | 2.8 | 17.26 |
m1o26 | 26 | 55.86 | 23.32 | 0.17 | 2.47 | 18.18 |
m1o52 | 52 | 59.45 | 20.29 | 0.21 | 2.36 | 17.68 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | System (s) | Memory (s) | libqmckl.so.0.0.0 (s) |
---|---|---|---|---|---|---|---|
m1o1 | 1 | 178.63 | 0 | 128.22 | 0.11 | 10.54 | 39.76 |
m1o2 | 2 | 98.09 | 7.24 | 64.45 | 0.06 | 5.31 | 21.03 |
m1o4 | 4 | 61.72 | 13.47 | 33.49 | 0.06 | 2.84 | 11.86 |
m1o8 | 8 | 39.56 | 14 | 17.06 | 0.04 | 1.5 | 6.96 |
m1o16 | 16 | 29.6 | 14.74 | 8.89 | 0.04 | 0.83 | 5.11 |
m1o26 | 26 | 25.67 | 14.34 | 5.99 | 0.04 | 0.63 | 4.67 |
m1o52 | 52 | 25.68 | 15.27 | 5.21 | 0.05 | 0.61 | 4.54 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
m1o1 | 1 | 1 |
m1o2 | 2 | 0.81 |
m1o4 | 4 | 0.55 |
m1o8 | 8 | 0.35 |
m1o16 | 16 | 0.2 |
m1o26 | 26 | 0.13 |
m1o52 | 52 | 0.07 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0.04 | 0.45 | 0 | 0.06 | 2.95 | 89.11 | 7.39 |
m1o4 | 4 | 0 | 0 | 0.76 | 0 | 0 | 0.06 | 0 | 2.76 | 14.46 | 60.07 | 21.89 |
m1o8 | 8 | 0 | 1.15 | 0 | 0.03 | 0 | 2.88 | 4.68 | 7.06 | 5.05 | 43.69 | 35.46 |
m1o16 | 16 | 1.58 | 0.05 | 0.06 | 2.8 | 9.3 | 0 | 1.57 | 5.13 | 0.83 | 28.82 | 49.86 |
m1o26 | 26 | 2 | 0.07 | 9.34 | 4.23 | 1.72 | 2.6 | 2.46 | 0 | 19.61 | 2.04 | 55.93 |
m1o52 | 52 | 2.35 | 15.63 | 3.15 | 2.36 | 0.25 | 15.65 | 0 | 0 | 0.07 | 1.05 | 59.49 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7.44 | 92.57 | 0 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21.87 | 78.11 | 0.02 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35.41 | 64.54 | 0.05 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 49.85 | 50.14 | 0.01 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 57.81 | 42.15 | 0.04 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2.15 | 59.6 | 38.24 | 0.01 |
PrOMPT - Coverage per Parallel Efficiency at OpenMP Region Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0.35 | 0 | 0 | 3.38 | 59.19 | 37.07 |
m1o4 | 4 | 0 | 0 | 0.5 | 0 | 0 | 0 | 6.4 | 0 | 3.13 | 38.56 | 51.41 |
m1o8 | 8 | 0 | 0.67 | 0 | 0 | 2.74 | 2.51 | 2.58 | 0 | 24.15 | 1.29 | 66.06 |
m1o16 | 16 | 0.77 | 0 | 0 | 4.73 | 2.05 | 0 | 0.31 | 0 | 15.28 | 0 | 76.87 |
m1o26 | 26 | 0.88 | 0 | 6.62 | 0.33 | 0.01 | 0 | 0 | 10.4 | 0.07 | 0.49 | 81.2 |
m1o52 | 52 | 4.3 | 4.26 | 0.01 | 0 | 9.8 | 0 | 0.04 | 0 | 0 | 0.25 | 81.33 |
PrOMPT - Coverage per Parallel Speedup at OpenMP Region Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
m1o1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
m1o2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 62.93 | 37.07 |
m1o4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 48.59 | 51.41 |
m1o8 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.67 | 33.28 | 66.06 |
m1o16 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.77 | 22.36 | 76.87 |
m1o26 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.88 | 17.92 | 81.2 |
m1o52 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.99 | 0 | 17.67 | 81.33 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | m1o1 | m1o2 | m1o4 | m1o8 | m1o16 | m1o26 | m1o52 |
---|---|---|---|---|---|---|---|
/home/kcamus/comparative/qmckl/qmckl_bench/build/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
/home/kcamus/comparative/qmckl/trexio/_install/lib/libtrexio.so.0.0.0 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifcoremt.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifport.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libimf.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libintlc.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libiomp5.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libsvml.so | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_avx512.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_core.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_intel_lp64.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_sequential.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_vml_avx512.so.2 | |||||||
/opt/other/hdf5/gcc/seq/1.10.2/lib/libhdf5.so.101.1.0 | |||||||
/usr/lib/ld-linux-x86-64.so.2 | |||||||
/usr/lib/libc.so.6 | |||||||
/usr/lib/libdl.so.2 | |||||||
/usr/lib/libgcc_s.so.1 | |||||||
/usr/lib/libm.so.6 | |||||||
/usr/lib/libpthread.so.0 | |||||||
/usr/lib/librt.so.1 | |||||||
/usr/lib/libz.so.1.3 |