Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
PrOMPT - Coverage per Parallel Efficiency |
PrOMPT - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | libqmckl.so (%) | libqmckl.so.0 (%) | libqmckl.so.0.0.0 (%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_0 | 190.08 | 0 | 0 | 0 | 67.54 | 0.02 | 0 | 0 | 0 | 8.35 | 0 | 0 | 24.08 | 0 |
▼Node skylake | 190.08 | 0 | 0 | 0 | 67.54 | 0.02 | 0 | 0 | 0 | 8.35 | 0 | 0 | 24.08 | 0 |
▼Process 2419925 | 190.08 | 0 | 0 | 0 | 67.54 | 0.02 | 0 | 0 | 0 | 8.35 | 0 | 0 | 24.08 | 0 |
○Thread 2419925 | 190.08 | 0 | 0 | 0 | 67.54 | 0.02 | 0 | 0 | 0 | 8.35 | 0 | 0 | 24.08 | 0 |
▼run_1 | 112.28 | 0 | 0 | 7.07 | 62.46 | 0.07 | 0 | 0 | 0 | 7.83 | 0 | 0 | 22.57 | 0 |
▼Node skylake | 112.28 | 0 | 0 | 7.07 | 62.46 | 0.07 | 0 | 0 | 0 | 7.83 | 0 | 0 | 22.57 | 0 |
▼Process 2419995 | 112.28 | 0 | 0 | 7.07 | 62.46 | 0.07 | 0 | 0 | 0 | 7.83 | 0 | 0 | 22.57 | 0 |
○Thread 2419995 | 112.28 | 0 | 0 | 0.03 | 57.62 | 0.1 | 0 | 0 | 0 | 14.23 | 0 | 0 | 28.01 | 0 |
○Thread 2420048 | 94.31 | 0 | 0 | 15.44 | 68.23 | 0.03 | 0 | 0 | 0 | 0.21 | 0 | 0 | 16.09 | 0 |
▼run_2 | 75.18 | 0 | 0 | 20.67 | 53.65 | 0.06 | 0 | 0 | 0 | 6.26 | 0 | 0 | 19.35 | 0 |
▼Node skylake | 75.18 | 0 | 0 | 20.67 | 53.65 | 0.06 | 0 | 0 | 0 | 6.26 | 0 | 0 | 19.35 | 0 |
▼Process 2420062 | 75.18 | 0 | 0 | 20.67 | 53.65 | 0.06 | 0 | 0 | 0 | 6.26 | 0 | 0 | 19.35 | 0 |
○Thread 2420062 | 75.18 | 0 | 0 | 4.94 | 43.39 | 0.12 | 0 | 0 | 0 | 20.18 | 0 | 0 | 31.36 | 0.01 |
○Thread 2420115 | 57.99 | 0 | 0 | 24.87 | 60.46 | 0.03 | 0 | 0 | 0 | 0.15 | 0 | 0 | 14.5 | 0 |
○Thread 2420116 | 56.68 | 0 | 0 | 28.97 | 57.04 | 0.04 | 0 | 0 | 0 | 0.16 | 0 | 0 | 13.79 | 0 |
○Thread 2420117 | 56.75 | 0 | 0 | 28.93 | 56.92 | 0.05 | 0 | 0 | 0 | 0.18 | 0 | 0 | 13.92 | 0 |
▼run_3 | 54.27 | 0 | 0 | 34.63 | 43.56 | 0.08 | 0 | 0 | 0 | 5 | 0 | 0 | 16.72 | 0 |
▼Node skylake | 54.27 | 0 | 0 | 34.63 | 43.56 | 0.08 | 0 | 0 | 0 | 5 | 0 | 0 | 16.72 | 0 |
▼Process 2420129 | 54.27 | 0 | 0 | 34.63 | 43.56 | 0.08 | 0 | 0 | 0 | 5 | 0 | 0 | 16.72 | 0 |
○Thread 2420129 | 54.27 | 0 | 0 | 3.76 | 30.76 | 0.16 | 0 | 0 | 0 | 27.76 | 0 | 0 | 37.56 | 0 |
○Thread 2420182 | 36.94 | 0 | 0 | 41.69 | 45.22 | 0.08 | 0 | 0 | 0 | 0.2 | 0 | 0 | 12.8 | 0 |
○Thread 2420183 | 37.11 | 0 | 0 | 39.05 | 47.82 | 0.07 | 0 | 0 | 0 | 0.18 | 0 | 0 | 12.88 | 0 |
○Thread 2420184 | 37.08 | 0 | 0 | 39.27 | 47.77 | 0.04 | 0 | 0 | 0 | 0.22 | 0 | 0 | 12.7 | 0 |
○Thread 2420185 | 36.75 | 0 | 0 | 40.45 | 47.05 | 0.07 | 0 | 0 | 0 | 0.23 | 0 | 0 | 12.2 | 0 |
○Thread 2420186 | 36.28 | 0 | 0 | 43.49 | 44.36 | 0.08 | 0 | 0 | 0 | 0.19 | 0 | 0 | 11.87 | 0 |
○Thread 2420187 | 36.27 | 0 | 0 | 43.51 | 44.45 | 0.06 | 0 | 0 | 0 | 0.17 | 0 | 0 | 11.82 | 0 |
○Thread 2420188 | 36.51 | 0 | 0 | 40.71 | 47.11 | 0.01 | 0 | 0 | 0 | 0.21 | 0 | 0 | 11.96 | 0 |
▼run_4 | 45.08 | 0 | 0 | 50.92 | 30.96 | 0.08 | 0 | 0 | 0 | 3.76 | 0 | 0 | 14.27 | 0 |
▼Node skylake | 45.08 | 0 | 0 | 50.92 | 30.96 | 0.08 | 0 | 0 | 0 | 3.76 | 0 | 0 | 14.27 | 0 |
▼Process 2420200 | 45.08 | 0 | 0 | 50.92 | 30.96 | 0.08 | 0 | 0 | 0 | 3.76 | 0 | 0 | 14.27 | 0 |
○Thread 2420200 | 45.08 | 0 | 0 | 2.41 | 19.47 | 0.16 | 0 | 0 | 0 | 35.45 | 0 | 0 | 42.51 | 0.01 |
○Thread 2420253 | 26.87 | 0 | 0 | 56.27 | 31.88 | 0.09 | 0 | 0 | 0 | 0.06 | 0 | 0 | 11.7 | 0 |
○Thread 2420254 | 26.89 | 0 | 0 | 56.2 | 31.73 | 0.06 | 0 | 0 | 0 | 0.3 | 0 | 0 | 11.72 | 0 |
○Thread 2420255 | 26.92 | 0 | 0 | 54.02 | 34 | 0.09 | 0 | 0 | 0 | 0.32 | 0 | 0 | 11.57 | 0 |
○Thread 2420256 | 26.76 | 0 | 0 | 55.46 | 33.13 | 0.07 | 0 | 0 | 0 | 0.19 | 0 | 0 | 11.15 | 0 |
○Thread 2420257 | 26.68 | 0 | 0 | 55.61 | 33.18 | 0.09 | 0 | 0 | 0 | 0.26 | 0 | 0 | 10.85 | 0 |
○Thread 2420258 | 26.68 | 0 | 0 | 55.28 | 33.3 | 0.07 | 0 | 0 | 0 | 0.15 | 0 | 0 | 11.19 | 0 |
○Thread 2420259 | 26.72 | 0 | 0 | 55.95 | 32.77 | 0.07 | 0 | 0 | 0 | 0.15 | 0 | 0 | 11.06 | 0 |
○Thread 2420260 | 26.64 | 0 | 0 | 57.95 | 30.97 | 0.04 | 0 | 0 | 0 | 0.24 | 0 | 0 | 10.79 | 0 |
○Thread 2420261 | 26.62 | 0 | 0 | 57.79 | 31.09 | 0.06 | 0 | 0 | 0 | 0.13 | 0 | 0 | 10.93 | 0 |
○Thread 2420262 | 26.6 | 0 | 0 | 57.82 | 31.07 | 0.08 | 0 | 0 | 0 | 0.15 | 0 | 0 | 10.88 | 0 |
○Thread 2420263 | 26.59 | 0 | 0 | 57.71 | 31.21 | 0.08 | 0 | 0 | 0 | 0.13 | 0 | 0 | 10.87 | 0 |
○Thread 2420264 | 26.85 | 0 | 0 | 57.69 | 30.91 | 0.13 | 0.02 | 0 | 0 | 0.22 | 0 | 0 | 11.02 | 0 |
○Thread 2420265 | 26.74 | 0 | 0 | 55.5 | 33.25 | 0.04 | 0 | 0 | 0 | 0.22 | 0 | 0 | 10.99 | 0 |
○Thread 2420266 | 26.73 | 0 | 0 | 55.27 | 33.43 | 0.07 | 0 | 0 | 0 | 0.17 | 0 | 0 | 11.05 | 0 |
○Thread 2420267 | 26.54 | 0 | 0 | 57.17 | 31.9 | 0.08 | 0 | 0 | 0 | 0.21 | 0 | 0 | 10.65 | 0 |
▼run_5 | 40.59 | 0 | 0 | 58.86 | 24.4 | 0.08 | 0 | 0 | 0 | 2.71 | 0 | 0 | 13.95 | 0 |
▼Node skylake | 40.59 | 0 | 0 | 58.86 | 24.4 | 0.08 | 0 | 0 | 0 | 2.71 | 0 | 0 | 13.95 | 0 |
▼Process 2420276 | 40.59 | 0 | 0 | 58.86 | 24.4 | 0.08 | 0 | 0 | 0 | 2.71 | 0 | 0 | 13.95 | 0 |
○Thread 2420276 | 40.59 | 0 | 0 | 1.42 | 14.93 | 0.12 | 0 | 0 | 0 | 37.64 | 0 | 0 | 45.89 | 0 |
○Thread 2420329 | 23.54 | 0 | 0 | 62.79 | 25.04 | 0.06 | 0 | 0 | 0 | 0.21 | 0 | 0 | 11.89 | 0 |
○Thread 2420330 | 23.56 | 0 | 0 | 62.54 | 24.98 | 0.11 | 0.02 | 0 | 0 | 0.23 | 0 | 0 | 12.12 | 0 |
○Thread 2420331 | 23.56 | 0 | 0 | 62.52 | 25.11 | 0.08 | 0 | 0 | 0 | 0.21 | 0 | 0 | 12.08 | 0 |
○Thread 2420332 | 23.67 | 0 | 0 | 62.74 | 24.88 | 0.04 | 0 | 0 | 0 | 0.25 | 0 | 0 | 12.08 | 0 |
○Thread 2420333 | 23.54 | 0 | 0 | 62.65 | 25.01 | 0.08 | 0 | 0 | 0 | 0.38 | 0 | 0 | 11.88 | 0 |
○Thread 2420334 | 23.54 | 0 | 0 | 62.25 | 25.71 | 0.13 | 0 | 0 | 0 | 0.36 | 0 | 0 | 11.56 | 0 |
○Thread 2420335 | 23.46 | 0 | 0 | 63.04 | 24.6 | 0.09 | 0 | 0 | 0 | 0.43 | 0 | 0 | 11.85 | 0 |
○Thread 2420336 | 23.57 | 0 | 0 | 62.63 | 25.56 | 0.08 | 0 | 0 | 0 | 0.25 | 0 | 0 | 11.47 | 0 |
○Thread 2420337 | 23.47 | 0 | 0 | 63.61 | 23.97 | 0.04 | 0 | 0 | 0 | 0.19 | 0 | 0 | 12.19 | 0 |
○Thread 2420338 | 23.5 | 0 | 0 | 62.97 | 24.68 | 0.11 | 0 | 0 | 0 | 0.32 | 0 | 0 | 11.93 | 0 |
○Thread 2420339 | 23.54 | 0 | 0 | 62.92 | 24.78 | 0.11 | 0 | 0 | 0 | 0.32 | 0 | 0 | 11.87 | 0 |
○Thread 2420340 | 23.6 | 0 | 0 | 62.1 | 25.66 | 0.11 | 0 | 0 | 0 | 0.25 | 0 | 0 | 11.89 | 0 |
○Thread 2420341 | 23.47 | 0 | 0 | 63.4 | 24.14 | 0.02 | 0 | 0 | 0 | 0.32 | 0 | 0 | 12.12 | 0 |
○Thread 2420342 | 23.47 | 0 | 0 | 63.41 | 24.27 | 0.09 | 0 | 0 | 0 | 0.36 | 0 | 0 | 11.87 | 0 |
○Thread 2420343 | 23.56 | 0 | 0 | 62.33 | 25.45 | 0.04 | 0 | 0 | 0 | 0.3 | 0 | 0 | 11.88 | 0 |
○Thread 2420344 | 23.68 | 0 | 0 | 63.03 | 24.54 | 0 | 0 | 0 | 0 | 0.36 | 0 | 0 | 12.08 | 0 |
○Thread 2420345 | 23.5 | 0 | 0 | 63.02 | 24.49 | 0.09 | 0 | 0 | 0 | 0.45 | 0 | 0 | 11.96 | 0 |
○Thread 2420346 | 23.58 | 0 | 0 | 61.95 | 25.55 | 0.06 | 0 | 0 | 0 | 0.36 | 0 | 0 | 12.08 | 0 |
○Thread 2420347 | 23.57 | 0 | 0 | 62.39 | 25.41 | 0.17 | 0 | 0 | 0 | 0.25 | 0 | 0 | 11.77 | 0 |
○Thread 2420348 | 23.58 | 0 | 0 | 62.56 | 25.36 | 0.02 | 0 | 0 | 0 | 0.36 | 0 | 0 | 11.7 | 0 |
○Thread 2420349 | 23.6 | 0 | 0 | 62.3 | 25.71 | 0.06 | 0.02 | 0 | 0 | 0.25 | 0 | 0 | 11.65 | 0 |
○Thread 2420350 | 23.26 | 0 | 0 | 64.45 | 24.31 | 0.13 | 0 | 0 | 0 | 0.37 | 0 | 0 | 10.75 | 0 |
○Thread 2420351 | 23.36 | 0 | 0 | 63.08 | 25.47 | 0.11 | 0 | 0 | 0 | 0.36 | 0 | 0 | 10.98 | 0 |
○Thread 2420352 | 23.43 | 0 | 0 | 62.85 | 25.74 | 0.06 | 0 | 0 | 0 | 0.19 | 0 | 0 | 11.16 | 0 |
○Thread 2420353 | 23.38 | 0 | 0 | 63.07 | 25.88 | 0.04 | 0 | 0 | 0 | 0.24 | 0 | 0 | 10.78 | 0 |
▼run_6 | 40.95 | 0 | 0 | 63.06 | 22.23 | 0.08 | 0 | 0 | 0 | 1.54 | 0 | 0 | 13.08 | 0 |
▼Node skylake | 40.95 | 0 | 0 | 63.06 | 22.23 | 0.08 | 0 | 0 | 0 | 1.54 | 0 | 0 | 13.08 | 0 |
▼Process 2420363 | 40.95 | 0 | 0 | 63.06 | 22.23 | 0.08 | 0 | 0 | 0 | 1.54 | 0 | 0 | 13.08 | 0 |
○Thread 2420363 | 40.95 | 0 | 0 | 1.32 | 15.12 | 0.17 | 0 | 0 | 0 | 37.71 | 0 | 0 | 45.67 | 0.01 |
○Thread 2420416 | 23.49 | 0 | 0 | 64.28 | 24.12 | 0.06 | 0 | 0 | 0 | 0.26 | 0 | 0 | 11.28 | 0 |
○Thread 2420417 | 22.93 | 0 | 0 | 67.02 | 20.87 | 0.09 | 0 | 0 | 0 | 0.13 | 0 | 0 | 11.89 | 0 |
○Thread 2420418 | 23 | 0 | 0 | 66.73 | 21.72 | 0.13 | 0 | 0 | 0 | 0.24 | 0 | 0 | 11.18 | 0 |
○Thread 2420419 | 23.21 | 0 | 0 | 66.67 | 20.55 | 0.11 | 0 | 0 | 0 | 0.22 | 0 | 0 | 12.45 | 0 |
○Thread 2420420 | 22.81 | 0 | 0 | 67.84 | 20.74 | 0.13 | 0 | 0 | 0 | 0.15 | 0 | 0 | 11.14 | 0 |
○Thread 2420421 | 22.83 | 0 | 0 | 67.88 | 20.56 | 0.04 | 0 | 0 | 0 | 0.09 | 0 | 0 | 11.43 | 0 |
○Thread 2420422 | 23.63 | 0 | 0 | 62.47 | 25.15 | 0.08 | 0 | 0 | 0 | 0.32 | 0 | 0 | 11.97 | 0 |
○Thread 2420423 | 23.32 | 0 | 0 | 64.34 | 23.38 | 0.09 | 0 | 0 | 0 | 0.34 | 0 | 0 | 11.86 | 0 |
○Thread 2420424 | 23.19 | 0 | 0 | 62.95 | 24.56 | 0.04 | 0 | 0 | 0 | 0.41 | 0 | 0 | 12.03 | 0 |
○Thread 2420425 | 22.94 | 0 | 0 | 66.96 | 20.29 | 0.02 | 0 | 0 | 0 | 0.26 | 0 | 0 | 12.47 | 0 |
○Thread 2420426 | 22.91 | 0 | 0 | 66.96 | 20.6 | 0.04 | 0 | 0 | 0 | 0.15 | 0 | 0 | 12.24 | 0 |
○Thread 2420427 | 23.23 | 0 | 0 | 66.85 | 20.4 | 0.09 | 0 | 0 | 0 | 0.22 | 0 | 0 | 12.44 | 0 |
○Thread 2420428 | 23.4 | 0 | 0 | 62.05 | 25.3 | 0.06 | 0 | 0 | 0 | 0.34 | 0 | 0 | 12.24 | 0 |
○Thread 2420429 | 23.63 | 0 | 0 | 62.49 | 24.96 | 0.13 | 0 | 0 | 0 | 0.36 | 0 | 0 | 12.06 | 0 |
○Thread 2420430 | 22.92 | 0 | 0 | 66.81 | 20.62 | 0.07 | 0 | 0 | 0 | 0.22 | 0 | 0 | 12.28 | 0 |
○Thread 2420431 | 23.56 | 0 | 0 | 67.64 | 20.14 | 0.04 | 0 | 0 | 0 | 0.19 | 0 | 0 | 11.99 | 0 |
○Thread 2420432 | 22.89 | 0 | 0 | 67.32 | 20.47 | 0.11 | 0 | 0 | 0 | 0.15 | 0 | 0 | 11.95 | 0 |
○Thread 2420433 | 22.82 | 0 | 0 | 67.2 | 20.73 | 0.04 | 0 | 0 | 0 | 0.24 | 0 | 0 | 11.79 | 0 |
○Thread 2420434 | 22.82 | 0 | 0 | 66.86 | 20.75 | 0.07 | 0 | 0 | 0 | 0.26 | 0 | 0 | 12.05 | 0 |
○Thread 2420435 | 23.56 | 0 | 0 | 62.31 | 25.34 | 0.17 | 0 | 0 | 0 | 0.36 | 0 | 0 | 11.82 | 0 |
○Thread 2420436 | 23.19 | 0 | 0 | 62.26 | 25.06 | 0.13 | 0 | 0 | 0 | 0.41 | 0 | 0 | 12.14 | 0 |
○Thread 2420437 | 22.79 | 0 | 0 | 61.65 | 26.02 | 0.04 | 0 | 0 | 0 | 0.31 | 0 | 0 | 11.98 | 0 |
○Thread 2420438 | 22.94 | 0 | 0 | 66.63 | 20.71 | 0.09 | 0 | 0 | 0 | 0.07 | 0 | 0 | 12.51 | 0 |
○Thread 2420439 | 23.35 | 0 | 0 | 62.03 | 25.5 | 0.09 | 0 | 0 | 0 | 0.41 | 0 | 0 | 11.97 | 0 |
○Thread 2420440 | 23.3 | 0 | 0 | 61.95 | 25.56 | 0.11 | 0 | 0 | 0 | 0.43 | 0 | 0 | 11.95 | 0 |
○Thread 2420441 | 23.35 | 0 | 0 | 62.11 | 25.57 | 0.06 | 0 | 0 | 0 | 0.41 | 0 | 0 | 11.84 | 0 |
○Thread 2420442 | 23.44 | 0 | 0 | 62.16 | 25.28 | 0.06 | 0 | 0 | 0 | 0.34 | 0 | 0 | 12.16 | 0 |
○Thread 2420443 | 23.67 | 0 | 0 | 62.51 | 25.11 | 0.11 | 0 | 0 | 0 | 0.42 | 0 | 0 | 11.85 | 0 |
○Thread 2420444 | 23.48 | 0 | 0 | 62.35 | 25.26 | 0.06 | 0 | 0 | 0 | 0.45 | 0 | 0 | 11.88 | 0 |
○Thread 2420445 | 23.09 | 0 | 0 | 68.1 | 20.4 | 0.09 | 0 | 0 | 0 | 0.32 | 0 | 0 | 11.09 | 0 |
○Thread 2420446 | 22.99 | 0 | 0 | 66.68 | 20.73 | 0.13 | 0 | 0 | 0 | 0.17 | 0 | 0 | 12.29 | 0 |
○Thread 2420447 | 23.68 | 0 | 0 | 63.94 | 23.75 | 0.04 | 0 | 0 | 0 | 0.19 | 0 | 0 | 12.08 | 0 |
○Thread 2420448 | 22.81 | 0 | 0 | 67.32 | 19.62 | 0.04 | 0 | 0 | 0 | 0.28 | 0 | 0 | 12.74 | 0 |
○Thread 2420449 | 22.76 | 0 | 0 | 68.23 | 19.02 | 0.04 | 0 | 0 | 0 | 0.18 | 0 | 0 | 12.52 | 0 |
○Thread 2420450 | 23.14 | 0 | 0 | 67.86 | 19.41 | 0.09 | 0 | 0 | 0 | 0.32 | 0 | 0 | 12.32 | 0 |
○Thread 2420451 | 23.5 | 0 | 0 | 68.78 | 18.13 | 0.09 | 0.02 | 0 | 0 | 0.26 | 0 | 0 | 12.73 | 0 |
○Thread 2420452 | 22.79 | 0 | 0 | 67.07 | 19.64 | 0.02 | 0 | 0 | 0 | 0.31 | 0 | 0 | 12.97 | 0 |
○Thread 2420453 | 23.05 | 0 | 0 | 62.95 | 24.53 | 0.07 | 0 | 0 | 0 | 0.41 | 0 | 0 | 12.04 | 0 |
○Thread 2420454 | 22.86 | 0 | 0 | 67.95 | 18.7 | 0.11 | 0 | 0 | 0 | 0.15 | 0 | 0 | 13.08 | 0 |
○Thread 2420455 | 23.27 | 0 | 0 | 62.98 | 24.19 | 0.09 | 0 | 0 | 0 | 0.41 | 0 | 0 | 12.33 | 0 |
○Thread 2420456 | 23.23 | 0 | 0 | 63.43 | 23.81 | 0.13 | 0 | 0 | 0 | 0.37 | 0 | 0 | 12.27 | 0 |
○Thread 2420457 | 23.25 | 0 | 0 | 63.2 | 24.28 | 0.06 | 0.02 | 0 | 0 | 0.45 | 0 | 0 | 11.98 | 0 |
○Thread 2420458 | 23.3 | 0 | 0 | 63.35 | 23.86 | 0.09 | 0 | 0 | 0 | 0.45 | 0 | 0 | 12.25 | 0 |
○Thread 2420459 | 23.35 | 0 | 0 | 67.83 | 19.38 | 0.06 | 0 | 0 | 0 | 0.21 | 0 | 0 | 12.51 | 0 |
○Thread 2420460 | 23.03 | 0 | 0 | 64.63 | 22.69 | 0.07 | 0 | 0 | 0 | 0.48 | 0 | 0 | 12.14 | 0 |
○Thread 2420461 | 23.09 | 0 | 0 | 63.77 | 23.11 | 0.13 | 0 | 0 | 0 | 0.39 | 0 | 0 | 12.6 | 0 |
○Thread 2420462 | 23.44 | 0 | 0 | 63.61 | 24.1 | 0.13 | 0 | 0 | 0 | 0.45 | 0 | 0 | 11.71 | 0 |
○Thread 2420463 | 23.17 | 0 | 0 | 65.53 | 24.22 | 0 | 0 | 0 | 0 | 0.15 | 0 | 0 | 10.1 | 0 |
○Thread 2420464 | 22.9 | 0 | 0 | 66.35 | 23.41 | 0.07 | 0 | 0 | 0 | 0.13 | 0 | 0 | 10.04 | 0 |
○Thread 2420465 | 22.84 | 0 | 0 | 64.89 | 24.56 | 0.04 | 0 | 0 | 0 | 0.18 | 0 | 0 | 10.33 | 0 |
○Thread 2420466 | 22.42 | 0 | 0 | 70.58 | 18.76 | 0.07 | 0 | 0 | 0 | 0.13 | 0 | 0 | 10.46 | 0 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | OMP (%) | Math (%) | System (%) | Memory (%) | libqmckl.so.0.0.0 (%) |
---|---|---|---|---|---|---|
run_0 | 1 | 0 | 67.54 | 0.02 | 8.35 | 24.08 |
run_1 | 2 | 7.07 | 62.46 | 0.07 | 7.83 | 22.57 |
run_2 | 4 | 20.67 | 53.65 | 0.06 | 6.26 | 19.35 |
run_3 | 8 | 34.63 | 43.56 | 0.08 | 5 | 16.72 |
run_4 | 16 | 50.92 | 30.96 | 0.08 | 3.76 | 14.27 |
run_5 | 26 | 58.86 | 24.4 | 0.08 | 2.71 | 13.95 |
run_6 | 52 | 63.06 | 22.23 | 0.08 | 1.54 | 13.08 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | OMP (s) | Math (s) | System (s) | Memory (s) | libqmckl.so.0.0.0 (s) |
---|---|---|---|---|---|---|---|
run_0 | 1 | 190.06 | 0 | 128.38 | 0.04 | 15.87 | 45.77 |
run_1 | 2 | 112.28 | 7.94 | 70.13 | 0.08 | 8.79 | 25.34 |
run_2 | 4 | 75.17 | 15.54 | 40.33 | 0.05 | 4.71 | 14.55 |
run_3 | 8 | 54.26 | 18.79 | 23.64 | 0.04 | 2.71 | 9.07 |
run_4 | 16 | 45.08 | 22.95 | 13.96 | 0.04 | 1.7 | 6.43 |
run_5 | 26 | 40.59 | 23.89 | 9.9 | 0.03 | 1.1 | 5.66 |
run_6 | 52 | 40.95 | 25.82 | 9.1 | 0.03 | 0.63 | 5.36 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_0 | 1 | 1 |
run_1 | 2 | 0.76 |
run_2 | 4 | 0.51 |
run_3 | 8 | 0.31 |
run_4 | 16 | 0.17 |
run_5 | 26 | 0.11 |
run_6 | 52 | 0.06 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0.07 | 0 | 0 | 0 | 0 | 15.3 | 77.52 | 7.11 |
run_2 | 4 | 0 | 0 | 0.09 | 0 | 0 | 0 | 4.26 | 9.14 | 65.81 | 0.02 | 20.68 |
run_3 | 8 | 0 | 0.08 | 0 | 0.01 | 4.71 | 3.2 | 44.4 | 12.92 | 0 | 0 | 34.68 |
run_4 | 16 | 0 | 0.07 | 5.43 | 0 | 7.02 | 31.38 | 5.1 | 0 | 0 | 0 | 51 |
run_5 | 26 | 0.06 | 6.77 | 1.69 | 5.46 | 0 | 25.47 | 1.63 | 0 | 0 | 0 | 58.92 |
run_6 | 52 | 9.65 | 6.28 | 0.24 | 17.14 | 1.54 | 1.88 | 0.18 | 0 | 0 | 0 | 63.09 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0.07 | 0 | 0 | 7.07 | 92.82 | 0.04 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.08 | 20.69 | 79.23 | 0 |
run_3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34.66 | 65.31 | 0.03 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50.97 | 49 | 0.03 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 58.87 | 41.08 | 0.05 |
run_6 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 63.07 | 36.91 | 0.02 |
PrOMPT - Coverage per Parallel Efficiency at OpenMP Region Level
Detailed Coverage per Parallel Efficiency
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54.32 | 45.68 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3.17 | 2.78 | 33.76 | 60.29 |
run_3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 4.28 | 0 | 20.05 | 1.28 | 74.39 |
run_4 | 16 | 0 | 0 | 0 | 2.06 | 0 | 1.42 | 0.24 | 0.01 | 12.45 | 0 | 83.82 |
run_5 | 26 | 0 | 0 | 2.36 | 0.27 | 1.11 | 0 | 0 | 8.36 | 0 | 0.42 | 87.48 |
run_6 | 52 | 2.77 | 0.25 | 1.11 | 0 | 7.77 | 0.04 | 0 | 0 | 0.04 | 0.18 | 87.84 |
PrOMPT - Coverage per Parallel Speedup at OpenMP Region Level
Detailed Coverage per Parallel Speedup
Colums Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured regions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54.32 | 45.68 |
run_2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 39.71 | 60.29 |
run_3 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25.61 | 74.39 |
run_4 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16.18 | 83.82 |
run_5 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12.52 | 87.48 |
run_6 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12.16 | 87.84 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_0 | run_1 | run_2 | run_3 | run_4 | run_5 | run_6 |
---|---|---|---|---|---|---|---|
/home/kcamus/comparative/qmckl/qmckl_bench/build/libqmckl/__install/lib/libqmckl.so.0.0.0 | |||||||
/home/kcamus/comparative/qmckl/trexio/_install/lib/libtrexio.so.0.0.0 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifcoremt.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libifport.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libimf.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libintlc.so.5 | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libiomp5.so | |||||||
/opt/intel/oneapi.old/compiler/2023.0.0/linux/compiler/lib/intel64_lin/libsvml.so | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_avx512.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_core.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_intel_lp64.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_sequential.so.2 | |||||||
/opt/intel/oneapi.old/mkl/2023.0.0/lib/intel64/libmkl_vml_avx512.so.2 | |||||||
/opt/other/hdf5/gcc/seq/1.10.2/lib/libhdf5.so.101.1.0 | |||||||
/usr/lib/ld-linux-x86-64.so.2 | |||||||
/usr/lib/libc.so.6 | |||||||
/usr/lib/libdl.so.2 | |||||||
/usr/lib/libgcc_s.so.1 | |||||||
/usr/lib/libm.so.6 | |||||||
/usr/lib/libpthread.so.0 | |||||||
/usr/lib/librt.so.1 | |||||||
/usr/lib/libz.so.1.3 |