Detailed Application Categorization |
Detailed Function Times |
Scalability - Coverage per Category |
Scalability - Time per Category |
Scalability - Efficiency |
Function Based Profile |
Scalability - Coverage per Parallel Efficiency |
Scalability - Coverage per Parallel Speedup |
Libraries |
Detailed Application Categorization
ID | Time(s) | Binary(%) | MPI(%) | OMP(%) | TBB(%) | Math(%) | System(%) | Pthread(%) | IO(%) | String(%) | Memory(%) | Others(%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
▼run_1_thread | 101.79 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 101.79 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 1508694 | 101.79 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 1508694) | 101.79 | 99.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_2_threads | 55.90 | 95.59 | 0.00 | 4.40 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 55.90 | 95.59 | 0.00 | 4.40 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 1508738 | 55.90 | 95.59 | 0.00 | 4.40 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 1508738) | 55.90 | 99.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 1508760) | 53.18 | 90.97 | 0.00 | 9.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_4_threads | 33.18 | 88.42 | 0.00 | 11.53 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 33.18 | 88.42 | 0.00 | 11.53 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 1508778 | 33.18 | 88.42 | 0.00 | 11.53 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 1508778) | 33.18 | 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 1508800) | 30.26 | 84.19 | 0.00 | 15.69 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 1508801) | 30.26 | 84.17 | 0.00 | 15.79 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 1508802) | 30.26 | 84.19 | 0.00 | 15.76 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_8_threads | 21.28 | 76.48 | 0.00 | 23.42 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 21.28 | 76.48 | 0.00 | 23.42 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 1508823 | 21.28 | 76.48 | 0.00 | 23.42 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 1508823) | 21.28 | 97.65 | 0.00 | 2.30 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 1508845) | 17.88 | 73.01 | 0.00 | 26.83 | 0.00 | 0.00 | 0.17 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 1508846) | 17.82 | 72.94 | 0.00 | 27.00 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 1508847) | 17.90 | 73.08 | 0.00 | 26.84 | 0.00 | 0.00 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 1508848) | 18.21 | 72.08 | 0.00 | 27.79 | 0.00 | 0.00 | 0.14 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 1508849) | 17.81 | 72.90 | 0.00 | 27.01 | 0.00 | 0.00 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 1508850) | 17.81 | 72.96 | 0.00 | 27.01 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 1508851) | 18.07 | 73.31 | 0.00 | 26.56 | 0.00 | 0.00 | 0.14 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼run_10_threads | 19.07 | 71.80 | 0.00 | 28.10 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Node otterfall | 19.07 | 71.80 | 0.00 | 28.10 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
▼Process 1508868 | 19.07 | 71.80 | 0.00 | 28.10 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 0 (TID 1508868) | 19.07 | 95.94 | 0.00 | 4.01 | 0.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 1 (TID 1508890) | 15.38 | 68.66 | 0.00 | 31.21 | 0.00 | 0.00 | 0.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 2 (TID 1508891) | 15.21 | 68.28 | 0.00 | 31.69 | 0.00 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 3 (TID 1508892) | 15.64 | 69.31 | 0.00 | 30.59 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 4 (TID 1508893) | 15.97 | 66.34 | 0.00 | 33.66 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 5 (TID 1508894) | 15.55 | 68.97 | 0.00 | 30.93 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 6 (TID 1508895) | 15.20 | 68.39 | 0.00 | 31.48 | 0.00 | 0.00 | 0.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 7 (TID 1508896) | 15.75 | 69.52 | 0.00 | 30.29 | 0.00 | 0.00 | 0.19 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 8 (TID 1508897) | 15.42 | 68.71 | 0.00 | 31.19 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
○OMP # 9 (TID 1508898) | 15.21 | 68.31 | 0.00 | 31.53 | 0.00 | 0.00 | 0.16 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Detailed Function Times
Scalability - Coverage per Category
Detailed Coverage per Category
Run | Number of threads | Binary (%) | OMP (%) | System (%) |
---|---|---|---|---|
run_1_thread | 1 | 99.99 | 0 | 0.01 |
run_2_threads | 2 | 95.59 | 4.4 | 0.01 |
run_4_threads | 4 | 88.42 | 11.53 | 0.05 |
run_8_threads | 8 | 76.48 | 23.42 | 0.09 |
run_10_threads | 10 | 71.8 | 28.1 | 0.1 |
Scalability - Time per Category
Detailed Time per Category
Run | Number of threads | Total Time (s) | Binary (s) | OMP (s) | System (s) |
---|---|---|---|---|---|
run_1_thread | 1 | 101.79 | 101.78 | 0 | 0.01 |
run_2_threads | 2 | 55.9 | 53.44 | 2.46 | 0.01 |
run_4_threads | 4 | 33.18 | 29.34 | 3.83 | 0.02 |
run_8_threads | 8 | 21.28 | 16.28 | 4.98 | 0.02 |
run_10_threads | 10 | 19.07 | 13.69 | 5.36 | 0.02 |
Scalability - Efficiency
Detailed Efficiency
Run | Number of observed threads | Efficiency (ideal is 1) |
---|---|---|
run_1_thread | 1 | 1 |
run_2_threads | 2 | 0.91 |
run_4_threads | 4 | 0.76 |
run_8_threads | 8 | 0.6 |
run_10_threads | 10 | 0.53 |
Function Based Profile
Scalability - Coverage per Parallel Efficiency at Function Level
Detailed Coverage per Parallel Efficiency
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% eff. | Coverage (%) 10% to 20% eff. | Coverage (%) 20% to 30% eff. | Coverage (%) 30% to 40% eff. | Coverage (%) 40% to 50% eff. | Coverage (%) 50% to 60% eff. | Coverage (%) 60% to 70% eff. | Coverage (%) 70% to 80% eff. | Coverage (%) 80% to 90% eff. | Coverage (%) 90% to 100% eff. | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 95.59 | 4.4 |
run_4_threads | 4 | 0 | 0 | 0.02 | 0 | 0 | 0 | 0 | 0 | 88.42 | 0 | 11.57 |
run_8_threads | 8 | 0.05 | 0 | 0 | 0 | 0 | 0 | 0 | 71.17 | 5.32 | 0 | 23.46 |
run_10_threads | 10 | 0.04 | 0 | 0 | 0 | 0 | 0 | 0 | 71.8 | 0 | 0 | 28.16 |
Scalability - Coverage per Parallel Speedup at Function Level
Detailed Coverage per Parallel Speedup
Columns Filter
Run | Number of observed threads | Coverage (%) 0% to 10% speedup | Coverage (%) 10% to 20% speedup | Coverage (%) 20% to 30% speedup | Coverage (%) 30% to 40% speedup | Coverage (%) 40% to 50% speedup | Coverage (%) 50% to 60% speedup | Coverage (%) 60% to 70% speedup | Coverage (%) 70% to 80% speedup | Coverage (%) 80% to 90% speedup | Coverage (%) 90% to 100% speedup | Coverage (%) > 100% speedup | Coverage (%) new or unmeasured functions |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
run_1_thread | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
run_2_threads | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4.41 | 95.59 | 0 |
run_4_threads | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11.58 | 88.42 | 0 |
run_8_threads | 8 | 0 | 0 | 0 | 0 | 0.05 | 0 | 0 | 0 | 0 | 23.46 | 76.49 | 0 |
run_10_threads | 10 | 0 | 0 | 0 | 0 | 0 | 0.04 | 0 | 0 | 0 | 28.16 | 71.8 | 0 |
Libraries
- green cell: the library has been found during the run profiling
- red cell: the library does not appear in the run profiling
Library | run_1_thread | run_2_threads | run_4_threads | run_8_threads | run_10_threads |
---|---|---|---|---|---|
/opt/intel/oneapi/compiler/2024.2/lib/libarcher.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libimf.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libintlc.so.5 | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libiomp5.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libirng.so | |||||
/opt/intel/oneapi/compiler/2024.2/lib/libsvml.so | |||||
/usr/lib/ld-linux-x86-64.so.2 | |||||
/usr/lib/libc.so.6 | |||||
/usr/lib/libdl.so.2 | |||||
/usr/lib/libgcc_s.so.1 | |||||
/usr/lib/libm.so.6 | |||||
/usr/lib/libpthread.so.0 | |||||
/usr/lib/librt.so.1 | |||||
/usr/lib/libstdc++.so.6.0.34 |