Run 1x1 | Number processes: 1Number nodes: 1Number processes per node: 1Run Command: <executable> MPI Command: mpirun -np <number_processes>Dataset: Run Directory: /home/eoseret/qaas_runs_CPU_9468/171-147-5968/intel/HACCmk/run/oneview_runs/compilers/icx_5/oneview_run_1711479105I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spreadOMP_NUM_THREADS: 1 |
---|---|
Run 1x2 | OMP_NUM_THREADS: 2I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x4 | OMP_NUM_THREADS: 4I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x8 | OMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x16 | OMP_NUM_THREADS: 16I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x32 | OMP_NUM_THREADS: 32I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x48 | OMP_NUM_THREADS: 48I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x96 | OMP_NUM_THREADS: 96I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Name | Module | Coverage 1x1 (%) | Coverage 1x2 (%) | Coverage 1x4 (%) | Coverage 1x8 (%) | Coverage 1x16 (%) | Coverage 1x32 (%) | Coverage 1x48 (%) | Coverage 1x96 (%) | Max Time Over Threads 1x1 (s) | Max Time Over Threads 1x2 (s) | Max Time Over Threads 1x4 (s) | Max Time Over Threads 1x8 (s) | Max Time Over Threads 1x16 (s) | Max Time Over Threads 1x32 (s) | Max Time Over Threads 1x48 (s) | Max Time Over Threads 1x96 (s) | Time w.r.t. Wall Time 1x1 (s) | Time w.r.t. Wall Time 1x2 (s) | Time w.r.t. Wall Time 1x4 (s) | Time w.r.t. Wall Time 1x8 (s) | Time w.r.t. Wall Time 1x16 (s) | Time w.r.t. Wall Time 1x32 (s) | Time w.r.t. Wall Time 1x48 (s) | Time w.r.t. Wall Time 1x96 (s) | Nb Threads 1x1 | Nb Threads 1x2 | Nb Threads 1x4 | Nb Threads 1x8 | Nb Threads 1x16 | Nb Threads 1x32 | Nb Threads 1x48 | Nb Threads 1x96 | Deviation (coverage) 1x1 | Deviation (coverage) 1x2 | Deviation (coverage) 1x4 | Deviation (coverage) 1x8 | Deviation (coverage) 1x16 | Deviation (coverage) 1x32 | Deviation (coverage) 1x48 | Deviation (coverage) 1x96 | Deviation (walltime) 1x1 | Deviation (walltime) 1x2 | Deviation (walltime) 1x4 | Deviation (walltime) 1x8 | Deviation (walltime) 1x16 | Deviation (walltime) 1x32 | Deviation (walltime) 1x48 | Deviation (walltime) 1x96 | Categories 1x1 | Categories 1x2 | Categories 1x4 | Categories 1x8 | Categories 1x16 | Categories 1x32 | Categories 1x48 | Categories 1x96 | GFLOPS 1x1 | GFLOPS 1x2 | GFLOPS 1x4 | GFLOPS 1x8 | GFLOPS 1x16 | GFLOPS 1x32 | GFLOPS 1x48 | GFLOPS 1x96 | Compilation Options | (1x1) Efficiency | (1x1) Potential Speed-Up (%) | (1x2) Efficiency | (1x2) Potential Speed-Up (%) | (1x4) Efficiency | (1x4) Potential Speed-Up (%) | (1x8) Efficiency | (1x8) Potential Speed-Up (%) | (1x16) Efficiency | (1x16) Potential Speed-Up (%) | (1x32) Efficiency | (1x32) Potential Speed-Up (%) | (1x48) Efficiency | (1x48) Potential Speed-Up (%) | (1x96) Efficiency | (1x96) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
►Step10_orig | exec | 99.98 | 99.77 | 99.57 | 99.02 | 98.36 | 96.78 | 91.32 | 91.21 | 1083.87 | 542.9 | 271.64 | 135.94 | 68.02 | 34.18 | 26.66 | 15.78 | 1083.87 | 542.78 | 271.75 | 135.87 | 67.85 | 33.97 | 25.13 | 15.24 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.03 | 0.06 | 0.11 | 0.22 | 0.45 | 5.21 | 1.42 | 0.00 | 0.28 | 0.07 | 0.13 | 0.15 | 0.17 | 1.47 | 0.24 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 21.45 | 42.83 | 85.54 | 171.09 | 342.60 | 684.28 | 925.01 | 1525.20 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -I /home/eoseret/qaas_runs_CPU_9468/171-147-5968/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /home/eoseret/qaas_runs_CPU_9468/171-147-5968/intel/HACCmk/build/icx_5 -fargu... | 1 | 0 | 1 | 0.16 | 1 | 0.29 | 1 | 0.28 | 1 | 0.16 | 1 | 0.28 | 0.9 | 9.26 | 0.74 | 23.64 |
○Loop 5 - Step10_orig.c:19-35 - exec | 99.95 | 99.75 | 99.54 | 99 | 98.34 | 96.76 | 91.3 | 91.19 | 1083.62 | 542.76 | 271.58 | 135.9 | 68.01 | 34.17 | 26.64 | 15.78 | 1083.62 | 542.65 | 271.68 | 135.84 | 67.84 | 33.97 | 25.12 | 15.23 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 0.03 | 0.06 | 0.11 | 0.22 | 0.45 | 5.21 | 1.42 | 0.00 | 0.25 | 0.06 | 0.13 | 0.15 | 0.17 | 1.47 | 0.24 | 21.45 | 42.83 | 85.55 | 171.10 | 342.62 | 684.22 | 925.26 | 1526.07 | 1 | 0 | 1 | 0.15 | 1 | 0.28 | 1 | 0.28 | 1 | 0.16 | 1 | 0.3 | 0.9 | 9.25 | 0.74 | 23.6 | ||||||||||
►__intel_avx_rep_memset | exec | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.1 | 0.12 | 0.08 | 0.15 | 0.14 | 0.12 | 0.1 | 0.09 | 0.1 | 0.06 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.83 | 0 | 1.25 | -0 | 0.63 | 0 | 0.63 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○Loop 7 - - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○unknown_kernel_region | kernel | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.05 | 0.03 | 0.01 | 0.03 | 0.02 | 0.01 | 0.03 | 0.02 | 0.05 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 1 | 2 | 4 | 8 | 16 | 29 | 40 | 76 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 98.21 OMP (%): 1.79 | System (%): 100.00 | 2.96 | 7.70 | 17.20 | 12.40 | 14.60 | 0.00 | 14.60 | 0.00 | 1 | 0 | 1.25 | -0 | 1.25 | -0 | 0.63 | 0 | 0.31 | 0.01 | 1 | 0 | 0.1 | 0.02 | 1 | 0 | |
►main | exec | 0.01 | 0.08 | 0.06 | 0.09 | 0.06 | 0.07 | 0.06 | 0.05 | 0.07 | 0.85 | 0.68 | 0.93 | 0.64 | 0.77 | 0.76 | 0.76 | 0.08 | 0.42 | 0.17 | 0.12 | 0.04 | 0.02 | 0.02 | 0.01 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 25.95 | 5.21 | 11.06 | 16.87 | 49.80 | 107.20 | 95.00 | 201.21 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -I /home/eoseret/qaas_runs_CPU_9468/171-147-5968/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /home/eoseret/qaas_runs_CPU_9468/171-147-5968/intel/HACCmk/build/icx_5 -fargu... | 1 | 0 | 0.1 | 0.07 | 0.12 | 0.05 | 0.08 | 0.08 | 0.13 | 0.05 | 0.13 | 0.06 | 0.08 | 0.05 | 0.08 | 0.05 |
►Loop 2 - main.c:77-169 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 1 - main.c:111-116 - exec | 0.01 | 0.08 | 0.06 | 0.09 | 0.06 | 0.07 | 0.06 | 0.05 | 0.07 | 0.85 | 0.68 | 0.93 | 0.64 | 0.77 | 0.76 | 0.76 | 0.07 | 0.42 | 0.17 | 0.12 | 0.04 | 0.02 | 0.02 | 0.01 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 29.66 | 5.21 | 11.06 | 16.87 | 49.80 | 107.00 | 95.00 | 201.21 | 1 | 0 | 0.08 | 0.07 | 0.1 | 0.05 | 0.07 | 0.08 | 0.11 | 0.05 | 0.11 | 0.06 | 0.07 | 0.06 | 0.07 | 0.05 | ||||||||||
○Loop 3 - main.c:111-116 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 0 - main.c:111-116 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○_ZN17_INTERNAL021345c126__kmp_hyper_barrier_gatherE12barrier_typeP8kmp_infoiiPFvPvS3_ES3_..0 | libiomp5.so | 0 | 0.03 | 0.01 | 0.03 | 0.02 | 0.04 | 0.04 | 0.06 | 0 | 0.37 | 0.15 | 0.15 | 0.13 | 0.17 | 0.06 | 0.13 | 0 | 0.19 | 0.04 | 0.04 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 2 | 2 | 3 | 4 | 11 | 21 | 27 | 0.00 | 0.05 | 0.04 | 0.06 | 0.07 | 0.14 | 0.07 | 0.17 | 0.00 | 0.26 | 0.10 | 0.08 | 0.05 | 0.05 | 0.02 | 0.03 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○_ZN17_INTERNAL021345c119__kmp_wait_templateI11kmp_flag_64ILb0ELb1EELb1ELb0ELb1EEEbP8kmp_infoPT_Pv | libiomp5.so | 0 | 0.1 | 0.33 | 0.82 | 1.5 | 3.02 | 8.4 | 8.47 | 0 | 1.05 | 1.33 | 1.47 | 1.3 | 1.52 | 3.98 | 1.89 | 0 | 0.53 | 0.91 | 1.12 | 1.03 | 1.06 | 2.31 | 1.41 | 0 | 1 | 3 | 7 | 15 | 31 | 47 | 95 | 0.00 | 0.00 | 0.04 | 0.11 | 0.17 | 0.45 | 5.10 | 1.35 | 0.00 | 0.00 | 0.12 | 0.16 | 0.11 | 0.16 | 1.40 | 0.22 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |