Run 1x1 | Number processes: 1Number nodes: 1Number processes per node: 1Run Command: <executable> MPI Command: mpirun -np <number_processes> /usr/bin/numactl -m 8-15Dataset: Run Directory: /home/eoseret/qaas_runs_CPU_9468/171-112-4218/intel/HACCmk/run/oneview_runs/compilers/icx_5/oneview_run_1711127387I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spreadOMP_NUM_THREADS: 1 |
---|---|
Run 1x2 | OMP_NUM_THREADS: 2I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x4 | OMP_NUM_THREADS: 4I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x8 | OMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x16 | OMP_NUM_THREADS: 16I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x32 | OMP_NUM_THREADS: 32I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x48 | OMP_NUM_THREADS: 48I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 1x96 | OMP_NUM_THREADS: 96I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Name | Module | Coverage 1x1 (%) | Coverage 1x2 (%) | Coverage 1x4 (%) | Coverage 1x8 (%) | Coverage 1x16 (%) | Coverage 1x32 (%) | Coverage 1x48 (%) | Coverage 1x96 (%) | Max Time Over Threads 1x1 (s) | Max Time Over Threads 1x2 (s) | Max Time Over Threads 1x4 (s) | Max Time Over Threads 1x8 (s) | Max Time Over Threads 1x16 (s) | Max Time Over Threads 1x32 (s) | Max Time Over Threads 1x48 (s) | Max Time Over Threads 1x96 (s) | Time w.r.t. Wall Time 1x1 (s) | Time w.r.t. Wall Time 1x2 (s) | Time w.r.t. Wall Time 1x4 (s) | Time w.r.t. Wall Time 1x8 (s) | Time w.r.t. Wall Time 1x16 (s) | Time w.r.t. Wall Time 1x32 (s) | Time w.r.t. Wall Time 1x48 (s) | Time w.r.t. Wall Time 1x96 (s) | Nb Threads 1x1 | Nb Threads 1x2 | Nb Threads 1x4 | Nb Threads 1x8 | Nb Threads 1x16 | Nb Threads 1x32 | Nb Threads 1x48 | Nb Threads 1x96 | Deviation (coverage) 1x1 | Deviation (coverage) 1x2 | Deviation (coverage) 1x4 | Deviation (coverage) 1x8 | Deviation (coverage) 1x16 | Deviation (coverage) 1x32 | Deviation (coverage) 1x48 | Deviation (coverage) 1x96 | Deviation (walltime) 1x1 | Deviation (walltime) 1x2 | Deviation (walltime) 1x4 | Deviation (walltime) 1x8 | Deviation (walltime) 1x16 | Deviation (walltime) 1x32 | Deviation (walltime) 1x48 | Deviation (walltime) 1x96 | Categories 1x1 | Categories 1x2 | Categories 1x4 | Categories 1x8 | Categories 1x16 | Categories 1x32 | Categories 1x48 | Categories 1x96 | GFLOPS 1x1 | GFLOPS 1x2 | GFLOPS 1x4 | GFLOPS 1x8 | GFLOPS 1x16 | GFLOPS 1x32 | GFLOPS 1x48 | GFLOPS 1x96 | Compilation Options | (1x1) Efficiency | (1x1) Potential Speed-Up (%) | (1x2) Efficiency | (1x2) Potential Speed-Up (%) | (1x4) Efficiency | (1x4) Potential Speed-Up (%) | (1x8) Efficiency | (1x8) Potential Speed-Up (%) | (1x16) Efficiency | (1x16) Potential Speed-Up (%) | (1x32) Efficiency | (1x32) Potential Speed-Up (%) | (1x48) Efficiency | (1x48) Potential Speed-Up (%) | (1x96) Efficiency | (1x96) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
►Step10_orig | exec | 99.97 | 98.94 | 99.46 | 99.22 | 98.34 | 96.56 | 90.05 | 89.62 | 1083.17 | 552.92 | 271.47 | 135.74 | 67.96 | 34.24 | 27.05 | 16.45 | 1083.17 | 548.19 | 271.34 | 135.79 | 67.77 | 34.05 | 25.13 | 15.53 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 1.21 | 0.08 | 0.08 | 0.19 | 0.41 | 5.77 | 2.02 | 0.00 | 7.33 | 0.17 | 0.07 | 0.14 | 0.16 | 1.64 | 0.36 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 21.46 | 42.40 | 85.67 | 171.19 | 343.00 | 682.67 | 925.01 | 1496.72 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -I /home/eoseret/qaas_runs_CPU_9468/171-112-4218/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /home/eoseret/qaas_runs_CPU_9468/171-112-4218/intel/HACCmk/build/icx_5 -fargu... | 1 | 0 | 0.99 | 1.19 | 1 | 0.2 | 1 | 0.29 | 1 | 0.1 | 0.99 | 0.57 | 0.9 | 9.19 | 0.73 | 24.51 |
○Loop 5 - Step10_orig.c:19-35 - exec | 99.95 | 98.89 | 99.43 | 99.18 | 98.31 | 96.54 | 90.03 | 89.6 | 1082.94 | 552.83 | 271.38 | 135.7 | 67.93 | 34.24 | 27.05 | 16.45 | 1082.93 | 547.89 | 271.25 | 135.74 | 67.75 | 34.04 | 25.12 | 15.53 | 1 | 2 | 4 | 8 | 16 | 32 | 48 | 96 | 0.00 | 1.26 | 0.08 | 0.09 | 0.18 | 0.41 | 5.77 | 2.02 | 0.00 | 7.62 | 0.17 | 0.08 | 0.14 | 0.16 | 1.64 | 0.36 | 21.46 | 42.42 | 85.69 | 171.23 | 343.07 | 682.81 | 925.26 | 1496.59 | 1 | 0 | 0.99 | 1.16 | 1 | 0.19 | 1 | 0.27 | 1 | 0.1 | 0.99 | 0.56 | 0.9 | 9.17 | 0.73 | 24.52 | ||||||||||
►__intel_avx_rep_memset | exec | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.1 | 0.12 | 0.15 | 0.11 | 0.12 | 0.11 | 0.1 | 0.14 | 0.1 | 0.06 | 0.04 | 0.01 | 0.01 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.83 | 0 | 0.63 | 0 | 1.25 | -0 | 0.63 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○Loop 7 - - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
►main | exec | 0.01 | 0.05 | 0.07 | 0.06 | 0.07 | 0.06 | 0.06 | 0.04 | 0.07 | 0.56 | 0.8 | 0.64 | 0.75 | 0.73 | 0.8 | 0.65 | 0.07 | 0.28 | 0.2 | 0.08 | 0.05 | 0.02 | 0.02 | 0.01 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 29.66 | 7.81 | 9.40 | 25.35 | 39.84 | 107.20 | 95.00 | 201.21 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.0.0 (2024.0.0.20231017) --intel -I /home/eoseret/qaas_runs_CPU_9468/171-112-4218/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /home/eoseret/qaas_runs_CPU_9468/171-112-4218/intel/HACCmk/build/icx_5 -fargu... | 1 | 0 | 0.13 | 0.04 | 0.09 | 0.06 | 0.11 | 0.05 | 0.09 | 0.06 | 0.11 | 0.05 | 0.07 | 0.06 | 0.07 | 0.04 |
►Loop 2 - main.c:77-169 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.40 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 1 - main.c:111-116 - exec | 0.01 | 0.05 | 0.07 | 0.06 | 0.07 | 0.06 | 0.06 | 0.04 | 0.07 | 0.55 | 0.8 | 0.64 | 0.75 | 0.73 | 0.8 | 0.65 | 0.07 | 0.28 | 0.2 | 0.08 | 0.05 | 0.02 | 0.02 | 0.01 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 29.66 | 7.80 | 9.40 | 25.35 | 39.84 | 107.00 | 95.00 | 201.21 | 1 | 0 | 0.13 | 0.04 | 0.09 | 0.06 | 0.11 | 0.05 | 0.09 | 0.06 | 0.11 | 0.05 | 0.07 | 0.06 | 0.07 | 0.04 | ||||||||||
○Loop 3 - main.c:111-116 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○Loop 0 - main.c:111-116 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||||||||||||
○_ZN15kmp_flag_nativeIyL9flag_type1ELb1EE13notdone_checkEv | libiomp5.so | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.04 | 0.15 | 0.14 | 0 | 0.13 | 0.03 | 0.04 | 0.04 | 0.05 | 0.09 | 0.05 | 0 | 0.08 | 0.02 | 0.02 | 0.01 | 0.02 | 0.04 | 0.02 | 0 | 2 | 3 | 7 | 15 | 31 | 47 | 94 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.03 | 0.09 | 0.06 | 0.00 | 0.07 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.01 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○_ZN17_INTERNAL021345c126__kmp_hyper_barrier_gatherE12barrier_typeP8kmp_infoiiPFvPvS3_ES3_..0 | libiomp5.so | 0 | 0.87 | 0.03 | 0.02 | 0.03 | 0.04 | 0.04 | 0.19 | 0 | 9.65 | 0.32 | 0.12 | 0.13 | 0.12 | 0.07 | 0.43 | 0 | 4.83 | 0.08 | 0.03 | 0.02 | 0.01 | 0.01 | 0.03 | 0 | 2 | 3 | 2 | 5 | 10 | 15 | 37 | 0.00 | 1.23 | 0.07 | 0.01 | 0.08 | 0.10 | 0.07 | 0.74 | 0.00 | 6.82 | 0.18 | 0.01 | 0.05 | 0.04 | 0.02 | 0.13 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○_ZN17_INTERNAL021345c119__kmp_wait_templateI11kmp_flag_64ILb0ELb1EELb1ELb0ELb1EEEbP8kmp_infoPT_Pv | libiomp5.so | 0 | 0.08 | 0.41 | 0.64 | 1.5 | 3.23 | 9.62 | 9.88 | 0 | 0.85 | 1.67 | 1.18 | 1.28 | 1.47 | 4.53 | 2.26 | 0 | 0.42 | 1.11 | 0.88 | 1.03 | 1.14 | 2.69 | 1.71 | 0 | 1 | 3 | 7 | 15 | 31 | 47 | 95 | 0.00 | 0.00 | 0.08 | 0.10 | 0.17 | 0.38 | 5.64 | 1.89 | 0.00 | 0.00 | 0.23 | 0.13 | 0.12 | 0.13 | 1.57 | 0.33 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |