Run 2x1 | Number processes: 2Number nodes: 1Number processes per node: 2Run Command: <executable> -x 100 -y 100 -z 100 --xproc=2 --yproc=1 --zproc=1MPI Command: mpirun -np <number_processes>Dataset: Run Directory: /scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_run_1711727524I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spreadOMP_NUM_THREADS: 1 |
---|---|
Run 2x2 | OMP_NUM_THREADS: 2I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x4 | OMP_NUM_THREADS: 4I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x8 | OMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x16 | OMP_NUM_THREADS: 16I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x32 | OMP_NUM_THREADS: 32I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x56 | OMP_NUM_THREADS: 56I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Name | Module | Coverage 2x1 (%) | Coverage 2x2 (%) | Coverage 2x4 (%) | Coverage 2x8 (%) | Coverage 2x16 (%) | Coverage 2x32 (%) | Coverage 2x56 (%) | Max Time Over Threads 2x1 (s) | Max Time Over Threads 2x2 (s) | Max Time Over Threads 2x4 (s) | Max Time Over Threads 2x8 (s) | Max Time Over Threads 2x16 (s) | Max Time Over Threads 2x32 (s) | Max Time Over Threads 2x56 (s) | Time w.r.t. Wall Time 2x1 (s) | Time w.r.t. Wall Time 2x2 (s) | Time w.r.t. Wall Time 2x4 (s) | Time w.r.t. Wall Time 2x8 (s) | Time w.r.t. Wall Time 2x16 (s) | Time w.r.t. Wall Time 2x32 (s) | Time w.r.t. Wall Time 2x56 (s) | Nb Threads 2x1 | Nb Threads 2x2 | Nb Threads 2x4 | Nb Threads 2x8 | Nb Threads 2x16 | Nb Threads 2x32 | Nb Threads 2x56 | Deviation (coverage) 2x1 | Deviation (coverage) 2x2 | Deviation (coverage) 2x4 | Deviation (coverage) 2x8 | Deviation (coverage) 2x16 | Deviation (coverage) 2x32 | Deviation (coverage) 2x56 | Deviation (walltime) 2x1 | Deviation (walltime) 2x2 | Deviation (walltime) 2x4 | Deviation (walltime) 2x8 | Deviation (walltime) 2x16 | Deviation (walltime) 2x32 | Deviation (walltime) 2x56 | Categories 2x1 | Categories 2x2 | Categories 2x4 | Categories 2x8 | Categories 2x16 | Categories 2x32 | Categories 2x56 | GFLOPS 2x1 | GFLOPS 2x2 | GFLOPS 2x4 | GFLOPS 2x8 | GFLOPS 2x16 | GFLOPS 2x32 | GFLOPS 2x56 | Compilation Options | (2x1) Efficiency | (2x1) Potential Speed-Up (%) | (2x2) Efficiency | (2x2) Potential Speed-Up (%) | (2x4) Efficiency | (2x4) Potential Speed-Up (%) | (2x8) Efficiency | (2x8) Potential Speed-Up (%) | (2x16) Efficiency | (2x16) Potential Speed-Up (%) | (2x32) Efficiency | (2x32) Potential Speed-Up (%) | (2x56) Efficiency | (2x56) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
►ljForce._omp_fn.1 | exec | 95.5 | 94.42 | 93.45 | 91.35 | 86.23 | 75.21 | 62.95 | 481.58 | 240.91 | 121.04 | 60.91 | 30.78 | 16.61 | 10.17 | 481.31 | 243.14 | 124.91 | 64.45 | 34.38 | 19.34 | 12.17 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.07 | 1.06 | 1.97 | 2.14 | 2.55 | 4.19 | 4.08 | 1.00 | 0.12 | 0.17 | 0.39 | 0.32 | 1.12 | 0.80 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 4.80 | 9.49 | 18.48 | 35.81 | 67.13 | 119.34 | 189.64 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.99 | 0.97 | 0.96 | 3.43 | 0.93 | 6.08 | 0.87 | 10.78 | 0.78 | 16.72 | 0.71 | 18.49 |
►Loop 21 - ljForce.c:175-216 - exec [...] | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.17 | 0.11 | 0.05 | 0.04 | 0.02 | 0.02 | 0.02 | 0.17 | 0.08 | 0.03 | 0.02 | 0.01 | 0.01 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.03 | 0.03 | 0.00 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 5.71 | 11.55 | 30.87 | 47.25 | 91.10 | 96.15 | 0.00 | 1 | 0 | 1.06 | -0 | 1.42 | 0 | 1.06 | -0 | 1.06 | -0 | 0.53 | 0.01 | 1 | 0 | |||||||||
►Loop 23 - ljForce.c:178-216 - exec [...] | 0.36 | 0.39 | 0.41 | 0.39 | 0.36 | 0.32 | 0.32 | 1.87 | 1.03 | 0.6 | 0.29 | 0.16 | 0.11 | 0.09 | 1.83 | 1 | 0.55 | 0.28 | 0.14 | 0.08 | 0.06 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.04 | 0.04 | 0.07 | 0.09 | 0.11 | 0.06 | 0.04 | 0.04 | 0.03 | 0.02 | 0.02 | 0.01 | 7.27 | 14.28 | 25.92 | 51.99 | 102.35 | 182.67 | 245.49 | 1 | 0 | 0.92 | 0.03 | 0.83 | 0.07 | 0.82 | 0.07 | 0.82 | 0.07 | 0.71 | 0.09 | 0.54 | 0.15 | |||||||||
►Loop 24 - ljForce.c:187-216 - exec [...] | 7.05 | 7.14 | 7.15 | 6.98 | 6.52 | 5.69 | 4.81 | 35.68 | 18.44 | 9.61 | 4.96 | 2.56 | 1.51 | 0.87 | 35.54 | 18.39 | 9.56 | 4.93 | 2.6 | 1.46 | 0.93 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.06 | 0.11 | 0.21 | 0.33 | 0.49 | 0.57 | 0.57 | 0.25 | 0.29 | 0.26 | 0.20 | 0.16 | 0.12 | 0.09 | 4.97 | 9.55 | 18.42 | 35.73 | 67.36 | 120.25 | 188.40 | 1 | 0 | 0.97 | 0.24 | 0.93 | 0.5 | 0.9 | 0.69 | 0.85 | 0.95 | 0.76 | 1.36 | 0.68 | 1.53 | |||||||||
○Loop 25 - ljForce.c:191-216 - exec [...] | 88.05 | 86.86 | 85.86 | 83.95 | 79.32 | 69.18 | 57.8 | 444.29 | 221.82 | 111.32 | 56.21 | 28.34 | 15.43 | 9.26 | 443.77 | 223.68 | 114.77 | 59.23 | 31.63 | 17.79 | 11.18 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.14 | 1.01 | 1.89 | 2.05 | 2.31 | 3.97 | 3.88 | 1.30 | 0.22 | 0.39 | 0.52 | 0.29 | 1.06 | 0.76 | 4.77 | 9.46 | 18.44 | 35.73 | 66.95 | 118.99 | 189.35 | 1 | 0 | 0.99 | 0.7 | 0.97 | 2.86 | 0.94 | 5.33 | 0.88 | 9.77 | 0.78 | 15.25 | 0.71 | 16.83 | |||||||||
○Loop 22 - ljForce.c:172-172 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►ljForce._omp_fn.0 | exec | 0.81 | 1.21 | 1.42 | 1.76 | 2.59 | 5.06 | 7.12 | 4.1 | 3.61 | 1.97 | 1.26 | 0.99 | 1.16 | 1.2 | 4.07 | 3.11 | 1.9 | 1.24 | 1.03 | 1.3 | 1.38 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.25 | 0.15 | 0.24 | 0.28 | 0.39 | 1.44 | 0.05 | 0.60 | 0.17 | 0.14 | 0.08 | 0.06 | 0.19 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.65 | 0.42 | 0.54 | 0.66 | 0.41 | 1.04 | 0.25 | 1.95 | 0.1 | 4.56 | 0.05 | 6.75 |
○Loop 16 - ljForce.c:161-161 - exec [...] | 0.81 | 1.21 | 1.42 | 1.76 | 2.59 | 5.06 | 7.12 | 4.1 | 3.61 | 1.97 | 1.26 | 0.99 | 1.16 | 1.2 | 4.07 | 3.11 | 1.9 | 1.24 | 1.03 | 1.3 | 1.38 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.25 | 0.15 | 0.24 | 0.28 | 0.39 | 1.44 | 0.05 | 0.60 | 0.16 | 0.14 | 0.08 | 0.06 | 0.19 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.65 | 0.42 | 0.54 | 0.66 | 0.41 | 1.04 | 0.25 | 1.95 | 0.1 | 4.56 | 0.05 | 6.75 | |||||||||
►advanceVelocity._omp_fn.0 | exec | 0.66 | 0.73 | 0.88 | 1.33 | 2.49 | 3.7 | 5.04 | 3.36 | 1.92 | 1.22 | 0.94 | 0.92 | 0.95 | 0.96 | 3.33 | 1.88 | 1.18 | 0.94 | 0.99 | 0.95 | 0.97 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.04 | 0.06 | 0.08 | 0.10 | 0.56 | 1.38 | 0.05 | 0.08 | 0.06 | 0.03 | 0.02 | 0.11 | 0.19 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.44 | 2.55 | 4.07 | 5.10 | 4.86 | 5.03 | 4.96 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.89 | 0.08 | 0.71 | 0.26 | 0.44 | 0.74 | 0.21 | 1.97 | 0.11 | 3.29 | 0.06 | 4.73 |
►Loop 17 - timestep.c:74-78 - exec | 0.32 | 0.37 | 0.42 | 0.68 | 1.08 | 1.6 | 2.24 | 1.67 | 0.98 | 0.61 | 0.53 | 0.48 | 0.62 | 0.6 | 1.62 | 0.95 | 0.56 | 0.48 | 0.43 | 0.41 | 0.43 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.02 | 0.05 | 0.09 | 0.13 | 0.55 | 0.96 | 0.06 | 0.03 | 0.05 | 0.05 | 0.04 | 0.11 | 0.13 | 1.27 | 2.22 | 3.70 | 4.17 | 4.37 | 4.31 | 3.84 | 1 | 0 | 0.85 | 0.05 | 0.72 | 0.12 | 0.42 | 0.39 | 0.24 | 0.83 | 0.12 | 1.4 | 0.07 | 2.09 | |||||||||
○Loop 18 - timestep.c:74-78 - exec | 0.34 | 0.36 | 0.46 | 0.65 | 1.41 | 2.11 | 2.79 | 1.71 | 0.96 | 0.66 | 0.5 | 0.57 | 0.58 | 0.65 | 1.7 | 0.92 | 0.62 | 0.46 | 0.56 | 0.54 | 0.54 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.02 | 0.04 | 0.07 | 0.11 | 0.31 | 0.71 | 0.01 | 0.05 | 0.04 | 0.04 | 0.04 | 0.07 | 0.10 | 1.61 | 2.92 | 4.40 | 6.08 | 5.23 | 5.58 | 5.85 | 1 | 0 | 0.92 | 0.03 | 0.69 | 0.14 | 0.46 | 0.35 | 0.19 | 1.14 | 0.1 | 1.9 | 0.06 | 2.63 | |||||||||
○unknown_function | Unknown module | 0.65 | 0.63 | 0.64 | 0.61 | 0.56 | 0.52 | 0.51 | 3.27 | 1.67 | 0.91 | 0.45 | 0.26 | 0.15 | 0.13 | 3.27 | 1.62 | 0.85 | 0.43 | 0.22 | 0.13 | 0.1 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.02 | 0.05 | 0.05 | 0.09 | 0.10 | 0.17 | 0.01 | 0.07 | 0.06 | 0.03 | 0.03 | 0.02 | 0.02 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.01 | 0 | 0.96 | 0.02 | 0.95 | 0.03 | 0.93 | 0.04 | 0.79 | 0.11 | 0.58 | 0.21 | |
►sortAtomsInCell | exec | 0.62 | 0.73 | 0.79 | 0.97 | 1.61 | 3.22 | 4.53 | 3.21 | 2.01 | 1.1 | 0.78 | 0.66 | 0.81 | 0.82 | 3.14 | 1.87 | 1.06 | 0.68 | 0.64 | 0.83 | 0.88 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.07 | 0.05 | 0.10 | 0.14 | 0.41 | 0.98 | 0.10 | 0.17 | 0.05 | 0.06 | 0.04 | 0.07 | 0.13 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.84 | 0.12 | 0.74 | 0.2 | 0.58 | 0.41 | 0.31 | 1.12 | 0.12 | 2.84 | 0.06 | 4.24 |
○Loop 61 - haloExchange.c:621-630 - exec | 0.29 | 0.3 | 0.3 | 0.39 | 0.57 | 1.72 | 2.38 | 1.5 | 0.87 | 0.44 | 0.36 | 0.28 | 0.43 | 0.44 | 1.45 | 0.77 | 0.41 | 0.27 | 0.23 | 0.44 | 0.46 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.03 | 0.03 | 0.06 | 0.08 | 0.24 | 0.47 | 0.06 | 0.07 | 0.05 | 0.04 | 0.03 | 0.04 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.94 | 0.02 | 0.88 | 0.03 | 0.67 | 0.13 | 0.39 | 0.35 | 0.1 | 1.54 | 0.06 | 2.25 | |||||||||
○Loop 60 - haloExchange.c:633-642 - exec | 0.08 | 0.08 | 0.07 | 0.07 | 0.06 | 0.06 | 0.06 | 0.41 | 0.25 | 0.13 | 0.07 | 0.05 | 0.04 | 0.04 | 0.41 | 0.21 | 0.1 | 0.05 | 0.03 | 0.02 | 0.01 | 2 | 4 | 8 | 16 | 32 | 60 | 87 | 0.00 | 0.02 | 0.02 | 0.02 | 0.03 | 0.04 | 0.04 | 0.01 | 0.04 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.98 | 0 | 1.02 | -0 | 1.02 | -0 | 0.85 | 0.01 | 0.64 | 0.02 | 0.73 | 0.02 | |||||||||
○getBoxFromCoord | exec | 0.5 | 0.55 | 0.6 | 0.51 | 0.49 | 0.53 | 0.38 | 2.57 | 2.96 | 3.13 | 2.83 | 2.75 | 3.4 | 3.42 | 2.53 | 1.42 | 0.8 | 0.36 | 0.2 | 0.14 | 0.07 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.01 | 0.08 | 0.04 | 0.24 | 0.02 | 0.05 | 3.46 | 0.06 | 0.20 | 0.05 | 0.17 | 0.00 | 0.00 | 0.68 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.53 | 1.02 | 1.88 | 3.76 | 6.80 | 11.37 | 20.66 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.89 | 0.06 | 0.79 | 0.13 | 0.88 | 0.06 | 0.79 | 0.1 | 0.56 | 0.23 | 0.65 | 0.13 |
►advancePosition._omp_fn.0 | exec | 0.34 | 0.38 | 0.45 | 0.7 | 1.35 | 2.15 | 2.98 | 1.71 | 1.02 | 0.63 | 0.49 | 0.5 | 0.53 | 0.54 | 1.7 | 0.98 | 0.6 | 0.49 | 0.54 | 0.55 | 0.58 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.02 | 0.03 | 0.04 | 0.06 | 0.27 | 0.74 | 0.01 | 0.05 | 0.03 | 0.02 | 0.01 | 0.05 | 0.10 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.35 | 4.08 | 6.67 | 8.17 | 7.38 | 7.29 | 6.91 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.87 | 0.05 | 0.71 | 0.13 | 0.43 | 0.4 | 0.2 | 1.08 | 0.1 | 1.94 | 0.05 | 2.82 |
►Loop 19 - timestep.c:88-94 - exec | 0.09 | 0.12 | 0.16 | 0.26 | 0.46 | 0.69 | 1.05 | 0.47 | 0.34 | 0.24 | 0.22 | 0.21 | 0.31 | 0.29 | 0.47 | 0.31 | 0.22 | 0.18 | 0.18 | 0.18 | 0.2 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.01 | 0.02 | 0.05 | 0.07 | 0.29 | 0.50 | 0.01 | 0.03 | 0.02 | 0.03 | 0.02 | 0.06 | 0.07 | 2.41 | 3.90 | 5.41 | 6.33 | 5.96 | 5.36 | 4.72 | 1 | 0 | 0.76 | 0.03 | 0.53 | 0.07 | 0.33 | 0.18 | 0.16 | 0.38 | 0.08 | 0.63 | 0.04 | 1.01 | |||||||||
○Loop 20 - timestep.c:88-94 - exec | 0.25 | 0.26 | 0.29 | 0.44 | 0.88 | 1.46 | 1.93 | 1.24 | 0.71 | 0.44 | 0.34 | 0.37 | 0.38 | 0.39 | 1.24 | 0.67 | 0.38 | 0.31 | 0.35 | 0.37 | 0.37 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.02 | 0.04 | 0.04 | 0.08 | 0.19 | 0.41 | 0.01 | 0.04 | 0.04 | 0.03 | 0.03 | 0.04 | 0.06 | 2.31 | 4.16 | 7.39 | 9.23 | 8.32 | 8.22 | 8.28 | 1 | 0 | 0.93 | 0.02 | 0.82 | 0.05 | 0.5 | 0.22 | 0.22 | 0.69 | 0.1 | 1.31 | 0.06 | 1.81 | |||||||||
○sortAtomsById | exec | 0.17 | 0.16 | 0.14 | 0.14 | 0.13 | 0.12 | 0.11 | 0.85 | 0.43 | 0.25 | 0.13 | 0.07 | 0.06 | 0.04 | 0.85 | 0.4 | 0.19 | 0.1 | 0.05 | 0.03 | 0.02 | 2 | 4 | 8 | 16 | 32 | 63 | 105 | 0.00 | 0.01 | 0.02 | 0.03 | 0.03 | 0.06 | 0.07 | 0.00 | 0.03 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 1.06 | 0 | 1.12 | 0 | 1.06 | 0 | 1.06 | 0 | 0.89 | 0.01 | 0.76 | 0.03 |
○putAtomInBox | exec | 0.13 | 0.19 | 0.22 | 0.13 | 0.14 | 0.22 | 0.13 | 0.69 | 1.05 | 1.12 | 0.72 | 0.78 | 1.52 | 1.39 | 0.64 | 0.48 | 0.29 | 0.09 | 0.05 | 0.06 | 0.02 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.02 | 0.05 | 0.01 | 0.05 | 0.04 | 0.41 | 2.99 | 0.08 | 0.13 | 0.02 | 0.04 | 0.01 | 0.11 | 0.58 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.26 | 0.36 | 0.64 | 1.99 | 3.36 | 3.07 | 9.53 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.67 | 0.06 | 0.55 | 0.1 | 0.89 | 0.01 | 0.8 | 0.03 | 0.33 | 0.15 | 0.57 | 0.06 |
○getBoxFromTuple | exec | 0.12 | 0.12 | 0.12 | 0.11 | 0.1 | 0.09 | 0.08 | 0.62 | 0.61 | 0.66 | 0.68 | 0.59 | 0.59 | 0.65 | 0.61 | 0.3 | 0.16 | 0.08 | 0.04 | 0.02 | 0.02 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.01 | 0.03 | 0.19 | 0.01 | 0.02 | 0.09 | 0.00 | 0.02 | 0.04 | 0.13 | 0.00 | 0.00 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.41 | 2.53 | 4.42 | 10.58 | 20.46 | 30.95 | 37.63 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 1.02 | -0 | 0.95 | 0.01 | 0.95 | 0.01 | 0.95 | 0 | 0.95 | 0 | 0.54 | 0.04 |
○unknown_kernel_region | kernel | 0.08 | 0.07 | 0.08 | 0.09 | 0.08 | 0.07 | 0.09 | 0.42 | 0.36 | 0.38 | 0.44 | 0.38 | 0.32 | 0.38 | 0.42 | 0.18 | 0.1 | 0.07 | 0.03 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 31 | 50 | 91 | 0.00 | 0.07 | 0.13 | 0.20 | 0.22 | 0.24 | 0.26 | 0.01 | 0.19 | 0.17 | 0.14 | 0.09 | 0.06 | 0.05 | System (%): 51.81 MPI (%): 48.19 | System (%): 51.08 MPI (%): 48.92 | System (%): 52.23 MPI (%): 47.77 | System (%): 51.27 MPI (%): 48.22 OMP (%): 0.51 | System (%): 62.98 MPI (%): 35.91 OMP (%): 1.10 | System (%): 54.75 MPI (%): 41.90 OMP (%): 3.35 | OMP (%): 7.80 System (%): 66.67 MPI (%): 25.53 | 0.06 | 0.20 | 0.35 | 0.47 | 1.30 | 1.70 | 1.88 | 1 | 0 | 1.17 | 0 | 1.05 | -0 | 0.75 | 0.02 | 0.88 | 0.01 | 0.66 | 0.02 | 0.38 | 0.06 | |
►loadAtomsBuffer | exec | 0.07 | 0.09 | 0.09 | 0.08 | 0.07 | 0.07 | 0.06 | 0.39 | 0.51 | 0.45 | 0.48 | 0.42 | 0.45 | 0.5 | 0.37 | 0.24 | 0.12 | 0.06 | 0.03 | 0.02 | 0.01 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.01 | 0.02 | 0.00 | 0.07 | 0.02 | 0.05 | 0.10 | 0.03 | 0.04 | 0.00 | 0.05 | 0.01 | 0.01 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.43 | 0.61 | 1.27 | 2.33 | 5.05 | 8.23 | 13.30 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.77 | 0.02 | 0.77 | 0.02 | 0.77 | 0.02 | 0.77 | 0.02 | 0.58 | 0.03 | 0.66 | 0.02 |
►Loop 1 - haloExchange.c:376-390 - exec | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.03 | 0.03 | 0.03 | 0.01 | 0.02 | 0.03 | 0.03 | 0.03 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.00 | 0.00 | 0.01 | 0.04 | 0.06 | 0.02 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.25 | 0.55 | 0.85 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.5 | 0 | 0.75 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 2 - haloExchange.c:380-390 - exec | 0.07 | 0.09 | 0.08 | 0.08 | 0.07 | 0.06 | 0.06 | 0.36 | 0.49 | 0.43 | 0.47 | 0.41 | 0.42 | 0.47 | 0.34 | 0.23 | 0.11 | 0.06 | 0.03 | 0.02 | 0.01 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.01 | 0.02 | 0.01 | 0.07 | 0.02 | 0.01 | 0.12 | 0.03 | 0.05 | 0.01 | 0.05 | 0.01 | 0.00 | 0.02 | 0.45 | 0.61 | 1.31 | 2.21 | 4.78 | 7.78 | 12.80 | 1 | 0 | 0.74 | 0.02 | 0.77 | 0.02 | 0.71 | 0.02 | 0.71 | 0.02 | 0.53 | 0.03 | 0.61 | 0.02 | |||||||||
►gasdev | exec | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.28 | 0.14 | 0.08 | 0.04 | 0.03 | 0.02 | 0.01 | 0.25 | 0.13 | 0.06 | 0.03 | 0.02 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.04 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.22 | 2.33 | 4.89 | 10.52 | 15.63 | 28.80 | 29.25 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.96 | 0 | 1.04 | -0 | 1.04 | -0 | 0.78 | 0.01 | 0.78 | 0.01 | 0.45 | 0.02 |
○Loop 92 - random.c:26-48 - exec [...] | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.12 | 0.07 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.12 | 0.06 | 0.02 | 0.01 | 0.01 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 52 | 66 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.28 | 0.51 | 1.88 | 3.45 | 4.40 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1.5 | 0 | 1.5 | 0 | 0.75 | 0.01 | 1 | 0 | 1 | 0 | |||||||||
►updateLinkCells | exec | 0.05 | 0.05 | 0.04 | 0.05 | 0.05 | 0.04 | 0.03 | 0.25 | 0.27 | 0.22 | 0.26 | 0.31 | 0.32 | 0.26 | 0.24 | 0.13 | 0.05 | 0.03 | 0.02 | 0.01 | 0.01 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.00 | 0.01 | 0.02 | 0.13 | 0.35 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.05 | 0.09 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.28 | 2.27 | 5.39 | 10.03 | 16.40 | 26.25 | 29.20 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.92 | 0 | 1.2 | 0 | 1 | 0 | 0.75 | 0.01 | 0.75 | 0.01 | 0.43 | 0.02 |
►Loop 82 - linkCells.c:291-301 - exec | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.02 | 0.02 | 0.01 | 0.03 | 0.03 | 0.05 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.07 | 0.02 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.02 | 0.00 | 0.65 | 1.95 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 83 - linkCells.c:295-301 - exec | 0.04 | 0.05 | 0.04 | 0.04 | 0.04 | 0.03 | 0.03 | 0.22 | 0.24 | 0.21 | 0.25 | 0.28 | 0.27 | 0.24 | 0.21 | 0.12 | 0.05 | 0.03 | 0.02 | 0.01 | 0.01 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.00 | 0.01 | 0.03 | 0.13 | 0.28 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.05 | 0.07 | 0.00 | 1.40 | 2.30 | 5.05 | 9.38 | 15.48 | 24.10 | 27.65 | 1 | 0 | 0.88 | 0.01 | 1.05 | -0 | 0.88 | 0.01 | 0.66 | 0.01 | 0.66 | 0.01 | 0.38 | 0.02 | |||||||||
○Loop 84 - linkCells.c:384-385 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○f64xsubf128 | libm-2.28.so | 0.04 | 0.03 | 0.04 | 0.05 | 0.03 | 0.03 | 0.03 | 0.21 | 0.1 | 0.06 | 0.05 | 0.02 | 0.02 | 0.01 | 0.19 | 0.09 | 0.06 | 0.03 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 1.14 | 2.29 | 3.68 | 6.50 | 20.25 | 21.25 | 20.50 | 1 | 0 | 1.06 | -0 | 0.79 | 0.01 | 0.79 | 0.01 | 1.19 | 0 | 0.59 | 0.01 | 0.34 | 0.02 | |
►kineticEnergy._omp_fn.0 | exec | 0.03 | 0.03 | 0.03 | 0.04 | 0.07 | 0.15 | 0.25 | 0.14 | 0.09 | 0.05 | 0.05 | 0.04 | 0.05 | 0.06 | 0.14 | 0.08 | 0.05 | 0.03 | 0.03 | 0.04 | 0.05 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.04 | 0.08 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.97 | 5.20 | 8.32 | 13.82 | 13.95 | 10.34 | 8.31 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.88 | 0 | 0.7 | 0.01 | 0.58 | 0.02 | 0.29 | 0.05 | 0.11 | 0.13 | 0.05 | 0.24 |
►Loop 27 - timestep.c:110-116 - exec | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.05 | 0.08 | 0.04 | 0.03 | 0.02 | 0.02 | 0.02 | 0.04 | 0.04 | 0.03 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 8 | 16 | 32 | 63 | 108 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.03 | 0.05 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 1.93 | 1.73 | 5.85 | 5.70 | 5.50 | 4.95 | 2.93 | 1 | 0 | 0.5 | 0.01 | 0.75 | 0 | 0.38 | 0.01 | 0.19 | 0.02 | 0.09 | 0.05 | 0.03 | 0.08 | |||||||||
○Loop 28 - timestep.c:110-116 - exec | 0.02 | 0.02 | 0.02 | 0.03 | 0.05 | 0.1 | 0.17 | 0.1 | 0.06 | 0.04 | 0.04 | 0.04 | 0.04 | 0.05 | 0.11 | 0.06 | 0.03 | 0.02 | 0.02 | 0.02 | 0.03 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.01 | 0.02 | 0.02 | 0.04 | 0.07 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 3.25 | 6.07 | 11.92 | 17.88 | 18.18 | 18.20 | 11.90 | 1 | 0 | 0.92 | 0 | 0.92 | 0 | 0.69 | 0.01 | 0.34 | 0.03 | 0.17 | 0.08 | 0.07 | 0.16 | |||||||||
○Loop 26 - timestep.c:107-107 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►setTemperature._omp_fn.0 | exec | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 0.01 | 0.01 | 0.11 | 0.06 | 0.04 | 0.02 | 0.02 | 0.01 | 0.01 | 0.1 | 0.05 | 0.02 | 0.01 | 0.01 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 54 | 74 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.37 | 0.90 | 2.10 | 3.90 | 3.45 | 0.00 | 0.00 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 1 | 0 | 1.25 | 0 | 1.25 | -0 | 0.63 | 0.01 | 1 | 0 | 1 | 0 |
►Loop 93 - initAtoms.c:154-162 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 94 - initAtoms.c:154-162 - exec [...] | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 0.01 | 0.01 | 0.11 | 0.06 | 0.04 | 0.02 | 0.02 | 0.01 | 0.01 | 0.1 | 0.05 | 0.02 | 0.01 | 0.01 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 54 | 74 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.37 | 0.90 | 2.10 | 3.90 | 3.45 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1.25 | 0 | 1.25 | -0 | 0.63 | 0.01 | 1 | 0 | 1 | 0 | |||||||||
►randomDisplacements._omp_fn.0 | exec | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.03 | 0.11 | 0.05 | 0.03 | 0.02 | 0.01 | 0.01 | 0 | 0.11 | 0.06 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 59 | 112 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.44 | 0.80 | 1.60 | 2.43 | 4.85 | 1.60 | 4.15 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.92 | 0 | 0.92 | 0 | 0.69 | 0.01 | 0.69 | 0.01 | 0.34 | 0.01 | 0.2 | 0.02 |
►Loop 14 - initAtoms.c:197-202 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 15 - initAtoms.c:197-202 - exec [...] | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.03 | 0.11 | 0.05 | 0.03 | 0.02 | 0.01 | 0.01 | 0 | 0.11 | 0.06 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 59 | 111 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.44 | 0.80 | 1.60 | 2.43 | 4.85 | 1.60 | 4.15 | 1 | 0 | 0.92 | 0 | 0.92 | 0 | 0.69 | 0.01 | 0.69 | 0.01 | 0.34 | 0.01 | 0.2 | 0.02 | |||||||||
○MPL_gpu_cuda_finalize | libmpi.so.12.0.0 | 0.02 | 0.02 | 0.03 | 0.01 | 0.01 | 0.01 | 0.04 | 0.18 | 0.14 | 0.19 | 0.05 | 0.06 | 0.12 | 0.62 | 0.11 | 0.05 | 0.04 | 0.01 | 0 | 0 | 0.01 | 2 | 2 | 2 | 2 | 2 | 2 | 1 | 0.02 | 0.03 | 0.03 | 0.02 | 0.03 | 0.23 | 0.00 | 0.10 | 0.07 | 0.05 | 0.01 | 0.01 | 0.06 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.1 | -0 | 0.69 | 0.01 | 1.38 | -0 | 1 | 0 | 1 | 0 | 0.2 | 0.03 | |
○__pthread_mutex_unlock_usercnt | libpthread-2.28.so | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0.01 | 0.06 | 0.1 | 0.04 | 0.01 | 0.03 | 0.03 | 0.21 | 0.04 | 0.03 | 0.01 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 0.01 | 0.02 | 0.00 | 0.01 | 0.04 | 0.00 | 0.00 | 0.03 | 0.06 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.67 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○I_MPI_memcpy_multipage_avx512 | libmpi.so.12.0.0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.07 | 0.07 | 0.08 | 0.07 | 0.08 | 0.08 | 0.08 | 0.05 | 0.03 | 0.02 | 0.01 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.00 | 0.01 | 0.02 | 0.04 | 0.14 | 0.15 | 0.02 | 0.00 | 0.02 | 0.01 | 0.02 | 0.04 | 0.03 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.83 | 0 | 0.63 | 0 | 0.63 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○MPIR_Progress_hook_exec_on_vci | libmpi.so.12.0.0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0.01 | 0.06 | 0.05 | 0.05 | 0.02 | 0.01 | 0.02 | 0.2 | 0.04 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.01 | 0.01 | 0.01 | 0.02 | 0.00 | 0.03 | 0.72 | 0.04 | 0.03 | 0.01 | 0.01 | 0.00 | 0.01 | 0.14 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○I_MPI_memcpy_stream_nontemporal_avx512 | libmpi.so.12.0.0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.04 | 0.07 | 0.08 | 0.03 | 0.05 | 0.1 | 0.08 | 0.04 | 0.03 | 0.01 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.01 | 0.03 | 0.01 | 0.02 | 0.21 | 0.15 | 0.01 | 0.01 | 0.05 | 0.00 | 0.01 | 0.05 | 0.03 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.67 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○I_MPI_memcpy_xeon_sse | libmpi.so.12.0.0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.04 | 0.06 | 0.04 | 0.04 | 0.04 | 0.03 | 0.02 | 0.04 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.01 | 0.01 | 0.00 | 0.03 | 0.01 | 0.00 | 0.01 | 0.03 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.01 | 1 | 0 | 1 | 0 | 1 | 0 | |
○I_MPI_memcpy_nontemporal_avx512 | libmpi.so.12.0.0 | 0.01 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0.04 | 0 | 0.02 | 0.04 | 0.03 | 0.05 | 0 | 0.03 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 1 | 2 | 1 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | NA | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | NA | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
►createFccLattice | exec | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.07 | 0.07 | 0.09 | 0.09 | 0.09 | 0.07 | 0.06 | 0.05 | 0.03 | 0.02 | 0.01 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.01 | 0.01 | 0.01 | 0.01 | 0.14 | 0.08 | 0.02 | 0.03 | 0.02 | 0.02 | 0.01 | 0.06 | 0.02 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.19 | 0.28 | 0.43 | 0.85 | 0.00 | 0.00 | 0.00 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 0.83 | 0 | 0.63 | 0 | 0.63 | 0.01 | 1 | 0 | 1 | 0 | 1 | 0 |
►Loop 86 - initAtoms.c:88-100 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 87 - initAtoms.c:89-100 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 88 - initAtoms.c:90-100 - exec | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.07 | 0.07 | 0.09 | 0.09 | 0.08 | 0.07 | 0.06 | 0.05 | 0.03 | 0.02 | 0.01 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.01 | 0.01 | 0.01 | 0.01 | 0.13 | 0.08 | 0.02 | 0.03 | 0.02 | 0.02 | 0.01 | 0.05 | 0.02 | 0.00 | 0.19 | 0.28 | 0.43 | 0.85 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.83 | 0 | 0.63 | 0 | 0.63 | 0.01 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○copyAtom.isra.0 | exec | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.07 | 0.05 | 0.09 | 0.04 | 0.06 | 0.04 | 0.05 | 0.05 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.01 | 0.03 | 0.00 | 0.06 | 0.04 | 0.09 | 0.02 | 0.03 | 0.04 | 0.00 | 0.02 | 0.01 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU GIMPLE 13.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fsave-optimization-record -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops... | 1 | 0 | 2.5 | 0 | 0.63 | 0 | 0.63 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
○gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 0 | 0.23 | 0.31 | 0.89 | 1.76 | 5.06 | 9.84 | 0 | 1.07 | 0.87 | 1 | 0.81 | 1.72 | 2.92 | 0 | 0.58 | 0.42 | 0.63 | 0.7 | 1.3 | 1.9 | 0 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.20 | 0.21 | 0.44 | 0.58 | 2.25 | 5.08 | 0.00 | 0.51 | 0.28 | 0.29 | 0.20 | 0.42 | 0.69 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○gomp_barrier_wait_end | libgomp.so.1.0.0 | 0 | 0.15 | 0.43 | 0.99 | 2.01 | 3.52 | 5.57 | 0 | 0.77 | 0.78 | 0.78 | 0.82 | 0.79 | 0.95 | 0 | 0.38 | 0.58 | 0.7 | 0.8 | 0.9 | 1.08 | 0 | 2 | 6 | 16 | 32 | 64 | 112 | 0.00 | 0.01 | 0.02 | 0.38 | 0.53 | 0.66 | 0.87 | 0.00 | 0.04 | 0.02 | 0.25 | 0.18 | 0.13 | 0.11 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |