Run 2x1 | Number processes: 2Number nodes: 1Number processes per node: 2Run Command: <executable> MPI Command: mpirun -np <number_processes> /usr/bin/numactl -m 8-15Dataset: Run Directory: /home/eoseret/qaas_runs_CPU_9468/171-112-9712/intel/CloverLeafCXX/run/oneview_runs/compilers/gcc_6/oneview_run_1711150496I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spreadOMP_NUM_THREADS: 1 |
---|---|
Run 2x2 | OMP_NUM_THREADS: 2I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x4 | OMP_NUM_THREADS: 4I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x8 | OMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x16 | OMP_NUM_THREADS: 16I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x32 | OMP_NUM_THREADS: 32I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x48 | OMP_NUM_THREADS: 48I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Name | Module | Coverage 2x1 (%) | Coverage 2x2 (%) | Coverage 2x4 (%) | Coverage 2x8 (%) | Coverage 2x16 (%) | Coverage 2x32 (%) | Coverage 2x48 (%) | Max Time Over Threads 2x1 (s) | Max Time Over Threads 2x2 (s) | Max Time Over Threads 2x4 (s) | Max Time Over Threads 2x8 (s) | Max Time Over Threads 2x16 (s) | Max Time Over Threads 2x32 (s) | Max Time Over Threads 2x48 (s) | Time w.r.t. Wall Time 2x1 (s) | Time w.r.t. Wall Time 2x2 (s) | Time w.r.t. Wall Time 2x4 (s) | Time w.r.t. Wall Time 2x8 (s) | Time w.r.t. Wall Time 2x16 (s) | Time w.r.t. Wall Time 2x32 (s) | Time w.r.t. Wall Time 2x48 (s) | Nb Threads 2x1 | Nb Threads 2x2 | Nb Threads 2x4 | Nb Threads 2x8 | Nb Threads 2x16 | Nb Threads 2x32 | Nb Threads 2x48 | Deviation (coverage) 2x1 | Deviation (coverage) 2x2 | Deviation (coverage) 2x4 | Deviation (coverage) 2x8 | Deviation (coverage) 2x16 | Deviation (coverage) 2x32 | Deviation (coverage) 2x48 | Deviation (walltime) 2x1 | Deviation (walltime) 2x2 | Deviation (walltime) 2x4 | Deviation (walltime) 2x8 | Deviation (walltime) 2x16 | Deviation (walltime) 2x32 | Deviation (walltime) 2x48 | Categories 2x1 | Categories 2x2 | Categories 2x4 | Categories 2x8 | Categories 2x16 | Categories 2x32 | Categories 2x48 | GFLOPS 2x1 | GFLOPS 2x2 | GFLOPS 2x4 | GFLOPS 2x8 | GFLOPS 2x16 | GFLOPS 2x32 | GFLOPS 2x48 | Compilation Options | (2x1) Efficiency | (2x1) Potential Speed-Up (%) | (2x2) Efficiency | (2x2) Potential Speed-Up (%) | (2x4) Efficiency | (2x4) Potential Speed-Up (%) | (2x8) Efficiency | (2x8) Potential Speed-Up (%) | (2x16) Efficiency | (2x16) Potential Speed-Up (%) | (2x32) Efficiency | (2x32) Potential Speed-Up (%) | (2x48) Efficiency | (2x48) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
►_Z16viscosity_kerneliiiiRN6clover8Buffer1DIdEES2_RNS_8Buffer2DIdEES5_S5_S5_S5_._omp_fn.0 | exec | 7.32 | 7.48 | 7.41 | 7.32 | 6.93 | 7.02 | 7.12 | 71.18 | 37.62 | 18.96 | 9.56 | 4.78 | 2.94 | 2.5 | 71.17 | 37.38 | 18.88 | 9.5 | 4.73 | 2.87 | 2.39 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.13 | 0.08 | 0.05 | 0.04 | 0.14 | 0.19 | 0.17 | 0.60 | 0.29 | 0.09 | 0.04 | 0.06 | 0.07 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 12.12 | 23.07 | 45.68 | 90.79 | 182.35 | 300.51 | 360.88 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.95 | 0.36 | 0.94 | 0.43 | 0.94 | 0.47 | 0.94 | 0.41 | 0.77 | 1.58 | 0.62 | 2.7 |
►Loop 588 - viscosity.cpp:38-64 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 60 | 62 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 6.90 | 7.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 589 - viscosity.cpp:39-64 - exec [...] | 7.32 | 7.48 | 7.41 | 7.32 | 6.93 | 7.02 | 7.12 | 71.17 | 37.62 | 18.96 | 9.55 | 4.78 | 2.94 | 2.5 | 71.16 | 37.37 | 18.88 | 9.49 | 4.72 | 2.87 | 2.39 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.13 | 0.08 | 0.05 | 0.04 | 0.14 | 0.19 | 0.17 | 0.61 | 0.29 | 0.09 | 0.04 | 0.06 | 0.07 | 12.12 | 23.08 | 45.68 | 90.88 | 182.71 | 300.47 | 360.86 | 1 | 0 | 0.95 | 0.36 | 0.94 | 0.43 | 0.94 | 0.46 | 0.94 | 0.4 | 0.77 | 1.58 | 0.62 | 2.7 | |||||||||
►_Z10PdV_kernelbiiiidRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_._omp_fn.1 | exec | 6.43 | 6.6 | 6.52 | 6.55 | 6.46 | 5.96 | 6.03 | 62.5 | 33.06 | 16.58 | 8.65 | 4.54 | 2.52 | 2.13 | 62.51 | 33.01 | 16.61 | 8.5 | 4.4 | 2.43 | 2.02 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.08 | 0.05 | 0.08 | 0.11 | 0.10 | 0.13 | 0.11 | 0.74 | 0.14 | 0.12 | 0.08 | 0.04 | 0.04 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 9.52 | 18.05 | 35.87 | 70.09 | 135.39 | 245.15 | 294.85 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.95 | 0.35 | 0.94 | 0.39 | 0.92 | 0.53 | 0.89 | 0.72 | 0.8 | 1.17 | 0.64 | 2.14 |
►Loop 274 - PdV.cpp:71-83 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 31 | 50 | 85 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 25.20 | 46.40 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 276 - PdV.cpp:72-83 - exec [...] | 6.42 | 6.6 | 6.51 | 6.55 | 6.46 | 5.96 | 6.02 | 62.49 | 33.06 | 16.58 | 8.64 | 4.53 | 2.51 | 2.13 | 62.49 | 33 | 16.61 | 8.49 | 4.4 | 2.43 | 2.02 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.08 | 0.05 | 0.08 | 0.11 | 0.10 | 0.13 | 0.13 | 0.74 | 0.14 | 0.12 | 0.08 | 0.04 | 0.04 | 9.51 | 18.04 | 35.84 | 70.14 | 135.32 | 245.01 | 294.47 | 1 | 0 | 0.95 | 0.35 | 0.94 | 0.39 | 0.92 | 0.52 | 0.89 | 0.73 | 0.8 | 1.17 | 0.64 | 2.14 | |||||||||
○Loop 275 - PdV.cpp:72-83 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z16flux_calc_kerneliiiidRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_._omp_fn.0 | exec | 6.26 | 4.74 | 4.63 | 4.45 | 4.56 | 4.82 | 4.59 | 60.86 | 23.91 | 12.11 | 6.07 | 3.28 | 2.12 | 1.66 | 60.85 | 23.7 | 11.81 | 5.77 | 3.11 | 1.97 | 1.54 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.11 | 0.15 | 0.11 | 0.10 | 0.14 | 0.16 | 0.15 | 0.57 | 0.35 | 0.15 | 0.07 | 0.06 | 0.05 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 3.37 | 8.64 | 17.45 | 35.71 | 66.27 | 104.62 | 133.91 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.28 | 0 | 1.29 | 0 | 1.32 | 0 | 1.22 | 0 | 0.97 | 0.17 | 0.82 | 0.81 |
►Loop 232 - flux_calc.cpp:38-40 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.06 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.05 | 0.04 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 61 | 88 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.02 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 13.78 | 49.45 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 2 | -0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 233 - flux_calc.cpp:39-40 - exec | 6.25 | 4.74 | 4.63 | 4.45 | 4.56 | 4.81 | 4.59 | 60.83 | 23.89 | 12.1 | 6.07 | 3.27 | 2.12 | 1.66 | 60.8 | 23.69 | 11.81 | 5.77 | 3.11 | 1.97 | 1.54 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.11 | 0.15 | 0.12 | 0.10 | 0.14 | 0.16 | 0.17 | 0.57 | 0.36 | 0.15 | 0.07 | 0.06 | 0.06 | 3.37 | 8.62 | 17.41 | 35.61 | 66.09 | 104.33 | 133.48 | 1 | 0 | 1.28 | 0 | 1.29 | 0 | 1.32 | 0 | 1.22 | 0 | 0.96 | 0.17 | 0.82 | 0.81 | |||||||||
►_Z14calc_dt_kerneliiiidddddRN6clover8Buffer2DIdEES2_RNS_8Buffer1DIdEES5_S5_S5_S2_S2_S2_S2_S2_S2_S2_S2_RdRiS6_S6_S7_S7_S7_._omp_fn.0 | exec | 5.66 | 6.25 | 6.33 | 6.26 | 5.76 | 5.28 | 5.19 | 55.02 | 31.79 | 16.35 | 8.2 | 4 | 2.22 | 1.84 | 55.02 | 31.26 | 16.14 | 8.12 | 3.92 | 2.16 | 1.74 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.18 | 0.09 | 0.08 | 0.11 | 0.12 | 0.16 | 0.12 | 0.88 | 0.30 | 0.13 | 0.08 | 0.05 | 0.05 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 14.18 | 24.98 | 48.47 | 96.32 | 199.52 | 361.93 | 449.25 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.88 | 0.75 | 0.85 | 0.94 | 0.85 | 0.96 | 0.88 | 0.71 | 0.8 | 1.08 | 0.66 | 1.77 |
○Loop 217 - calc_dt.cpp:49-49 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 218 - calc_dt.cpp:49-75 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 7 | 15 | 9 | 44 | 40 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 220 - calc_dt.cpp:52-75 - exec [...] | 5.66 | 6.25 | 6.33 | 6.26 | 5.76 | 5.28 | 5.18 | 55.01 | 31.79 | 16.35 | 8.2 | 4 | 2.22 | 1.84 | 55.01 | 31.26 | 16.14 | 8.12 | 3.92 | 2.16 | 1.74 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.18 | 0.09 | 0.08 | 0.11 | 0.12 | 0.16 | 0.11 | 0.88 | 0.30 | 0.13 | 0.08 | 0.05 | 0.06 | 14.18 | 24.98 | 48.47 | 96.30 | 199.51 | 361.83 | 449.21 | 1 | 0 | 0.88 | 0.75 | 0.85 | 0.94 | 0.85 | 0.96 | 0.88 | 0.71 | 0.8 | 1.08 | 0.66 | 1.77 | |||||||||
○Loop 219 - calc_dt.cpp:54-75 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z10PdV_kernelbiiiidRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_._omp_fn.0 | exec | 5.33 | 5.31 | 5.26 | 5.2 | 5.15 | 4.87 | 4.96 | 51.8 | 26.62 | 13.46 | 6.82 | 3.62 | 2.07 | 1.73 | 51.83 | 26.55 | 13.41 | 6.75 | 3.51 | 1.99 | 1.67 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.02 | 0.04 | 0.06 | 0.05 | 0.08 | 0.10 | 0.09 | 0.06 | 0.51 | 0.14 | 0.08 | 0.06 | 0.04 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 9.90 | 19.29 | 38.20 | 75.89 | 145.95 | 257.59 | 306.83 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.98 | 0.13 | 0.97 | 0.18 | 0.96 | 0.21 | 0.92 | 0.4 | 0.81 | 0.91 | 0.65 | 1.75 |
►Loop 271 - PdV.cpp:50-63 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 31 | 59 | 58 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 22.40 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 273 - PdV.cpp:51-63 - exec [...] | 5.33 | 5.31 | 5.26 | 5.2 | 5.15 | 4.87 | 4.95 | 51.78 | 26.62 | 13.45 | 6.82 | 3.62 | 2.07 | 1.72 | 51.82 | 26.55 | 13.41 | 6.75 | 3.51 | 1.99 | 1.66 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.02 | 0.04 | 0.05 | 0.05 | 0.08 | 0.09 | 0.09 | 0.06 | 0.52 | 0.14 | 0.08 | 0.06 | 0.04 | 0.03 | 9.89 | 19.28 | 38.17 | 75.84 | 145.86 | 257.38 | 308.55 | 1 | 0 | 0.98 | 0.13 | 0.97 | 0.18 | 0.96 | 0.21 | 0.92 | 0.4 | 0.81 | 0.91 | 0.65 | 1.73 | |||||||||
○Loop 272 - PdV.cpp:55-63 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z17accelerate_kerneliiiidRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_._omp_fn.0 | exec | 5.15 | 5.28 | 5.28 | 5.34 | 5.26 | 4.85 | 4.95 | 50.2 | 26.56 | 13.62 | 7.1 | 3.7 | 2.06 | 1.73 | 50.11 | 26.38 | 13.47 | 6.92 | 3.59 | 1.98 | 1.66 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.09 | 0.06 | 0.08 | 0.11 | 0.08 | 0.10 | 0.24 | 0.82 | 0.25 | 0.14 | 0.08 | 0.03 | 0.04 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 14.75 | 28.02 | 54.78 | 106.59 | 205.47 | 372.54 | 444.20 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.95 | 0.27 | 0.93 | 0.37 | 0.91 | 0.51 | 0.87 | 0.67 | 0.79 | 1.01 | 0.63 | 1.84 |
►Loop 155 - accelerate.cpp:42-53 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 63 | 92 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 42.45 | 39.95 | 37.30 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 157 - accelerate.cpp:43-53 - exec | 5.15 | 5.27 | 5.28 | 5.33 | 5.26 | 4.84 | 4.94 | 50.19 | 26.56 | 13.61 | 7.1 | 3.7 | 2.06 | 1.73 | 50.07 | 26.35 | 13.46 | 6.92 | 3.58 | 1.98 | 1.66 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.02 | 0.09 | 0.06 | 0.08 | 0.11 | 0.08 | 0.10 | 0.28 | 0.82 | 0.25 | 0.13 | 0.08 | 0.03 | 0.04 | 14.75 | 28.03 | 54.76 | 106.49 | 205.82 | 372.16 | 443.73 | 1 | 0 | 0.95 | 0.26 | 0.93 | 0.37 | 0.9 | 0.51 | 0.87 | 0.66 | 0.79 | 1.02 | 0.63 | 1.84 | |||||||||
○Loop 156 - accelerate.cpp:43-53 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.04 | 0.03 | 0.02 | 0.01 | 0.02 | 0 | 0 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 52 | 78 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 20.85 | 16.28 | 35.90 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.5 | 0 | 0.5 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
►_Z16ideal_gas_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_._omp_fn.0 | exec | 4.6 | 4.15 | 4 | 3.9 | 4.01 | 4.35 | 4.18 | 44.74 | 20.67 | 10.36 | 5.22 | 2.82 | 1.84 | 1.49 | 44.79 | 20.73 | 10.2 | 5.06 | 2.73 | 1.78 | 1.4 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.11 | 0.10 | 0.06 | 0.07 | 0.07 | 0.09 | 0.04 | 0.35 | 0.21 | 0.08 | 0.05 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 7.80 | 16.85 | 34.24 | 69.11 | 128.10 | 196.46 | 249.61 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.08 | 0 | 1.1 | 0 | 1.11 | 0 | 1.03 | 0 | 0.79 | 0.93 | 0.67 | 1.39 |
►Loop 242 - ideal_gas.cpp:39-45 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 21 | 40 | 61 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 19.60 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 243 - ideal_gas.cpp:40-45 - exec | 4.6 | 4.15 | 4 | 3.9 | 4 | 4.34 | 4.16 | 44.73 | 20.67 | 10.36 | 5.22 | 2.81 | 1.84 | 1.49 | 44.78 | 20.72 | 10.2 | 5.06 | 2.73 | 1.77 | 1.4 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.11 | 0.10 | 0.06 | 0.07 | 0.07 | 0.09 | 0.04 | 0.35 | 0.21 | 0.08 | 0.04 | 0.03 | 0.03 | 7.80 | 16.85 | 34.22 | 69.07 | 128.07 | 197.49 | 249.45 | 1 | 0 | 1.08 | 0 | 1.1 | 0 | 1.11 | 0 | 1.03 | 0 | 0.79 | 0.91 | 0.67 | 1.39 | |||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.10 | exec | 4.35 | 5.3 | 5.38 | 5.43 | 5.04 | 4.4 | 4.5 | 42.25 | 27.92 | 14.44 | 7.22 | 3.57 | 1.85 | 1.58 | 42.28 | 26.52 | 13.7 | 7.04 | 3.43 | 1.8 | 1.51 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.28 | 0.25 | 0.18 | 0.12 | 0.07 | 0.10 | 0.04 | 1.64 | 0.71 | 0.25 | 0.09 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 18.51 | 29.49 | 57.09 | 111.10 | 228.09 | 434.75 | 518.64 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.8 | 1.08 | 0.77 | 1.23 | 0.75 | 1.35 | 0.77 | 1.16 | 0.73 | 1.17 | 0.58 | 1.87 |
►Loop 201 - advec_mom.cpp:182-211 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 30 | 52 | 73 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 34.45 | 39.30 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 203 - advec_mom.cpp:186-211 - exec [...] | 4.35 | 5.3 | 5.37 | 5.43 | 5.04 | 4.39 | 4.48 | 42.25 | 27.91 | 14.44 | 7.22 | 3.57 | 1.85 | 1.58 | 42.28 | 26.51 | 13.7 | 7.04 | 3.43 | 1.79 | 1.51 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.28 | 0.25 | 0.18 | 0.12 | 0.07 | 0.10 | 0.05 | 1.64 | 0.71 | 0.25 | 0.09 | 0.03 | 0.03 | 18.50 | 29.48 | 57.05 | 111.03 | 227.97 | 436.97 | 518.37 | 1 | 0 | 0.8 | 1.07 | 0.77 | 1.23 | 0.75 | 1.35 | 0.77 | 1.16 | 0.74 | 1.15 | 0.58 | 1.87 | |||||||||
○Loop 202 - advec_mom.cpp:186-211 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 13 | 17 | 17 | 26 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 12.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.6 | exec | 4.2 | 5.07 | 5.19 | 5.24 | 4.84 | 4.16 | 4.29 | 40.82 | 26.31 | 13.85 | 6.99 | 3.41 | 1.77 | 1.53 | 40.81 | 25.36 | 13.24 | 6.79 | 3.3 | 1.7 | 1.44 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.02 | 0.19 | 0.22 | 0.16 | 0.09 | 0.06 | 0.11 | 0.10 | 1.20 | 0.64 | 0.23 | 0.07 | 0.03 | 0.04 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 20.12 | 32.36 | 61.92 | 120.74 | 248.44 | 482.36 | 570.16 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.8 | 0.99 | 0.77 | 1.19 | 0.75 | 1.3 | 0.77 | 1.1 | 0.75 | 1.04 | 0.59 | 1.76 |
►Loop 198 - advec_mom.cpp:110-139 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.02 | 0.01 | 0 | 0.02 | 0.01 | 0.01 | 0.03 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 29 | 57 | 82 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | 38.35 | 36.80 | 0.00 | 39.80 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.5 | 0 | 1 | 0 | 0.13 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 200 - advec_mom.cpp:114-139 - exec | 4.19 | 5.07 | 5.19 | 5.23 | 4.84 | 4.15 | 4.28 | 40.78 | 26.3 | 13.84 | 6.99 | 3.4 | 1.76 | 1.53 | 40.78 | 25.35 | 13.24 | 6.78 | 3.3 | 1.69 | 1.44 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.02 | 0.19 | 0.22 | 0.17 | 0.09 | 0.07 | 0.11 | 0.08 | 1.20 | 0.64 | 0.23 | 0.07 | 0.03 | 0.04 | 20.12 | 32.34 | 61.87 | 120.81 | 248.25 | 484.76 | 569.71 | 1 | 0 | 0.8 | 0.99 | 0.77 | 1.19 | 0.75 | 1.3 | 0.77 | 1.1 | 0.75 | 1.02 | 0.59 | 1.75 | |||||||||
○Loop 199 - advec_mom.cpp:114-139 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.03 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 28 | 55 | 46 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 13.08 | 29.15 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.11 | exec | 3.66 | 3.68 | 3.62 | 3.58 | 3.6 | 3.79 | 3.64 | 35.6 | 18.29 | 9.23 | 4.75 | 2.53 | 1.62 | 1.37 | 35.61 | 18.38 | 9.23 | 4.64 | 2.45 | 1.55 | 1.22 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.03 | 0.04 | 0.05 | 0.07 | 0.07 | 0.15 | 0.05 | 0.23 | 0.10 | 0.07 | 0.05 | 0.03 | 0.05 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 4.61 | 8.94 | 17.50 | 34.75 | 65.90 | 104.24 | 132.87 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.97 | 0.12 | 0.96 | 0.13 | 0.96 | 0.15 | 0.91 | 0.33 | 0.72 | 1.07 | 0.61 | 1.43 |
►Loop 196 - advec_mom.cpp:220-221 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.03 | 0 | 0.01 | 0.02 | 0.01 | 0.01 | 0.02 | 0.03 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 2 | 4 | 8 | 15 | 28 | 46 | 62 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 5.37 | 0.00 | 8.45 | 14.40 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 197 - advec_mom.cpp:221-221 - exec [...] | 3.66 | 3.68 | 3.62 | 3.58 | 3.59 | 3.78 | 3.63 | 35.57 | 18.28 | 9.22 | 4.75 | 2.53 | 1.62 | 1.37 | 35.58 | 18.38 | 9.23 | 4.64 | 2.45 | 1.54 | 1.22 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.03 | 0.04 | 0.05 | 0.07 | 0.07 | 0.15 | 0.05 | 0.23 | 0.10 | 0.07 | 0.05 | 0.03 | 0.05 | 4.61 | 8.93 | 17.49 | 34.72 | 65.84 | 104.80 | 132.75 | 1 | 0 | 0.97 | 0.12 | 0.96 | 0.13 | 0.96 | 0.15 | 0.91 | 0.33 | 0.72 | 1.05 | 0.61 | 1.42 | |||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.7 | exec | 3.63 | 3.64 | 3.61 | 3.56 | 3.58 | 3.76 | 3.59 | 35.3 | 18.12 | 9.29 | 4.75 | 2.52 | 1.61 | 1.33 | 35.3 | 18.21 | 9.21 | 4.62 | 2.44 | 1.53 | 1.21 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.03 | 0.06 | 0.05 | 0.05 | 0.09 | 0.14 | 0.07 | 0.23 | 0.12 | 0.07 | 0.04 | 0.04 | 0.05 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 4.65 | 9.01 | 17.87 | 35.62 | 67.25 | 107.46 | 135.71 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.97 | 0.11 | 0.96 | 0.15 | 0.96 | 0.16 | 0.9 | 0.34 | 0.72 | 1.05 | 0.61 | 1.41 |
►Loop 190 - advec_mom.cpp:148-149 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 21 | 38 | 52 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 14.05 | 10.45 | 12.45 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 191 - advec_mom.cpp:149-149 - exec [...] | 3.63 | 3.64 | 3.61 | 3.56 | 3.57 | 3.75 | 3.58 | 35.28 | 18.11 | 9.29 | 4.75 | 2.52 | 1.6 | 1.33 | 35.29 | 18.2 | 9.2 | 4.62 | 2.43 | 1.53 | 1.2 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.03 | 0.06 | 0.05 | 0.05 | 0.09 | 0.14 | 0.07 | 0.24 | 0.12 | 0.07 | 0.04 | 0.04 | 0.05 | 4.65 | 9.01 | 17.87 | 35.59 | 67.48 | 107.37 | 136.76 | 1 | 0 | 0.97 | 0.11 | 0.96 | 0.15 | 0.95 | 0.16 | 0.91 | 0.33 | 0.72 | 1.05 | 0.61 | 1.39 | |||||||||
►_Z17advec_cell_kerneliiiiiiRN6clover8Buffer1DIdEES2_RNS_8Buffer2DIdEES5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_._omp_fn.6 | exec | 3.49 | 3.78 | 3.85 | 3.88 | 3.63 | 3.28 | 3.33 | 33.93 | 19.69 | 10.31 | 5.16 | 2.56 | 1.39 | 1.19 | 33.94 | 18.87 | 9.81 | 5.03 | 2.47 | 1.34 | 1.12 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.15 | 0.17 | 0.10 | 0.08 | 0.05 | 0.09 | 0.06 | 0.89 | 0.47 | 0.14 | 0.06 | 0.02 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 21.17 | 38.07 | 73.22 | 142.80 | 290.78 | 536.08 | 641.50 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.9 | 0.38 | 0.86 | 0.52 | 0.84 | 0.61 | 0.86 | 0.51 | 0.79 | 0.68 | 0.63 | 1.23 |
►Loop 175 - advec_cell.cpp:159-202 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 28 | 49 | 69 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 22.40 | 30.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 177 - advec_cell.cpp:163-202 - exec [...] | 3.49 | 3.77 | 3.85 | 3.88 | 3.63 | 3.28 | 3.32 | 33.92 | 19.69 | 10.31 | 5.16 | 2.56 | 1.39 | 1.19 | 33.93 | 18.86 | 9.81 | 5.03 | 2.47 | 1.34 | 1.11 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.15 | 0.17 | 0.10 | 0.08 | 0.05 | 0.09 | 0.07 | 0.89 | 0.47 | 0.14 | 0.06 | 0.02 | 0.03 | 21.17 | 38.07 | 73.20 | 142.74 | 290.68 | 535.88 | 647.00 | 1 | 0 | 0.9 | 0.38 | 0.86 | 0.52 | 0.84 | 0.61 | 0.86 | 0.51 | 0.79 | 0.68 | 0.64 | 1.21 | |||||||||
○Loop 176 - advec_cell.cpp:163-202 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.9 | exec | 3.14 | 2.68 | 2.59 | 2.52 | 2.67 | 2.98 | 2.98 | 30.51 | 13.74 | 7.1 | 3.58 | 1.91 | 1.29 | 1.06 | 30.54 | 13.4 | 6.61 | 3.26 | 1.82 | 1.22 | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.11 | 0.13 | 0.09 | 0.06 | 0.07 | 0.10 | 0.01 | 0.48 | 0.31 | 0.11 | 0.04 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 6.72 | 15.38 | 31.17 | 63.19 | 113.04 | 168.59 | 205.32 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.14 | 0 | 1.16 | 0 | 1.17 | 0 | 1.05 | 0 | 0.78 | 0.65 | 0.64 | 1.08 |
►Loop 192 - advec_mom.cpp:169-172 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 61 | 79 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 28.73 | 48.25 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 193 - advec_mom.cpp:170-172 - exec [...] | 3.14 | 2.68 | 2.59 | 2.51 | 2.66 | 2.97 | 2.97 | 30.48 | 13.74 | 7.1 | 3.58 | 1.91 | 1.29 | 1.06 | 30.53 | 13.4 | 6.61 | 3.26 | 1.82 | 1.22 | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.11 | 0.13 | 0.09 | 0.06 | 0.07 | 0.10 | 0.01 | 0.48 | 0.30 | 0.11 | 0.04 | 0.03 | 0.03 | 6.71 | 15.34 | 31.10 | 63.04 | 112.77 | 168.18 | 204.80 | 1 | 0 | 1.14 | 0 | 1.15 | 0 | 1.17 | 0 | 1.05 | 0 | 0.78 | 0.65 | 0.64 | 1.08 | |||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.0 | exec | 3.13 | 2.64 | 2.56 | 2.52 | 2.64 | 2.93 | 2.96 | 30.42 | 13.22 | 6.73 | 3.39 | 1.89 | 1.27 | 1.04 | 30.41 | 13.19 | 6.52 | 3.26 | 1.8 | 1.2 | 0.99 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.06 | 0.08 | 0.06 | 0.06 | 0.08 | 0.08 | 0.07 | 0.29 | 0.19 | 0.07 | 0.04 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.73 | 6.30 | 12.53 | 25.14 | 45.65 | 68.59 | 83.23 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.15 | 0 | 1.17 | 0 | 1.17 | 0 | 1.06 | 0 | 0.79 | 0.61 | 0.64 | 1.07 |
►Loop 178 - advec_mom.cpp:44-48 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.05 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.05 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 15 | 31 | 54 | 60 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 5.40 | 23.20 | 17.20 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 2.5 | -0 | 1.25 | -0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 179 - advec_mom.cpp:47-48 - exec | 3.12 | 2.64 | 2.55 | 2.51 | 2.63 | 2.92 | 2.95 | 30.38 | 13.2 | 6.73 | 3.39 | 1.88 | 1.26 | 1.04 | 30.36 | 13.18 | 6.51 | 3.26 | 1.8 | 1.19 | 0.99 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.06 | 0.08 | 0.06 | 0.06 | 0.07 | 0.08 | 0.08 | 0.29 | 0.19 | 0.07 | 0.04 | 0.03 | 0.03 | 2.73 | 6.29 | 12.52 | 25.09 | 45.50 | 68.94 | 82.98 | 1 | 0 | 1.15 | 0 | 1.17 | 0 | 1.16 | 0 | 1.05 | 0 | 0.8 | 0.59 | 0.64 | 1.07 | |||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.5 | exec | 3.13 | 2.66 | 2.56 | 2.52 | 2.66 | 2.97 | 3 | 30.45 | 13.23 | 6.82 | 3.44 | 1.91 | 1.28 | 1.08 | 30.48 | 13.28 | 6.54 | 3.27 | 1.82 | 1.21 | 1.01 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.07 | 0.10 | 0.07 | 0.06 | 0.08 | 0.09 | 0.02 | 0.23 | 0.21 | 0.09 | 0.04 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 6.74 | 15.47 | 31.51 | 63.03 | 113.20 | 170.19 | 203.27 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.15 | 0 | 1.17 | 0 | 1.17 | 0 | 1.05 | 0 | 0.79 | 0.63 | 0.63 | 1.11 |
►Loop 186 - advec_mom.cpp:97-100 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.03 | 0.02 | 0.02 | 0.01 | 0.01 | 0.02 | 0.01 | 0.03 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 63 | 85 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 22.65 | 70.40 | 62.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.5 | -0 | 0.75 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 187 - advec_mom.cpp:98-100 - exec [...] | 3.13 | 2.65 | 2.56 | 2.52 | 2.66 | 2.96 | 2.99 | 30.43 | 13.22 | 6.82 | 3.44 | 1.9 | 1.28 | 1.07 | 30.46 | 13.27 | 6.53 | 3.27 | 1.81 | 1.21 | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.07 | 0.10 | 0.07 | 0.06 | 0.08 | 0.09 | 0.02 | 0.23 | 0.21 | 0.09 | 0.04 | 0.03 | 0.03 | 6.72 | 15.43 | 31.47 | 62.85 | 113.52 | 169.69 | 204.82 | 1 | 0 | 1.15 | 0 | 1.17 | 0 | 1.16 | 0 | 1.05 | 0 | 0.79 | 0.63 | 0.63 | 1.09 | |||||||||
►_Z17advec_cell_kerneliiiiiiRN6clover8Buffer1DIdEES2_RNS_8Buffer2DIdEES5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_._omp_fn.7 | exec | 3.07 | 3.02 | 2.99 | 2.95 | 2.97 | 3.1 | 2.79 | 29.86 | 15.07 | 7.68 | 3.89 | 2.1 | 1.29 | 0.99 | 29.82 | 15.08 | 7.62 | 3.83 | 2.02 | 1.26 | 0.94 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.02 | 0.02 | 0.02 | 0.04 | 0.05 | 0.06 | 0.13 | 0.26 | 0.09 | 0.03 | 0.03 | 0.02 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 6.89 | 13.62 | 27.04 | 53.79 | 101.93 | 163.12 | 218.80 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.99 | 0.03 | 0.98 | 0.06 | 0.97 | 0.08 | 0.92 | 0.23 | 0.74 | 0.81 | 0.66 | 0.95 |
►Loop 170 - advec_cell.cpp:210-216 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 14 | 22 | 32 | 42 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 10.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 171 - advec_cell.cpp:211-216 - exec | 3.07 | 3.02 | 2.99 | 2.95 | 2.96 | 3.09 | 2.78 | 29.86 | 15.06 | 7.67 | 3.89 | 2.1 | 1.29 | 0.99 | 29.82 | 15.08 | 7.62 | 3.83 | 2.02 | 1.26 | 0.93 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.02 | 0.02 | 0.02 | 0.04 | 0.05 | 0.06 | 0.13 | 0.26 | 0.09 | 0.03 | 0.03 | 0.02 | 0.02 | 6.88 | 13.61 | 27.03 | 53.77 | 101.86 | 163.02 | 221.07 | 1 | 0 | 0.99 | 0.03 | 0.98 | 0.06 | 0.97 | 0.08 | 0.92 | 0.23 | 0.74 | 0.8 | 0.67 | 0.92 | |||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.1 | exec | 3.05 | 2.58 | 2.51 | 2.43 | 2.58 | 2.86 | 2.89 | 29.65 | 13.02 | 6.7 | 3.31 | 1.85 | 1.24 | 1.04 | 29.64 | 12.91 | 6.4 | 3.15 | 1.76 | 1.17 | 0.97 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.07 | 0.10 | 0.06 | 0.06 | 0.08 | 0.09 | 0.08 | 0.33 | 0.22 | 0.08 | 0.04 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.74 | 6.29 | 12.69 | 25.81 | 46.21 | 69.42 | 83.14 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.15 | 0 | 1.16 | 0 | 1.18 | 0 | 1.05 | 0 | 0.79 | 0.6 | 0.64 | 1.05 |
►Loop 180 - advec_mom.cpp:53-57 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.04 | 0.02 | 0.02 | 0.01 | 0.01 | 0.02 | 0.01 | 0.03 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 29 | 46 | 56 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 8.13 | 12.10 | 19.60 | 18.20 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.75 | 0 | 0.75 | 0 | 0.38 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 181 - advec_mom.cpp:56-57 - exec | 3.04 | 2.58 | 2.51 | 2.42 | 2.57 | 2.85 | 2.88 | 29.64 | 13 | 6.68 | 3.31 | 1.84 | 1.24 | 1.04 | 29.6 | 12.89 | 6.39 | 3.14 | 1.75 | 1.17 | 0.97 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.07 | 0.10 | 0.06 | 0.06 | 0.08 | 0.09 | 0.11 | 0.33 | 0.22 | 0.08 | 0.04 | 0.03 | 0.03 | 2.74 | 6.28 | 12.68 | 25.84 | 46.35 | 69.22 | 82.91 | 1 | 0 | 1.15 | 0 | 1.16 | 0 | 1.18 | 0 | 1.06 | 0 | 0.79 | 0.6 | 0.64 | 1.05 | |||||||||
►_Z17advec_cell_kerneliiiiiiRN6clover8Buffer1DIdEES2_RNS_8Buffer2DIdEES5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_._omp_fn.2 | exec | 2.99 | 3.49 | 3.62 | 3.68 | 3.48 | 3.07 | 3 | 29.11 | 18.75 | 9.87 | 5 | 2.45 | 1.3 | 1.07 | 29.09 | 17.46 | 9.23 | 4.77 | 2.37 | 1.26 | 1.01 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.21 | 0.19 | 0.19 | 0.08 | 0.06 | 0.08 | 0.09 | 1.15 | 0.54 | 0.26 | 0.06 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 24.69 | 41.14 | 77.83 | 150.60 | 303.08 | 570.18 | 710.65 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.83 | 0.58 | 0.79 | 0.77 | 0.76 | 0.87 | 0.77 | 0.81 | 0.72 | 0.86 | 0.6 | 1.2 |
►Loop 172 - advec_cell.cpp:67-110 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 31 | 51 | 48 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 13.50 | 9.95 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 174 - advec_cell.cpp:71-110 - exec [...] | 2.99 | 3.49 | 3.62 | 3.68 | 3.47 | 3.07 | 2.99 | 29.08 | 18.74 | 9.87 | 4.99 | 2.44 | 1.3 | 1.07 | 29.06 | 17.44 | 9.23 | 4.77 | 2.37 | 1.25 | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.21 | 0.19 | 0.19 | 0.08 | 0.06 | 0.08 | 0.10 | 1.16 | 0.54 | 0.26 | 0.06 | 0.03 | 0.03 | 24.71 | 41.18 | 77.80 | 150.53 | 302.95 | 574.50 | 717.60 | 1 | 0 | 0.83 | 0.58 | 0.79 | 0.77 | 0.76 | 0.88 | 0.77 | 0.81 | 0.73 | 0.84 | 0.61 | 1.18 | |||||||||
○Loop 173 - advec_cell.cpp:71-110 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 19 | 31 | 35 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 7.28 | 13.70 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z17advec_cell_kerneliiiiiiRN6clover8Buffer1DIdEES2_RNS_8Buffer2DIdEES5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_._omp_fn.3 | exec | 2.98 | 2.98 | 2.96 | 2.92 | 2.89 | 3.02 | 2.72 | 28.95 | 14.9 | 7.6 | 3.84 | 2.01 | 1.27 | 0.96 | 28.95 | 14.91 | 7.55 | 3.79 | 1.97 | 1.23 | 0.91 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.02 | 0.03 | 0.03 | 0.04 | 0.05 | 0.06 | 0.06 | 0.30 | 0.09 | 0.04 | 0.03 | 0.02 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 7.09 | 13.72 | 27.29 | 54.36 | 104.61 | 167.54 | 226.61 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.97 | 0.09 | 0.96 | 0.12 | 0.95 | 0.13 | 0.92 | 0.24 | 0.74 | 0.8 | 0.66 | 0.92 |
►Loop 163 - advec_cell.cpp:119-125 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 7 | 13 | 23 | 32 | 53 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 164 - advec_cell.cpp:120-125 - exec | 2.98 | 2.98 | 2.96 | 2.92 | 2.89 | 3.01 | 2.71 | 28.94 | 14.9 | 7.6 | 3.83 | 2.01 | 1.27 | 0.96 | 28.95 | 14.9 | 7.55 | 3.78 | 1.97 | 1.23 | 0.91 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.02 | 0.03 | 0.03 | 0.04 | 0.05 | 0.06 | 0.05 | 0.30 | 0.09 | 0.03 | 0.03 | 0.02 | 0.02 | 7.09 | 13.73 | 27.28 | 54.48 | 104.55 | 167.46 | 226.44 | 1 | 0 | 0.97 | 0.09 | 0.96 | 0.12 | 0.96 | 0.12 | 0.92 | 0.24 | 0.74 | 0.8 | 0.66 | 0.91 | |||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.3 | exec | 2.4 | 2.07 | 2.03 | 2 | 2.09 | 2.33 | 2.21 | 23.32 | 10.49 | 5.36 | 2.75 | 1.52 | 1 | 0.79 | 23.32 | 10.37 | 5.17 | 2.6 | 1.43 | 0.95 | 0.74 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.04 | 0.07 | 0.05 | 0.05 | 0.06 | 0.09 | 0.06 | 0.25 | 0.16 | 0.07 | 0.04 | 0.02 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.78 | 3.94 | 7.90 | 15.77 | 28.62 | 43.46 | 55.25 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.12 | 0 | 1.13 | 0 | 1.12 | 0 | 1.02 | 0 | 0.77 | 0.54 | 0.66 | 0.76 |
►Loop 184 - advec_mom.cpp:73-75 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.03 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.02 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 14 | 25 | 36 | 58 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5.20 | 7.50 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.75 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 185 - advec_mom.cpp:74-75 - exec | 2.39 | 2.07 | 2.03 | 2 | 2.09 | 2.32 | 2.21 | 23.3 | 10.46 | 5.35 | 2.75 | 1.52 | 1 | 0.79 | 23.29 | 10.35 | 5.17 | 2.59 | 1.42 | 0.95 | 0.74 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.04 | 0.07 | 0.06 | 0.05 | 0.06 | 0.09 | 0.07 | 0.25 | 0.16 | 0.07 | 0.03 | 0.02 | 0.03 | 1.78 | 3.93 | 7.88 | 15.76 | 28.72 | 43.33 | 55.00 | 1 | 0 | 1.13 | 0 | 1.13 | 0 | 1.12 | 0 | 1.03 | 0 | 0.77 | 0.54 | 0.66 | 0.76 | |||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.2 | exec | 2.33 | 2.07 | 2 | 1.95 | 2.06 | 2.27 | 2.17 | 22.7 | 10.39 | 5.24 | 2.63 | 1.48 | 0.96 | 0.79 | 22.7 | 10.37 | 5.09 | 2.53 | 1.4 | 0.93 | 0.73 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.04 | 0.06 | 0.04 | 0.05 | 0.05 | 0.09 | 0.05 | 0.22 | 0.13 | 0.05 | 0.03 | 0.02 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.79 | 3.98 | 8.10 | 16.23 | 29.46 | 43.70 | 55.22 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.09 | 0 | 1.11 | 0 | 1.12 | 0 | 1.01 | 0 | 0.76 | 0.54 | 0.65 | 0.76 |
►Loop 182 - advec_mom.cpp:62-66 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 14 | 24 | 47 | 50 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5.47 | 17.60 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 183 - advec_mom.cpp:65-66 - exec | 2.33 | 2.07 | 1.99 | 1.95 | 2.06 | 2.26 | 2.17 | 22.67 | 10.37 | 5.24 | 2.63 | 1.48 | 0.95 | 0.78 | 22.67 | 10.36 | 5.08 | 2.53 | 1.4 | 0.92 | 0.73 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.04 | 0.06 | 0.04 | 0.05 | 0.05 | 0.09 | 0.05 | 0.22 | 0.13 | 0.06 | 0.03 | 0.02 | 0.03 | 1.78 | 3.97 | 8.10 | 16.17 | 29.32 | 43.98 | 55.02 | 1 | 0 | 1.09 | 0 | 1.12 | 0 | 1.12 | 0 | 1.01 | 0 | 0.77 | 0.52 | 0.65 | 0.77 | |||||||||
►_Z18reset_field_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_._omp_fn.1 | exec | 1.97 | 1.93 | 1.9 | 1.87 | 1.91 | 2.02 | 1.97 | 19.15 | 9.56 | 4.93 | 2.5 | 1.37 | 0.87 | 0.73 | 19.16 | 9.63 | 4.84 | 2.43 | 1.3 | 0.83 | 0.66 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.02 | 0.04 | 0.03 | 0.05 | 0.05 | 0.09 | 0.02 | 0.09 | 0.09 | 0.04 | 0.03 | 0.02 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.99 | 0.01 | 0.99 | 0.02 | 0.99 | 0.03 | 0.92 | 0.15 | 0.72 | 0.56 | 0.6 | 0.78 |
►Loop 279 - reset_field.cpp:46-48 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 3 | 4 | 7 | 11 | 22 | 24 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 280 - reset_field.cpp:47-48 - exec | 1.97 | 1.93 | 1.9 | 1.87 | 1.91 | 2.02 | 1.97 | 19.13 | 9.56 | 4.93 | 2.49 | 1.37 | 0.87 | 0.73 | 19.15 | 9.63 | 4.84 | 2.42 | 1.3 | 0.82 | 0.66 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.03 | 0.04 | 0.03 | 0.05 | 0.05 | 0.09 | 0.02 | 0.09 | 0.09 | 0.04 | 0.03 | 0.02 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.99 | 0.01 | 0.99 | 0.02 | 0.99 | 0.02 | 0.92 | 0.15 | 0.73 | 0.55 | 0.6 | 0.78 | |||||||||
►_Z13revert_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_._omp_fn.0 | exec | 1.75 | 1.68 | 1.66 | 1.64 | 1.66 | 1.69 | 1.63 | 16.98 | 8.36 | 4.3 | 2.19 | 1.2 | 0.76 | 0.62 | 17 | 8.37 | 4.24 | 2.12 | 1.13 | 0.69 | 0.55 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.01 | 0.04 | 0.03 | 0.06 | 0.06 | 0.07 | 0.01 | 0.11 | 0.08 | 0.05 | 0.04 | 0.03 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.02 | 0 | 1 | -0 | 1 | -0 | 0.94 | 0.1 | 0.77 | 0.39 | 0.64 | 0.58 |
►Loop 284 - revert.cpp:36-38 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 3 | 4 | 4 | 4 | 16 | 21 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 285 - revert.cpp:37-38 - exec | 1.75 | 1.67 | 1.66 | 1.63 | 1.66 | 1.69 | 1.63 | 16.95 | 8.36 | 4.3 | 2.18 | 1.2 | 0.76 | 0.61 | 16.98 | 8.37 | 4.23 | 2.12 | 1.13 | 0.69 | 0.55 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.04 | 0.03 | 0.06 | 0.06 | 0.07 | 0.01 | 0.11 | 0.09 | 0.05 | 0.04 | 0.03 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.01 | 0 | 1 | 0 | 1 | -0 | 0.94 | 0.1 | 0.77 | 0.39 | 0.64 | 0.58 | |||||||||
►_Z18reset_field_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_._omp_fn.0 | exec | 1.74 | 1.67 | 1.66 | 1.63 | 1.67 | 1.76 | 1.69 | 16.92 | 8.34 | 4.31 | 2.18 | 1.21 | 0.78 | 0.66 | 16.93 | 8.36 | 4.22 | 2.12 | 1.14 | 0.72 | 0.57 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.02 | 0.04 | 0.03 | 0.05 | 0.07 | 0.09 | 0.03 | 0.12 | 0.10 | 0.04 | 0.04 | 0.03 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.01 | 0 | 1 | -0 | 1 | 0 | 0.93 | 0.12 | 0.73 | 0.47 | 0.62 | 0.64 |
►Loop 281 - reset_field.cpp:36-38 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 3 | 3 | 4 | 8 | 18 | 21 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 282 - reset_field.cpp:37-38 - exec | 1.74 | 1.67 | 1.65 | 1.63 | 1.67 | 1.75 | 1.68 | 16.9 | 8.34 | 4.31 | 2.18 | 1.2 | 0.78 | 0.65 | 16.91 | 8.36 | 4.22 | 2.12 | 1.14 | 0.72 | 0.57 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.02 | 0.04 | 0.03 | 0.05 | 0.07 | 0.08 | 0.02 | 0.12 | 0.10 | 0.04 | 0.04 | 0.03 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.01 | 0 | 1 | -0 | 1 | 0 | 0.93 | 0.12 | 0.73 | 0.47 | 0.62 | 0.64 | |||||||||
►_Z17advec_cell_kerneliiiiiiRN6clover8Buffer1DIdEES2_RNS_8Buffer2DIdEES5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_._omp_fn.0 | exec | 1.62 | 1.37 | 1.33 | 1.29 | 1.31 | 1.48 | 1.47 | 15.82 | 7.02 | 3.57 | 1.76 | 0.95 | 0.64 | 0.54 | 15.77 | 6.85 | 3.39 | 1.67 | 0.9 | 0.61 | 0.49 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.06 | 0.08 | 0.04 | 0.04 | 0.04 | 0.05 | 0.10 | 0.30 | 0.18 | 0.06 | 0.02 | 0.02 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 3.95 | 9.15 | 18.28 | 37.11 | 68.84 | 101.60 | 126.98 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.15 | 0 | 1.16 | 0 | 1.18 | 0 | 1.1 | 0 | 0.81 | 0.28 | 0.67 | 0.48 |
►Loop 159 - advec_cell.cpp:44-48 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.04 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 13 | 23 | 36 | 48 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8.60 | 15.20 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 160 - advec_cell.cpp:47-48 - exec | 1.62 | 1.37 | 1.33 | 1.29 | 1.31 | 1.48 | 1.47 | 15.78 | 7.01 | 3.56 | 1.76 | 0.94 | 0.64 | 0.54 | 15.75 | 6.85 | 3.38 | 1.67 | 0.89 | 0.61 | 0.49 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.06 | 0.08 | 0.05 | 0.04 | 0.04 | 0.05 | 0.08 | 0.30 | 0.18 | 0.06 | 0.02 | 0.02 | 0.02 | 3.95 | 9.13 | 18.32 | 37.04 | 69.42 | 101.40 | 126.63 | 1 | 0 | 1.15 | 0 | 1.16 | 0 | 1.18 | 0 | 1.11 | 0 | 0.81 | 0.29 | 0.67 | 0.49 | |||||||||
►_Z17advec_cell_kerneliiiiiiRN6clover8Buffer1DIdEES2_RNS_8Buffer2DIdEES5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_._omp_fn.4 | exec | 1.57 | 1.33 | 1.29 | 1.24 | 1.28 | 1.44 | 1.42 | 15.28 | 6.74 | 3.51 | 1.74 | 0.93 | 0.63 | 0.54 | 15.24 | 6.63 | 3.29 | 1.61 | 0.87 | 0.59 | 0.48 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.05 | 0.08 | 0.05 | 0.04 | 0.04 | 0.06 | 0.09 | 0.27 | 0.18 | 0.06 | 0.03 | 0.02 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 4.00 | 9.24 | 18.41 | 37.62 | 69.65 | 102.47 | 126.53 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.15 | 0 | 1.16 | 0 | 1.18 | 0 | 1.09 | 0 | 0.81 | 0.28 | 0.66 | 0.48 |
►Loop 165 - advec_cell.cpp:136-140 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 14 | 23 | 37 | 35 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8.40 | 16.00 | 16.20 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 166 - advec_cell.cpp:139-140 - exec | 1.56 | 1.33 | 1.29 | 1.24 | 1.27 | 1.44 | 1.42 | 15.26 | 6.73 | 3.5 | 1.73 | 0.93 | 0.63 | 0.54 | 15.22 | 6.63 | 3.29 | 1.61 | 0.87 | 0.59 | 0.48 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.01 | 0.05 | 0.08 | 0.05 | 0.04 | 0.04 | 0.06 | 0.09 | 0.27 | 0.18 | 0.06 | 0.03 | 0.02 | 0.02 | 3.99 | 9.22 | 18.36 | 37.53 | 69.46 | 102.23 | 126.25 | 1 | 0 | 1.15 | 0 | 1.16 | 0 | 1.18 | 0 | 1.09 | 0 | 0.81 | 0.28 | 0.66 | 0.48 | |||||||||
►_Z17advec_cell_kerneliiiiiiRN6clover8Buffer1DIdEES2_RNS_8Buffer2DIdEES5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_._omp_fn.5 | exec | 1.23 | 1.08 | 1.04 | 1.03 | 1.07 | 1.19 | 1.16 | 11.92 | 5.41 | 2.71 | 1.39 | 0.77 | 0.52 | 0.43 | 11.93 | 5.42 | 2.65 | 1.33 | 0.73 | 0.49 | 0.39 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.05 | 0.01 | 0.09 | 0.07 | 0.03 | 0.02 | 0.01 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.74 | 3.83 | 7.98 | 16.00 | 29.15 | 43.45 | 54.16 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.1 | 0 | 1.13 | 0 | 1.12 | 0 | 1.02 | 0 | 0.76 | 0.28 | 0.64 | 0.42 |
►Loop 169 - advec_cell.cpp:148-150 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 2 | 2 | 9 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 167 - advec_cell.cpp:148-150 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 1 | 11 | 20 | 36 | 44 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 9.40 | 6.80 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 168 - advec_cell.cpp:149-150 - exec | 1.22 | 1.08 | 1.04 | 1.03 | 1.07 | 1.19 | 1.15 | 11.91 | 5.39 | 2.71 | 1.39 | 0.77 | 0.52 | 0.43 | 11.91 | 5.41 | 2.65 | 1.33 | 0.73 | 0.49 | 0.39 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.05 | 0.02 | 0.09 | 0.07 | 0.03 | 0.02 | 0.01 | 0.02 | 1.74 | 3.83 | 7.98 | 15.94 | 28.98 | 43.16 | 53.82 | 1 | 0 | 1.1 | 0 | 1.12 | 0 | 1.12 | 0 | 1.02 | 0 | 0.76 | 0.29 | 0.64 | 0.42 | |||||||||
►_Z17advec_cell_kerneliiiiiiRN6clover8Buffer1DIdEES2_RNS_8Buffer2DIdEES5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_S5_._omp_fn.1 | exec | 1.22 | 1.06 | 1.01 | 0.98 | 1.04 | 1.17 | 1.15 | 11.92 | 5.32 | 2.67 | 1.33 | 0.74 | 0.52 | 0.42 | 11.9 | 5.28 | 2.58 | 1.27 | 0.71 | 0.48 | 0.39 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.04 | 0.04 | 0.03 | 0.02 | 0.03 | 0.05 | 0.06 | 0.15 | 0.08 | 0.03 | 0.02 | 0.01 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.71 | 3.91 | 8.00 | 16.18 | 28.70 | 42.13 | 52.19 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.13 | 0 | 1.15 | 0 | 1.17 | 0 | 1.05 | 0 | 0.77 | 0.26 | 0.64 | 0.42 |
►Loop 161 - advec_cell.cpp:56-58 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 12 | 16 | 29 | 35 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 11.20 | 5.20 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 162 - advec_cell.cpp:57-58 - exec | 1.22 | 1.06 | 1.01 | 0.98 | 1.04 | 1.16 | 1.14 | 11.91 | 5.32 | 2.67 | 1.33 | 0.74 | 0.51 | 0.42 | 11.89 | 5.28 | 2.58 | 1.27 | 0.71 | 0.48 | 0.38 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.04 | 0.04 | 0.03 | 0.02 | 0.03 | 0.06 | 0.06 | 0.15 | 0.08 | 0.03 | 0.02 | 0.01 | 0.02 | 1.70 | 3.90 | 7.95 | 16.14 | 28.60 | 41.98 | 53.37 | 1 | 0 | 1.13 | 0 | 1.15 | 0 | 1.17 | 0 | 1.05 | 0 | 0.77 | 0.26 | 0.65 | 0.4 | |||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.4 | exec | 0.8 | 0.86 | 0.83 | 0.83 | 0.81 | 0.81 | 0.76 | 7.79 | 4.32 | 2.26 | 1.13 | 0.61 | 0.38 | 0.31 | 7.77 | 4.28 | 2.13 | 1.07 | 0.55 | 0.33 | 0.25 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.03 | 0.03 | 0.03 | 0.05 | 0.08 | 0.04 | 0.08 | 0.09 | 0.04 | 0.02 | 0.02 | 0.03 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 10.57 | 19.20 | 38.58 | 76.72 | 149.69 | 249.73 | 331.03 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.91 | 0.08 | 0.91 | 0.07 | 0.91 | 0.08 | 0.88 | 0.09 | 0.74 | 0.21 | 0.65 | 0.27 |
►Loop 188 - advec_mom.cpp:87-88 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.03 | 0.01 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 15 | 29 | 44 | 63 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 11.50 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 189 - advec_mom.cpp:88-88 - exec | 0.8 | 0.85 | 0.83 | 0.83 | 0.81 | 0.81 | 0.75 | 7.77 | 4.31 | 2.26 | 1.13 | 0.61 | 0.38 | 0.31 | 7.75 | 4.27 | 2.13 | 1.07 | 0.55 | 0.33 | 0.25 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.03 | 0.03 | 0.04 | 0.05 | 0.08 | 0.05 | 0.08 | 0.10 | 0.04 | 0.02 | 0.02 | 0.03 | 10.57 | 19.20 | 38.51 | 76.56 | 149.38 | 249.12 | 330.07 | 1 | 0 | 0.91 | 0.08 | 0.91 | 0.08 | 0.91 | 0.08 | 0.88 | 0.1 | 0.73 | 0.22 | 0.65 | 0.27 | |||||||||
►_Z16advec_mom_kerneliiiiRN6clover8Buffer2DIdEES2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_S2_RNS_8Buffer1DIdEES5_iii._omp_fn.8 | exec | 0.79 | 0.84 | 0.82 | 0.8 | 0.79 | 0.79 | 0.75 | 7.65 | 4.18 | 2.22 | 1.14 | 0.61 | 0.36 | 0.3 | 7.64 | 4.18 | 2.08 | 1.04 | 0.54 | 0.32 | 0.25 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.04 | 0.04 | 0.04 | 0.05 | 0.06 | 0.02 | 0.07 | 0.11 | 0.05 | 0.03 | 0.02 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 10.75 | 19.65 | 40.16 | 80.32 | 154.82 | 260.82 | 334.87 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.91 | 0.07 | 0.92 | 0.07 | 0.92 | 0.07 | 0.88 | 0.09 | 0.75 | 0.2 | 0.64 | 0.27 |
►Loop 194 - advec_mom.cpp:159-160 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.02 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 15 | 23 | 45 | 51 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 10.08 | 20.15 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 195 - advec_mom.cpp:160-160 - exec | 0.78 | 0.83 | 0.82 | 0.8 | 0.79 | 0.79 | 0.74 | 7.63 | 4.18 | 2.22 | 1.13 | 0.61 | 0.36 | 0.29 | 7.63 | 4.17 | 2.08 | 1.04 | 0.54 | 0.32 | 0.25 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.04 | 0.04 | 0.04 | 0.05 | 0.06 | 0.02 | 0.07 | 0.11 | 0.05 | 0.03 | 0.02 | 0.02 | 10.74 | 19.65 | 40.05 | 80.21 | 154.54 | 260.12 | 334.02 | 1 | 0 | 0.91 | 0.07 | 0.92 | 0.07 | 0.92 | 0.07 | 0.88 | 0.09 | 0.75 | 0.2 | 0.64 | 0.27 | |||||||||
►_Z13field_summaryR16global_variablesR9parallel_._omp_fn.0 | exec | 0.63 | 0.66 | 0.67 | 0.68 | 0.62 | 0.63 | 0.65 | 6.16 | 3.36 | 1.77 | 0.89 | 0.43 | 0.28 | 0.23 | 6.15 | 3.31 | 1.7 | 0.88 | 0.42 | 0.26 | 0.22 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.02 | 0.01 | 0.01 | 0.02 | 0.02 | 0.03 | 0.09 | 0.07 | 0.02 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 11.89 | 22.10 | 43.02 | 83.09 | 174.18 | 281.52 | 332.26 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.93 | 0.05 | 0.9 | 0.06 | 0.87 | 0.09 | 0.92 | 0.05 | 0.74 | 0.16 | 0.58 | 0.27 |
○Loop 229 - context.h:69-69 - exec [...] | 0.63 | 0.66 | 0.67 | 0.68 | 0.62 | 0.63 | 0.65 | 6.16 | 3.36 | 1.77 | 0.89 | 0.43 | 0.28 | 0.23 | 6.15 | 3.31 | 1.7 | 0.88 | 0.42 | 0.26 | 0.22 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.02 | 0.01 | 0.01 | 0.02 | 0.02 | 0.03 | 0.09 | 0.07 | 0.02 | 0.01 | 0.01 | 0.01 | 11.89 | 22.10 | 43.02 | 83.09 | 174.18 | 281.52 | 332.26 | 1 | 0 | 0.93 | 0.05 | 0.9 | 0.06 | 0.87 | 0.09 | 0.92 | 0.05 | 0.74 | 0.16 | 0.58 | 0.27 | |||||||||
○__memset_avx512_unaligned_erms | libc.so.6 | 0.09 | 0.08 | 0.08 | 0.08 | 0.07 | 0.08 | 0.1 | 0.86 | 0.4 | 0.21 | 0.11 | 0.05 | 0.05 | 0.04 | 0.85 | 0.4 | 0.2 | 0.1 | 0.05 | 0.03 | 0.03 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.06 | 0 | 1.06 | 0 | 1.06 | 0 | 1.06 | -0 | 0.89 | 0.01 | 0.59 | 0.04 | |
○impi_pause | libmpi.so.12.0.0 | 0.07 | 0.59 | 0.15 | 0.08 | 0.05 | 0.02 | 0.03 | 0.83 | 7.37 | 2.37 | 0.9 | 0.7 | 0.53 | 0.73 | 0.64 | 2.93 | 0.39 | 0.1 | 0.03 | 0.01 | 0.01 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.03 | 0.45 | 0.46 | 0.10 | 0.43 | 0.84 | 1.27 | 0.28 | 2.28 | 1.17 | 0.12 | 0.29 | 0.34 | 0.42 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.11 | 0.53 | 0.41 | 0.09 | 0.8 | 0.02 | 1.33 | 0 | 2 | 0 | 1.33 | 0 | |
►_Z14generate_chunkiR16global_variables._omp_fn.0 | exec | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.34 | 0.22 | 0.11 | 0.06 | 0.03 | 0.02 | 0.01 | 0.34 | 0.2 | 0.11 | 0.06 | 0.03 | 0.02 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.03 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.85 | 0.01 | 0.77 | 0.01 | 0.71 | 0.01 | 0.71 | 0.01 | 0.53 | 0.02 | 0.71 | 0.01 |
►Loop 235 - generate_chunk.cpp:74-80 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 1 | 0 | 1 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 236 - generate_chunk.cpp:77-80 - exec | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.34 | 0.22 | 0.11 | 0.06 | 0.03 | 0.02 | 0.01 | 0.34 | 0.2 | 0.11 | 0.05 | 0.03 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.03 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.85 | 0.01 | 0.77 | 0.01 | 0.85 | 0.01 | 0.71 | 0.01 | 1.06 | -0 | 0.71 | 0.01 | |||||||||
►_Z16initialise_chunkiR16global_variables._omp_fn.4 | exec | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.31 | 0.17 | 0.09 | 0.05 | 0.03 | 0.02 | 0.01 | 0.31 | 0.14 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.11 | -0 | 0.97 | 0 | 0.97 | 0 | 0.97 | 0 | 0.97 | 0 | 0.65 | 0.01 |
►Loop 248 - initialise_chunk.cpp:77-82 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 249 - initialise_chunk.cpp:80-82 - exec | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.31 | 0.17 | 0.09 | 0.05 | 0.03 | 0.02 | 0.01 | 0.31 | 0.14 | 0.08 | 0.04 | 0.02 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.11 | -0 | 0.97 | 0 | 0.97 | 0 | 0.97 | 0 | 0.97 | 0 | 0.65 | 0.01 | |||||||||
○unknown_kernel_region | kernel | 0.03 | 0.02 | 0.02 | 0.02 | 0.03 | 0.03 | 0.04 | 0.28 | 0.14 | 0.09 | 0.05 | 0.04 | 0.03 | 0.04 | 0.27 | 0.11 | 0.06 | 0.02 | 0.02 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 62 | 92 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | System (%): 98.17 OMP (%): 1.83 | System (%): 100.00 | System (%): 95.88 OMP (%): 4.12 | System (%): 97.33 OMP (%): 2.67 | System (%): 96.39 OMP (%): 3.61 | System (%): 96.43 OMP (%): 2.98 MPI (%): 0.60 | System (%): 95.35 OMP (%): 4.65 | 0.32 | 0.53 | 0.79 | 2.75 | 2.40 | 5.00 | 6.90 | 1 | 0 | 1.23 | -0 | 1.13 | -0 | 1.69 | 0 | 0.84 | 0 | 0.84 | 0 | 0.56 | 0.02 | |
○MPL_gpu_cuda_init | libmpi.so.12.0.0 | 0.02 | 0.13 | 0.02 | 0.01 | 0 | 0 | 0 | 0.29 | 1.47 | 0.27 | 0.08 | 0.07 | 0.09 | 0.07 | 0.17 | 0.67 | 0.05 | 0.01 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.02 | 0.04 | 0.04 | 0.01 | 0.07 | 0.15 | 0.07 | 0.17 | 0.22 | 0.09 | 0.01 | 0.05 | 0.06 | 0.02 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.13 | 0.11 | 0.85 | 0 | 2.13 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
►_Z24clover_pack_message_leftR16global_variablesiiiiRN6clover8Buffer2DIdEERNS1_8Buffer1DIdEEiiiiiii._omp_fn.0 | exec | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.45 | 0.21 | 0.13 | 0.07 | 0.06 | 0.04 | 0.04 | 0.23 | 0.1 | 0.05 | 0.02 | 0.02 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 16 | 31 | 46 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.00 | 0.01 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.15 | -0 | 1.15 | -0 | 1.44 | 0 | 0.72 | 0.01 | 0.72 | 0.01 | 0.48 | 0.01 |
►Loop 255 - pack_kernel.cpp:57-59 - exec [...] | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.45 | 0.21 | 0.13 | 0.07 | 0.06 | 0.04 | 0.03 | 0.23 | 0.1 | 0.05 | 0.02 | 0.02 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 16 | 31 | 46 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.00 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.15 | -0 | 1.15 | -0 | 1.44 | 0 | 0.72 | 0.01 | 0.72 | 0.01 | 0.48 | 0.01 | |||||||||
○Loop 256 - pack_kernel.cpp:57-59 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z25clover_pack_message_rightR16global_variablesiiiiRN6clover8Buffer2DIdEERNS1_8Buffer1DIdEEiiiiiii._omp_fn.0 | exec | 0.02 | 0.02 | 0.02 | 0.03 | 0.02 | 0.02 | 0.03 | 0.39 | 0.26 | 0.14 | 0.1 | 0.05 | 0.04 | 0.04 | 0.2 | 0.12 | 0.06 | 0.03 | 0.02 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 16 | 31 | 47 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.03 | 0.03 | 0.00 | 0.05 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.83 | 0 | 0.83 | 0 | 0.83 | 0 | 0.63 | 0.01 | 0.63 | 0.01 | 0.42 | 0.02 |
►Loop 259 - pack_kernel.cpp:122-124 - exec [...] | 0.02 | 0.02 | 0.02 | 0.03 | 0.02 | 0.02 | 0.02 | 0.39 | 0.26 | 0.14 | 0.1 | 0.05 | 0.04 | 0.04 | 0.2 | 0.12 | 0.06 | 0.03 | 0.02 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 16 | 31 | 47 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.03 | 0.00 | 0.05 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.83 | 0 | 0.83 | 0 | 0.83 | 0 | 0.63 | 0.01 | 0.63 | 0.01 | 0.42 | 0.01 | |||||||||
○Loop 260 - pack_kernel.cpp:122-124 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z14generate_chunkiR16global_variables._omp_fn.1 | exec | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.31 | 0.24 | 0.19 | 0.12 | 0.06 | 0.03 | 0.03 | 0.2 | 0.11 | 0.05 | 0.03 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 49 | 68 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.15 | 0.09 | 0.06 | 0.03 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.91 | 0 | 1 | 0 | 0.83 | 0 | 1.25 | 0 | 0.63 | 0.01 | 0.42 | 0.01 |
►Loop 237 - generate_chunk.cpp:85-123 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 239 - context.h:46-69 - exec [...] | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.31 | 0.24 | 0.19 | 0.12 | 0.06 | 0.03 | 0.03 | 0.2 | 0.11 | 0.05 | 0.03 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 49 | 68 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.15 | 0.09 | 0.06 | 0.03 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.91 | 0 | 1 | 0 | 0.83 | 0 | 1.25 | 0 | 0.63 | 0.01 | 0.42 | 0.01 | |||||||||
○Loop 238 - generate_chunk.cpp:88-123 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z27clover_unpack_message_rightR16global_variablesiiiiRN6clover8Buffer2DIdEERNS1_8Buffer1DIdEEiiiiiii._omp_fn.0 | exec | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.21 | 0.12 | 0.05 | 0.04 | 0.02 | 0.03 | 0.04 | 0.11 | 0.05 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 16 | 30 | 47 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.02 | 0.03 | 0.00 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 1.1 | -0 | 1.38 | -0 | 1.38 | -0 | 0.69 | 0 | 0.34 | 0.01 | 0.23 | 0.02 |
►Loop 261 - pack_kernel.cpp:158-160 - exec [...] | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.21 | 0.12 | 0.05 | 0.04 | 0.02 | 0.03 | 0.03 | 0.11 | 0.05 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 15 | 30 | 46 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.02 | 0.00 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.1 | -0 | 1.38 | -0 | 1.38 | -0 | 0.69 | 0 | 0.34 | 0.01 | 0.23 | 0.02 | |||||||||
○Loop 262 - pack_kernel.cpp:158-160 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►_Z26clover_unpack_message_leftR16global_variablesiiiiRN6clover8Buffer2DIdEERNS1_8Buffer1DIdEEiiiiiii._omp_fn.0 | exec | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.21 | 0.15 | 0.07 | 0.04 | 0.04 | 0.04 | 0.04 | 0.1 | 0.06 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 12 | 32 | 47 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.00 | 0.06 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | GNU C++17 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp | 1 | 0 | 0.83 | 0 | 0.83 | 0 | 1.25 | -0 | 0.63 | 0 | 0.31 | 0.01 | 0.21 | 0.02 |
►Loop 257 - pack_kernel.cpp:90-92 - exec [...] | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.21 | 0.15 | 0.07 | 0.04 | 0.03 | 0.04 | 0.04 | 0.1 | 0.06 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 2 | 4 | 8 | 12 | 32 | 47 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.00 | 0.06 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.83 | 0 | 0.83 | 0 | 1.25 | -0 | 0.63 | 0 | 0.31 | 0.01 | 0.21 | 0.02 | |||||||||
○Loop 258 - pack_kernel.cpp:90-92 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○gomp_barrier_wait_end | libgomp.so.1.0.0 | 0 | 0.15 | 0.28 | 0.51 | 0.97 | 1.22 | 2.06 | 0 | 1.47 | 1.05 | 0.81 | 1.01 | 1.06 | 1.19 | 0 | 0.74 | 0.7 | 0.66 | 0.66 | 0.5 | 0.69 | 0 | 4 | 8 | 16 | 32 | 63 | 94 | 0.00 | 0.15 | 0.15 | 0.18 | 0.43 | 0.86 | 1.04 | 0.00 | 0.75 | 0.37 | 0.23 | 0.29 | 0.35 | 0.35 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 0 | 0.83 | 2.1 | 2.9 | 3.63 | 3.27 | 3.6 | 0 | 6.88 | 7.51 | 4.99 | 3.34 | 1.84 | 1.9 | 0 | 4.14 | 5.35 | 3.76 | 2.47 | 1.34 | 1.21 | 0 | 4 | 8 | 16 | 32 | 64 | 96 | 0.00 | 0.54 | 0.55 | 0.70 | 1.03 | 0.86 | 1.00 | 0.00 | 2.61 | 1.38 | 0.89 | 0.70 | 0.35 | 0.33 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |