Loop id | Source Location | Source Function | Level | Coverage 2x1 (%) | Coverage 2x2 (%) | Coverage 2x4 (%) | Coverage 2x8 (%) | Coverage 2x16 (%) | Coverage 2x32 (%) | Coverage 2x64 (%) | Coverage 2x96 (%) | Max Time Over Threads 2x1 (s) | Max Time Over Threads 2x2 (s) | Max Time Over Threads 2x4 (s) | Max Time Over Threads 2x8 (s) | Max Time Over Threads 2x16 (s) | Max Time Over Threads 2x32 (s) | Max Time Over Threads 2x64 (s) | Max Time Over Threads 2x96 (s) | Time w.r.t. Wall Time 2x1 (s) | Time w.r.t. Wall Time 2x2 (s) | Time w.r.t. Wall Time 2x4 (s) | Time w.r.t. Wall Time 2x8 (s) | Time w.r.t. Wall Time 2x16 (s) | Time w.r.t. Wall Time 2x32 (s) | Time w.r.t. Wall Time 2x64 (s) | Time w.r.t. Wall Time 2x96 (s) | Nb Threads 2x1 | Nb Threads 2x2 | Nb Threads 2x4 | Nb Threads 2x8 | Nb Threads 2x16 | Nb Threads 2x32 | Nb Threads 2x64 | Nb Threads 2x96 | GFLOPS 2x1 | GFLOPS 2x2 | GFLOPS 2x4 | GFLOPS 2x8 | GFLOPS 2x16 | GFLOPS 2x32 | GFLOPS 2x64 | GFLOPS 2x96 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing 2x1 | Speedup If Perfect Load Balancing 2x2 | Speedup If Perfect Load Balancing 2x4 | Speedup If Perfect Load Balancing 2x8 | Speedup If Perfect Load Balancing 2x16 | Speedup If Perfect Load Balancing 2x32 | Speedup If Perfect Load Balancing 2x64 | Speedup If Perfect Load Balancing 2x96 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (2x1) Efficiency | (2x1) Potential Speed-Up (%) | (2x2) Efficiency | (2x2) Potential Speed-Up (%) | (2x4) Efficiency | (2x4) Potential Speed-Up (%) | (2x8) Efficiency | (2x8) Potential Speed-Up (%) | (2x16) Efficiency | (2x16) Potential Speed-Up (%) | (2x32) Efficiency | (2x32) Potential Speed-Up (%) | (2x64) Efficiency | (2x64) Potential Speed-Up (%) | (2x96) Efficiency | (2x96) Potential Speed-Up (%) |
---|
246 | exec - advec_mom_kernel.f90:81-177 [...] | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 16.14 | 15.24 | 13.13 | 14.49 | 10.93 | 9.07 | 4.87 | 4.36 | 147.34 | 72.37 | 56.08 | 28.1 | 14.06 | 7.03 | 3.53 | 2.57 | 147.33 | 72.35 | 40.04 | 22.23 | 11.32 | 6.1 | 2.86 | 2.22 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 18.25 | 37.17 | 67.16 | 120.97 | 237.56 | 440.77 | 940.20 | 1210.14 | 81.88 | 22.24 | 1.06 | 1.36 | 3.38 | 1 | 1 | 1.42 | 1.27 | 1.26 | 1.16 | 1.24 | 1.16 | NA | NA | NA | NA | NA | 1 | 0 | 1.02 | 0 | 0.92 | 1.05 | 0.83 | 2.49 | 0.81 | 2.04 | 0.75 | 2.22 | 0.8 | 0.95 | 0.69 | 1.35 |
199 | exec - advec_cell_kernel.f90:112-157 [...] | advec_cell_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 12.5 | 11.95 | 10.18 | 11.15 | 8.41 | 6.93 | 3.72 | 3.44 | 114.18 | 56.93 | 42.58 | 21.39 | 10.75 | 5.44 | 2.73 | 1.92 | 114.15 | 56.72 | 31.04 | 17.11 | 8.71 | 4.66 | 2.18 | 1.75 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 18.79 | 37.82 | 69.11 | 125.37 | 246.28 | 460.25 | 983.72 | 1225.28 | 80.86 | 22.34 | 1.06 | 1.4 | 4.02 | 1 | 1 | 1.39 | 1.26 | 1.25 | 1.17 | 1.26 | 1.1 | 2 | 4 | 1 | 0 | 2 | 1 | 0 | 1.01 | 0 | 0.92 | 0.82 | 0.83 | 1.85 | 0.82 | 1.52 | 0.77 | 1.63 | 0.82 | 0.68 | 0.68 | 1.1 |
1039 | exec - viscosity_kernel.f90:53-89 | viscosity_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 9.83 | 8.8 | 7.32 | 7.9 | 5.95 | 4.83 | 2.62 | 2.25 | 89.94 | 42.17 | 28.74 | 14.47 | 7.3 | 3.68 | 1.94 | 1.3 | 89.69 | 41.77 | 22.3 | 12.12 | 6.16 | 3.25 | 1.54 | 1.14 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 15.10 | 32.42 | 60.74 | 111.75 | 219.88 | 416.74 | 879.36 | 1188.10 | 92.14 | 23.88 | 1 | 2 | 2 | 1 | 1.01 | 1.3 | 1.2 | 1.2 | 1.14 | 1.27 | 1.14 | NA | NA | NA | NA | NA | 1 | 0 | 1.07 | 0 | 1.01 | 0 | 0.93 | 0.59 | 0.91 | 0.54 | 0.86 | 0.66 | 0.91 | 0.24 | 0.82 | 0.41 |
257 | exec - advec_mom_kernel.f90:215-241 [...] | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 8.62 | 8.3 | 6.55 | 6.53 | 4.87 | 3.79 | 3.68 | 4.28 | 78.74 | 39.48 | 20.12 | 10.14 | 5.16 | 3.27 | 2.65 | 2.38 | 78.67 | 39.4 | 19.97 | 10.02 | 5.05 | 2.55 | 2.16 | 2.18 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 13.07 | 26.09 | 51.47 | 102.59 | 203.56 | 403.18 | 475.49 | 470.78 | 39.33 | 17.42 | 1.49 | 2.87 | 5.32 | 1 | 1 | 1.02 | 1.02 | 1.04 | 1.29 | 1.24 | 1.1 | 1 | 0 | 0 | 7 | 0 | 1 | 0 | 1 | 0.01 | 0.98 | 0.1 | 0.98 | 0.12 | 0.97 | 0.13 | 0.96 | 0.14 | 0.57 | 1.59 | 0.38 | 2.67 |
318 | exec - calc_dt_kernel.f90:94-129 [...] | calc_dt_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 7.01 | 6.59 | 5.19 | 5.23 | 3.9 | 3.05 | 2.86 | 3.32 | 64.09 | 31.53 | 16.41 | 8.24 | 4.12 | 2.16 | 2.12 | 1.83 | 63.97 | 31.29 | 15.83 | 8.02 | 4.04 | 2.05 | 1.68 | 1.69 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 17.00 | 34.75 | 68.70 | 135.59 | 269.13 | 530.27 | 645.71 | 641.60 | 95.28 | 24.41 | 1 | 2 | 2 | 1 | 1.01 | 1.05 | 1.03 | 1.04 | 1.06 | 1.27 | 1.08 | 2 | 7 | 5 | 0 | 0 | 1 | 0 | 1.02 | 0 | 1.01 | 0 | 1 | 0.02 | 0.99 | 0.04 | 0.98 | 0.08 | 0.59 | 1.16 | 0.39 | 2.01 |
210 | exec - advec_cell_kernel.f90:202-248 [...] | advec_cell_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 6.11 | 5.89 | 4.68 | 4.66 | 3.49 | 2.85 | 2.99 | 3.41 | 56.07 | 28.36 | 14.69 | 7.52 | 3.97 | 2.16 | 2.05 | 1.8 | 55.8 | 27.99 | 14.26 | 7.15 | 3.62 | 1.92 | 1.75 | 1.74 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 14.03 | 27.97 | 54.91 | 109.51 | 216.29 | 407.78 | 447.00 | 450.11 | 51.49 | 18.87 | 1.54 | 3.16 | 7.69 | 1 | 1.01 | 1.04 | 1.06 | 1.11 | 1.14 | 1.18 | 1.04 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0.02 | 0.98 | 0.1 | 0.98 | 0.11 | 0.96 | 0.13 | 0.91 | 0.26 | 0.5 | 1.5 | 0.33 | 2.27 |
179 | exec - PdV_kernel.f90:111-135 [...] | pdv_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 4.7 | 4.58 | 3.87 | 3.86 | 3.79 | 4.85 | 5.48 | 6.41 | 43.08 | 22.29 | 12.48 | 6.38 | 5.04 | 3.67 | 3.76 | 3.4 | 42.89 | 21.76 | 11.78 | 5.92 | 3.92 | 3.26 | 3.22 | 3.26 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 14.10 | 27.79 | 51.33 | 102.13 | 154.20 | 185.30 | 187.43 | 185.33 | 98.31 | 24.79 | 1.11 | 1.18 | 2.27 | 1 | 1.03 | 1.07 | 1.09 | 1.3 | 1.13 | 1.18 | 1.05 | 1 | 3 | 9 | 1 | 0 | 1 | 0 | 0.99 | 0.07 | 0.91 | 0.35 | 0.91 | 0.36 | 0.68 | 1.2 | 0.41 | 2.86 | 0.21 | 4.34 | 0.14 | 5.53 |
383 | exec - ideal_gas_kernel.f90:49-55 | ideal_gas_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 4.49 | 4.32 | 3.41 | 3.39 | 3.15 | 4.11 | 4.55 | 5.32 | 41.03 | 20.49 | 10.29 | 5.19 | 4.17 | 3.15 | 3.07 | 2.76 | 41.01 | 20.5 | 10.39 | 5.19 | 3.26 | 2.76 | 2.67 | 2.71 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 7.45 | 14.90 | 29.40 | 58.86 | 93.67 | 110.60 | 114.28 | 112.72 | 100 | 25 | 1 | 2 | 2 | 1 | 1 | 1 | 1.01 | 1.3 | 1.15 | 1.16 | 1.02 | 0 | 4 | 0 | 0 | 0 | 1 | 0 | 1 | -0 | 0.99 | 0.05 | 0.99 | 0.04 | 0.79 | 0.67 | 0.46 | 2.2 | 0.24 | 3.46 | 0.16 | 4.48 |
188 | exec - accelerate_kernel.f90:62-76 | accelerate_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 3.48 | 3.27 | 2.7 | 2.71 | 2.87 | 3.98 | 4.42 | 5.12 | 31.88 | 16.02 | 8.6 | 4.33 | 3.97 | 3.01 | 3.06 | 2.73 | 31.8 | 15.54 | 8.24 | 4.16 | 2.98 | 2.67 | 2.6 | 2.61 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 20.30 | 41.55 | 78.36 | 155.19 | 216.53 | 241.41 | 248.01 | 247.16 | 98.41 | 24.8 | 1.05 | 1.36 | 2.39 | 1 | 1.03 | 1.06 | 1.05 | 1.35 | 1.13 | 1.19 | 1.05 | 1 | 3 | 9 | 1 | 0 | 1 | 0 | 1.02 | 0 | 0.96 | 0.1 | 0.96 | 0.12 | 0.67 | 0.96 | 0.37 | 2.5 | 0.19 | 3.58 | 0.13 | 4.47 |
176 | exec - PdV_kernel.f90:69-99 [...] | pdv_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 2.71 | 2.68 | 2.63 | 2.66 | 3.01 | 4.19 | 4.69 | 5.41 | 24.91 | 13.44 | 8.68 | 4.35 | 4.28 | 3.19 | 3.19 | 2.85 | 24.74 | 12.7 | 8.02 | 4.08 | 3.12 | 2.82 | 2.75 | 2.75 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 17.81 | 34.69 | 54.93 | 107.97 | 141.15 | 155.98 | 159.92 | 159.94 | 97.67 | 24.71 | 1 | 1.52 | 2 | 1.01 | 1.06 | 1.09 | 1.07 | 1.39 | 1.14 | 1.17 | 1.04 | 1 | 7 | 6 | 0 | 0 | 1 | 0 | 0.97 | 0.07 | 0.77 | 0.6 | 0.76 | 0.64 | 0.5 | 1.52 | 0.27 | 3.04 | 0.14 | 4.03 | 0.09 | 4.9 |
255 | exec - advec_mom_kernel.f90:247-248 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.99 | 2.07 | 2.1 | 2.13 | 2.33 | 3.18 | 3.62 | 4.15 | 18.24 | 10.68 | 6.98 | 3.5 | 3.26 | 2.43 | 2.51 | 2.21 | 18.13 | 9.82 | 6.41 | 3.27 | 2.41 | 2.14 | 2.12 | 2.11 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 11.30 | 20.87 | 31.98 | 62.67 | 85.05 | 95.57 | 96.20 | 96.75 | 100 | 25 | 1 | 2 | 2 | 1.01 | 1.09 | 1.1 | 1.08 | 1.37 | 1.15 | 1.19 | 1.05 | 0 | 5 | 0 | 0 | 0 | 1 | 0 | 0.92 | 0.16 | 0.71 | 0.62 | 0.69 | 0.65 | 0.47 | 1.23 | 0.26 | 2.34 | 0.13 | 3.14 | 0.09 | 3.78 |
241 | exec - advec_mom_kernel.f90:183-184 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.96 | 2.05 | 2.07 | 2.12 | 2.33 | 3.19 | 3.67 | 4.22 | 18.01 | 10.54 | 6.89 | 3.47 | 3.25 | 2.44 | 2.49 | 2.2 | 17.91 | 9.71 | 6.32 | 3.25 | 2.41 | 2.15 | 2.16 | 2.15 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 11.44 | 21.11 | 32.43 | 63.08 | 85.09 | 95.57 | 94.69 | 95.12 | 100 | 25 | 1 | 2 | 2 | 1.01 | 1.09 | 1.1 | 1.07 | 1.37 | 1.15 | 1.16 | 1.03 | 0 | 3 | 1 | 0 | 0 | 1 | 0 | 0.92 | 0.16 | 0.71 | 0.6 | 0.69 | 0.66 | 0.46 | 1.25 | 0.26 | 2.36 | 0.13 | 3.19 | 0.09 | 3.85 |
333 | exec - flux_calc_kernel.f90:56-60 | flux_calc_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.9 | 1.96 | 2.01 | 2.04 | 2.3 | 3.19 | 3.65 | 4.11 | 17.4 | 10.1 | 6.68 | 3.37 | 3.27 | 2.45 | 2.47 | 2.17 | 17.31 | 9.32 | 6.11 | 3.13 | 2.38 | 2.15 | 2.14 | 2.09 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 14.20 | 26.36 | 40.23 | 78.51 | 103.33 | 114.28 | 114.69 | 117.69 | 95.83 | 24.48 | 1 | 1.08 | 2.36 | 1.01 | 1.09 | 1.11 | 1.08 | 1.39 | 1.15 | 1.17 | 1.04 | 0 | 8 | 2 | 0 | 0 | 1 | 0 | 0.93 | 0.14 | 0.71 | 0.59 | 0.69 | 0.63 | 0.45 | 1.25 | 0.25 | 2.39 | 0.13 | 3.19 | 0.09 | 3.76 |
249 | exec - advec_mom_kernel.f90:138-144 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.72 | 1.69 | 1.54 | 1.55 | 1.65 | 2.27 | 2.58 | 3.02 | 15.73 | 8.22 | 4.97 | 2.56 | 2.31 | 1.73 | 1.74 | 1.59 | 15.72 | 8.02 | 4.7 | 2.37 | 1.7 | 1.52 | 1.52 | 1.54 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 15.01 | 29.41 | 50.19 | 99.53 | 138.71 | 155.15 | 155.19 | 153.35 | 100 | 25 | 1.08 | 1.06 | 2.24 | 1 | 1.03 | 1.07 | 1.08 | 1.38 | 1.15 | 1.16 | 1.04 | 3 | 0 | 4 | 1 | 0 | 1 | 0 | 0.98 | 0.03 | 0.84 | 0.25 | 0.83 | 0.26 | 0.58 | 0.7 | 0.32 | 1.54 | 0.16 | 2.16 | 0.11 | 2.7 |
260 | exec - advec_mom_kernel.f90:203-208 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.71 | 1.68 | 1.55 | 1.54 | 1.65 | 2.27 | 2.56 | 2.97 | 15.57 | 8.18 | 5.01 | 2.54 | 2.33 | 1.73 | 1.75 | 1.57 | 15.56 | 7.96 | 4.71 | 2.36 | 1.7 | 1.53 | 1.5 | 1.51 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 15.16 | 29.63 | 50.08 | 99.94 | 138.71 | 153.96 | 157.09 | 156.08 | 100 | 25 | 1.12 | 1.08 | 2.27 | 1 | 1.03 | 1.08 | 1.09 | 1.39 | 1.15 | 1.17 | 1.04 | 2 | 0 | 4 | 3 | 0 | 1 | 0 | 0.98 | 0.04 | 0.83 | 0.27 | 0.82 | 0.27 | 0.57 | 0.71 | 0.32 | 1.55 | 0.16 | 2.15 | 0.11 | 2.65 |
194 | exec - advec_cell_kernel.f90:164-170 | advec_cell_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.56 | 1.62 | 1.64 | 1.66 | 1.85 | 2.58 | 2.94 | 3.37 | 14.26 | 8.26 | 5.45 | 2.75 | 2.62 | 1.96 | 2.01 | 1.75 | 14.24 | 7.67 | 4.99 | 2.54 | 1.92 | 1.74 | 1.73 | 1.72 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 15.12 | 28.08 | 43.15 | 84.76 | 112.11 | 123.61 | 124.42 | 125.02 | 100 | 25 | 1 | 2 | 2 | 1 | 1.08 | 1.11 | 1.09 | 1.39 | 1.13 | 1.18 | 1.02 | 1 | 2 | 3 | 1 | 0 | 1 | 0 | 0.93 | 0.12 | 0.71 | 0.47 | 0.7 | 0.5 | 0.46 | 0.99 | 0.26 | 1.92 | 0.13 | 2.56 | 0.09 | 3.08 |
208 | exec - advec_cell_kernel.f90:255-261 | advec_cell_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.53 | 1.58 | 1.62 | 1.65 | 1.85 | 2.58 | 2.91 | 3.35 | 13.99 | 8.09 | 5.39 | 2.73 | 2.63 | 2.01 | 1.98 | 1.75 | 13.95 | 7.49 | 4.93 | 2.53 | 1.91 | 1.73 | 1.71 | 1.71 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 15.43 | 28.75 | 43.66 | 85.10 | 112.67 | 124.26 | 125.85 | 125.54 | 100 | 25 | 1 | 2 | 2 | 1 | 1.08 | 1.11 | 1.08 | 1.39 | 1.17 | 1.16 | 1.03 | 1 | 9 | 0 | 0 | 0 | 1 | 0 | 0.93 | 0.11 | 0.71 | 0.47 | 0.69 | 0.51 | 0.46 | 1.01 | 0.25 | 1.93 | 0.13 | 2.54 | 0.08 | 3.07 |
275 | exec - advec_mom_kernel.f90:85-87 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.23 | 1.33 | 1.41 | 1.46 | 1.63 | 2.29 | 2.57 | 2.96 | 11.28 | 7.1 | 4.79 | 2.42 | 2.3 | 1.74 | 1.72 | 1.57 | 11.22 | 6.33 | 4.31 | 2.23 | 1.69 | 1.54 | 1.51 | 1.5 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 11.08 | 19.65 | 28.87 | 55.79 | 73.61 | 81.16 | 82.67 | 83.02 | 100 | 25 | 1.06 | 1.05 | 2.34 | 1.01 | 1.12 | 1.12 | 1.09 | 1.38 | 1.14 | 1.15 | 1.05 | 3 | 3 | 0 | 2 | 0 | 1 | 0 | 0.89 | 0.15 | 0.65 | 0.49 | 0.63 | 0.54 | 0.41 | 0.95 | 0.23 | 1.77 | 0.12 | 2.27 | 0.08 | 2.73 |
272 | exec - advec_mom_kernel.f90:95-97 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.2 | 1.31 | 1.38 | 1.42 | 1.6 | 2.23 | 2.51 | 2.91 | 11.03 | 6.93 | 4.67 | 2.35 | 2.26 | 1.7 | 1.7 | 1.52 | 10.97 | 6.2 | 4.22 | 2.18 | 1.66 | 1.5 | 1.48 | 1.48 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 11.08 | 19.59 | 28.80 | 55.75 | 73.23 | 81.12 | 82.38 | 82.17 | 100 | 25 | 1.06 | 1.05 | 2.34 | 1.01 | 1.12 | 1.12 | 1.08 | 1.39 | 1.14 | 1.16 | 1.03 | 3 | 2 | 1 | 1 | 0 | 1 | 0 | 0.88 | 0.15 | 0.65 | 0.48 | 0.63 | 0.53 | 0.41 | 0.94 | 0.23 | 1.72 | 0.12 | 2.22 | 0.08 | 2.69 |
266 | exec - advec_mom_kernel.f90:115-117 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.04 | 1.14 | 1.21 | 1.24 | 1.41 | 1.96 | 2.18 | 2.53 | 9.55 | 6.11 | 4.1 | 2.06 | 2 | 1.48 | 1.5 | 1.32 | 9.52 | 5.41 | 3.67 | 1.9 | 1.46 | 1.32 | 1.28 | 1.29 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 8.71 | 15.34 | 22.61 | 43.66 | 56.86 | 62.58 | 64.45 | 64.29 | 100 | 25 | 1.08 | 1.06 | 2.48 | 1 | 1.13 | 1.13 | 1.09 | 1.39 | 1.13 | 1.18 | 1.03 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 0.88 | 0.14 | 0.65 | 0.43 | 0.63 | 0.46 | 0.41 | 0.84 | 0.23 | 1.52 | 0.12 | 1.93 | 0.08 | 2.34 |
269 | exec - advec_mom_kernel.f90:105-107 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.03 | 1.12 | 1.19 | 1.22 | 1.38 | 1.92 | 2.12 | 2.51 | 9.41 | 5.99 | 4 | 2.01 | 1.97 | 1.48 | 1.45 | 1.29 | 9.38 | 5.33 | 3.62 | 1.87 | 1.43 | 1.29 | 1.24 | 1.27 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 8.63 | 15.19 | 22.37 | 43.31 | 56.65 | 63.02 | 65.60 | 64.17 | 100 | 25 | 1.08 | 1.06 | 2.48 | 1 | 1.13 | 1.12 | 1.08 | 1.4 | 1.15 | 1.18 | 1.02 | 3 | 1 | 0 | 2 | 0 | 1 | 0 | 0.88 | 0.13 | 0.65 | 0.42 | 0.63 | 0.46 | 0.41 | 0.81 | 0.23 | 1.48 | 0.12 | 1.87 | 0.08 | 2.32 |
472 | exec - reset_field_kernel.f90:61-63 | reset_field_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 1.01 | 1.11 | 1.19 | 1.22 | 1.38 | 1.94 | 2.22 | 2.56 | 9.25 | 5.95 | 4.06 | 2.03 | 1.95 | 1.5 | 1.54 | 1.35 | 9.23 | 5.25 | 3.62 | 1.87 | 1.43 | 1.31 | 1.31 | 1.3 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 4.44 | 7.80 | 11.32 | 21.93 | 28.69 | 31.29 | 31.40 | 31.57 | 100 | 25 | 1 | 1 | 2.67 | 1 | 1.14 | 1.14 | 1.09 | 1.38 | 1.15 | 1.19 | 1.04 | 0 | 4 | 0 | 0 | 0 | 1 | 0 | 0.88 | 0.13 | 0.64 | 0.43 | 0.62 | 0.47 | 0.4 | 0.82 | 0.22 | 1.51 | 0.11 | 1.98 | 0.07 | 2.37 |
475 | exec - reset_field_kernel.f90:51-53 | reset_field_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 0.99 | 1.1 | 1.17 | 1.21 | 1.38 | 1.91 | 2.21 | 2.5 | 9.08 | 5.87 | 4.01 | 2.01 | 1.95 | 1.46 | 1.47 | 1.3 | 9.07 | 5.2 | 3.58 | 1.85 | 1.43 | 1.28 | 1.3 | 1.27 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 4.52 | 7.89 | 11.45 | 22.17 | 28.68 | 32.01 | 31.51 | 32.07 | 100 | 25 | 1 | 1 | 2.67 | 1 | 1.13 | 1.13 | 1.09 | 1.38 | 1.15 | 1.14 | 1.02 | 0 | 4 | 0 | 0 | 0 | 1 | 0 | 0.87 | 0.14 | 0.63 | 0.43 | 0.61 | 0.47 | 0.4 | 0.83 | 0.22 | 1.49 | 0.11 | 1.97 | 0.07 | 2.31 |
490 | exec - revert_kernel.f90:46-48 | revert_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 0.98 | 1.08 | 1.18 | 1.21 | 1.36 | 1.91 | 2.24 | 2.56 | 8.95 | 5.83 | 4.06 | 1.98 | 1.94 | 1.47 | 1.54 | 1.34 | 8.94 | 5.14 | 3.59 | 1.85 | 1.41 | 1.29 | 1.32 | 1.3 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 4.58 | 7.97 | 11.42 | 22.14 | 29.07 | 31.75 | 30.95 | 31.55 | 100 | 25 | 1 | 1 | 2.67 | 1 | 1.14 | 1.14 | 1.08 | 1.4 | 1.15 | 1.18 | 1.03 | 0 | 4 | 0 | 0 | 0 | 1 | 0 | 0.87 | 0.14 | 0.62 | 0.45 | 0.6 | 0.48 | 0.4 | 0.82 | 0.22 | 1.5 | 0.11 | 2 | 0.07 | 2.38 |
205 | exec - advec_cell_kernel.f90:89-91 | advec_cell_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 0.69 | 0.72 | 0.73 | 0.74 | 0.82 | 1.15 | 1.29 | 1.48 | 6.32 | 3.73 | 2.41 | 1.2 | 1.16 | 0.87 | 0.88 | 0.79 | 6.32 | 3.42 | 2.23 | 1.13 | 0.85 | 0.77 | 0.76 | 0.75 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 13.10 | 24.24 | 37.18 | 73.34 | 97.64 | 107.64 | 109.36 | 110.56 | 100 | 25 | 1.13 | 1.09 | 2.34 | 1 | 1.09 | 1.1 | 1.07 | 1.38 | 1.13 | 1.17 | 1.05 | 3 | 3 | 0 | 1 | 0 | 1 | 0 | 0.92 | 0.05 | 0.71 | 0.21 | 0.7 | 0.22 | 0.46 | 0.44 | 0.26 | 0.86 | 0.13 | 1.12 | 0.09 | 1.35 |
252 | exec - advec_mom_kernel.f90:128-131 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 0.69 | 0.71 | 0.69 | 0.69 | 0.72 | 0.98 | 1.06 | 1.23 | 6.28 | 3.64 | 2.27 | 1.17 | 0.99 | 0.75 | 0.74 | 0.65 | 6.26 | 3.39 | 2.11 | 1.06 | 0.75 | 0.66 | 0.62 | 0.62 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 14.70 | 27.15 | 43.62 | 86.83 | 122.85 | 139.05 | 147.88 | 148.48 | 100 | 25 | 1 | 1.1 | 2.44 | 1 | 1.07 | 1.09 | 1.11 | 1.34 | 1.15 | 1.19 | 1.05 | 0 | 1 | 2 | 0 | 0 | 1 | 0 | 0.92 | 0.05 | 0.74 | 0.18 | 0.74 | 0.18 | 0.52 | 0.34 | 0.3 | 0.69 | 0.16 | 0.89 | 0.11 | 1.1 |
216 | exec - advec_cell_kernel.f90:181-183 | advec_cell_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 0.67 | 0.7 | 0.71 | 0.72 | 0.8 | 1.13 | 1.27 | 1.41 | 6.18 | 3.66 | 2.37 | 1.2 | 1.14 | 0.85 | 0.86 | 0.75 | 6.15 | 3.34 | 2.18 | 1.11 | 0.83 | 0.76 | 0.75 | 0.72 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 13.18 | 24.27 | 37.19 | 73.01 | 97.65 | 106.45 | 108.24 | 112.12 | 100 | 25 | 1.13 | 1.09 | 2.34 | 1 | 1.1 | 1.1 | 1.09 | 1.39 | 1.13 | 1.16 | 1.04 | 3 | 1 | 3 | 0 | 0 | 1 | 0 | 0.92 | 0.06 | 0.71 | 0.21 | 0.69 | 0.22 | 0.46 | 0.43 | 0.25 | 0.84 | 0.13 | 1.11 | 0.09 | 1.28 |
263 | exec - advec_mom_kernel.f90:193-196 | advec_mom_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 0.66 | 0.7 | 0.67 | 0.68 | 0.71 | 0.96 | 1.11 | 1.28 | 6.08 | 3.56 | 2.27 | 1.14 | 0.98 | 0.73 | 0.78 | 0.69 | 6.07 | 3.3 | 2.05 | 1.05 | 0.73 | 0.64 | 0.65 | 0.65 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 15.15 | 27.87 | 44.88 | 87.63 | 126.30 | 144.46 | 141.83 | 142.41 | 100 | 25 | 1 | 1.1 | 2.44 | 1 | 1.08 | 1.12 | 1.1 | 1.36 | 1.14 | 1.2 | 1.06 | 0 | 1 | 2 | 0 | 0 | 1 | 0 | 0.92 | 0.06 | 0.74 | 0.17 | 0.72 | 0.19 | 0.52 | 0.34 | 0.3 | 0.68 | 0.15 | 0.95 | 0.1 | 1.16 |
213 | exec - advec_cell_kernel.f90:191-193 | advec_cell_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 0.51 | 0.57 | 0.6 | 0.62 | 0.68 | 0.97 | 1.12 | 1.27 | 4.69 | 3.04 | 2.05 | 1.04 | 0.97 | 0.74 | 0.76 | 0.66 | 4.68 | 2.69 | 1.83 | 0.95 | 0.71 | 0.65 | 0.66 | 0.65 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 8.86 | 15.40 | 22.66 | 43.64 | 58.32 | 63.72 | 62.82 | 64.06 | 100 | 25 | 1.06 | 1.05 | 2.46 | 1 | 1.13 | 1.13 | 1.09 | 1.39 | 1.14 | 1.17 | 1.03 | 2 | 3 | 0 | 1 | 0 | 1 | 0 | 0.87 | 0.07 | 0.64 | 0.22 | 0.62 | 0.24 | 0.41 | 0.4 | 0.22 | 0.75 | 0.11 | 1 | 0.07 | 1.17 |
202 | exec - advec_cell_kernel.f90:99-101 | advec_cell_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 0.5 | 0.55 | 0.58 | 0.6 | 0.67 | 0.96 | 1.06 | 1.23 | 4.53 | 2.93 | 1.99 | 1.01 | 0.96 | 0.73 | 0.73 | 0.65 | 4.52 | 2.59 | 1.78 | 0.92 | 0.7 | 0.64 | 0.62 | 0.62 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 8.97 | 15.66 | 22.77 | 44.08 | 57.87 | 63.40 | 65.62 | 65.52 | 100 | 25 | 1.06 | 1.05 | 2.46 | 1 | 1.13 | 1.13 | 1.1 | 1.39 | 1.14 | 1.18 | 1.05 | 2 | 1 | 1 | 1 | 0 | 1 | 0 | 0.87 | 0.07 | 0.63 | 0.21 | 0.61 | 0.23 | 0.4 | 0.4 | 0.22 | 0.75 | 0.11 | 0.94 | 0.08 | 1.14 |
326 | exec - field_summary_kernel.f90:58-71 | field_summary_kernel_.DIR.OMP.PARALLEL.2 | Innermost | 0.32 | 0.3 | 0.25 | 0.25 | 0.2 | 0.23 | 0.26 | 0.29 | 2.94 | 1.45 | 0.83 | 0.42 | 0.23 | 0.18 | 0.18 | 0.16 | 2.93 | 1.43 | 0.75 | 0.38 | 0.2 | 0.15 | 0.16 | 0.15 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 20.92 | 42.87 | 81.74 | 161.32 | 306.43 | 408.71 | 384.98 | 406.21 | 98.08 | 24.76 | 1.48 | 1.88 | 3.81 | 1 | 1.01 | 1.12 | 1.11 | 1.15 | 1.2 | 1.2 | 1.07 | 2 | 0 | 0 | 2 | 0 | 1 | 0 | 1.02 | 0 | 0.98 | 0.01 | 0.96 | 0.01 | 0.92 | 0.02 | 0.61 | 0.09 | 0.29 | 0.19 | 0.2 | 0.23 |
342 | exec - generate_chunk_kernel.f90:87-163 [...] | generate_chunk_kernel_.DIR.OMP.PARALLEL.2 | InBetween | 0.04 | 0.04 | 0.03 | 0.03 | 0.03 | 0.02 | 0.01 | 0.01 | 0.46 | 0.27 | 0.18 | 0.09 | 0.06 | 0.03 | 0.02 | 0.01 | 0.37 | 0.19 | 0.1 | 0.05 | 0.03 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 6.05 | 11.80 | 21.96 | 43.83 | 72.96 | 209.88 | 195.75 | 201.00 | 5.88 | 12.59 | 2.38 | 1.99 | 4.25 | 1.24 | 1.42 | 1.8 | 1.8 | 2 | 3 | 2 | 1 | NA | NA | NA | NA | NA | 1 | 0 | 0.97 | 0 | 0.92 | 0 | 0.92 | 0 | 0.77 | 0.01 | 1.16 | -0 | 0.58 | 0 | 0.39 | 0.01 |
439 | exec - pack_kernel.f90:158-160 | clover_pack_message_right_.DIR.OMP.PARALLEL.LOOP.2.split100 | Innermost | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.3 | 0.19 | 0.08 | 0.05 | 0.04 | 0.03 | 0.03 | 0.03 | 0.15 | 0.09 | 0.04 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 1 | 2 | 4 | 8 | 16 | 32 | 60 | 89 | 0.50 | 1.01 | 2.09 | 4.69 | 5.75 | 14.13 | 10.88 | 0.00 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1.12 | 1 | 1.25 | 2 | 1.5 | 3 | 3 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0.83 | 0 | 0.94 | 0 | 0.94 | 0 | 0.94 | 0 | 0.47 | 0.01 | 0.23 | 0.01 | 1 | 0 |
433 | exec - pack_kernel.f90:64-66 | clover_pack_message_left_.DIR.OMP.PARALLEL.LOOP.2.split104 | Innermost | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.27 | 0.15 | 0.06 | 0.04 | 0.05 | 0.03 | 0.02 | 0.02 | 0.14 | 0.07 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0 | 1 | 2 | 4 | 8 | 16 | 31 | 58 | 86 | 0.42 | 0.93 | 3.25 | 9.38 | 4.00 | 7.00 | 0.00 | 0.00 | 0 | 12.5 | 1.25 | 1 | 8 | 1 | 1.07 | 1.2 | 1.33 | 1.67 | 3 | 2 | 2 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 1.75 | 0 | 1.75 | 0 | 0.88 | 0 | 0.44 | 0.01 | 1 | 0 | 1 | 0 |
186 | exec - accelerate_kernel.f90:60-76 | accelerate_kernel_.DIR.OMP.PARALLEL.2 | Outermost | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.04 | 0.04 | 0.06 | 0.03 | 0.04 | 0.02 | 0.03 | 0.04 | 0.05 | 0.05 | 0.06 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 12.65 | 25.08 | 38.06 | 86.63 | 118.00 | 191.13 | 92.63 | 72.81 | 0 | 11.21 | 1 | 1 | 4.95 | 1 | 1 | 2 | 2 | 3 | 4 | 2.5 | 2.5 | 1 | 0 | 0 | 14 | 0 | 1 | 0 | 1 | 0 | 0.75 | 0 | 0.75 | 0 | 0.38 | 0.01 | 0.19 | 0.02 | 0.05 | 0.04 | 0.03 | 0.04 |