Loops
▶advec_mom_kernel.f90: 81 - 57.2%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
184 | 2.35 | 2.21 | 6.79 | 91.61 | 41.89 | 343.56 | 278 | 2.98 | 2.84 | 8.94 | 94.34 | 80.64 | 111.7 | 184 | 2.03 | 2.02 | 5.79 | 91.86 | 42.13 | 335.02 | 269 | 1.19 | 1.14 | 3.67 | 97.82 | 77.38 | 288.32 | 184 | 3.32 | 3.09 | 4.02 | 91.86 | 42.13 | 219.01 | 181 | 3.02 | 2.88 | 3.83 | 94.77 | 81.02 | 114.53 | |||||||||||||||||||||
175 | 1.61 | 1.48 | 4.53 | 98.15 | 41.78 | 514.36 | 268 | 1.23 | 1.11 | 3.51 | 97.79 | 77.05 | 293.86 | 175 | 1.67 | 1.65 | 4.73 | 98.21 | 42.08 | 422.56 | 277 | 1.29 | 1.18 | 3.79 | 88.52 | 73.75 | 278.03 | 175 | 3.12 | 2.94 | 3.82 | 98.21 | 42.08 | 237.16 | 173 | 2.92 | 2.84 | 3.78 | 97.82 | 77.38 | 115.65 |
▶PdV_kernel.f90: 111 - 37.99%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
114 | 2.13 | 2.05 | 6.28 | 100 | 50 | 299.86 | 218 | 2.19 | 2.04 | 6.44 | 100 | 100 | 299.6 | 114 | 2.01 | 1.99 | 5.69 | 100 | 50 | 309.35 | 219 | 2.02 | 1.91 | 6.15 | 100 | 100 | 322.09 | 114 | 5.18 | 5.11 | 6.64 | 100 | 50 | 120.46 | 123 | 5.17 | 5.11 | 6.79 | 100 | 100 | 120.52 |
▶PdV_kernel.f90: 69 - 31.67%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
111 | 1.8 | 1.71 | 5.24 | 100 | 50 | 264.3 | 216 | 1.72 | 1.63 | 5.15 | 100 | 100 | 277.65 | 111 | 1.59 | 1.64 | 4.69 | 100 | 50 | 275.37 | 217 | 1.64 | 1.6 | 5.14 | 100 | 100 | 282.27 | 111 | 4.41 | 4.36 | 5.66 | 100 | 50 | 103.57 | 121 | 4.4 | 4.35 | 5.79 | 100 | 100 | 103.73 |
▶advec_cell_kernel.f90: 83 - 30.25%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
127 | 1.23 | 1.1 | 3.39 | 97.56 | 42.91 | 566.83 | 240 | 1.12 | 1.01 | 3.19 | 97.97 | 80.46 | 467.33 | 127 | 1.2 | 1.2 | 3.45 | 97.67 | 43.24 | 495.97 | 241 | 1.18 | 1.14 | 3.65 | 92.26 | 78.36 | 378.65 | 127 | 2.79 | 2.67 | 3.46 | 97.67 | 43.24 | 222.9 | 135 | 0.03 | 0.01 | 0.02 | 96.35 | 81.23 | 330.41 | |||||||||||||||||||||
230 | 0.02 | 0.01 | 0.02 | 96.22 | 80.55 | 392.81 | 231 | 0.02 | 0.01 | 0.02 | 96.35 | 81.23 | 399.21 | 145 | 2.77 | 2.64 | 3.51 | 98.01 | 83.06 | 163.43 | ||||||||||||||||||||||||||||||||||||||||||
232 | 0.97 | 0.91 | 2.88 | 98.63 | 86.42 | 516.02 | 233 | 1.06 | 1 | 3.2 | 98.06 | 85.36 | 427.05 | 137 | 2.66 | 2.61 | 3.46 | 97.88 | 86.19 | 163.91 |
▶accelerate_kernel.f90: 62 - 29.24%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
120 | 1.65 | 1.57 | 4.83 | 100 | 50 | 457.03 | 224 | 1.68 | 1.51 | 4.75 | 100 | 100 | 478.73 | 120 | 1.51 | 1.53 | 4.38 | 100 | 50 | 469.23 | 225 | 1.52 | 1.46 | 4.68 | 100 | 100 | 491.97 | 120 | 4.14 | 4.08 | 5.29 | 100 | 50 | 175.96 | 129 | 4.09 | 3.99 | 5.31 | 100 | 100 | 179.99 |
▶flux_calc_kernel.f90: 56 - 24.64%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
272 | 1.23 | 1.19 | 3.65 | 100 | 50 | 172.64 | 345 | 1.4 | 1.33 | 4.18 | 100 | 100 | 157.22 | 272 | 1.23 | 1.25 | 3.58 | 100 | 50 | 164.15 | 344 | 1.51 | 1.42 | 4.56 | 100 | 100 | 144.68 | 272 | 3.38 | 3.32 | 4.31 | 100 | 50 | 61.8 | 248 | 3.39 | 3.28 | 4.36 | 100 | 100 | 62.39 |
▶ideal_gas_kernel.f90: 50 - 23.96%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
390 | 1.42 | 1.33 | 4.2 | 100 | 100 | 259.78 | 330 | 1.39 | 1.4 | 4.02 | 100 | 50 | 248.86 | 389 | 1.49 | 1.39 | 4.48 | 100 | 100 | 250.71 | 330 | 4.35 | 4.29 | 5.57 | 100 | 50 | 81.21 | 293 | 4.43 | 4.28 | 5.69 | 100 | 100 | 81.55 |
▶advec_mom_kernel.f90: 183 - 22.68%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
172 | 1.2 | 1.13 | 3.48 | 100 | 50 | 145.23 | 266 | 1.19 | 1.1 | 3.48 | 100 | 100 | 151.88 | 172 | 1.19 | 1.18 | 3.38 | 100 | 50 | 139.47 | 267 | 1.16 | 1.11 | 3.58 | 100 | 100 | 147.03 | 172 | 3.37 | 3.34 | 4.33 | 100 | 50 | 49.24 | 171 | 3.39 | 3.33 | 4.43 | 100 | 100 | 48.83 |
▶advec_mom_kernel.f90: 247 - 22.6%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
182 | 1.17 | 1.12 | 3.45 | 100 | 50 | 146.69 | 274 | 1.17 | 1.1 | 3.48 | 100 | 100 | 147.67 | 182 | 1.18 | 1.16 | 3.33 | 100 | 50 | 141.81 | 275 | 1.18 | 1.12 | 3.59 | 100 | 100 | 146.37 | 182 | 3.38 | 3.34 | 4.33 | 100 | 50 | 49.22 | 179 | 3.39 | 3.33 | 4.42 | 100 | 100 | 49.33 |
▶calc_dt_kernel.f90: 99 - 19.89%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
211 | 2.78 | 2.69 | 8.13 | 9.45 | 13.68 | 170.94 | 286 | 2.81 | 2.94 | 8.14 | 9.45 | 13.68 | 230.34 | 211 | 2.71 | 2.67 | 3.62 | 9.45 | 13.68 | 253.69 |
▶PdV_kernel.f90: 114 - 17.77%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
104 | 1.99 | 1.88 | 5.68 | 95 | 95.63 | 318.31 | 14 | 1.97 | 2.04 | 5.63 | 95 | 48.13 | 294.3 | 104 | 4.85 | 4.77 | 6.46 | 95 | 95.63 | 125.32 |
▶viscosity_kernel.f90: 53 - 17.13%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1028 | 1.49 | 1.39 | 4.28 | 95.96 | 48.23 | 680.34 | 920 | 0.76 | 0.66 | 2.09 | 99.36 | 98.25 | 1155.22 | 1028 | 1.49 | 1.52 | 4.35 | 95.54 | 47.88 | 783.2 | 919 | 0.75 | 0.69 | 2.23 | 99.38 | 98.29 | 1221.87 | 1028 | 1.83 | 1.67 | 2.17 | 95.54 | 47.88 | 712.85 | 829 | 1.63 | 1.51 | 2.01 | 95.69 | 90.18 | 557.26 |
▶advec_mom_kernel.f90: 138 - 17.07%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
177 | 0.87 | 0.84 | 2.57 | 100 | 50 | 244.65 | 270 | 0.99 | 0.91 | 2.88 | 100 | 100 | 225 | 177 | 0.88 | 0.9 | 2.57 | 100 | 50 | 227.72 | 271 | 0.98 | 0.94 | 3.03 | 100 | 100 | 218.43 | 177 | 2.39 | 2.32 | 3.01 | 100 | 50 | 88.35 | 175 | 2.34 | 2.27 | 3.01 | 100 | 100 | 90.7 |
▶advec_mom_kernel.f90: 203 - 17.06%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
186 | 0.87 | 0.83 | 2.56 | 100 | 50 | 246.67 | 280 | 1.01 | 0.94 | 2.96 | 100 | 100 | 220.36 | 186 | 0.87 | 0.89 | 2.54 | 100 | 50 | 230.54 | 279 | 0.96 | 0.92 | 2.97 | 100 | 100 | 223.17 | 186 | 2.38 | 2.32 | 3.01 | 100 | 50 | 88.46 | 183 | 2.36 | 2.27 | 3.02 | 100 | 100 | 90.27 |
▶ideal_gas_kernel.f90: 49 - 16.7%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
330 | 1.39 | 1.32 | 4.04 | 100 | 50 | 265.21 | 256 | 1.35 | 1.28 | 3.87 | 100 | 100 | 272.48 | 302 | 1.38 | 1.42 | 3.93 | 100 | 50 | 245.66 | 256 | 3.66 | 3.6 | 4.86 | 100 | 100 | 96.88 |
▶calc_dt_kernel.f90: 92 - 15.89%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
253 | 1.56 | 1.52 | 4.68 | 94.36 | 47.88 | 481.92 | 253 | 1.58 | 1.61 | 4.61 | 95.32 | 48.25 | 331.39 | 253 | 2.67 | 2.52 | 3.26 | 95.32 | 48.25 | 211.71 | 227 | 2.67 | 2.51 | 3.34 | 95.25 | 95.84 | 212.68 |
▶viscosity_kernel.f90: 56 - 15.59%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
780 | 2.16 | 2.12 | 6.4 | 95.06 | 95.68 | 543.42 | 733 | 2.16 | 2.31 | 6.38 | 95.06 | 48.15 | 499.57 | 780 | 2.13 | 2.08 | 2.81 | 95.06 | 95.68 | 553.79 |
▶advec_cell_kernel.f90: 256 - 15.47%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
238 | 0.94 | 0.9 | 2.83 | 100 | 100 | 226.71 | 134 | 0.93 | 0.95 | 2.71 | 100 | 50 | 215.81 | 239 | 0.96 | 0.91 | 2.91 | 100 | 100 | 226.05 | 134 | 2.71 | 2.68 | 3.47 | 100 | 50 | 76.51 | 143 | 2.75 | 2.67 | 3.55 | 100 | 100 | 77.03 |
▶advec_cell_kernel.f90: 165 - 15.19%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
228 | 0.9 | 0.85 | 2.68 | 100 | 100 | 240.39 | 125 | 0.91 | 0.93 | 2.67 | 100 | 50 | 220.09 | 229 | 0.9 | 0.88 | 2.82 | 100 | 100 | 232.87 | 125 | 2.71 | 2.68 | 3.47 | 100 | 50 | 76.37 | 133 | 2.71 | 2.67 | 3.55 | 100 | 100 | 76.71 |
▶advec_cell_kernel.f90: 204 - 14.71%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
136 | 1.55 | 1.49 | 4.59 | 99.12 | 45.94 | 454.45 | 136 | 2.24 | 2.32 | 6.64 | 98.35 | 45.25 | 256.55 | 136 | 2.91 | 2.68 | 3.48 | 98.35 | 45.25 | 222.08 |
▶advec_mom_kernel.f90: 86 - 14.56%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
290 | 0.98 | 0.93 | 2.92 | 100 | 100 | 88.68 | 197 | 0.88 | 0.89 | 2.56 | 100 | 50 | 93.09 | 289 | 0.95 | 0.93 | 2.98 | 100 | 100 | 89.68 | 197 | 2.41 | 2.36 | 3.06 | 100 | 50 | 35.09 | 193 | 2.34 | 2.29 | 3.04 | 100 | 100 | 36.66 |
▶accelerate_kernel.f90: 67 - 14.37%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
118 | 1.65 | 1.48 | 4.47 | 100 | 100 | 512.94 | 20 | 1.58 | 1.58 | 4.37 | 100 | 50 | 480.63 | 118 | 4.23 | 4.09 | 5.53 | 100 | 100 | 185.63 |
▶advec_mom_kernel.f90: 96 - 14.05%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
284 | 0.92 | 0.88 | 2.77 | 100 | 100 | 92.34 | 195 | 0.83 | 0.85 | 2.43 | 100 | 50 | 95.58 | 287 | 0.93 | 0.9 | 2.89 | 100 | 100 | 90.04 | 195 | 2.34 | 2.3 | 2.99 | 100 | 50 | 35.33 | 191 | 2.27 | 2.23 | 2.97 | 100 | 100 | 36.14 |
▶advec_cell_kernel.f90: 202 - 12.57%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
132 | 1.73 | 1.49 | 4.49 | 14.04 | 14.14 | 162.23 | 97 | 1.69 | 1.62 | 4.48 | 14.04 | 14.14 | 165.4 | 132 | 2.82 | 2.66 | 3.6 | 14.04 | 14.14 | 100.71 |
▶advec_mom_kernel.f90: 214 - 12.2%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
166 | 1.54 | 1.39 | 4.2 | 14.63 | 14.33 | 118.07 | 123 | 1.57 | 1.54 | 4.27 | 14.63 | 14.33 | 107.16 | 166 | 2.86 | 2.76 | 3.73 | 14.63 | 14.33 | 59.74 |
▶flux_calc_kernel.f90: 58 - 12.08%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
223 | 1.44 | 1.36 | 4.11 | 80 | 82.5 | 154.18 | 173 | 1.23 | 1.25 | 3.46 | 80 | 42.5 | 171.93 | 223 | 3.42 | 3.33 | 4.51 | 80 | 82.5 | 62.98 |
▶advec_cell_kernel.f90: 255 - 11.78%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
134 | 0.93 | 0.91 | 2.78 | 100 | 50 | 224.89 | 131 | 0.94 | 0.89 | 2.68 | 100 | 100 | 230.38 | 96 | 0.94 | 0.98 | 2.7 | 100 | 50 | 209.29 | 131 | 2.73 | 2.68 | 3.62 | 100 | 100 | 76.51 |
▶advec_mom_kernel.f90: 116 - 11.62%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
286 | 0.75 | 0.67 | 2.11 | 100 | 100 | 61.37 | 190 | 0.75 | 0.72 | 2.05 | 100 | 50 | 57.48 | 283 | 0.74 | 0.7 | 2.24 | 100 | 100 | 58.08 | 190 | 2.07 | 2.02 | 2.63 | 100 | 50 | 20.49 | 187 | 2.08 | 1.95 | 2.59 | 100 | 100 | 21.08 |
▶advec_cell_kernel.f90: 164 - 11.56%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
125 | 0.94 | 0.88 | 2.71 | 100 | 50 | 233.16 | 123 | 0.93 | 0.86 | 2.61 | 100 | 100 | 238.74 | 88 | 0.94 | 0.95 | 2.62 | 100 | 50 | 215.54 | 123 | 2.72 | 2.67 | 3.62 | 100 | 100 | 76.9 |
▶reset_field_kernel.f90: 52 - 11.5%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
473 | 0.69 | 0.63 | 1.98 | 100 | 100 | 0 | 449 | 0.75 | 0.73 | 2.08 | 100 | 50 | 0 | 472 | 0.71 | 0.66 | 2.13 | 100 | 100 | 0 | 449 | 2.05 | 2.02 | 2.62 | 100 | 50 | 0 | 382 | 2.14 | 2.02 | 2.69 | 100 | 100 | 0 |
▶revert_kernel.f90: 47 - 11.39%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
483 | 0.69 | 0.63 | 1.97 | 100 | 100 | 0 | 464 | 0.72 | 0.71 | 2.03 | 100 | 50 | 0 | 482 | 0.7 | 0.65 | 2.1 | 100 | 100 | 0 | 464 | 2.04 | 2.02 | 2.61 | 100 | 50 | 0 | 392 | 2.11 | 2.02 | 2.68 | 100 | 100 | 0 |
▶advec_mom_kernel.f90: 106 - 11.34%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
288 | 0.72 | 0.64 | 2.02 | 100 | 100 | 62.55 | 193 | 0.72 | 0.7 | 2.01 | 100 | 50 | 57.95 | 285 | 0.73 | 0.69 | 2.22 | 100 | 100 | 59.34 | 193 | 2.02 | 1.97 | 2.56 | 100 | 50 | 20.59 | 189 | 1.99 | 1.9 | 2.53 | 100 | 100 | 21.31 |
▶reset_field_kernel.f90: 62 - 11.28%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
471 | 0.66 | 0.61 | 1.91 | 100 | 100 | 0 | 446 | 0.71 | 0.72 | 2.05 | 100 | 50 | 0 | 470 | 0.71 | 0.65 | 2.1 | 100 | 100 | 0 | 446 | 2.06 | 2.02 | 2.63 | 100 | 50 | 0 | 380 | 2.06 | 1.95 | 2.59 | 100 | 100 | 0 |
▶advec_mom_kernel.f90: 248 - 11.23%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
164 | 1.22 | 1.12 | 3.39 | 100 | 100 | 145.51 | 121 | 1.23 | 1.21 | 3.35 | 100 | 50 | 135.88 | 164 | 3.38 | 3.32 | 4.49 | 100 | 100 | 49.1 |
▶advec_mom_kernel.f90: 184 - 11.19%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
156 | 1.21 | 1.11 | 3.36 | 100 | 100 | 147.7 | 129 | 1.19 | 1.21 | 3.34 | 100 | 50 | 135.82 | 156 | 3.38 | 3.32 | 4.49 | 100 | 100 | 49.39 |
▶advec_cell_kernel.f90: 110 - 11.19%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
124 | 1.37 | 1.27 | 3.84 | 13.79 | 14.01 | 170.33 | 89 | 1.39 | 1.36 | 3.76 | 13.79 | 14.01 | 196.55 | 124 | 2.77 | 2.65 | 3.59 | 13.79 | 14.01 | 100.88 |
▶advec_mom_kernel.f90: 151 - 11.03%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
158 | 1.31 | 1.23 | 3.71 | 15.38 | 14.42 | 130.29 | 131 | 1.33 | 1.3 | 3.59 | 15.38 | 14.42 | 126.5 | 158 | 2.83 | 2.76 | 3.73 | 15.38 | 14.42 | 59.62 |
▶advec_mom_kernel.f90: 85 - 10.62%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
197 | 0.87 | 0.82 | 2.52 | 100 | 50 | 100.76 | 172 | 0.89 | 0.85 | 2.57 | 100 | 100 | 98.35 | 137 | 0.88 | 0.89 | 2.46 | 100 | 50 | 92.64 | 172 | 2.35 | 2.27 | 3.07 | 100 | 100 | 36.84 |
▶advec_mom_kernel.f90: 95 - 10.3%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
190 | 0.82 | 0.79 | 2.43 | 100 | 50 | 103.15 | 174 | 0.9 | 0.82 | 2.48 | 100 | 100 | 99.99 | 139 | 0.84 | 0.86 | 2.38 | 100 | 50 | 95.05 | 174 | 2.32 | 2.22 | 3.01 | 100 | 100 | 36.92 |
▶PdV_kernel.f90: 74 - 10.03%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
106 | 1.62 | 1.52 | 4.59 | 94.44 | 95.14 | 338.97 | 106 | 4.09 | 4.02 | 5.44 | 94.44 | 95.14 | 128.31 |
▶calc_dt_kernel.f90: 94 - 9.66%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
324 | 1.57 | 1.5 | 4.72 | 100 | 100 | 489.67 | 323 | 1.59 | 1.54 | 4.94 | 100 | 100 | 372.23 |
▶advec_mom_kernel.f90: 115 - 8.8%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
192 | 0.73 | 0.67 | 2.06 | 100 | 50 | 62.04 | 178 | 0.75 | 0.68 | 2.07 | 100 | 100 | 60.75 | 143 | 0.73 | 0.73 | 2.03 | 100 | 50 | 56.85 | 178 | 2 | 1.95 | 2.64 | 100 | 100 | 21.19 |
▶reset_field_kernel.f90: 61 - 8.64%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
446 | 0.71 | 0.66 | 2.04 | 100 | 50 | 0 | 354 | 0.7 | 0.63 | 1.91 | 100 | 100 | 0 | 265 | 0.75 | 0.74 | 2.05 | 100 | 50 | 0 | 354 | 2.02 | 1.95 | 2.64 | 100 | 100 | 0 |
▶advec_mom_kernel.f90: 105 - 8.62%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
195 | 0.68 | 0.65 | 2.01 | 100 | 50 | 62.13 | 176 | 0.73 | 0.67 | 2.03 | 100 | 100 | 60.31 | 141 | 0.73 | 0.73 | 2.01 | 100 | 50 | 55.32 | 176 | 1.99 | 1.9 | 2.57 | 100 | 100 | 21.24 |
▶advec_mom_kernel.f90: 143 - 8.33%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
160 | 0.95 | 0.89 | 2.68 | 100 | 100 | 229.78 | 133 | 0.92 | 0.94 | 2.6 | 100 | 50 | 218.26 | 160 | 2.35 | 2.26 | 3.05 | 100 | 100 | 90.43 |
▶advec_mom_kernel.f90: 207 - 8.3%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
168 | 0.95 | 0.88 | 2.64 | 100 | 100 | 233.06 | 125 | 0.92 | 0.93 | 2.59 | 100 | 50 | 220.28 | 168 | 2.33 | 2.27 | 3.07 | 100 | 100 | 90.3 |
▶reset_field_kernel.f90: 51 - 8%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
449 | 0.71 | 0.68 | 2.08 | 100 | 50 | 0 | 357 | 0.59 | 0.55 | 1.65 | 100 | 100 | 0 | 268 | 0.71 | 0.71 | 1.96 | 100 | 50 | 0 | 357 | 1.74 | 1.71 | 2.31 | 100 | 100 | 0 |
▶revert_kernel.f90: 46 - 7.69%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
464 | 0.7 | 0.66 | 2.02 | 100 | 50 | 0 | 368 | 0.55 | 0.5 | 1.52 | 100 | 100 | 0 | 279 | 0.68 | 0.67 | 1.85 | 100 | 50 | 0 | 368 | 1.73 | 1.7 | 2.3 | 100 | 100 | 0 |
▶advec_cell_kernel.f90: 90 - 7.41%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
236 | 0.51 | 0.48 | 1.51 | 100 | 100 | 128.83 | 131 | 0.44 | 0.45 | 1.28 | 100 | 50 | 138.55 | 237 | 0.51 | 0.48 | 1.56 | 100 | 100 | 129.55 | 131 | 1.21 | 1.18 | 1.53 | 100 | 50 | 52.83 | 141 | 1.21 | 1.15 | 1.53 | 100 | 100 | 54.16 |
▶advec_cell_kernel.f90: 182 - 7.17%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
244 | 0.5 | 0.45 | 1.43 | 100 | 100 | 132.47 | 141 | 0.43 | 0.43 | 1.24 | 100 | 50 | 141.46 | 245 | 0.51 | 0.47 | 1.52 | 100 | 100 | 129.01 | 141 | 1.17 | 1.15 | 1.49 | 100 | 50 | 52.87 | 149 | 1.19 | 1.12 | 1.49 | 100 | 100 | 54.28 |
▶advec_cell_kernel.f90: 192 - 5.79%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
242 | 0.38 | 0.34 | 1.06 | 100 | 100 | 64.38 | 139 | 0.38 | 0.36 | 1.04 | 100 | 50 | 58.16 | 243 | 0.38 | 0.35 | 1.11 | 100 | 100 | 59.82 | 139 | 1.03 | 1 | 1.3 | 100 | 50 | 20.94 | 147 | 0.99 | 0.96 | 1.28 | 100 | 100 | 21.64 |
▶advec_cell_kernel.f90: 100 - 5.61%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
234 | 0.35 | 0.31 | 0.99 | 100 | 100 | 64.98 | 129 | 0.36 | 0.35 | 1.01 | 100 | 50 | 57.21 | 235 | 0.38 | 0.34 | 1.08 | 100 | 100 | 60.5 | 129 | 1.01 | 0.98 | 1.27 | 100 | 50 | 20.45 | 139 | 1.02 | 0.94 | 1.26 | 100 | 100 | 21.38 |
▶advec_mom_kernel.f90: 193 - 5.34%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
188 | 0.3 | 0.25 | 0.78 | 100 | 50 | 330.11 | 282 | 0.3 | 0.26 | 0.8 | 100 | 100 | 313.41 | 188 | 0.35 | 0.32 | 0.91 | 100 | 50 | 256.35 | 281 | 0.36 | 0.3 | 0.97 | 100 | 100 | 272.82 | 188 | 0.76 | 0.72 | 0.93 | 100 | 50 | 113.92 | 185 | 0.78 | 0.72 | 0.95 | 100 | 100 | 114.35 |
▶advec_mom_kernel.f90: 128 - 5.33%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
179 | 0.29 | 0.25 | 0.78 | 100 | 50 | 326.9 | 272 | 0.29 | 0.25 | 0.78 | 100 | 100 | 327.53 | 179 | 0.35 | 0.32 | 0.92 | 100 | 50 | 256.44 | 273 | 0.34 | 0.3 | 0.97 | 100 | 100 | 270.78 | 179 | 0.77 | 0.71 | 0.93 | 100 | 50 | 115.54 | 177 | 0.77 | 0.72 | 0.95 | 100 | 100 | 114.24 |
▶advec_cell_kernel.f90: 89 - 5.32%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
131 | 0.44 | 0.41 | 1.27 | 100 | 50 | 151.96 | 127 | 0.45 | 0.43 | 1.29 | 100 | 100 | 143.2 | 92 | 0.44 | 0.45 | 1.23 | 100 | 50 | 138.44 | 127 | 1.2 | 1.13 | 1.53 | 100 | 100 | 54.53 |
▶advec_cell_kernel.f90: 181 - 5.16%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
141 | 0.42 | 0.4 | 1.23 | 100 | 50 | 151.48 | 135 | 0.44 | 0.41 | 1.24 | 100 | 100 | 148.72 | 100 | 0.42 | 0.43 | 1.19 | 100 | 50 | 140.66 | 135 | 1.15 | 1.11 | 1.5 | 100 | 100 | 54.94 |
▶PdV_kernel.f90: 78 - 4.63%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
16 | 1.62 | 1.67 | 4.63 | 94.44 | 47.92 | 310.26 |
▶advec_cell_kernel.f90: 191 - 4.43%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
139 | 0.38 | 0.33 | 1.03 | 100 | 50 | 62.98 | 137 | 0.4 | 0.35 | 1.05 | 100 | 100 | 59.24 | 102 | 0.38 | 0.38 | 1.04 | 100 | 50 | 54.23 | 137 | 1.02 | 0.97 | 1.31 | 100 | 100 | 21.37 |
▶advec_cell_kernel.f90: 99 - 4.3%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
129 | 0.36 | 0.33 | 1 | 100 | 50 | 61.62 | 129 | 0.39 | 0.33 | 1.01 | 100 | 100 | 60.75 | 94 | 0.38 | 0.37 | 1.01 | 100 | 50 | 55.24 | 129 | 0.99 | 0.95 | 1.28 | 100 | 100 | 21.06 |
▶advec_mom_kernel.f90: 131 - 2.56%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
162 | 0.29 | 0.24 | 0.72 | 100 | 100 | 342.98 | 135 | 0.34 | 0.31 | 0.87 | 100 | 50 | 265.27 | 162 | 0.77 | 0.72 | 0.97 | 100 | 100 | 114.35 |
▶advec_mom_kernel.f90: 196 - 2.55%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
170 | 0.31 | 0.24 | 0.71 | 100 | 100 | 340.04 | 127 | 0.34 | 0.31 | 0.87 | 100 | 50 | 263.66 | 170 | 0.76 | 0.71 | 0.97 | 100 | 100 | 114.85 |
▶field_summary_kernel.f90: 58 - 1.71%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
264 | 0.1 | 0.09 | 0.28 | 96.3 | 47.69 | 588 | 335 | 0.1 | 0.09 | 0.28 | 96.43 | 91.52 | 654.7 | 264 | 0.1 | 0.1 | 0.28 | 96.3 | 47.69 | 590.54 | 334 | 0.1 | 0.09 | 0.29 | 93.1 | 88.79 | 653.13 | 264 | 0.24 | 0.22 | 0.29 | 96.3 | 47.69 | 268.46 | 238 | 0.23 | 0.22 | 0.29 | 96.43 | 91.52 | 267.32 |
▶field_summary_kernel.f90: 62 - 0.87%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
219 | 0.1 | 0.09 | 0.28 | 100 | 100 | 649.68 | 324 | 0.11 | 0.1 | 0.29 | 100 | 50 | 588.22 | 219 | 0.24 | 0.22 | 0.3 | 100 | 100 | 265.59 |
▶pack_kernel.f90: 64 - 0.1%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
400 | 0.05 | 0.01 | 0.04 | 0 | 12.5 | 0 | 400 | 0.05 | 0.01 | 0.04 | 0 | 12.5 | 0 | 400 | 0.06 | 0.02 | 0.02 | 0 | 12.5 | 0 |
▶pack_kernel.f90: 158 - 0.1%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
406 | 0.05 | 0.01 | 0.04 | 0 | 12.5 | 0 | 406 | 0.05 | 0.01 | 0.04 | 0 | 12.5 | 0 | 406 | 0.06 | 0.02 | 0.02 | 0 | 12.5 | 0 |
▶pack_kernel.f90: 205 - 0.06%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
409 | 0.04 | 0.01 | 0.03 | 0 | 12.5 | 0 | 409 | 0.03 | 0.01 | 0.02 | 0 | 12.5 | 0 | 409 | 0.05 | 0.01 | 0.01 | 0 | 12.5 | 0 |
▶generate_chunk_kernel.f90: 105 - 0.06%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
295 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 362 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 295 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 361 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 295 | 0.01 | 0.01 | 0.01 | 100 | 50 | 0 | 265 | 0.01 | 0.01 | 0.01 | 100 | 100 | 0 |
▶generate_chunk_kernel.f90: 97 - 0.06%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
298 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 364 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 298 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 363 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 298 | 0.01 | 0.01 | 0.01 | 100 | 50 | 0 | 267 | 0.01 | 0.01 | 0.01 | 100 | 100 | 0 |
▶initialise_chunk_kernel.f90: 128 - 0.06%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| ||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
338 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 396 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 338 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 395 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 338 | 0.01 | 0.01 | 0.01 | 100 | 50 | 0 | 299 | 0.01 | 0.01 | 0.01 | 100 | 100 | 0 |
▶generate_chunk_kernel.f90: 130 - 0.05%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
232 | 0.03 | 0.01 | 0.02 | 0 | 12.5 | 0 | 290 | 0.03 | 0.01 | 0.02 | 0 | 12.5 | 0 | 232 | 0.03 | 0.01 | 0.01 | 0 | 12.5 | 0 |
▶initialise_chunk_kernel.f90: 119 - 0.05%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
341 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 398 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 341 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 341 | 0.01 | 0.01 | 0.01 | 100 | 50 | 0 | 301 | 0.01 | 0.01 | 0.01 | 100 | 100 | 0 |
▶generate_chunk_kernel.f90: 89 - 0.05%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
301 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 366 | 0 | 0 | 0.01 | 100 | 100 | NA | 301 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 301 | 0.01 | 0.01 | 0.01 | 100 | 50 | 0 | 269 | 0.01 | 0.01 | 0.01 | 100 | 100 | 0 |
▶pack_kernel.f90: 111 - 0.05%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
403 | 0.04 | 0.01 | 0.02 | 0 | 12.5 | 0 | 403 | 0.04 | 0.01 | 0.02 | 0 | 12.5 | 0 | 403 | 0.03 | 0.01 | 0.01 | 0 | 12.5 | 0 |
▶initialise_chunk_kernel.f90: 110 - 0.05%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
344 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 344 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 399 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 344 | 0.01 | 0.01 | 0.01 | 100 | 50 | 0 | 303 | 0.01 | 0.01 | 0.01 | 100 | 100 | 0 |
▶generate_chunk_kernel.f90: 113 - 0.05%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions | Loop Source Regions |
| |||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
292 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 292 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 359 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 292 | 0.01 | 0.01 | 0.01 | 100 | 50 | 0 | 263 | 0.01 | 0.01 | 0.01 | 100 | 100 | 0 |
▶generate_chunk_kernel.f90: 106 - 0.03%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
237 | 0.01 | 0 | 0.01 | 50 | 56.25 | NA | 295 | 0.01 | 0 | 0.01 | 50 | 31.25 | NA | 237 | 0.01 | 0.01 | 0.01 | 50 | 56.25 | 0 |
▶generate_chunk_kernel.f90: 98 - 0.03%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
239 | 0 | 0 | 0.01 | 50 | 56.25 | NA | 297 | 0.01 | 0 | 0.01 | 50 | 31.25 | NA | 239 | 0.01 | 0.01 | 0.01 | 50 | 56.25 | 0 |
▶initialise_chunk_kernel.f90: 129 - 0.03%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
261 | 0.01 | 0 | 0.01 | 100 | 100 | NA | 179 | 0.01 | 0 | 0.01 | 100 | 50 | NA | 261 | 0.01 | 0.01 | 0.01 | 100 | 100 | 0 |
▶generate_chunk_kernel.f90: 90 - 0.02%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
299 | 0.01 | 0 | 0.01 | 50 | 31.25 | NA | 241 | 0.01 | 0.01 | 0.01 | 50 | 56.25 | 0 |
▶initialise_chunk_kernel.f90: 120 - 0.02%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions |
| Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | ||||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
264 | 0 | 0 | 0.01 | 50 | 56.25 | NA | 264 | 0.01 | 0.01 | 0.01 | 50 | 56.25 | 0 |
▶generate_chunk_kernel.f90: 114 - 0.01%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
235 | 0.01 | 0.01 | 0.01 | 50 | 56.25 | 0 |
▶initialise_chunk_kernel.f90: 111 - 0.01%
Run orig_HBM | Run gcc_2_HBM | Run icx_10_HBM | Run orig_HBM_CACHE | Run gcc_9_HBM_CACHE | Run icx_10_HBM_CACHE | Run orig_DDR | Run gcc_2_DDR | Run icx_2_DDR | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions |
| Loop Source Regions | |||||||||||||||||||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
266 | 0.01 | 0.01 | 0.01 | 25 | 34.38 | 0 |