Loops
MultiBsplineRef.hpp: 68 - 152.13%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
873 | 23.69 | 22.88 | 28.41 | 100 | 25 | 266.32 | 676 | 21.63 | 20.91 | 24.22 | 100 | 50 | 293.08 | 849 | 23.65 | 22.66 | 28.09 | 100 | 25 | 268.94 | 873 | 29.62 | 27.69 | 24.67 | 100 | 25 | 219.96 | 690 | 28.18 | 26.92 | 22.6 | 100 | 50 | 227.64 | 846 | 28.67 | 27.25 | 24.14 | 100 | 25 | 223.58 |
SoaDistanceTableAAOMPTarget.h: 440 - 34.9%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1864 | 4.66 | 4.32 | 5.36 | 54.55 | 15.91 | 0 | 184 | 4.5 | 4.18 | 4.84 | 27.27 | 15.91 | 0 | 1852 | 4.55 | 4.25 | 5.27 | 54.55 | 15.91 | 0 | 1864 | 8.42 | 7.77 | 6.92 | 54.55 | 15.91 | 0 | 226 | 7.61 | 7.13 | 5.98 | 27.27 | 15.91 | 0 | 1858 | 8.39 | 7.37 | 6.53 | 54.55 | 15.91 | 0 |
inner_product.hpp: 155 - 5.83%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
979 | 0.11 | 0.07 | 0.09 | 33.33 | 16.67 | 181.22 | 658 | 0.18 | 0.11 | 0.13 | 100 | 44.87 | 576.35 | 973 | 0.42 | 0.36 | 0.45 | 33.33 | 16.67 | 176.51 | 979 | 0.31 | 0.14 | 0.13 | 33.33 | 16.67 | 90.55 | 672 | 0.19 | 0.13 | 0.11 | 100 | 50 | 483.5 | 960 | 0.37 | 0.28 | 0.24 | 33.33 | 16.67 | 226.42 |
981 | 0.54 | 0.38 | 0.47 | 33.33 | 16.67 | 168.01 | 958 | 0.09 | 0.07 | 0.08 | 33.33 | 16.67 | 182.01 | 982 | 0.44 | 0.32 | 0.29 | 33.33 | 16.67 | 197.17 | 972 | 0.8 | 0.61 | 0.54 | 33.33 | 16.67 | 103.7 | ||||||||||||||
982 | 0.34 | 0.27 | 0.34 | 33.33 | 16.67 | 233.46 | 960 | 0.47 | 0.35 | 0.44 | 33.33 | 16.67 | 181.82 | 981 | 0.79 | 0.63 | 0.56 | 33.33 | 16.67 | 101.28 | 957 | 0.22 | 0.13 | 0.11 | 33.33 | 16.67 | 98.18 | ||||||||||||||
994 | 0.44 | 0.38 | 0.48 | 33.33 | 16.67 | 167.21 | 961 | 0.34 | 0.23 | 0.28 | 33.33 | 16.67 | 275.11 | 994 | 0.84 | 0.65 | 0.58 | 33.33 | 16.67 | 97.72 | 959 | 0.72 | 0.58 | 0.51 | 33.33 | 16.67 | 110.63 |
einspline_spo_ref.hpp: 223 - 5.25%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
875 | 0.92 | 0.77 | 0.96 | 30 | 15.31 | 0 | 682 | 0.9 | 0.74 | 0.86 | 11.11 | 13.89 | 0 | 851 | 0.81 | 0.63 | 0.79 | 20 | 13.13 | 0 | 875 | 1.21 | 1.01 | 0.9 | 30 | 15.31 | 0 | 696 | 1.29 | 1.08 | 0.91 | 11.11 | 13.89 | 0 | 848 | 1.18 | 0.93 | 0.83 | 0 | 11.93 | 0 |
<unknown>: 0 - 4.1%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1345 | 0 | 0 | 0 | 0 | 0 | NA | 94 | 0.01 | 0 | 0 | 0 | 0 | NA | 347 | 0 | 0 | 0 | 0 | 0 | NA | 347 | 0 | 0 | 0 | 0 | 0 | NA | 86 | 0 | 0 | 0 | 0 | 0 | NA | 2345 | 1.51 | 1.22 | 1.08 | 100 | 50 | 0.27 |
2345 | 0.9 | 0.77 | 0.95 | 100 | 50 | 0.34 | 87 | 0 | 0 | 0 | 0 | 0 | NA | 831 | 0 | 0 | 0 | 0 | 0 | NA | 2345 | 1.49 | 1.25 | 1.11 | 100 | 50 | 0.24 | 84 | 0 | 0 | 0 | 0 | 0 | NA | 114 | 0.01 | 0 | 0 | 0 | 0 | NA |
353 | 0.01 | 0 | 0 | 0 | 0 | NA | 89 | 0 | 0 | 0 | 0 | 0 | NA | 353 | 0.01 | 0 | 0 | 0 | 0 | NA | 353 | 0 | 0 | 0 | 0 | 0 | NA | 80 | 0 | 0 | 0 | 0 | 0 | NA | 359 | 0 | 0 | 0 | 0 | 0 | NA |
355 | 0.02 | 0 | 0 | 0 | 0 | NA | 45 | 0.01 | 0 | 0 | 0 | 0 | NA | 355 | 0.01 | 0 | 0 | 0 | 0 | NA | 355 | 0.02 | 0 | 0 | 0 | 0 | NA | 74 | 0.04 | 0.02 | 0.01 | 30.95 | 14.96 | 4.13 | 352 | 0.01 | 0 | 0 | 0 | 0 | NA |
365 | 0.02 | 0 | 0 | 0 | 0 | NA | 85 | 0.01 | 0 | 0 | 0 | 0 | NA | 365 | 0.01 | 0 | 0 | 0 | 0 | NA | 365 | 0.01 | 0 | 0 | 0 | 0 | NA | 44 | 0 | 0 | 0 | 0 | 0 | NA | 1554 | 0.02 | 0 | 0 | 0 | 0 | NA |
1058 | 0.02 | 0 | 0 | 0 | 0 | NA | 92 | 0.01 | 0 | 0 | 0 | 0 | NA | 367 | 0.01 | 0 | 0 | 0 | 0 | NA | 1058 | 0.01 | 0 | 0 | 0 | 0 | NA | 87 | 0 | 0 | 0 | 0 | 0 | NA | 365 | 0 | 0 | 0 | 0 | 0 | NA |
367 | 0.01 | 0 | 0 | 0 | 0 | NA | 75 | 0.04 | 0.01 | 0.01 | 30.95 | 14.96 | 4.35 | 1540 | 0 | 0 | 0 | 0 | 0 | NA | 367 | 0 | 0 | 0 | 0 | 0 | NA | 91 | 0 | 0 | 0 | 0 | 0 | NA | 367 | 0 | 0 | 0 | 0 | 0 | NA |
1544 | 0 | 0 | 0 | 0 | 0 | NA | 742 | 0 | 0 | 0 | 0 | 0 | NA | 373 | 0 | 0 | 0 | 0 | 0 | NA | 1544 | 0 | 0 | 0 | 0 | 0 | NA | 79 | 0 | 0 | 0 | 0 | 0 | NA | 361 | 0 | 0 | 0 | 0 | 0 | NA |
1283 | 0.01 | 0 | 0 | 0 | 0 | NA | 206 | 0.02 | 0 | 0 | 0 | 0 | NA | 375 | 0 | 0 | 0 | 0 | 0 | NA | 1283 | 0 | 0 | 0 | 0 | 0 | NA | 490 | 0.02 | 0 | 0 | 0 | 0 | NA | 363 | 0 | 0 | 0 | 0 | 0 | NA |
372 | 0 | 0 | 0 | 0 | 0 | NA | 205 | 0.02 | 0 | 0 | 0 | 0 | NA | 1847 | 0 | 0 | 0 | 0 | 0 | NA | 372 | 0 | 0 | 0 | 0 | 0 | NA | 363 | 0.01 | 0 | 0 | 0 | 0 | NA | 242 | 0.01 | 0 | 0 | 0 | 0 | NA |
1323 | 0 | 0 | 0 | 0 | 0 | NA | 692 | 0.01 | 0 | 0 | 0 | 0 | NA | 383 | 0 | 0 | 0 | 0 | 0 | NA | 1321 | 0 | 0 | 0 | 0 | 0 | NA | 362 | 0.01 | 0 | 0 | 0 | 0 | NA | 374 | 0.02 | 0 | 0 | 0 | 0 | NA |
371 | 0 | 0 | 0 | 0 | 0 | NA | 691 | 0 | 0 | 0 | 0 | 0 | NA | 377 | 0 | 0 | 0 | 0 | 0 | NA | 371 | 0 | 0 | 0 | 0 | 0 | NA | 558 | 0.01 | 0 | 0 | 0 | 0 | NA | 369 | 0 | 0 | 0 | 0 | 0 | NA |
380 | 0 | 0 | 0 | 0 | 0 | NA | 684 | 0 | 0 | 0 | 0 | 0 | NA | 379 | 0 | 0 | 0 | 0 | 0 | NA | 380 | 0 | 0 | 0 | 0 | 0 | NA | 207 | 0.01 | 0 | 0 | 0 | 0 | NA | 1042 | 0.01 | 0 | 0 | 0 | 0 | NA |
382 | 0.01 | 0 | 0 | 0 | 0 | NA | 672 | 0.03 | 0 | 0 | 0 | 0 | NA | 101 | 0.02 | 0 | 0 | 0 | 0 | NA | 382 | 0 | 0 | 0 | 0 | 0 | NA | 204 | 0.03 | 0 | 0 | 0 | 0 | NA | 371 | 0 | 0 | 0 | 0 | 0 | NA |
388 | 0.01 | 0 | 0 | 0 | 0 | NA | 208 | 0.01 | 0 | 0 | 0 | 0 | NA | 2085 | 0.01 | 0 | 0 | 0 | 0 | NA | 388 | 0 | 0 | 0 | 0 | 0 | NA | 707 | 0 | 0 | 0 | 0 | 0 | NA | 370 | 0.03 | 0 | 0 | 0 | 0 | NA |
384 | 0 | 0 | 0 | 0 | 0 | NA | 210 | 0 | 0 | 0 | 0 | 0 | NA | 257 | 0.01 | 0 | 0 | 0 | 0 | NA | 384 | 0 | 0 | 0 | 0 | 0 | NA | 205 | 0.02 | 0 | 0 | 0 | 0 | NA | 267 | 0 | 0 | 0 | 0 | 0 | NA |
387 | 0.01 | 0 | 0 | 0 | 0 | NA | 443 | 0 | 0 | 0 | 0 | 0 | NA | 281 | 0 | 0 | 0 | 0 | 0 | NA | 387 | 0.02 | 0 | 0 | 0 | 0 | NA | 203 | 0 | 0 | 0 | 0 | 0 | NA | 268 | 0 | 0 | 0 | 0 | 0 | NA |
101 | 0.02 | 0 | 0 | 0 | 0 | NA | 442 | 0 | 0 | 0 | 0 | 0 | NA | 967 | 0 | 0 | 0 | 0 | 0 | NA | 258 | 0.01 | 0 | 0 | 0 | 0 | NA | 698 | 0 | 0 | 0 | 0 | 0 | NA | 270 | 0.01 | 0 | 0 | 0 | 0 | NA |
258 | 0.02 | 0 | 0 | 0 | 0 | NA | 444 | 0 | 0 | 0 | 0 | 0 | NA | 285 | 0 | 0 | 0 | 0 | 0 | NA | 987 | 0 | 0 | 0 | 0 | 0 | NA | 686 | 0.01 | 0 | 0 | 0 | 0 | NA | 105 | 0 | 0 | 0 | 0 | 0 | NA |
987 | 0.01 | 0 | 0 | 0 | 0 | NA | 448 | 0 | 0 | 0 | 0 | 0 | NA | 968 | 0.01 | 0 | 0 | 0 | 0 | NA | 989 | 0.01 | 0 | 0 | 0 | 0 | NA | 48 | 0 | 0 | 0 | 0 | 0 | NA | 828 | 0 | 0 | 0 | 0 | 0 | NA |
989 | 0.02 | 0 | 0 | 0 | 0 | NA | 449 | 0 | 0 | 0 | 0 | 0 | NA | 1362 | 0 | 0 | 0 | 0 | 0 | NA | 102 | 0.02 | 0 | 0 | 0 | 0 | NA | 209 | 0 | 0 | 0 | 0 | 0 | NA | 53 | 0 | 0 | 0 | 0 | 0 | NA |
284 | 0 | 0 | 0 | 0 | 0 | NA | 618 | 0 | 0 | 0 | 0 | 0 | NA | 1108 | 0 | 0 | 0 | 0 | 0 | NA | 284 | 0 | 0 | 0 | 0 | 0 | NA | 443 | 0 | 0 | 0 | 0 | 0 | NA | 862 | 0 | 0 | 0 | 0 | 0 | NA |
969 | 0.01 | 0 | 0 | 0 | 0 | NA | 617 | 0.01 | 0 | 0 | 0 | 0 | NA | 284 | 0 | 0 | 0 | 0 | 0 | NA | 969 | 0.01 | 0 | 0 | 0 | 0 | NA | 705 | 0 | 0 | 0 | 0 | 0 | NA | 967 | 0.01 | 0 | 0 | 0 | 0 | NA |
1369 | 0.01 | 0 | 0 | 0 | 0 | NA | 615 | 0.01 | 0 | 0 | 0 | 0 | NA | 295 | 0.01 | 0 | 0 | 0 | 0 | NA | 951 | 0 | 0 | 0 | 0 | 0 | NA | 445 | 0 | 0 | 0 | 0 | 0 | NA | 965 | 0 | 0 | 0 | 0 | 0 | NA |
369 | 0 | 0 | 0 | 0 | 0 | NA | 613 | 0.01 | 0 | 0 | 0 | 0 | NA | 932 | 0 | 0 | 0 | 0 | 0 | NA | 986 | 0 | 0 | 0 | 0 | 0 | NA | 444 | 0 | 0 | 0 | 0 | 0 | NA | 285 | 0 | 0 | 0 | 0 | 0 | NA |
951 | 0 | 0 | 0 | 0 | 0 | NA | 609 | 0.01 | 0 | 0 | 0 | 0 | NA | 935 | 0 | 0 | 0 | 0 | 0 | NA | 885 | 0 | 0 | 0 | 0 | 0 | NA | 449 | 0 | 0 | 0 | 0 | 0 | NA | 962 | 0 | 0 | 0 | 0 | 0 | NA |
984 | 0 | 0 | 0 | 0 | 0 | NA | 606 | 0 | 0 | 0 | 0 | 0 | NA | 965 | 0 | 0 | 0 | 0 | 0 | NA | 99 | 0.01 | 0 | 0 | 0 | 0 | NA | 627 | 0 | 0 | 0 | 0 | 0 | NA | 1106 | 0 | 0 | 0 | 0 | 0 | NA |
386 | 0 | 0 | 0 | 0 | 0 | NA | 605 | 0 | 0 | 0 | 0 | 0 | NA | 98 | 0 | 0 | 0 | 0 | 0 | NA | 369 | 0 | 0 | 0 | 0 | 0 | NA | 624 | 0 | 0 | 0 | 0 | 0 | NA | 20 | 0 | 0 | 0 | 0 | 0 | NA |
48 | 0 | 0 | 0 | 0 | 0 | NA | 619 | 0.01 | 0 | 0 | 0 | 0 | NA | 1553 | 0.06 | 0 | 0 | 0 | 0 | NA | 295 | 0.01 | 0.01 | 0 | 0 | 0 | 73.7 | 621 | 0 | 0 | 0 | 0 | 0 | NA | 947 | 0 | 0 | 0 | 0 | 0 | NA |
49 | 0 | 0 | 0 | 0 | 0 | NA | 464 | 0.03 | 0 | 0 | 0 | 0 | NA | 48 | 0 | 0 | 0 | 0 | 0 | NA | 302 | 0.01 | 0 | 0 | 0 | 0 | NA | 617 | 0.01 | 0 | 0 | 0 | 0 | NA | 278 | 0.02 | 0 | 0 | 0 | 0 | NA |
300 | 0.02 | 0 | 0 | 0 | 0 | NA | 598 | 0.01 | 0 | 0 | 0 | 0 | NA | 302 | 0 | 0 | 0 | 0 | 0 | NA | 378 | 0 | 0 | 0 | 0 | 0 | NA | 615 | 0 | 0 | 0 | 0 | 0 | NA | 279 | 0.01 | 0 | 0 | 0 | 0 | NA |
295 | 0.02 | 0 | 0 | 0 | 0 | NA | 43 | 0.01 | 0 | 0 | 0 | 0 | NA | 105 | 0 | 0 | 0 | 0 | 0 | NA | 386 | 0 | 0 | 0 | 0 | 0 | NA | 606 | 0.03 | 0 | 0 | 0 | 0 | NA | 64 | 0.01 | 0 | 0 | 0 | 0 | NA |
1242 | 0 | 0 | 0 | 0 | 0 | NA | 330 | 0.01 | 0 | 0 | 0 | 0 | NA | 300 | 0.01 | 0 | 0 | 0 | 0 | NA | 882 | 0 | 0 | 0 | 0 | 0 | NA | 467 | 0 | 0 | 0 | 0 | 0 | NA | 934 | 0 | 0 | 0 | 0 | 0 | NA |
1321 | 0 | 0 | 0 | 0 | 0 | NA | 48 | 0.01 | 0 | 0 | 0 | 0 | NA | 863 | 0 | 0 | 0 | 0 | 0 | NA | 984 | 0 | 0 | 0 | 0 | 0 | NA | 464 | 0.02 | 0 | 0 | 0 | 0 | NA | 931 | 0 | 0 | 0 | 0 | 0 | NA |
882 | 0 | 0 | 0 | 0 | 0 | NA | 331 | 0.01 | 0 | 0 | 0 | 0 | NA | 1551 | 0 | 0 | 0 | 0 | 0 | NA | 1555 | 0.01 | 0 | 0 | 0 | 0 | NA | 465 | 0.02 | 0.01 | 0 | 0 | 0 | 208.81 | 289 | 0.01 | 0 | 0 | 0 | 0 | NA |
285 | 0 | 0 | 0 | 0 | 0 | NA | 466 | 0.02 | 0 | 0 | 0 | 0 | NA | 296 | 0.01 | 0 | 0 | 0 | 0 | NA | 1559 | 0.07 | 0 | 0 | 0 | 0 | NA | 43 | 0 | 0 | 0 | 0 | 0 | NA | 1540 | 0 | 0 | 0 | 0 | 0 | NA |
1115 | 0 | 0 | 0 | 0 | 0 | NA | 683 | 0.02 | 0 | 0 | 0 | 0 | NA | 963 | 0 | 0 | 0 | 0 | 0 | NA | 1115 | 0 | 0 | 0 | 0 | 0 | NA | 468 | 0.01 | 0 | 0 | 0 | 0 | NA | 2086 | 0.01 | 0 | 0 | 0 | 0 | NA |
1555 | 0 | 0 | 0 | 0 | 0 | NA | 607 | 0 | 0 | 0 | 0 | 0 | NA | 110 | 0 | 0 | 0 | 0 | 0 | NA | 1242 | 0 | 0 | 0 | 0 | 0 | NA | 628 | 0 | 0 | 0 | 0 | 0 | NA | 2090 | 0 | 0 | 0 | 0 | 0 | NA |
1559 | 0.05 | 0 | 0 | 0 | 0 | NA | 298 | 0.01 | 0 | 0 | 0 | 0 | NA | 1338 | 0 | 0 | 0 | 0 | 0 | NA | 300 | 0.01 | 0 | 0 | 0 | 0 | NA | 450 | 0.01 | 0 | 0 | 0 | 0 | NA | 964 | 0 | 0 | 0 | 0 | 0 | NA |
2095 | 0 | 0 | 0 | 0 | 0 | NA | 297 | 0.01 | 0 | 0 | 0 | 0 | NA | 1047 | 0.01 | 0 | 0 | 0 | 0 | NA | 1534 | 0 | 0 | 0 | 0 | 0 | NA | 697 | 0.02 | 0 | 0 | 0 | 0 | NA | 1537 | 0 | 0 | 0 | 0 | 0 | NA |
1536 | 0.01 | 0 | 0 | 0 | 0 | NA | 207 | 0.01 | 0 | 0 | 0 | 0 | NA | 1316 | 0 | 0 | 0 | 0 | 0 | NA | 109 | 0 | 0 | 0 | 0 | 0 | NA | 403 | 0 | 0 | 0 | 0 | 0 | NA | 109 | 0.01 | 0 | 0 | 0 | 0 | NA |
378 | 0 | 0 | 0 | 0 | 0 | NA | 305 | 0 | 0 | 0 | 0 | 0 | NA | 331 | 0 | 0 | 0 | 0 | 0 | NA | 296 | 0.01 | 0 | 0 | 0 | 0 | NA | 219 | 0 | 0 | 0 | 0 | 0 | NA | 55 | 0 | 0 | 0 | 0 | 0 | NA |
109 | 0 | 0 | 0 | 0 | 0 | NA | 302 | 0 | 0 | 0 | 0 | 0 | NA | 966 | 0 | 0 | 0 | 0 | 0 | NA | 48 | 0 | 0 | 0 | 0 | 0 | NA | 302 | 0.02 | 0 | 0 | 0 | 0 | NA | 290 | 0 | 0 | 0 | 0 | 0 | NA |
296 | 0.02 | 0 | 0 | 0 | 0 | NA | 399 | 0 | 0 | 0 | 0 | 0 | NA | 104 | 0.01 | 0 | 0 | 0 | 0 | NA | 110 | 0.01 | 0 | 0 | 0 | 0 | NA | 301 | 0.02 | 0 | 0 | 0 | 0 | NA | 1226 | 0 | 0 | 0 | 0 | 0 | NA |
53 | 0 | 0 | 0 | 0 | 0 | NA | 345 | 0.01 | 0 | 0 | 0 | 0 | NA | 49 | 0 | 0 | 0 | 0 | 0 | NA | 1332 | 0 | 0 | 0 | 0 | 0 | NA | 349 | 0 | 0 | 0 | 0 | 0 | NA | 966 | 0 | 0 | 0 | 0 | 0 | NA |
110 | 0.01 | 0 | 0 | 0 | 0 | NA | 208 | 0.01 | 0 | 0 | 0 | 0 | NA | 1361 | 0 | 0 | 0 | 0 | 0 | NA | 49 | 0.01 | 0 | 0 | 0 | 0 | NA | 38 | 0.02 | 0 | 0 | 0 | 0 | NA | 354 | 0 | 0 | 0 | 0 | 0 | NA |
1332 | 0.01 | 0 | 0 | 0 | 0 | NA | 35 | 0.01 | 0 | 0 | 0 | 0 | NA | 1230 | 0.01 | 0 | 0 | 0 | 0 | NA | 306 | 0.01 | 0 | 0 | 0 | 0 | NA | 39 | 0.09 | 0 | 0 | 0 | 0 | NA | 1272 | 0 | 0 | 0 | 0 | 0 | NA |
281 | 0 | 0 | 0 | 0 | 0 | NA | 39 | 0.06 | 0 | 0 | 0 | 0 | NA | 1224 | 0.01 | 0 | 0 | 0 | 0 | NA | 285 | 0 | 0 | 0 | 0 | 0 | NA | 117 | 0 | 0 | 0 | 0 | 0 | NA | 111 | 0 | 0 | 0 | 0 | 0 | NA |
306 | 0.01 | 0 | 0 | 0 | 0 | NA | 38 | 0.01 | 0 | 0 | 0 | 0 | NA | 1554 | 0.01 | 0 | 0 | 0 | 0 | NA | 954 | 0 | 0 | 0 | 0 | 0 | NA | 227 | 0 | 0 | 0 | 0 | 0 | NA | 1351 | 0.01 | 0 | 0 | 0 | 0 | NA |
988 | 0 | 0 | 0 | 0 | 0 | NA | 208 | 0 | 0 | 0 | 0 | 0 | NA | 1276 | 0 | 0 | 0 | 0 | 0 | NA | 104 | 0.01 | 0 | 0 | 0 | 0 | NA | 404 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 865 | 0.01 | 0 | 0 | 0 | 0 | NA |
1859 | 0 | 0 | 0 | 0 | 0 | NA | 185 | 0.01 | 0 | 0 | 0 | 0 | NA | 106 | 0 | 0 | 0 | 0 | 0 | NA | 1238 | 0.01 | 0 | 0 | 0 | 0 | NA | 316 | 0 | 0 | 0 | 0 | 0 | NA | 1553 | 0.05 | 0 | 0 | 0 | 0 | NA |
1560 | 0.01 | 0 | 0 | 0 | 0 | NA | 207 | 0 | 0 | 0 | 0 | 0 | NA | 2352 | 0.9 | 0.76 | 0.94 | 100 | 50 | 0.39 | 1232 | 0.01 | 0 | 0 | 0 | 0 | NA | 1549 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
104 | 0.01 | 0 | 0 | 0 | 0 | NA | 290 | 0.02 | 0 | 0 | 0 | 0 | NA | 948 | 0 | 0 | 0 | 0 | 0 | NA | 1557 | 0.01 | 0 | 0 | 0 | 0 | NA | 1220 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1238 | 0.02 | 0 | 0 | 0 | 0 | NA | 129 | 0 | 0 | 0 | 0 | 0 | NA | 866 | 0 | 0 | 0 | 0 | 0 | NA | 281 | 0 | 0 | 0 | 0 | 0 | NA | 1853 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1232 | 0.02 | 0 | 0 | 0 | 0 | NA | 442 | 0 | 0 | 0 | 0 | 0 | NA | 1549 | 0 | 0 | 0 | 0 | 0 | NA | 106 | 0 | 0 | 0 | 0 | 0 | NA | 1551 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1557 | 0.01 | 0 | 0 | 0 | 0 | NA | 306 | 0 | 0 | 0 | 0 | 0 | NA | 1859 | 0.01 | 0 | 0 | 0 | 0 | NA | 106 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | ||||||||||||||
986 | 0.01 | 0 | 0 | 0 | 0 | NA | 258 | 0.01 | 0 | 0 | 0 | 0 | NA | 869 | 0.02 | 0 | 0 | 0 | 0 | NA | 107 | 0.02 | 0 | 0 | 0 | 0 | NA | ||||||||||||||
106 | 0.01 | 0 | 0 | 0 | 0 | NA | 845 | 0.04 | 0 | 0 | 0 | 0 | NA | 988 | 0.01 | 0 | 0 | 0 | 0 | NA | 333 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||
302 | 0.01 | 0 | 0 | 0 | 0 | NA | 57 | 0 | 0 | 0 | 0 | 0 | NA | 1368 | 0 | 0 | 0 | 0 | 0 | NA | 341 | 0.01 | 0 | 0 | 0 | 0 | NA | ||||||||||||||
869 | 0.03 | 0 | 0 | 0 | 0 | NA | 1560 | 0.03 | 0 | 0 | 0 | 0 | NA | 339 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||
885 | 0.01 | 0 | 0 | 0 | 0 | NA | 2091 | 0.01 | 0 | 0 | 0 | 0 | NA | 842 | 0.04 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||
52 | 0 | 0 | 0 | 0 | 0 | NA | 855 | 0 | 0 | 0 | 0 | 0 | NA | 283 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||
1368 | 0.01 | 0 | 0 | 0 | 0 | NA | 1369 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||
2091 | 0.01 | 0 | 0 | 0 | 0 | NA | 57 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||||||||||||||||
855 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||
331 | 0 | 0 | 0 | 0 | 0 | NA | |||||||||||||||||||||||||||||||||||
57 | 0.01 | 0 | 0 | 0 | 0 | NA |
inner_product.hpp: 82 - 3.95%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
985 | 0.12 | 0.08 | 0.1 | 100 | 50 | 262.76 | 730 | 0.12 | 0.07 | 0.08 | 100 | 50 | 299.55 | 964 | 0.09 | 0.06 | 0.08 | 100 | 50 | 356.14 | 985 | 0.17 | 0.11 | 0.1 | 100 | 50 | 193.44 | 41 | 0.45 | 0.31 | 0.26 | 100 | 50 | 269.87 | 963 | 0.19 | 0.1 | 0.09 | 100 | 50 | 205.49 |
883 | 0.39 | 0.28 | 0.35 | 100 | 50 | 301.82 | 728 | 0.22 | 0.17 | 0.19 | 100 | 50 | 123.93 | 975 | 0.24 | 0.17 | 0.21 | 100 | 50 | 123.68 | 883 | 0.46 | 0.34 | 0.3 | 100 | 50 | 249.37 | 745 | 0.14 | 0.1 | 0.08 | 100 | 50 | 213.17 | 955 | 0.09 | 0.03 | 0.03 | 100 | 50 | 139.52 |
996 | 0.24 | 0.16 | 0.2 | 100 | 50 | 131.46 | 733 | 0.04 | 0.02 | 0.02 | 100 | 50 | 209.91 | 956 | 0.04 | 0.02 | 0.02 | 100 | 50 | 211.23 | 996 | 0.41 | 0.29 | 0.26 | 100 | 50 | 72.48 | 748 | 0.05 | 0.03 | 0.03 | 100 | 50 | 136.37 | 974 | 0.45 | 0.28 | 0.24 | 100 | 50 | 75.24 |
977 | 0.04 | 0.02 | 0.02 | 100 | 50 | 213.51 | 41 | 0.39 | 0.3 | 0.35 | 100 | 50 | 280.76 | 864 | 0.35 | 0.28 | 0.35 | 100 | 50 | 303.17 | 977 | 0.07 | 0.03 | 0.03 | 100 | 50 | 142.25 | 743 | 0.44 | 0.3 | 0.25 | 100 | 50 | 69.92 | 863 | 0.48 | 0.35 | 0.31 | 100 | 50 | 242.32 |
MultiBsplineRef.hpp: 276 - 3.68%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
877 | 0.46 | 0.36 | 0.44 | 100 | 50 | 304.46 | 677 | 1.28 | 0.95 | 1.11 | 0 | 12.5 | 141.03 | 853 | 0.41 | 0.3 | 0.37 | 100 | 50 | 379.02 | 877 | 0.58 | 0.43 | 0.38 | 100 | 50 | 266.46 | 691 | 1.95 | 1.26 | 1.05 | 0 | 12.5 | 103.32 | 851 | 0.5 | 0.37 | 0.33 | 100 | 50 | 309.6 |
BsplineFunctor.h: 291 - 3.16%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
332 | 0.56 | 0.42 | 0.52 | 86.96 | 44.57 | 0.56 | 485 | 0.25 | 0.17 | 0.19 | 0 | 9.38 | 0.32 | 332 | 0.55 | 0.39 | 0.49 | 83.48 | 42.77 | 0.37 | 332 | 0.76 | 0.58 | 0.52 | 86.96 | 44.57 | 0.46 | 489 | 0.35 | 0.26 | 0.22 | 0 | 9.38 | 0.09 | 317 | 0.73 | 0.63 | 0.55 | 0 | 9.94 | 0.04 |
551 | 0.33 | 0.25 | 0.29 | 0 | 9.38 | 0.19 | 557 | 0.48 | 0.37 | 0.31 | 0 | 9.38 | 0.06 | ||||||||||||||||||||||||||||
465 | 0.07 | 0.04 | 0.04 | 0 | 9.38 | 0.45 | 466 | 0.07 | 0.04 | 0.03 | 0 | 9.38 | 0.11 |
TwoBodyJastrowRef.h: 324 - 1.86%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
363 | 0.29 | 0.2 | 0.25 | 100 | 50 | 470.14 | 490 | 0.4 | 0.29 | 0.34 | 0 | 12.5 | 315.62 | 363 | 0.26 | 0.19 | 0.24 | 100 | 50 | 499.5 | 363 | 0.44 | 0.36 | 0.32 | 100 | 50 | 264.79 | 495 | 0.59 | 0.46 | 0.39 | 0 | 12.5 | 205.03 | 350 | 0.45 | 0.36 | 0.32 | 100 | 50 | 263.64 |
inner_product.hpp: 211 - 1.2%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
962 | 0.17 | 0.12 | 0.15 | 85.71 | 41.07 | 0 | 695 | 0.21 | 0.2 | 0.23 | 0 | 12.5 | 0 | 941 | 0.06 | 0.06 | 0.07 | 85.71 | 41.07 | 0 | 962 | 0.31 | 0.15 | 0.14 | 85.71 | 41.07 | 0 | 711 | 0.42 | 0.26 | 0.22 | 33.33 | 16.67 | 0 | 940 | 0.31 | 0.16 | 0.14 | 33.33 | 16.67 | 0 |
961 | 0.16 | 0.11 | 0.13 | 85.71 | 41.07 | 0 | 961 | 0.28 | 0.14 | 0.12 | 85.71 | 41.07 | 0 |
TwoBodyJastrowRef.h: 381 - 0.44%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
381 | 0.05 | 0.02 | 0.03 | 100 | 50 | 209.48 | 462 | 0.11 | 0.07 | 0.08 | 100 | 50 | 177.33 | 374 | 0.05 | 0.02 | 0.03 | 100 | 50 | 207.46 | 381 | 0.05 | 0.02 | 0.02 | 100 | 50 | 213.98 | 463 | 0.13 | 0.07 | 0.06 | 100 | 50 | 176.44 | 364 | 0.06 | 0.02 | 0.02 | 100 | 50 | 213.41 |
383 | 0.05 | 0.02 | 0.03 | 100 | 50 | 213.81 | 376 | 0.05 | 0.02 | 0.03 | 100 | 50 | 212.78 | 383 | 0.05 | 0.02 | 0.02 | 100 | 50 | 214.61 | 366 | 0.05 | 0.02 | 0.02 | 100 | 50 | 214.23 | ||||||||||||||
379 | 0.06 | 0.02 | 0.03 | 100 | 50 | 213.93 | 378 | 0.05 | 0.02 | 0.03 | 100 | 50 | 205.88 | 379 | 0.05 | 0.02 | 0.02 | 100 | 50 | 210.26 | 362 | 0.05 | 0.02 | 0.02 | 100 | 50 | 205.21 |
TwoBodyJastrowRef.h: 388 - 0.07%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
377 | 0.03 | 0.01 | 0.02 | 100 | 50 | 818.57 | 461 | 0.03 | 0.01 | 0.01 | 100 | 50 | 846.13 | 372 | 0.03 | 0.01 | 0.01 | 100 | 50 | 839.18 | 377 | 0.04 | 0.01 | 0.01 | 100 | 50 | 819.62 | 462 | 0.03 | 0.01 | 0.01 | 100 | 50 | 841.43 | 360 | 0.04 | 0.01 | 0.01 | 100 | 50 | 825.82 |
OneBodyJastrowRef.h: 214 - 0.06%
Run orig_HBM_CACHE | Run gcc_11_HBM_CACHE | Run icx_1_HBM_CACHE | Run orig_DDR | Run gcc_9_DDR | Run icx_3_DDR | ||||||||||||||||||||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| ||||||||||||||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
329 | 0.03 | 0.01 | 0.01 | 0 | 11.11 | 0.6 | 596 | 0.03 | 0.01 | 0.01 | 0 | 12.5 | 0.85 | 329 | 0.03 | 0.01 | 0.01 | 0 | 11.61 | 1.6 | 329 | 0.03 | 0.01 | 0.01 | 0 | 11.11 | 3 | 604 | 0.04 | 0.01 | 0.01 | 0 | 12.5 | 0.1 | 313 | 0.03 | 0.01 | 0.01 | 0 | 11.61 | 0.7 |