Loops
MultiBsplineRef.hpp: 68 - 69%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
873 | 27.97 | 26.69 | 23.66 | 100 | 25 | 228.14 | 676 | 27.45 | 26.12 | 22.24 | 100 | 50 | 234.6 | 748 | 27.55 | 26 | 23.1 | 100 | 25 | 225.8 |
SoaDistanceTableAAOMPTarget.h: 440 - 19.51%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1864 | 8.17 | 7.71 | 6.83 | 54.55 | 15.91 | 0 | 184 | 7.49 | 7.09 | 6.03 | 27.27 | 15.91 | 0 | 1724 | 8.22 | 7.49 | 6.65 | 54.55 | 15.91 | 0 |
BsplineFunctor.h: 236 - 4.51%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
393 | 1.98 | 1.47 | 1.3 | 89.47 | 44.08 | 0.53 | 642 | 0.12 | 0.05 | 0.04 | 0 | 10 | 0.57 | 252 | 2.15 | 1.7 | 1.51 | 0 | 11.16 | 0.18 |
308 | 0.1 | 0.04 | 0.04 | 88.24 | 43.38 | 1.76 | 558 | 2.2 | 1.91 | 1.62 | 0 | 10 | 0.25 |
inner_product.hpp: 155 - 3.06%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
979 | 0.28 | 0.13 | 0.12 | 33.33 | 16.67 | 97.54 | 658 | 0.22 | 0.14 | 0.12 | 100 | 44.87 | 451.06 | 866 | 0.81 | 0.62 | 0.55 | 33.33 | 16.67 | 102 |
981 | 0.78 | 0.62 | 0.55 | 33.33 | 16.67 | 103.17 | 854 | 0.41 | 0.27 | 0.24 | 33.33 | 16.67 | 234.17 | |||||||
982 | 0.42 | 0.32 | 0.28 | 33.33 | 16.67 | 197.18 | 853 | 0.72 | 0.59 | 0.52 | 33.33 | 16.67 | 108.63 | |||||||
994 | 0.83 | 0.65 | 0.58 | 33.33 | 16.67 | 97.65 | 851 | 0.18 | 0.11 | 0.1 | 33.33 | 16.67 | 116.08 |
einspline_spo_ref.hpp: 223 - 2.59%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
875 | 1.23 | 1.02 | 0.91 | 30 | 15.31 | 0 | 682 | 1.16 | 1.02 | 0.87 | 11.11 | 13.89 | 0 | 750 | 1.13 | 0.91 | 0.81 | 0 | 11.93 | 0 |
<unknown>: 0 - 2.2%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2345 | 1.39 | 1.24 | 1.1 | 100 | 50 | 0.28 | 92 | 0 | 0 | 0 | 0 | 0 | NA | 830 | 0 | 0 | 0 | 0 | 0 | NA |
353 | 0 | 0 | 0 | 0 | 0 | NA | 81 | 0 | 0 | 0 | 0 | 0 | NA | 82 | 0.01 | 0 | 0 | 0 | 0 | NA |
355 | 0.02 | 0 | 0 | 0 | 0 | NA | 80 | 0 | 0 | 0 | 0 | 0 | NA | 178 | 0 | 0 | 0 | 0 | 0 | NA |
365 | 0.01 | 0 | 0 | 0 | 0 | NA | 87 | 0 | 0 | 0 | 0 | 0 | NA | 179 | 0.02 | 0 | 0 | 0 | 0 | NA |
1058 | 0.01 | 0 | 0 | 0 | 0 | NA | 45 | 0 | 0 | 0 | 0 | 0 | NA | 212 | 0.02 | 0 | 0 | 0 | 0 | NA |
367 | 0 | 0 | 0 | 0 | 0 | NA | 85 | 0.01 | 0 | 0 | 0 | 0 | NA | 858 | 0 | 0 | 0 | 0 | 0 | NA |
1544 | 0 | 0 | 0 | 0 | 0 | NA | 94 | 0.01 | 0 | 0 | 0 | 0 | NA | 1198 | 0 | 0 | 0 | 0 | 0 | NA |
1283 | 0 | 0 | 0 | 0 | 0 | NA | 75 | 0.03 | 0.01 | 0.01 | 30.95 | 14.96 | 8.75 | 29 | 0 | 0 | 0 | 0 | 0 | NA |
372 | 0 | 0 | 0 | 0 | 0 | NA | 742 | 0 | 0 | 0 | 0 | 0 | NA | 744 | 0.01 | 0 | 0 | 0 | 0 | NA |
1323 | 0 | 0 | 0 | 0 | 0 | NA | 206 | 0.02 | 0 | 0 | 0 | 0 | NA | 1441 | 0.07 | 0 | 0 | 0 | 0 | NA |
371 | 0 | 0 | 0 | 0 | 0 | NA | 204 | 0 | 0 | 0 | 0 | 0 | NA | 204 | 0.01 | 0 | 0 | 0 | 0 | NA |
380 | 0 | 0 | 0 | 0 | 0 | NA | 205 | 0.01 | 0 | 0 | 0 | 0 | NA | 976 | 0 | 0 | 0 | 0 | 0 | NA |
382 | 0 | 0 | 0 | 0 | 0 | NA | 692 | 0 | 0 | 0 | 0 | 0 | NA | 730 | 0.01 | 0 | 0 | 0 | 0 | NA |
388 | 0 | 0 | 0 | 0 | 0 | NA | 691 | 0 | 0 | 0 | 0 | 0 | NA | 728 | 0 | 0 | 0 | 0 | 0 | NA |
384 | 0 | 0 | 0 | 0 | 0 | NA | 684 | 0.01 | 0 | 0 | 0 | 0 | NA | 984 | 0 | 0 | 0 | 0 | 0 | NA |
387 | 0.02 | 0.01 | 0 | 0 | 0 | 233.81 | 672 | 0.01 | 0 | 0 | 0 | 0 | NA | 1966 | 0.02 | 0 | 0 | 0 | 0 | NA |
101 | 0.02 | 0 | 0 | 0 | 0 | NA | 208 | 0.01 | 0 | 0 | 0 | 0 | NA | 280 | 0.03 | 0 | 0 | 0 | 0 | NA |
258 | 0.02 | 0 | 0 | 0 | 0 | NA | 210 | 0 | 0 | 0 | 0 | 0 | NA | 872 | 0 | 0 | 0 | 0 | 0 | NA |
986 | 0 | 0 | 0 | 0 | 0 | NA | 443 | 0 | 0 | 0 | 0 | 0 | NA | 282 | 0 | 0 | 0 | 0 | 0 | NA |
989 | 0.01 | 0 | 0 | 0 | 0 | NA | 442 | 0 | 0 | 0 | 0 | 0 | NA | 297 | 0 | 0 | 0 | 0 | 0 | NA |
102 | 0.01 | 0 | 0 | 0 | 0 | NA | 605 | 0 | 0 | 0 | 0 | 0 | NA | 1719 | 0 | 0 | 0 | 0 | 0 | NA |
284 | 0 | 0 | 0 | 0 | 0 | NA | 444 | 0 | 0 | 0 | 0 | 0 | NA | 1093 | 0 | 0 | 0 | 0 | 0 | NA |
969 | 0.01 | 0 | 0 | 0 | 0 | NA | 448 | 0 | 0 | 0 | 0 | 0 | NA | 287 | 0 | 0 | 0 | 0 | 0 | NA |
951 | 0 | 0 | 0 | 0 | 0 | NA | 449 | 0 | 0 | 0 | 0 | 0 | NA | 272 | 0.02 | 0 | 0 | 0 | 0 | NA |
987 | 0 | 0 | 0 | 0 | 0 | NA | 618 | 0 | 0 | 0 | 0 | 0 | NA | 268 | 0 | 0 | 0 | 0 | 0 | NA |
984 | 0 | 0 | 0 | 0 | 0 | NA | 617 | 0 | 0 | 0 | 0 | 0 | NA | 270 | 0.01 | 0 | 0 | 0 | 0 | NA |
49 | 0 | 0 | 0 | 0 | 0 | NA | 615 | 0 | 0 | 0 | 0 | 0 | NA | 295 | 0 | 0 | 0 | 0 | 0 | NA |
1534 | 0 | 0 | 0 | 0 | 0 | NA | 613 | 0 | 0 | 0 | 0 | 0 | NA | 856 | 0 | 0 | 0 | 0 | 0 | NA |
295 | 0.02 | 0 | 0 | 0 | 0 | NA | 463 | 0.02 | 0.01 | 0 | 0 | 0 | 630.47 | 61 | 0 | 0 | 0 | 0 | 0 | NA |
369 | 0 | 0 | 0 | 0 | 0 | NA | 609 | 0 | 0 | 0 | 0 | 0 | NA | 1089 | 0 | 0 | 0 | 0 | 0 | NA |
1857 | 0 | 0 | 0 | 0 | 0 | NA | 606 | 0 | 0 | 0 | 0 | 0 | NA | 293 | 0 | 0 | 0 | 0 | 0 | NA |
1115 | 0 | 0 | 0 | 0 | 0 | NA | 466 | 0.02 | 0 | 0 | 0 | 0 | NA | 291 | 0 | 0 | 0 | 0 | 0 | NA |
2095 | 0 | 0 | 0 | 0 | 0 | NA | 598 | 0.01 | 0 | 0 | 0 | 0 | NA | 289 | 0 | 0 | 0 | 0 | 0 | NA |
1555 | 0 | 0 | 0 | 0 | 0 | NA | 43 | 0 | 0 | 0 | 0 | 0 | NA | 1191 | 0.01 | 0 | 0 | 0 | 0 | NA |
1557 | 0.01 | 0 | 0 | 0 | 0 | NA | 619 | 0 | 0 | 0 | 0 | 0 | NA | 205 | 0 | 0 | 0 | 0 | 0 | NA |
378 | 0 | 0 | 0 | 0 | 0 | NA | 683 | 0.02 | 0 | 0 | 0 | 0 | NA | 301 | 0.01 | 0 | 0 | 0 | 0 | NA |
48 | 0 | 0 | 0 | 0 | 0 | NA | 330 | 0.01 | 0 | 0 | 0 | 0 | NA | 77 | 0.01 | 0 | 0 | 0 | 0 | NA |
53 | 0 | 0 | 0 | 0 | 0 | NA | 331 | 0.01 | 0 | 0 | 0 | 0 | NA | 299 | 0 | 0 | 0 | 0 | 0 | NA |
109 | 0 | 0 | 0 | 0 | 0 | NA | 48 | 0 | 0 | 0 | 0 | 0 | NA | 924 | 0.01 | 0 | 0 | 0 | 0 | NA |
296 | 0.01 | 0 | 0 | 0 | 0 | NA | 607 | 0 | 0 | 0 | 0 | 0 | NA | 1083 | 0.01 | 0 | 0 | 0 | 0 | NA |
285 | 0 | 0 | 0 | 0 | 0 | NA | 207 | 0 | 0 | 0 | 0 | 0 | NA | 59 | 0 | 0 | 0 | 0 | 0 | NA |
110 | 0 | 0 | 0 | 0 | 0 | NA | 298 | 0 | 0 | 0 | 0 | 0 | NA | 764 | 0 | 0 | 0 | 0 | 0 | NA |
1332 | 0 | 0 | 0 | 0 | 0 | NA | 297 | 0.01 | 0 | 0 | 0 | 0 | NA | 1428 | 0 | 0 | 0 | 0 | 0 | NA |
1560 | 0.01 | 0 | 0 | 0 | 0 | NA | 223 | 0 | 0 | 0 | 0 | 0 | NA | 79 | 0 | 0 | 0 | 0 | 0 | NA |
306 | 0 | 0 | 0 | 0 | 0 | NA | 399 | 0 | 0 | 0 | 0 | 0 | NA | 1431 | 0 | 0 | 0 | 0 | 0 | NA |
300 | 0.01 | 0 | 0 | 0 | 0 | NA | 220 | 0 | 0 | 0 | 0 | 0 | NA | 220 | 0.01 | 0 | 0 | 0 | 0 | NA |
954 | 0 | 0 | 0 | 0 | 0 | NA | 345 | 0.01 | 0 | 0 | 0 | 0 | NA | 766 | 0 | 0 | 0 | 0 | 0 | NA |
988 | 0 | 0 | 0 | 0 | 0 | NA | 38 | 0.03 | 0 | 0 | 0 | 0 | NA | 1442 | 0.02 | 0 | 0 | 0 | 0 | NA |
104 | 0.01 | 0 | 0 | 0 | 0 | NA | 39 | 0.08 | 0 | 0 | 0 | 0 | NA | 1437 | 0 | 0 | 0 | 0 | 0 | NA |
1238 | 0.01 | 0 | 0 | 0 | 0 | NA | 185 | 0 | 0 | 0 | 0 | 0 | NA | 222 | 0.01 | 0 | 0 | 0 | 0 | NA |
1232 | 0.01 | 0 | 0 | 0 | 0 | NA | 152 | 0 | 0 | 0 | 0 | 0 | NA | 1145 | 0 | 0 | 0 | 0 | 0 | NA |
1368 | 0.01 | 0 | 0 | 0 | 0 | NA | 443 | 0 | 0 | 0 | 0 | 0 | NA | 2247 | 1.44 | 1.23 | 1.09 | 100 | 50 | 0.26 |
882 | 0 | 0 | 0 | 0 | 0 | NA | 442 | 0 | 0 | 0 | 0 | 0 | NA | 216 | 0.02 | 0 | 0 | 0 | 0 | NA |
106 | 0 | 0 | 0 | 0 | 0 | NA | 130 | 0 | 0 | 0 | 0 | 0 | NA | 860 | 0 | 0 | 0 | 0 | 0 | NA |
281 | 0.01 | 0 | 0 | 0 | 0 | NA | 150 | 0 | 0 | 0 | 0 | 0 | NA | 207 | 0.01 | 0 | 0 | 0 | 0 | NA |
869 | 0.03 | 0 | 0 | 0 | 0 | NA | 74 | 0.03 | 0 | 0 | 0 | 0 | NA | |||||||
1559 | 0.04 | 0 | 0 | 0 | 0 | NA | 1226 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1859 | 0.01 | 0 | 0 | 0 | 0 | NA | 1439 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
302 | 0 | 0 | 0 | 0 | 0 | NA | 75 | 0.02 | 0 | 0 | 0 | 0 | NA | |||||||
2091 | 0.01 | 0 | 0 | 0 | 0 | NA | 52 | 0 | 0 | 0 | 0 | 0 | NA | |||||||
855 | 0 | 0 | 0 | 0 | 0 | NA | 249 | 0.03 | 0.01 | 0 | 0 | 0 | 270.71 | |||||||
1369 | 0.01 | 0 | 0 | 0 | 0 | NA | 225 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1126 | 0 | 0 | 0 | 0 | 0 | NA | 861 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
57 | 0 | 0 | 0 | 0 | 0 | NA | 859 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
298 | 0.04 | 0.01 | 0 | 0 | 0 | 205.26 | ||||||||||||||
841 | 0.01 | 0 | 0 | 0 | 0 | NA | ||||||||||||||
215 | 0.02 | 0 | 0 | 0 | 0 | NA | ||||||||||||||
827 | 0 | 0 | 0 | 0 | 0 | NA |
inner_product.hpp: 82 - 2.04%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
985 | 0.16 | 0.11 | 0.1 | 100 | 50 | 186.17 | 730 | 0.15 | 0.1 | 0.09 | 100 | 50 | 209.73 | 765 | 0.46 | 0.36 | 0.32 | 100 | 50 | 234.25 |
883 | 0.46 | 0.36 | 0.31 | 100 | 50 | 234.08 | 728 | 0.44 | 0.29 | 0.24 | 100 | 50 | 72.52 | 849 | 0.07 | 0.03 | 0.03 | 100 | 50 | 139.74 |
996 | 0.44 | 0.3 | 0.26 | 100 | 50 | 70.12 | 41 | 0.44 | 0.36 | 0.3 | 100 | 50 | 232.96 | 857 | 0.16 | 0.1 | 0.09 | 100 | 50 | 210.05 |
977 | 0.08 | 0.04 | 0.03 | 100 | 50 | 106.99 | 733 | 0.06 | 0.03 | 0.02 | 100 | 50 | 140.67 | 868 | 0.46 | 0.28 | 0.25 | 100 | 50 | 75.39 |
MultiBsplineRef.hpp: 276 - 1.69%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
877 | 0.55 | 0.42 | 0.37 | 100 | 50 | 273.62 | 677 | 1.47 | 1.18 | 1 | 0 | 12.5 | 119.33 | 753 | 0.45 | 0.36 | 0.32 | 100 | 50 | 305.85 |
BsplineFunctor.h: 291 - 1.6%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
332 | 0.73 | 0.59 | 0.52 | 86.96 | 44.57 | 0.44 | 485 | 0.34 | 0.26 | 0.22 | 0 | 9.38 | 0.18 | 248 | 0.75 | 0.62 | 0.55 | 0 | 9.94 | 0.03 |
551 | 0.4 | 0.33 | 0.28 | 0 | 9.38 | 0.1 | ||||||||||||||
465 | 0.07 | 0.03 | 0.03 | 0 | 9.38 | 0.6 |
TwoBodyJastrowRef.h: 324 - 1.03%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
363 | 0.45 | 0.37 | 0.33 | 100 | 50 | 257.2 | 490 | 0.55 | 0.45 | 0.38 | 0 | 12.5 | 198.77 | 278 | 0.47 | 0.36 | 0.32 | 100 | 50 | 263.08 |
inner_product.hpp: 211 - 0.6%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
962 | 0.22 | 0.15 | 0.13 | 85.71 | 41.07 | 0 | 695 | 0.35 | 0.27 | 0.23 | 0 | 12.5 | 0 | 835 | 0.32 | 0.15 | 0.13 | 66.67 | 31.25 | 0 |
961 | 0.19 | 0.13 | 0.11 | 85.71 | 41.07 | 0 |
TwoBodyJastrowRef.h: 381 - 0.17%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
381 | 0.06 | 0.02 | 0.02 | 100 | 50 | 214.96 | 462 | 0.11 | 0.06 | 0.05 | 100 | 50 | 207.86 | 294 | 0.05 | 0.02 | 0.02 | 100 | 50 | 205.98 |
383 | 0.05 | 0.02 | 0.02 | 100 | 50 | 209.88 | 292 | 0.06 | 0.02 | 0.02 | 100 | 50 | 212.93 | |||||||
379 | 0.05 | 0.02 | 0.02 | 100 | 50 | 209.71 | 290 | 0.05 | 0.02 | 0.02 | 100 | 50 | 209.93 |
OneBodyJastrowRef.h: 214 - 0.03%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
329 | 0.03 | 0.01 | 0.01 | 0 | 11.11 | 1.8 | 596 | 0.04 | 0.01 | 0.01 | 0 | 12.5 | 0.25 | 244 | 0.03 | 0.01 | 0.01 | 0 | 11.61 | 1.5 |
TwoBodyJastrowRef.h: 388 - 0.03%
Run orig | Run gcc_11 | Run icx_7 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
377 | 0.04 | 0.02 | 0.01 | 100 | 50 | 416.69 | 461 | 0.04 | 0.02 | 0.01 | 100 | 50 | 425.76 | 288 | 0.03 | 0.02 | 0.01 | 100 | 50 | 422.51 |