Loops
MultiBsplineRef.hpp: 68 - 71.41%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
873 | 29.62 | 27.69 | 24.67 | 100 | 25 | 219.96 | 690 | 28.18 | 26.92 | 22.6 | 100 | 50 | 227.64 | 846 | 28.67 | 27.25 | 24.14 | 100 | 25 | 223.58 |
SoaDistanceTableAAOMPTarget.h: 440 - 19.43%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1864 | 8.42 | 7.77 | 6.92 | 54.55 | 15.91 | 0 | 226 | 7.61 | 7.13 | 5.98 | 27.27 | 15.91 | 0 | 1858 | 8.39 | 7.37 | 6.53 | 54.55 | 15.91 | 0 |
BsplineFunctor.h: 236 - 4.55%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
308 | 0.09 | 0.04 | 0.04 | 88.24 | 43.38 | 1.65 | 656 | 0.11 | 0.05 | 0.04 | 0 | 10 | 0.56 | 377 | 2.02 | 1.66 | 1.47 | 0 | 10.85 | 0.28 |
393 | 2 | 1.48 | 1.32 | 89.47 | 44.08 | 0.46 | 565 | 2.37 | 1.95 | 1.63 | 0 | 10 | 0.25 | 292 | 0.09 | 0.05 | 0.05 | 0 | 10.98 | 1.09 |
inner_product.hpp: 155 - 3.07%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
979 | 0.31 | 0.14 | 0.13 | 33.33 | 16.67 | 90.55 | 672 | 0.19 | 0.13 | 0.11 | 100 | 50 | 483.5 | 960 | 0.37 | 0.28 | 0.24 | 33.33 | 16.67 | 226.42 |
982 | 0.44 | 0.32 | 0.29 | 33.33 | 16.67 | 197.17 | 972 | 0.8 | 0.61 | 0.54 | 33.33 | 16.67 | 103.7 | |||||||
981 | 0.79 | 0.63 | 0.56 | 33.33 | 16.67 | 101.28 | 957 | 0.22 | 0.13 | 0.11 | 33.33 | 16.67 | 98.18 | |||||||
994 | 0.84 | 0.65 | 0.58 | 33.33 | 16.67 | 97.72 | 959 | 0.72 | 0.58 | 0.51 | 33.33 | 16.67 | 110.63 |
einspline_spo_ref.hpp: 223 - 2.64%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
875 | 1.21 | 1.01 | 0.9 | 30 | 15.31 | 0 | 696 | 1.29 | 1.08 | 0.91 | 11.11 | 13.89 | 0 | 848 | 1.18 | 0.93 | 0.83 | 0 | 11.93 | 0 |
TwoBodyJastrowRef.h: 342 - 2.28%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
357 | 0.41 | 0.3 | 0.27 | 100 | 50 | 211.17 | 494 | 0.99 | 0.81 | 0.68 | 100 | 50 | 235.54 | 347 | 0.43 | 0.29 | 0.26 | 100 | 50 | 218.17 |
359 | 0.41 | 0.31 | 0.27 | 100 | 50 | 202.88 | 345 | 0.44 | 0.3 | 0.26 | 100 | 50 | 211.51 | |||||||
361 | 0.43 | 0.31 | 0.28 | 100 | 50 | 204.36 | 343 | 0.42 | 0.29 | 0.26 | 100 | 50 | 218.46 |
<unknown>: 0 - 2.2%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions | Loop Source Regions | Loop Source Regions | ||||||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
347 | 0 | 0 | 0 | 0 | 0 | NA | 86 | 0 | 0 | 0 | 0 | 0 | NA | 2345 | 1.51 | 1.22 | 1.08 | 100 | 50 | 0.27 |
2345 | 1.49 | 1.25 | 1.11 | 100 | 50 | 0.24 | 84 | 0 | 0 | 0 | 0 | 0 | NA | 114 | 0.01 | 0 | 0 | 0 | 0 | NA |
353 | 0 | 0 | 0 | 0 | 0 | NA | 80 | 0 | 0 | 0 | 0 | 0 | NA | 359 | 0 | 0 | 0 | 0 | 0 | NA |
355 | 0.02 | 0 | 0 | 0 | 0 | NA | 74 | 0.04 | 0.02 | 0.01 | 30.95 | 14.96 | 4.13 | 352 | 0.01 | 0 | 0 | 0 | 0 | NA |
365 | 0.01 | 0 | 0 | 0 | 0 | NA | 44 | 0 | 0 | 0 | 0 | 0 | NA | 1554 | 0.02 | 0 | 0 | 0 | 0 | NA |
1058 | 0.01 | 0 | 0 | 0 | 0 | NA | 87 | 0 | 0 | 0 | 0 | 0 | NA | 365 | 0 | 0 | 0 | 0 | 0 | NA |
367 | 0 | 0 | 0 | 0 | 0 | NA | 91 | 0 | 0 | 0 | 0 | 0 | NA | 367 | 0 | 0 | 0 | 0 | 0 | NA |
1544 | 0 | 0 | 0 | 0 | 0 | NA | 79 | 0 | 0 | 0 | 0 | 0 | NA | 361 | 0 | 0 | 0 | 0 | 0 | NA |
1283 | 0 | 0 | 0 | 0 | 0 | NA | 490 | 0.02 | 0 | 0 | 0 | 0 | NA | 363 | 0 | 0 | 0 | 0 | 0 | NA |
372 | 0 | 0 | 0 | 0 | 0 | NA | 363 | 0.01 | 0 | 0 | 0 | 0 | NA | 242 | 0.01 | 0 | 0 | 0 | 0 | NA |
1321 | 0 | 0 | 0 | 0 | 0 | NA | 362 | 0.01 | 0 | 0 | 0 | 0 | NA | 374 | 0.02 | 0 | 0 | 0 | 0 | NA |
371 | 0 | 0 | 0 | 0 | 0 | NA | 558 | 0.01 | 0 | 0 | 0 | 0 | NA | 369 | 0 | 0 | 0 | 0 | 0 | NA |
380 | 0 | 0 | 0 | 0 | 0 | NA | 207 | 0.01 | 0 | 0 | 0 | 0 | NA | 1042 | 0.01 | 0 | 0 | 0 | 0 | NA |
382 | 0 | 0 | 0 | 0 | 0 | NA | 204 | 0.03 | 0 | 0 | 0 | 0 | NA | 371 | 0 | 0 | 0 | 0 | 0 | NA |
388 | 0 | 0 | 0 | 0 | 0 | NA | 707 | 0 | 0 | 0 | 0 | 0 | NA | 370 | 0.03 | 0 | 0 | 0 | 0 | NA |
384 | 0 | 0 | 0 | 0 | 0 | NA | 205 | 0.02 | 0 | 0 | 0 | 0 | NA | 267 | 0 | 0 | 0 | 0 | 0 | NA |
387 | 0.02 | 0 | 0 | 0 | 0 | NA | 203 | 0 | 0 | 0 | 0 | 0 | NA | 268 | 0 | 0 | 0 | 0 | 0 | NA |
258 | 0.01 | 0 | 0 | 0 | 0 | NA | 698 | 0 | 0 | 0 | 0 | 0 | NA | 270 | 0.01 | 0 | 0 | 0 | 0 | NA |
987 | 0 | 0 | 0 | 0 | 0 | NA | 686 | 0.01 | 0 | 0 | 0 | 0 | NA | 105 | 0 | 0 | 0 | 0 | 0 | NA |
989 | 0.01 | 0 | 0 | 0 | 0 | NA | 48 | 0 | 0 | 0 | 0 | 0 | NA | 828 | 0 | 0 | 0 | 0 | 0 | NA |
102 | 0.02 | 0 | 0 | 0 | 0 | NA | 209 | 0 | 0 | 0 | 0 | 0 | NA | 53 | 0 | 0 | 0 | 0 | 0 | NA |
284 | 0 | 0 | 0 | 0 | 0 | NA | 443 | 0 | 0 | 0 | 0 | 0 | NA | 862 | 0 | 0 | 0 | 0 | 0 | NA |
969 | 0.01 | 0 | 0 | 0 | 0 | NA | 705 | 0 | 0 | 0 | 0 | 0 | NA | 967 | 0.01 | 0 | 0 | 0 | 0 | NA |
951 | 0 | 0 | 0 | 0 | 0 | NA | 445 | 0 | 0 | 0 | 0 | 0 | NA | 965 | 0 | 0 | 0 | 0 | 0 | NA |
986 | 0 | 0 | 0 | 0 | 0 | NA | 444 | 0 | 0 | 0 | 0 | 0 | NA | 285 | 0 | 0 | 0 | 0 | 0 | NA |
885 | 0 | 0 | 0 | 0 | 0 | NA | 449 | 0 | 0 | 0 | 0 | 0 | NA | 962 | 0 | 0 | 0 | 0 | 0 | NA |
99 | 0.01 | 0 | 0 | 0 | 0 | NA | 627 | 0 | 0 | 0 | 0 | 0 | NA | 1106 | 0 | 0 | 0 | 0 | 0 | NA |
369 | 0 | 0 | 0 | 0 | 0 | NA | 624 | 0 | 0 | 0 | 0 | 0 | NA | 20 | 0 | 0 | 0 | 0 | 0 | NA |
295 | 0.01 | 0.01 | 0 | 0 | 0 | 73.7 | 621 | 0 | 0 | 0 | 0 | 0 | NA | 947 | 0 | 0 | 0 | 0 | 0 | NA |
302 | 0.01 | 0 | 0 | 0 | 0 | NA | 617 | 0.01 | 0 | 0 | 0 | 0 | NA | 278 | 0.02 | 0 | 0 | 0 | 0 | NA |
378 | 0 | 0 | 0 | 0 | 0 | NA | 615 | 0 | 0 | 0 | 0 | 0 | NA | 279 | 0.01 | 0 | 0 | 0 | 0 | NA |
386 | 0 | 0 | 0 | 0 | 0 | NA | 606 | 0.03 | 0 | 0 | 0 | 0 | NA | 64 | 0.01 | 0 | 0 | 0 | 0 | NA |
882 | 0 | 0 | 0 | 0 | 0 | NA | 467 | 0 | 0 | 0 | 0 | 0 | NA | 934 | 0 | 0 | 0 | 0 | 0 | NA |
984 | 0 | 0 | 0 | 0 | 0 | NA | 464 | 0.02 | 0 | 0 | 0 | 0 | NA | 931 | 0 | 0 | 0 | 0 | 0 | NA |
1555 | 0.01 | 0 | 0 | 0 | 0 | NA | 465 | 0.02 | 0.01 | 0 | 0 | 0 | 208.81 | 289 | 0.01 | 0 | 0 | 0 | 0 | NA |
1559 | 0.07 | 0 | 0 | 0 | 0 | NA | 43 | 0 | 0 | 0 | 0 | 0 | NA | 1540 | 0 | 0 | 0 | 0 | 0 | NA |
1115 | 0 | 0 | 0 | 0 | 0 | NA | 468 | 0.01 | 0 | 0 | 0 | 0 | NA | 2086 | 0.01 | 0 | 0 | 0 | 0 | NA |
1242 | 0 | 0 | 0 | 0 | 0 | NA | 628 | 0 | 0 | 0 | 0 | 0 | NA | 2090 | 0 | 0 | 0 | 0 | 0 | NA |
300 | 0.01 | 0 | 0 | 0 | 0 | NA | 450 | 0.01 | 0 | 0 | 0 | 0 | NA | 964 | 0 | 0 | 0 | 0 | 0 | NA |
1534 | 0 | 0 | 0 | 0 | 0 | NA | 697 | 0.02 | 0 | 0 | 0 | 0 | NA | 1537 | 0 | 0 | 0 | 0 | 0 | NA |
109 | 0 | 0 | 0 | 0 | 0 | NA | 403 | 0 | 0 | 0 | 0 | 0 | NA | 109 | 0.01 | 0 | 0 | 0 | 0 | NA |
296 | 0.01 | 0 | 0 | 0 | 0 | NA | 219 | 0 | 0 | 0 | 0 | 0 | NA | 55 | 0 | 0 | 0 | 0 | 0 | NA |
48 | 0 | 0 | 0 | 0 | 0 | NA | 302 | 0.02 | 0 | 0 | 0 | 0 | NA | 290 | 0 | 0 | 0 | 0 | 0 | NA |
110 | 0.01 | 0 | 0 | 0 | 0 | NA | 301 | 0.02 | 0 | 0 | 0 | 0 | NA | 1226 | 0 | 0 | 0 | 0 | 0 | NA |
1332 | 0 | 0 | 0 | 0 | 0 | NA | 349 | 0 | 0 | 0 | 0 | 0 | NA | 966 | 0 | 0 | 0 | 0 | 0 | NA |
49 | 0.01 | 0 | 0 | 0 | 0 | NA | 38 | 0.02 | 0 | 0 | 0 | 0 | NA | 354 | 0 | 0 | 0 | 0 | 0 | NA |
306 | 0.01 | 0 | 0 | 0 | 0 | NA | 39 | 0.09 | 0 | 0 | 0 | 0 | NA | 1272 | 0 | 0 | 0 | 0 | 0 | NA |
285 | 0 | 0 | 0 | 0 | 0 | NA | 117 | 0 | 0 | 0 | 0 | 0 | NA | 111 | 0 | 0 | 0 | 0 | 0 | NA |
954 | 0 | 0 | 0 | 0 | 0 | NA | 227 | 0 | 0 | 0 | 0 | 0 | NA | 1351 | 0.01 | 0 | 0 | 0 | 0 | NA |
104 | 0.01 | 0 | 0 | 0 | 0 | NA | 404 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 865 | 0.01 | 0 | 0 | 0 | 0 | NA |
1238 | 0.01 | 0 | 0 | 0 | 0 | NA | 316 | 0 | 0 | 0 | 0 | 0 | NA | 1553 | 0.05 | 0 | 0 | 0 | 0 | NA |
1232 | 0.01 | 0 | 0 | 0 | 0 | NA | 1549 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1557 | 0.01 | 0 | 0 | 0 | 0 | NA | 1220 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
281 | 0 | 0 | 0 | 0 | 0 | NA | 1853 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
106 | 0 | 0 | 0 | 0 | 0 | NA | 1551 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1859 | 0.01 | 0 | 0 | 0 | 0 | NA | 106 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | |||||||
869 | 0.02 | 0 | 0 | 0 | 0 | NA | 107 | 0.02 | 0 | 0 | 0 | 0 | NA | |||||||
988 | 0.01 | 0 | 0 | 0 | 0 | NA | 333 | 0 | 0 | 0 | 0 | 0 | NA | |||||||
1368 | 0 | 0 | 0 | 0 | 0 | NA | 341 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1560 | 0.03 | 0 | 0 | 0 | 0 | NA | 339 | 0 | 0 | 0 | 0 | 0 | NA | |||||||
2091 | 0.01 | 0 | 0 | 0 | 0 | NA | 842 | 0.04 | 0 | 0 | 0 | 0 | NA | |||||||
855 | 0 | 0 | 0 | 0 | 0 | NA | 283 | 0.01 | 0 | 0 | 0 | 0 | NA | |||||||
1369 | 0 | 0 | 0 | 0 | 0 | NA | ||||||||||||||
57 | 0 | 0 | 0 | 0 | 0 | NA |
inner_product.hpp: 82 - 1.98%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
985 | 0.17 | 0.11 | 0.1 | 100 | 50 | 193.44 | 41 | 0.45 | 0.31 | 0.26 | 100 | 50 | 269.87 | 963 | 0.19 | 0.1 | 0.09 | 100 | 50 | 205.49 |
883 | 0.46 | 0.34 | 0.3 | 100 | 50 | 249.37 | 745 | 0.14 | 0.1 | 0.08 | 100 | 50 | 213.17 | 955 | 0.09 | 0.03 | 0.03 | 100 | 50 | 139.52 |
996 | 0.41 | 0.29 | 0.26 | 100 | 50 | 72.48 | 748 | 0.05 | 0.03 | 0.03 | 100 | 50 | 136.37 | 974 | 0.45 | 0.28 | 0.24 | 100 | 50 | 75.24 |
977 | 0.07 | 0.03 | 0.03 | 100 | 50 | 142.25 | 743 | 0.44 | 0.3 | 0.25 | 100 | 50 | 69.92 | 863 | 0.48 | 0.35 | 0.31 | 100 | 50 | 242.32 |
MultiBsplineRef.hpp: 276 - 1.76%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
877 | 0.58 | 0.43 | 0.38 | 100 | 50 | 266.46 | 691 | 1.95 | 1.26 | 1.05 | 0 | 12.5 | 103.32 | 851 | 0.5 | 0.37 | 0.33 | 100 | 50 | 309.6 |
BsplineFunctor.h: 291 - 1.63%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
332 | 0.76 | 0.58 | 0.52 | 86.96 | 44.57 | 0.46 | 489 | 0.35 | 0.26 | 0.22 | 0 | 9.38 | 0.09 | 317 | 0.73 | 0.63 | 0.55 | 0 | 9.94 | 0.04 |
557 | 0.48 | 0.37 | 0.31 | 0 | 9.38 | 0.06 | ||||||||||||||
466 | 0.07 | 0.04 | 0.03 | 0 | 9.38 | 0.11 |
TwoBodyJastrowRef.h: 324 - 1.03%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
363 | 0.44 | 0.36 | 0.32 | 100 | 50 | 264.79 | 495 | 0.59 | 0.46 | 0.39 | 0 | 12.5 | 205.03 | 350 | 0.45 | 0.36 | 0.32 | 100 | 50 | 263.64 |
inner_product.hpp: 211 - 0.62%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
962 | 0.31 | 0.15 | 0.14 | 85.71 | 41.07 | 0 | 711 | 0.42 | 0.26 | 0.22 | 33.33 | 16.67 | 0 | 940 | 0.31 | 0.16 | 0.14 | 33.33 | 16.67 | 0 |
961 | 0.28 | 0.14 | 0.12 | 85.71 | 41.07 | 0 |
BsplineFunctor.h: 246 - 0.25%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
391 | 0.16 | 0.09 | 0.08 | 100 | 46.88 | 657.15 | 566 | 0.14 | 0.08 | 0.07 | 100 | 48.46 | 764.95 | 375 | 0.18 | 0.11 | 0.1 | 55.26 | 30.26 | 559.07 |
TwoBodyJastrowRef.h: 381 - 0.18%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
381 | 0.05 | 0.02 | 0.02 | 100 | 50 | 213.98 | 463 | 0.13 | 0.07 | 0.06 | 100 | 50 | 176.44 | 364 | 0.06 | 0.02 | 0.02 | 100 | 50 | 213.41 |
383 | 0.05 | 0.02 | 0.02 | 100 | 50 | 214.61 | 366 | 0.05 | 0.02 | 0.02 | 100 | 50 | 214.23 | |||||||
379 | 0.05 | 0.02 | 0.02 | 100 | 50 | 210.26 | 362 | 0.05 | 0.02 | 0.02 | 100 | 50 | 205.21 |
OneBodyJastrowRef.h: 214 - 0.03%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
329 | 0.03 | 0.01 | 0.01 | 0 | 11.11 | 3 | 604 | 0.04 | 0.01 | 0.01 | 0 | 12.5 | 0.1 | 313 | 0.03 | 0.01 | 0.01 | 0 | 11.61 | 0.7 |
TwoBodyJastrowRef.h: 388 - 0.03%
Run orig | Run gcc_9 | Run icx_3 | ||||||||||||||||||
Loop Source Regions |
| Loop Source Regions |
| Loop Source Regions |
| |||||||||||||||
ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | Assembly Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
377 | 0.04 | 0.01 | 0.01 | 100 | 50 | 819.62 | 462 | 0.03 | 0.01 | 0.01 | 100 | 50 | 841.43 | 360 | 0.04 | 0.01 | 0.01 | 100 | 50 | 825.82 |