ID | Module | Source Location | Source Function | Level | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Coverage (% app. time) | Speedup if no scalar integer | Speedup if FP arith vectorized | Speedup if fully vectorized | Speedup if FP only | Number of paths | Vectorization Ratio (%) | Vector Length Use (%) | Flops (GFLOP/s) | CQA cycles | CQA cycles if no scalar integer | CQA cycles if FP arith vectorized | CQA cycles if fully vectorized | CQA cycles if FP only |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
○Loop 109 | exec | ljForce.c:191-191,ljForce.c:197-216 | .omp_outlined..5.120 | Innermost | 4.12 | 3.42 | 30.22 | 1.02 | 1.71 | 4.32 | 1.24 | 3 | 30.51 | 16.31 | 914.33 | 4.83 | 4.72 | 2.83 | 1.12 | 3.89 |
○Loop 66 | exec | haloExchange.c:621-630 | sortAtomsInCell | Single | 0.45 | 0.24 | 2.15 | 1.33 | 1.00 | 6.10 | 5.33 | 1 | 0.00 | 10.94 | 21.78 | 5.33 | 4.00 | 5.33 | 0.88 | 1.00 |
○Loop 104 | exec | mytype.h:23-23,ljForce.c:158-161 | .omp_outlined..119 | Single | 0.38 | 0.24 | 2.08 | 1.00 | 1.00 | 8.00 | 1.00 | 1 | 33.33 | 12.50 | 7.24 | 1.00 | 1.00 | 1.00 | 0.13 | 1.00 |
○Loop 122 | exec | timestep.c:74-78 | .omp_outlined..244 | Innermost | 0.24 | 0.13 | 1.16 | 1.00 | 1.00 | 1.02 | 6.17 | 1 | 97.30 | 97.64 | 30.63 | 18.50 | 18.50 | 18.50 | 18.06 | 3.00 |
○Loop 125 | exec | timestep.c:88-94 | .omp_outlined..2 | Innermost | 0.17 | 0.1 | 0.85 | 1.00 | 1.13 | 4.00 | 1.00 | 1 | 0.00 | 12.50 | 51.94 | 5.00 | 5.00 | 4.42 | 1.25 | 5.00 |
○Loop 108 | exec | ljForce.c:187-187,ljForce.c:191-191 | .omp_outlined..5.120 | InBetween | 0.1 | 0.05 | 0.41 | 1.00 | 1.00 | 5.67 | 2.13 | 4 | 0.00 | 12.50 | 1106.76 | 2.13 | 2.13 | 2.13 | 0.38 | 1.00 |
○Loop 99 | exec | linkCells.c:295-301,linkCells.c:352-352,linkCells.c:359-365,linkCells.c:371-371,linkCells.c:378-378 | updateLinkCells | Innermost | 3.5 | 0.03 | 0.29 | 1.55 | 1.86 | 5.53 | 4.47 | 4 | 44.51 | 16.57 | 219.25 | 7.08 | 4.58 | 3.81 | 1.28 | 1.58 |
○Loop 52 | exec | haloExchange.c:414-418,haloExchange.c:424-424,linkCells.c:178-194,linkCells.c:352-352,linkCells.c:359-365,linkCells.c:371-371,linkCells.c:378-378 | unloadAtomsBuffer | Single | 1.22 | 0.01 | 0.11 | 1.80 | 1.30 | 4.08 | 9.03 | 4 | 44.59 | 16.55 | 161.00 | 14.67 | 8.17 | 11.30 | 3.59 | 1.63 |
○Loop 107 | exec | ljForce.c:178-184,ljForce.c:187-187 | .omp_outlined..5.120 | InBetween | 0.03 | 0.01 | 0.1 | 1.00 | 1.00 | 5.87 | 2.44 | 5 | 0.00 | 11.25 | 1544.52 | 3.67 | 3.67 | 3.67 | 0.63 | 1.50 |
○Loop 119 | exec | timestep.c:110-116 | .omp_outlined..4 | Innermost | 0.02 | 0.01 | 0.08 | 1.00 | 2.86 | 4.00 | 1.00 | 1 | 0.00 | 12.50 | 43.75 | 5.00 | 5.00 | 1.75 | 1.25 | 5.00 |
○Loop 67 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.03 | 0.01 | 0.04 | 1.33 | 1.00 | 6.10 | 5.33 | 1 | 0.00 | 10.94 | 4.50 | 5.33 | 4.00 | 5.33 | 0.88 | 1.00 |
○Loop 121 | exec | timestep.c:74-78 | .omp_outlined..244 | Innermost | 0.03 | 0 | 0.04 | 1.00 | 1.00 | 4.57 | 2.67 | 1 | 0.00 | 12.50 | NA | 4.00 | 4.00 | 4.00 | 0.88 | 1.50 |
○Loop 68 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.02 | 0 | 0.04 | 1.00 | 1.00 | 1.49 | 17.00 | 1 | 65.96 | 61.70 | NA | 17.00 | 17.00 | 17.00 | 11.38 | 1.00 |
○Loop 51 | exec | haloExchange.c:380-389 | loadAtomsBuffer | Innermost | 0.39 | 0 | 0.03 | 1.31 | 1.17 | 5.91 | 3.78 | 1 | 0.00 | 11.25 | NA | 5.67 | 4.33 | 4.83 | 0.96 | 1.50 |
○Loop 69 | exec | haloExchange.c:633-642 | sortAtomsInCell | Single | 0.03 | 0 | 0.03 | 1.00 | 1.00 | 2.06 | 6.00 | 1 | 66.67 | 34.72 | NA | 6.00 | 6.00 | 6.00 | 2.92 | 1.00 |
○Loop 123 | exec | timestep.c:74-78 | .omp_outlined..244 | Innermost | 0.01 | 0 | 0.02 | 1.00 | 1.89 | 1.03 | 5.67 | 1 | 95.83 | 81.77 | NA | 8.50 | 8.50 | 4.50 | 8.25 | 1.50 |
○Loop 106 | exec | ljForce.c:173-175,ljForce.c:178-178 | .omp_outlined..5.120 | Outermost | 0.01 | 0 | 0.02 | 1.00 | 1.00 | 5.67 | 3.09 | 2 | 0.00 | 10.21 | NA | 2.83 | 2.83 | 2.83 | 0.50 | 0.92 |
○Loop 120 | exec | timestep.c:72-74,timestep.c:78-78 | .omp_outlined..244 | Outermost | 0.01 | 0 | 0.01 | 1.00 | 1.00 | 12.07 | 3.19 | 10 | 0.00 | 10.23 | NA | 14.33 | 14.33 | 14.33 | 1.19 | 4.50 |
○Loop 73 | exec | initAtoms.c:91-100,linkCells.c:178-193,linkCells.c:352-352,linkCells.c:359-365,linkCells.c:371-371,linkCells.c:378-378 | createFccLattice | Innermost | 0.04 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 126 | exec | timestep.c:153-154 | .omp_outlined..6.245 | Single | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 124 | exec | timestep.c:86-88 | .omp_outlined..2 | Outermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 79 | exec | initAtoms.c:221-228 | .omp_outlined..3 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 98 | exec | linkCells.c:291-291,linkCells.c:295-295 | updateLinkCells | Outermost | 0.02 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 81 | exec | initAtoms.c:126-133 | .omp_outlined..84 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 50 | exec | haloExchange.c:376-380 | loadAtomsBuffer | Outermost | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 100 | exec | linkCells.c:384-385 | updateLinkCells | Single | 0.01 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 92 | exec | random.c:45-48,random.c:67-70,initAtoms.c:197-202 | .omp_outlined..9.90 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 86 | exec | random.c:29-31,random.c:45-46,random.c:67-70,initAtoms.c:154-162 | .omp_outlined..5 | InBetween | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 65 | exec | memUtils.h:13-13,memUtils.h:28-28,parallel.c:99-104,haloExchange.c:249-249,haloExchange.c:283-302,performanceTimers.c:94-102,performanceTimers.c:247-248 | haloExchange | Single | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 85 | exec | random.c:26-29,random.c:45-48 | .omp_outlined..5 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 89 | exec | initAtoms.c:177-181 | .omp_outlined..7 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 84 | exec | random.c:26-29,random.c:45-48 | .omp_outlined..5 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 118 | exec | timestep.c:108-110 | .omp_outlined..4 | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 83 | exec | random.c:26-29,random.c:45-48 | .omp_outlined..5 | Innermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
○Loop 91 | exec | initAtoms.c:195-199 | .omp_outlined..9.90 | Outermost | 0 | 0 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |