| Run G++ O3 + Funroll | Run Clang++ O3 + Ffastmath | Run ICPX O3 + More Aggressive Flags | 
| Loop Source Regions | /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 61-67
 | Loop Source Regions | /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 61-67
 | Loop Source Regions | /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 61-67
 | 
| ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | ASM Loop ID | Max Time Over Threads (s) | Time w.r.t. Wall Time (s) | Cov (%) | Vect. Ratio (%) | Vector Length Use (%) | GFLOP/s | 
|---|
| 1 | 5.57 | 13.45 | 80.37 | 14.29 | 13.39 | 43.61 | 9 | 6.71 | 10.09 | 56.05 | 55 | 18.59 | 60.26 | 18 | 5.25 | 8.32 | 50.59 | 57.89 | 18.86 | 74.22 | 
|  |  |  | 
| Sum on 1 analyzed binary loop (kmeans-gcc-O3-funroll - 1) | Sum on 1 analyzed binary loop (kmeans-clang-O3-ffast-math - 9) | Sum on 1 analyzed binary loop (kmeans-icpx-O3-aggressive - 18) | 
| Analysis | Count | Analysis | Count | Analysis | Count | 
|---|
| Loop Computation Issues |  | Loop Computation Issues |  | Loop Computation Issues |  | 
| Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA |  | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | Less than 10% of the FP ADD/SUB/MUL arithmetic operations are performed using FMA | 1 | 
| Presence of a large number of scalar integer instructions |  | Presence of a large number of scalar integer instructions | 1 | Presence of a large number of scalar integer instructions | 0 | 
| Control Flow Issues |  | Control Flow Issues |  | Control Flow Issues |  | 
| Presence of 2 to 4 paths | 0 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 0 | 
| Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 | Presence of more than 4 paths | 1 | 
| Data Access Issues |  | Data Access Issues |  | Data Access Issues |  | 
| Presence of special instructions executing on a single port |  | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 | 
| Vectorization Roadblocks |  | Vectorization Roadblocks |  | Vectorization Roadblocks |  | 
| Presence of 2 to 4 paths | 0 | Presence of 2 to 4 paths | 1 | Presence of 2 to 4 paths | 0 | 
| Presence of more than 4 paths | 1 | Presence of more than 4 paths | 0 | Presence of more than 4 paths | 1 | 
| Inefficient Vectorization |  | Inefficient Vectorization |  | Inefficient Vectorization |  | 
| Presence of special instructions executing on a single port |  | Presence of special instructions executing on a single port | 1 | Presence of special instructions executing on a single port | 1 |