Help is available by moving the cursor above any  symbol or by checking MAQAO website.
 symbol or by checking MAQAO website.
  - r0: run_1_thread
- r1: run_2_threads
- r2: run_4_threads
- r3: run_8_threads
- r4: run_16_threads
- r5: run_32_threads
- r6: run_64_threads
- r7: run_96_threads
| Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | 
|---|
| Total Time (s) | 86.34 | 47.77 | 23.48 | 12.42 | 6.63 | 3.42 | 1.76 | 2.16 | 
| Max (Thread Active Time) (s) | 78.72 | 43.31 | 21.49 | 11.36 | 6.06 | 3.13 | 1.34 | 1.07 | 
| Average Active Time (s) | 78.72 | 43.20 | 21.30 | 11.20 | 5.92 | 3.06 | 1.31 | 1.00 | 
| Activity Ratio (%) | 91.2 | 90.4 | 90.8 | 90.2 | 89.5 | 89.5 | 74.7 | 46.7 | 
| Average number of active threads | 0.912 | 1.809 | 3.630 | 7.216 | 14.303 | 28.560 | 47.502 | 44.543 | 
| Affinity Stability (%) | 99.6 | 99.5 | 99.5 | 99.4 | 99.2 | 98.7 | 97.7 | 97.6 | 
| GFLOPS | 4.236 | 8.198 | 16.073 | 30.920 | 59.380 | 120.819 | 214.349 | 105.928 | 
| Time in analyzed loops (%) | 100.0 | 97.6 | 95.0 | 91.5 | 88.4 | 90.2 | 93.4 | 48.1 | 
| Time in analyzed innermost loops (%) | 98.9 | 96.3 | 93.9 | 90.4 | 87.4 | 89.1 | 92.4 | 47.6 | 
| Time in user code (%) | 100 | 97.6 | 95.0 | 91.5 | 88.4 | 90.2 | 93.4 | 48.1 | 
| Compilation Options Score (%) | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 
| Array Access Efficiency (%) | 99.7 | 99.7 | 99.7 | 99.7 | 99.7 | 99.7 | 99.7 | 99.7 | 
|  | 
| Potential Speedups |  | 
| Perfect Flow Complexity | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 
| Perfect OpenMP/MPI/Pthread/TBB | 1.00 | 1.02 | 1.00 | 1.01 | 1.02 | 1.00 | 1.02 | 1.89 | 
| Perfect OpenMP/MPI/Pthread/TBB + Perfect Load Distribution | 1.00 | 1.03 | 1.06 | 1.11 | 1.16 | 1.14 | 1.10 | 2.22 | 
| Scalability - Gap | 1.00 | 1.11 | 1.09 | 1.15 | 1.23 | 1.27 | 1.31 | 2.40 | 
| No Scalar Integer | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 
| Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 
| FP Vectorised | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 
| Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 
| Fully Vectorised | Potential Speedup | 1.18 | 1.17 | 1.17 | 1.16 | 1.15 | 1.15 | 1.15 | 1.07 | 
| Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 
| Only FP Arithmetic | Potential Speedup | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 
| Nb Loops to get 80% | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 
| Source Object | Issue | 
|---|
| ▼kmeans-acfl-O3-all– |  | 
| ▼main.cpp– |  | 
| ○ |  | 
 
 
| Source Object | Issue | 
|---|
| ▼kmeans-acfl-O3-all– |  | 
| ▼main.cpp– |  | 
| ○ |  | 
 
 
| Source Object | Issue | 
|---|
| ▼kmeans-acfl-O3-all– |  | 
| ▼main.cpp– |  | 
| ○ |  | 
 
 
| Source Object | Issue | 
|---|
| ▼kmeans-acfl-O3-all– |  | 
| ▼main.cpp– |  | 
| ○ |  | 
 
 
| Source Object | Issue | 
|---|
| ▼kmeans-acfl-O3-all– |  | 
| ▼main.cpp– |  | 
| ○ |  | 
 
 
| Source Object | Issue | 
|---|
| ▼kmeans-acfl-O3-all– |  | 
| ▼main.cpp– |  | 
| ○ |  | 
 
 
| Source Object | Issue | 
|---|
| ▼kmeans-acfl-O3-all– |  | 
| ▼main.cpp– |  | 
| ○ |  | 
 
 
| Source Object | Issue | 
|---|
| ▼kmeans-acfl-O3-all– |  | 
| ▼main.cpp– |  | 
| ○ |  | 
 
 
 
|  | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | 
|---|
| Experiment Name | K-Means scalability acfl-O3-all 100000000 | K-Means scalability acfl-O3-all 100000000 | K-Means scalability acfl-O3-all 100000000 | K-Means scalability acfl-O3-all 100000000 | K-Means scalability acfl-O3-all 100000000 | K-Means scalability acfl-O3-all 100000000 | K-Means scalability acfl-O3-all 100000000 | K-Means scalability acfl-O3-all 100000000 | 
|---|
| Application | ./kmeans/kmeans-acfl-O3-all | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Timestamp | 2025-07-17 11:10:29 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Experiment Type | Sequential | OpenMP; | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | same as r1 | 
|---|
| Machine | ip-172-31-47-249.ec2.internal | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Architecture | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Micro Architecture | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Model Name |  |  |  |  |  |  |  |  | 
|---|
| Cache Size |  |  |  |  |  |  |  |  | 
|---|
| Number of Cores |  |  |  |  |  |  |  |  | 
|---|
| Maximal Frequency | 0 GHz | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| OS Version | Linux 6.1.109-118.189.amzn2023.aarch64 #1 SMP Tue Sep 10 08:58:40 UTC 2024 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Architecture used during static analysis | aarch64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Micro Architecture used during static analysis | ARM_NEOVERSE_V2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Compilation Options | kmeans-acfl-O3-all: Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-loops -ffast-math -grecord-command-line -c -o main.o main.cpp | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Number of threads observed | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 
|---|
| Frequency Driver | NA | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Frequency Governor | NA | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Huge Pages | madvise | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Hyperthreading | off | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Number of sockets | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Number of cores per socket | 96 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| MAQAO version | 2025.1.1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| MAQAO build | 2302fb4b01f3b07cccb215042f4e5c7e9fcc3718::20250717-122749 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|
| Comments | AWS Graviton 4 (Neoverse V2) CPU, 1-96 threads runs | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | 
|---|