Run Neoverse V2 GCC O2 | Run Neoverse V2 GCC O3 | Run Neoverse V2 GCC Ofast | Run Neoverse V2 ACFL O2 | Run Neoverse V2 ACFL O3 | Run Neoverse V2 ACFL Ofast |
| | | | | | | | | | | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
-1 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | -1 | 0.00 | 0.00 | 0.00 | 13 | 0.00 | 0.00 | NA | -1 | 0.00 | 0.00 | 0.00 | 25 | 0.00 | 0.00 | NA | 1 | 0.23 | 0.00 | 0.02 | 45 | 0.31 | 0.00 | 0.00 | 1 | 0.18 | 0.00 | 0.01 | 36 | 0.21 | 0.00 | 0.00 | 1 | 0.19 | 0.00 | 0.02 | 33 | 0.31 | 0.00 | 0.00 |
10 | 7.15 | 0.11 | 0.12 | 95 | 1.87 | 0.02 | 42.12 | 280 | 59.05 | 0.78 | 0.97 | 95 | 9.88 | 0.10 | 0.00 | 11 | 3.35 | 0.05 | 0.09 | 95 | 1.72 | 0.02 | 39.39 | 529 | 3.31 | 0.05 | 0.09 | 96 | 1.46 | 0.01 | 0.00 | 529 | 3.00 | 0.05 | 0.10 | 95 | 1.51 | 0.02 | 0.00 | 529 | 3.14 | 0.04 | 0.07 | 96 | 1.56 | 0.01 | 0.00 |
280 | 62.22 | 0.93 | 1.14 | 95 | 10.12 | 0.10 | 0.00 | 276 | 0.64 | 0.01 | 0.03 | 35 | 0.59 | 0.01 | 0.00 | 280 | 43.76 | 0.67 | 0.87 | 96 | 6.60 | 0.10 | 0.00 | 1242 | 0.09 | 0.00 | 0.01 | 21 | 0.17 | 0.00 | 0.00 | 1242 | 0.08 | 0.00 | 0.01 | 20 | 0.10 | 0.00 | 0.00 | 1242 | 0.06 | 0.00 | 0.01 | 13 | 0.21 | 0.00 | 0.00 |
737 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | NA | 10 | 7.82 | 0.10 | 0.13 | 95 | 1.97 | 0.02 | 37.74 | 276 | 11.25 | 0.17 | 0.28 | 95 | 6.54 | 0.09 | 0.00 | 855 | 47.85 | 0.73 | 0.89 | 96 | 17.74 | 0.13 | 0.00 | 855 | 52.12 | 0.85 | 0.98 | 95 | 14.39 | 0.11 | 0.00 | 855 | 46.66 | 0.66 | 0.86 | 96 | 20.04 | 0.13 | 0.00 |
276 | 0.19 | 0.00 | 0.03 | 25 | 0.53 | 0.01 | 0.00 | | 65 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | -1 | 0.00 | 0.00 | 0.00 | 30 | 0.00 | 0.00 | NA | 15 | 6.46 | 0.10 | 0.13 | 93 | 1.88 | 0.02 | 43.42 | 321 | 0.00 | 0.00 | 0.01 | 1 | 0.00 | 0.00 | 0.00 |
-1 | 0.00 | 0.00 | 0.00 | 21 | 0.00 | 0.00 | NA | | | 71 | 1.65 | 0.03 | 0.05 | 95 | 0.96 | 0.01 | 0.00 | 71 | 1.56 | 0.03 | 0.05 | 95 | 0.95 | 0.01 | 0.00 | 71 | 1.39 | 0.02 | 0.04 | 87 | 0.98 | 0.01 | 0.00 |
| | | 15 | 4.65 | 0.07 | 0.09 | 96 | 1.81 | 0.02 | 40.79 | -1 | 0.00 | 0.00 | 0.00 | 24 | 0.00 | 0.00 | NA | 16 | 1.34 | 0.02 | 0.09 | 30 | 3.64 | 0.03 | 39.42 |
| | | -1 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | | -1 | 0.00 | 0.00 | 0.00 | 35 | 0.00 | 0.00 | NA |
Run Neoverse V2 GCC O2 | Run Neoverse V2 GCC O3 | Run Neoverse V2 GCC Ofast | Run Neoverse V2 ACFL O2 | Run Neoverse V2 ACFL O3 | Run Neoverse V2 ACFL Ofast |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 113-122
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 113-122
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 113-122
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 113-122
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 113-122
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 113-122
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
9 | 30.44 | 0.45 | 0.81 | 96 | 13.40 | 0.19 | 654.33 | 9 | 32.49 | 0.43 | 0.66 | 96 | 12.34 | 0.14 | 437.95 | 10 | 41.64 | 0.63 | 0.96 | 96 | 10.73 | 0.17 | 410.89 | 13 | 42.21 | 0.64 | 1.13 | 96 | 20.81 | 0.34 | 591.92 | 13 | 36.60 | 0.59 | 1.15 | 96 | 18.53 | 0.29 | 614.32 | 14 | 47.21 | 0.67 | 1.20 | 96 | 24.29 | 0.34 | 583.06 |
Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
Neoverse V2 GCC O2 | Neoverse V2 GCC O3 | Neoverse V2 GCC Ofast | Neoverse V2 ACFL O2 | Neoverse V2 ACFL O3 | Neoverse V2 ACFL Ofast | Neoverse V2 GCC O2 | Neoverse V2 GCC O3 | Neoverse V2 GCC Ofast | Neoverse V2 ACFL O2 | Neoverse V2 ACFL O3 | Neoverse V2 ACFL Ofast | Neoverse V2 GCC O2 | Neoverse V2 GCC O3 | Neoverse V2 GCC Ofast | Neoverse V2 ACFL O2 | Neoverse V2 ACFL O3 | Neoverse V2 ACFL Ofast | Neoverse V2 GCC O2 | Neoverse V2 GCC O3 | Neoverse V2 GCC Ofast | Neoverse V2 ACFL O2 | Neoverse V2 ACFL O3 | Neoverse V2 ACFL Ofast | Neoverse V2 GCC O2 | Neoverse V2 GCC O3 | Neoverse V2 GCC Ofast | Neoverse V2 ACFL O2 | Neoverse V2 ACFL O3 | Neoverse V2 ACFL Ofast | Neoverse V2 GCC O2 | Neoverse V2 GCC O3 | Neoverse V2 GCC Ofast | Neoverse V2 ACFL O2 | Neoverse V2 ACFL O3 | Neoverse V2 ACFL Ofast | Neoverse V2 GCC O2 | Neoverse V2 GCC O3 | Neoverse V2 GCC Ofast | Neoverse V2 ACFL O2 | Neoverse V2 ACFL O3 | Neoverse V2 ACFL Ofast |
gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 62.22 | 59.05 | 43.76 | NA | NA | NA | 0.93 | 0.78 | 0.67 | NA | NA | NA | 1.14 | 0.97 | 0.87 | NA | NA | NA | 95 | 95 | 96 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 10.12 | 9.88 | 6.60 | NA | NA | NA | 0.10 | 0.10 | 0.10 | NA | NA | NA |
kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | NA | NA | NA | 47.85 | 52.12 | 46.66 | NA | NA | NA | 0.73 | 0.85 | 0.66 | NA | NA | NA | 0.89 | 0.98 | 0.86 | NA | NA | NA | 96 | 95 | 96 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 17.74 | 14.39 | 20.04 | NA | NA | NA | 0.13 | 0.11 | 0.13 |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined] | binary | NA | NA | NA | 42.21 | 36.60 | 47.21 | NA | NA | NA | 0.64 | 0.59 | 0.67 | NA | NA | NA | 1.13 | 1.15 | 1.20 | NA | NA | NA | 96 | 96 | 96 | NA | NA | NA | 591.92 | 614.32 | 583.06 | NA | NA | NA | 20.81 | 18.53 | 24.29 | NA | NA | NA | 0.34 | 0.29 | 0.34 |
k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.0] | binary | 30.44 | 32.49 | 41.64 | NA | NA | NA | 0.45 | 0.43 | 0.63 | NA | NA | NA | 0.81 | 0.66 | 0.96 | NA | NA | NA | 96 | 96 | 96 | NA | NA | NA | 654.33 | 437.95 | 410.89 | NA | NA | NA | 13.40 | 12.34 | 10.73 | NA | NA | NA | 0.19 | 0.14 | 0.17 | NA | NA | NA |
k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.1] | binary | 7.15 | 7.82 | 3.35 | NA | NA | NA | 0.11 | 0.10 | 0.05 | NA | NA | NA | 0.12 | 0.13 | 0.09 | NA | NA | NA | 95 | 95 | 95 | NA | NA | NA | 42.12 | 37.74 | 39.39 | NA | NA | NA | 1.87 | 1.97 | 1.72 | NA | NA | NA | 0.02 | 0.02 | 0.02 | NA | NA | NA |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined.3] | binary | NA | NA | NA | 4.65 | 6.46 | 1.34 | NA | NA | NA | 0.07 | 0.10 | 0.02 | NA | NA | NA | 0.09 | 0.13 | 0.09 | NA | NA | NA | 96 | 93 | 30 | NA | NA | NA | 40.79 | 43.42 | 39.42 | NA | NA | NA | 1.81 | 1.88 | 3.64 | NA | NA | NA | 0.02 | 0.02 | 0.03 |
gomp_barrier_wait_end | libgomp.so.1.0.0 | 0.19 | 0.64 | 11.25 | NA | NA | NA | 0.00 | 0.01 | 0.17 | NA | NA | NA | 0.03 | 0.03 | 0.28 | NA | NA | NA | 25 | 35 | 95 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.53 | 0.59 | 6.54 | NA | NA | NA | 0.01 | 0.01 | 0.09 | NA | NA | NA |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | NA | NA | NA | 3.31 | 3.00 | 3.14 | NA | NA | NA | 0.05 | 0.05 | 0.04 | NA | NA | NA | 0.09 | 0.10 | 0.07 | NA | NA | NA | 96 | 95 | 96 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 1.46 | 1.51 | 1.56 | NA | NA | NA | 0.01 | 0.02 | 0.01 |
__sched_yield | libc.so.6 | NA | NA | NA | 1.65 | 1.56 | 1.39 | NA | NA | NA | 0.03 | 0.03 | 0.02 | NA | NA | NA | 0.05 | 0.05 | 0.04 | NA | NA | NA | 95 | 95 | 87 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.96 | 0.95 | 0.98 | NA | NA | NA | 0.01 | 0.01 | 0.01 |
@plt_start@ | libomp.so | NA | NA | NA | 0.23 | 0.18 | 0.19 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.02 | 0.01 | 0.02 | NA | NA | NA | 45 | 36 | 33 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.31 | 0.21 | 0.31 | NA | NA | NA | 0.00 | 0.00 | 0.00 |
__kmp_yield | libomp.so | NA | NA | NA | 0.09 | 0.08 | 0.06 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.01 | 0.01 | 0.01 | NA | NA | NA | 21 | 20 | 13 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.17 | 0.10 | 0.21 | NA | NA | NA | 0.00 | 0.00 | 0.00 |
unknown_function | binary | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 1 | NA | NA | 1 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA | 0.00 | NA | NA |
__kmpc_for_static_fini | libomp.so | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.01 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 |
do_lookup_x | ld-linux-aarch64.so.1 | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA |
unknown_kernel_region | kernel | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 21 | 13 | 25 | 30 | 24 | 35 | NA | NA | NA | NA | NA | NA | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
__aarch64_cas4_acq | libgomp.so.1.0.0 | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA |