Run Neoverse V1 GCC O2 | Run Neoverse V1 GCC O3 | Run Neoverse V1 GCC Ofast | Run Neoverse V1 ACFL O2 | Run Neoverse V1 ACFL O3 | Run Neoverse V1 ACFL Ofast |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-123
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-123
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-123
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-123
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-123
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 114-123
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
9 | 63.75 | 1.42 | 1.75 | 64 | 7.98 | 0.16 | 24.00 | 9 | 68.66 | 1.40 | 1.87 | 64 | 8.83 | 0.16 | 25.94 | 10 | 70.58 | 2.07 | 2.39 | 64 | 8.54 | 0.22 | 355.59 | 13 | 70.87 | 1.56 | 1.71 | 64 | 6.96 | 0.14 | 23.96 | 13 | 75.18 | 1.52 | 1.75 | 64 | 6.72 | 0.14 | 24.95 | 15 | 70.87 | 1.49 | 1.75 | 64 | 8.16 | 0.18 | 17.04 |
Run Neoverse V1 GCC O2 | Run Neoverse V1 GCC O3 | Run Neoverse V1 GCC Ofast | Run Neoverse V1 ACFL O2 | Run Neoverse V1 ACFL O3 | Run Neoverse V1 ACFL Ofast |
| | | | | | | | | | | |
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
280 | 29.16 | 0.65 | 0.95 | 63 | 6.31 | 0.14 | 0.00 | -1 | 0.00 | 0.00 | 0.00 | 5 | 0.00 | 0.00 | NA | 280 | 21.87 | 0.64 | 1.30 | 64 | 7.71 | 0.22 | 0.00 | 1 | 0.07 | 0.00 | 0.01 | 16 | 0.09 | 0.00 | 0.00 | 1 | 0.05 | 0.00 | 0.01 | 12 | 0.08 | 0.00 | 0.00 | 1 | 0.05 | 0.00 | 0.01 | 12 | 0.02 | 0.00 | 0.00 |
276 | 0.26 | 0.01 | 0.02 | 35 | 0.22 | 0.00 | 0.00 | 280 | 23.41 | 0.48 | 0.82 | 64 | 7.75 | 0.16 | 0.00 | 745 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 528 | 1.18 | 0.03 | 0.07 | 63 | 0.68 | 0.01 | 0.00 | 528 | 0.95 | 0.02 | 0.05 | 62 | 0.56 | 0.01 | 0.00 | 528 | 0.99 | 0.02 | 0.05 | 62 | 0.56 | 0.01 | 0.00 |
-1 | 0.00 | 0.00 | 0.00 | 8 | 0.00 | 0.00 | NA | 745 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 0.00 | 276 | 3.08 | 0.09 | 0.13 | 63 | 0.95 | 0.03 | 0.00 | 1901 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 1241 | 0.04 | 0.00 | 0.01 | 8 | 0.09 | 0.00 | 0.00 | 1241 | 0.04 | 0.00 | 0.01 | 8 | 0.11 | 0.00 | 0.00 |
| 276 | 0.33 | 0.01 | 0.02 | 40 | 0.23 | 0.00 | 0.00 | 11 | 4.47 | 0.13 | 0.20 | 63 | 2.42 | 0.07 | 27.64 | 1241 | 0.05 | 0.00 | 0.01 | 12 | 0.11 | 0.00 | 0.00 | 854 | 16.93 | 0.34 | 0.54 | 64 | 4.76 | 0.09 | 0.00 | 854 | 19.33 | 0.41 | 0.63 | 64 | 6.21 | 0.11 | 0.00 |
| 10 | 7.60 | 0.16 | 0.19 | 64 | 2.04 | 0.04 | 1.79 | -1 | 0.00 | 0.00 | 0.00 | 24 | 0.00 | 0.00 | NA | 854 | 19.98 | 0.44 | 0.67 | 64 | 5.21 | 0.10 | 0.00 | 320 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 437 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 |
| | | 837 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 0.00 | 1137 | 0.76 | 0.02 | 0.05 | 58 | 0.54 | 0.01 | 0.00 | 1137 | 0.83 | 0.02 | 0.04 | 59 | 0.53 | 0.01 | 0.00 |
| | | -1 | 0.00 | 0.00 | 0.00 | 9 | 0.00 | 0.00 | NA | -1 | 0.00 | 0.00 | 0.00 | 10 | 0.00 | 0.00 | NA | -1 | 0.00 | 0.00 | 0.00 | 3 | 0.00 | 0.00 | NA |
| | | 1137 | 1.05 | 0.02 | 0.05 | 63 | 0.55 | 0.01 | 0.00 | | |
Run Neoverse V1 GCC O2 | Run Neoverse V1 GCC O3 | Run Neoverse V1 GCC Ofast | Run Neoverse V1 ACFL O2 | Run Neoverse V1 ACFL O3 | Run Neoverse V1 ACFL Ofast |
| - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 139-144
| | | | | | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 139-144
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 139-144
| | - /home/fmusial/KMEANS_Benchmarks/kmeans/main.cpp: 139-144
|
ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s | ASM Fct ID | Coverage (%) | Inc Time w.r.t. Wall Time (s) | Max Inc. Time over Threads(s) | Nb Threads | Deviation (cov) | Deviation (tps) | GFLOP/s |
10 | 6.83 | 0.15 | 0.19 | 63 | 1.88 | 0.04 | 2.39 | | | 14 | 0.00 | 0.00 | 0.01 | 1 | 0.00 | 0.00 | 0.00 | 15 | 6.09 | 0.12 | 0.20 | 54 | 3.16 | 0.06 | 2.61 | 17 | 7.87 | 0.17 | 0.19 | 64 | 2.42 | 0.05 | 1.20 |
| | | 15 | 6.79 | 0.15 | 0.20 | 64 | 2.30 | 0.05 | 0.93 | | |
Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
Neoverse V1 GCC O2 | Neoverse V1 GCC O3 | Neoverse V1 GCC Ofast | Neoverse V1 ACFL O2 | Neoverse V1 ACFL O3 | Neoverse V1 ACFL Ofast | Neoverse V1 GCC O2 | Neoverse V1 GCC O3 | Neoverse V1 GCC Ofast | Neoverse V1 ACFL O2 | Neoverse V1 ACFL O3 | Neoverse V1 ACFL Ofast | Neoverse V1 GCC O2 | Neoverse V1 GCC O3 | Neoverse V1 GCC Ofast | Neoverse V1 ACFL O2 | Neoverse V1 ACFL O3 | Neoverse V1 ACFL Ofast | Neoverse V1 GCC O2 | Neoverse V1 GCC O3 | Neoverse V1 GCC Ofast | Neoverse V1 ACFL O2 | Neoverse V1 ACFL O3 | Neoverse V1 ACFL Ofast | Neoverse V1 GCC O2 | Neoverse V1 GCC O3 | Neoverse V1 GCC Ofast | Neoverse V1 ACFL O2 | Neoverse V1 ACFL O3 | Neoverse V1 ACFL Ofast | Neoverse V1 GCC O2 | Neoverse V1 GCC O3 | Neoverse V1 GCC Ofast | Neoverse V1 ACFL O2 | Neoverse V1 ACFL O3 | Neoverse V1 ACFL Ofast | Neoverse V1 GCC O2 | Neoverse V1 GCC O3 | Neoverse V1 GCC Ofast | Neoverse V1 ACFL O2 | Neoverse V1 ACFL O3 | Neoverse V1 ACFL Ofast |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined] | binary | NA | NA | NA | 70.87 | 75.18 | 70.87 | NA | NA | NA | 1.56 | 1.52 | 1.49 | NA | NA | NA | 1.71 | 1.75 | 1.75 | NA | NA | NA | 64 | 64 | 64 | NA | NA | NA | 23.96 | 24.95 | 17.04 | NA | NA | NA | 6.96 | 6.72 | 8.16 | NA | NA | NA | 0.14 | 0.14 | 0.18 |
k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.0] | binary | 63.75 | 68.66 | 70.58 | NA | NA | NA | 1.42 | 1.40 | 2.07 | NA | NA | NA | 1.75 | 1.87 | 2.39 | NA | NA | NA | 64 | 64 | 64 | NA | NA | NA | 24.00 | 25.94 | 355.59 | NA | NA | NA | 7.98 | 8.83 | 8.54 | NA | NA | NA | 0.16 | 0.16 | 0.22 | NA | NA | NA |
gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 29.16 | 23.41 | 21.87 | NA | NA | NA | 0.65 | 0.48 | 0.64 | NA | NA | NA | 0.95 | 0.82 | 1.30 | NA | NA | NA | 63 | 64 | 64 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 6.31 | 7.75 | 7.71 | NA | NA | NA | 0.14 | 0.16 | 0.22 | NA | NA | NA |
kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | NA | NA | NA | 19.98 | 16.93 | 19.33 | NA | NA | NA | 0.44 | 0.34 | 0.41 | NA | NA | NA | 0.67 | 0.54 | 0.63 | NA | NA | NA | 64 | 64 | 64 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 5.21 | 4.76 | 6.21 | NA | NA | NA | 0.10 | 0.09 | 0.11 |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined.3] | binary | NA | NA | NA | 6.79 | 6.09 | 7.87 | NA | NA | NA | 0.15 | 0.12 | 0.17 | NA | NA | NA | 0.20 | 0.20 | 0.19 | NA | NA | NA | 64 | 54 | 64 | NA | NA | NA | 0.93 | 2.61 | 1.20 | NA | NA | NA | 2.30 | 3.16 | 2.42 | NA | NA | NA | 0.05 | 0.06 | 0.05 |
k_means(int, point_t*, point_t*, int*, int, int) [clone ._omp_fn.1] | binary | 6.83 | 7.60 | 4.47 | NA | NA | NA | 0.15 | 0.16 | 0.13 | NA | NA | NA | 0.19 | 0.19 | 0.20 | NA | NA | NA | 63 | 64 | 63 | NA | NA | NA | 2.39 | 1.79 | 27.64 | NA | NA | NA | 1.88 | 2.04 | 2.42 | NA | NA | NA | 0.04 | 0.04 | 0.07 | NA | NA | NA |
gomp_barrier_wait_end | libgomp.so.1.0.0 | 0.26 | 0.33 | 3.08 | NA | NA | NA | 0.01 | 0.01 | 0.09 | NA | NA | NA | 0.02 | 0.02 | 0.13 | NA | NA | NA | 35 | 40 | 63 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.22 | 0.23 | 0.95 | NA | NA | NA | 0.00 | 0.00 | 0.03 | NA | NA | NA |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | NA | NA | NA | 1.18 | 0.95 | 0.99 | NA | NA | NA | 0.03 | 0.02 | 0.02 | NA | NA | NA | 0.07 | 0.05 | 0.05 | NA | NA | NA | 63 | 62 | 62 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.68 | 0.56 | 0.56 | NA | NA | NA | 0.01 | 0.01 | 0.01 |
__sched_yield | libc.so.6 | NA | NA | NA | 1.05 | 0.76 | 0.83 | NA | NA | NA | 0.02 | 0.02 | 0.02 | NA | NA | NA | 0.05 | 0.05 | 0.04 | NA | NA | NA | 63 | 58 | 59 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.55 | 0.54 | 0.53 | NA | NA | NA | 0.01 | 0.01 | 0.01 |
@plt_start@ | libomp.so | NA | NA | NA | 0.07 | 0.05 | 0.05 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.01 | 0.01 | 0.01 | NA | NA | NA | 16 | 12 | 12 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.09 | 0.08 | 0.02 | NA | NA | NA | 0.00 | 0.00 | 0.00 |
__kmp_yield | libomp.so | NA | NA | NA | 0.05 | 0.04 | 0.04 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.01 | 0.01 | 0.01 | NA | NA | NA | 12 | 8 | 8 | NA | NA | NA | 0.00 | 0.00 | 0.00 | NA | NA | NA | 0.11 | 0.09 | 0.11 | NA | NA | NA | 0.00 | 0.00 | 0.00 |
__aarch64_ldadd4_acq_rel | libgomp.so.1.0.0 | NA | 0.01 | 0.00 | NA | NA | NA | NA | 0.00 | 0.00 | NA | NA | NA | NA | 0.00 | 0.00 | NA | NA | NA | NA | 2 | 1 | NA | NA | NA | NA | 0.00 | 0.00 | NA | NA | NA | NA | 0.00 | 0.00 | NA | NA | NA | NA | 0.00 | 0.00 | NA | NA | NA |
__kmp_resume_if_soft_paused | libomp.so | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 |
__kmpc_for_static_fini | libomp.so | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA |
__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA |
__aarch64_ldadd8_acq_rel | libomp.so | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined_debug__.2] [clone .omp] [clone .reduction] [clone .reduction_func] | binary | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.01 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA | NA | NA | NA | 0.00 | NA | NA |
unknown_kernel_region | kernel | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 8 | 5 | 24 | 9 | 10 | 3 | NA | NA | NA | NA | NA | NA | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |