Name | Module | Coverage (%) | Inclusive Time w.r.t. Wall Time(s) | Max Inc. Time over Threads(s) | Nb Threads | GFLOP/s | Deviation (coverage) | Deviation (time) |
Neoverse V1 ACFL Ofast Base (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll + SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Base (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll + SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Base (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll + SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Base (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll + SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Base (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll + SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Base (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll + SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Base (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast SoA (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll (250 iterations, 64 threads) | Neoverse V1 ACFL Ofast Manual Unroll + SoA (250 iterations, 64 threads) |
k_means(int, point_t&, point_t&, int*, int, int) [clone .omp_outlined] | binary | NA | 71.09 | NA | 70.85 | NA | 13.49 | NA | 11.96 | NA | 15.92 | NA | 13.44 | NA | 64 | NA | 64 | NA | 1.86 | NA | 2.08 | NA | 6.36 | NA | 5.06 | NA | 1.06 | NA | 0.76 |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined] | binary | 74.13 | NA | 66.55 | NA | 14.37 | NA | 10.80 | NA | 18.60 | NA | 12.53 | NA | 64 | NA | 64 | NA | 1.74 | NA | 2.26 | NA | 8.72 | NA | 6.32 | NA | 1.80 | NA | 0.92 | NA |
kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | 17.77 | 20.23 | 23.21 | 20.24 | 3.44 | 3.84 | 3.77 | 3.42 | 5.53 | 4.18 | 4.05 | 3.67 | 64 | 64 | 64 | 63 | 0.00 | 0.00 | 0.00 | 0.00 | 5.55 | 2.99 | 3.06 | 1.36 | 0.99 | 0.56 | 0.50 | 0.22 |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined.3] | binary | 6.09 | NA | 7.63 | NA | 1.18 | NA | 1.24 | NA | 1.75 | NA | 1.71 | NA | 64 | NA | 64 | NA | 1.53 | NA | 2.03 | NA | 3.81 | NA | 3.86 | NA | 0.71 | NA | 0.61 | NA |
k_means(int, point_t&, point_t&, int*, int, int) [clone .omp_outlined.3] | binary | NA | 6.39 | NA | 6.59 | NA | 1.21 | NA | 1.11 | NA | 1.77 | NA | 1.50 | NA | 64 | NA | 64 | NA | 1.31 | NA | 3.27 | NA | 3.64 | NA | 2.85 | NA | 0.68 | NA | 0.47 |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | 1.07 | 1.21 | 1.39 | 1.23 | 0.21 | 0.23 | 0.23 | 0.21 | 0.35 | 0.28 | 0.28 | 0.25 | 64 | 64 | 64 | 63 | 0.00 | 0.00 | 0.00 | 0.00 | 0.39 | 0.20 | 0.20 | 0.13 | 0.07 | 0.04 | 0.03 | 0.02 |
__sched_yield | libc.so.6 | 0.85 | 0.97 | 1.11 | 0.97 | 0.16 | 0.18 | 0.18 | 0.16 | 0.27 | 0.22 | 0.22 | 0.22 | 64 | 64 | 64 | 63 | 0.00 | 0.00 | 0.00 | 0.00 | 0.29 | 0.16 | 0.16 | 0.11 | 0.05 | 0.03 | 0.03 | 0.02 |
@plt_start@ | libomp.so | 0.06 | 0.07 | 0.07 | 0.07 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 63 | 63 | 63 | 63 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.02 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_yield | libomp.so | 0.03 | 0.03 | 0.04 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 62 | 62 | 64 | 63 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmpc_for_static_fini | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5 | 3 | 10 | 5 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
unknown_function | binary | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 10 | 3 | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
__aarch64_ldadd8_acq_rel | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 6 | 8 | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 4 | 5 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_invoke_task_func | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5 | 3 | 3 | 6 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 7 | 5 | 4 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check() | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5 | 3 | 2 | 5 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_join_barrier(int) | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 7 | 3 | 4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmpc_for_static_init_4 | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 4 | 2 | 6 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_finish_implicit_task | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 3 | 1 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmpc_reduce_nowait | libomp.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 3 | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
k_means(int, point_t&, point_t&, int*, int, int) [clone .omp_outlined_debug__.2] [clone .omp] [clone .reduction] [clone .reduction_func] | binary | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 4 | NA | 2 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_launch_thread | libomp.so | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 2 | NA | 2 | 2 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 |
k_means(int, point_t*, point_t*, int*, int, int) [clone .omp_outlined_debug__.2] [clone .omp] [clone .reduction] [clone .reduction_func] | binary | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 3 | NA | 2 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA |
__kmp_fork_barrier(int, int) | libomp.so | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 1 | 1 | 2 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 |
__kmp_barrier | libomp.so | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 1 | 3 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA |
__kmp_task_team_sync | libomp.so | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 1 | 1 | 1 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | NA |
__kmp_resume_if_soft_paused | libomp.so | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 2 | 1 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA |
__kmp_init_implicit_task | libomp.so | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 2 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA |
__kmp_join_call | libomp.so | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA |
getenv | libc.so.6 | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 1 | 1 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA | NA | 0.00 | 0.00 | NA |
__kmp_fork_call | libomp.so | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 1 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 | NA | 0.00 |
__kmp_determine_reduction_method | libomp.so | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 2 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA |
__memset | libastring.so | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 2 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA |
__aarch64_ldadd4_rel | libomp.so | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA |
__libc_free | libc.so.6 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 |
__kmp_alloc_argv_entries(int, kmp_team*, int) | libomp.so | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 |
__default_morecore | libc.so.6 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 |
__tunable_get_val | ld-linux-aarch64.so.1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 |
k_means(int, point_t*, point_t*, int*, int, int) | binary | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA |
__kmp_free_team | libomp.so | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA |
__kmp_task_team_setup | libomp.so | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 1 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA | NA | 0.00 | NA | NA |
unknown_kernel_region | kernel | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2 | 2 | 5 | 4 | NA | NA | NA | NA | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |