Run baseline | Run locus440 |
| | | - /home/eoseret/3dtransp_code/permute3d_1.locus440.cpp: 98-104
|
ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) | ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) |
| 13 | 84.65 | 11.20 | 38 | 13.32 | 1.79 |
Run baseline | Run locus440 |
| | | |
ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) | ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) |
2 | 0.00 | 0.00 | 16 | 0.01 | 0.00 | -1 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
2244 | 0.00 | 0.00 | 4 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 2 | 0.00 | 0.00 |
1903 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1138 | 0.01 | 0.00 | 5 | 0.03 | 0.00 |
1891 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 1896 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
1059 | 4.34 | 2.51 | 21 | 17.19 | 9.85 | 1903 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
1063 | 0.00 | 0.00 | 4 | 0.00 | 0.00 | 1059 | 0.38 | 0.05 | 13 | 1.39 | 0.18 |
1880 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1062 | 0.02 | 0.00 | 13 | 0.02 | 0.00 |
1086 | 1.22 | 0.71 | 38 | 0.26 | 0.15 | 1897 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
1960 | 0.00 | 0.00 | 11 | 0.00 | 0.00 | 1902 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
2690 | 0.00 | 0.00 | 3 | 0.00 | 0.00 | 1086 | 0.40 | 0.05 | 38 | 0.35 | 0.05 |
1853 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1961 | 0.02 | 0.00 | 17 | 0.02 | 0.00 |
2196 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 2690 | 0.00 | 0.00 | 5 | 0.00 | 0.00 |
1042 | 45.93 | 26.53 | 37 | 12.63 | 7.20 | 1853 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
2715 | 0.03 | 0.02 | 36 | 0.01 | 0.01 | 2244 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
2194 | 0.00 | 0.00 | 5 | 0.00 | 0.00 | 1042 | 14.37 | 1.90 | 37 | 13.27 | 1.73 |
1084 | 0.00 | 0.00 | 12 | 0.00 | 0.00 | 2354 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
1062 | 0.00 | 0.00 | 15 | 0.01 | 0.00 | 2194 | 0.00 | 0.00 | 5 | 0.00 | 0.00 |
1883 | 0.00 | 0.00 | 5 | 0.00 | 0.00 | 1084 | 0.00 | 0.00 | 3 | 0.02 | 0.00 |
1048 | 0.00 | 0.00 | 15 | 0.01 | 0.00 | 1048 | 0.02 | 0.00 | 11 | 0.05 | 0.01 |
1138 | 0.00 | 0.00 | 4 | 0.00 | 0.00 | 1883 | 0.01 | 0.00 | 7 | 0.01 | 0.00 |
22 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1063 | 0.01 | 0.00 | 6 | 0.00 | 0.00 |
12 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 2715 | 0.01 | 0.00 | 6 | 0.03 | 0.00 |
40 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 22 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
74 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 12 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
1943 | 0.05 | 0.03 | 37 | 0.02 | 0.01 | 476 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
635 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1943 | 0.02 | 0.00 | 14 | 0.03 | 0.00 |
1704 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 40 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
-1 | 0.19 | 0.11 | 38 | 0.04 | 0.02 | -1 | 0.07 | 0.01 | 35 | 0.04 | 0.01 |
Run baseline | Run locus440 |
| - /home/eoseret/3dtransp_code/permute3d_1.omp.cpp: 89-93
- /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/stl_vector.h: 1046-1046
| | |
ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) | ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) |
13 | 48.21 | 27.85 | 38 | 10.63 | 6.19 | |
Name | Module | Coverage (%) | Time (s) | Nb Threads | Deviation (coverage) | Deviation (time) |
baseline | locus440 | baseline | locus440 | baseline | locus440 | baseline | locus440 | baseline | locus440 |
main.DIR.OMP.PARALLEL.LOOP.2.split178 | binary | NA | 84.65 | NA | 11.20 | NA | 38 | NA | 13.32 | NA | 1.79 |
bool _INTERNAL021345c1::__kmp_wait_template<kmp_flag_64<false, true>, true, false, true>(kmp_info*, kmp_flag_64<false, true>*, void*) | libiomp5.so | 45.93 | 14.37 | 26.53 | 1.90 | 37 | 37 | 12.63 | 13.27 | 7.20 | 1.73 |
main.DIR.OMP.PARALLEL.LOOP.2.split145 | binary | 48.21 | NA | 27.85 | NA | 38 | NA | 10.63 | NA | 6.19 | NA |
__kmp_hyper_barrier_gather | libiomp5.so | 4.34 | 0.38 | 2.51 | 0.05 | 21 | 13 | 17.19 | 1.39 | 9.85 | 0.18 |
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libiomp5.so | 1.22 | 0.40 | 0.71 | 0.05 | 38 | 38 | 0.26 | 0.35 | 0.15 | 0.05 |
unknown_kernel_region | kernel | 0.19 | 0.07 | 0.11 | 0.01 | 38 | 35 | 0.04 | 0.04 | 0.02 | 0.01 |
__GI___sched_yield | libc.so.6 | 0.05 | 0.02 | 0.03 | 0.00 | 37 | 14 | 0.02 | 0.03 | 0.01 | 0.00 |
__kmp_yield | libiomp5.so | 0.03 | 0.01 | 0.02 | 0.00 | 36 | 6 | 0.01 | 0.03 | 0.01 | 0.00 |
_INTERNAL021345c1::__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libiomp5.so | 0.00 | 0.02 | 0.00 | 0.00 | 15 | 11 | 0.01 | 0.05 | 0.00 | 0.01 |
__kmp_join_barrier(int) | libiomp5.so | 0.00 | 0.02 | 0.00 | 0.00 | 15 | 13 | 0.01 | 0.02 | 0.00 | 0.00 |
__kmpc_for_static_init_8 | libiomp5.so | NA | 0.02 | NA | 0.00 | NA | 17 | NA | 0.02 | NA | 0.00 |
__kmp_launch_thread | libiomp5.so | 0.00 | 0.01 | 0.00 | 0.00 | 5 | 7 | 0.00 | 0.01 | 0.00 | 0.00 |
__kmpc_for_static_fini | libiomp5.so | 0.00 | 0.01 | 0.00 | 0.00 | 4 | 5 | 0.00 | 0.03 | 0.00 | 0.00 |
kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check() | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 12 | 3 | 0.00 | 0.02 | 0.00 | 0.00 |
__kmp_fork_barrier(int, int) | libiomp5.so | 0.00 | 0.01 | 0.00 | 0.00 | 4 | 6 | 0.00 | 0.00 | 0.00 | 0.00 |
.plt.sec@start | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 16 | 2 | 0.01 | 0.00 | 0.00 | 0.00 |
__kmp_finish_implicit_task | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 5 | 5 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_invoke_microtask | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 5 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmpc_for_static_init_8u | libiomp5.so | 0.00 | NA | 0.00 | NA | 11 | NA | 0.00 | NA | 0.00 | NA |
__kmp_join_call | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
main | binary | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
do_lookup_x | ld-linux-x86-64.so.2 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_get_global_thread_id_reg | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_init_implicit_task | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_fork_call | libiomp5.so | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
__intel_avx_rep_memset | binary | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
__kmp_allocate_team | libiomp5.so | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
__ompt_team_assign_id(kmp_team*, ompt_data_t) | libiomp5.so | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
__kmp_internal_fork | libiomp5.so | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
__GI___strtoul_internal | libc.so.6 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
unknown_function | [vdso] | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
__kmp_invoke_task_func | libiomp5.so | 0.00 | NA | 0.00 | NA | 2 | NA | 0.00 | NA | 0.00 | NA |
_dl_relocate_object | ld-linux-x86-64.so.2 | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |
__kmp_task_team_setup | libiomp5.so | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |
__GI___clock_gettime | libc.so.6 | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |
__kmp_free_team | libiomp5.so | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |
__vfscanf_internal | libc.so.6 | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |