| Run baseline | Run locus440 |
| | | - /home/eoseret/3dtransp_code/permute3d_1.locus440.cpp: 98-104
|
| ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) | ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) |
| 13 | 84.65 | 11.20 | 38 | 13.32 | 1.79 |
| Run baseline | Run locus440 |
| | | |
| ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) | ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) |
| 2 | 0.00 | 0.00 | 16 | 0.01 | 0.00 | -1 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 2244 | 0.00 | 0.00 | 4 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 2 | 0.00 | 0.00 |
| 1903 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1138 | 0.01 | 0.00 | 5 | 0.03 | 0.00 |
| 1891 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | 1896 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 1059 | 4.34 | 2.51 | 21 | 17.19 | 9.85 | 1903 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 1063 | 0.00 | 0.00 | 4 | 0.00 | 0.00 | 1059 | 0.38 | 0.05 | 13 | 1.39 | 0.18 |
| 1880 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1062 | 0.02 | 0.00 | 13 | 0.02 | 0.00 |
| 1086 | 1.22 | 0.71 | 38 | 0.26 | 0.15 | 1897 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 1960 | 0.00 | 0.00 | 11 | 0.00 | 0.00 | 1902 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 2690 | 0.00 | 0.00 | 3 | 0.00 | 0.00 | 1086 | 0.40 | 0.05 | 38 | 0.35 | 0.05 |
| 1853 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1961 | 0.02 | 0.00 | 17 | 0.02 | 0.00 |
| 2196 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 2690 | 0.00 | 0.00 | 5 | 0.00 | 0.00 |
| 1042 | 45.93 | 26.53 | 37 | 12.63 | 7.20 | 1853 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 2715 | 0.03 | 0.02 | 36 | 0.01 | 0.01 | 2244 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 2194 | 0.00 | 0.00 | 5 | 0.00 | 0.00 | 1042 | 14.37 | 1.90 | 37 | 13.27 | 1.73 |
| 1084 | 0.00 | 0.00 | 12 | 0.00 | 0.00 | 2354 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 1062 | 0.00 | 0.00 | 15 | 0.01 | 0.00 | 2194 | 0.00 | 0.00 | 5 | 0.00 | 0.00 |
| 1883 | 0.00 | 0.00 | 5 | 0.00 | 0.00 | 1084 | 0.00 | 0.00 | 3 | 0.02 | 0.00 |
| 1048 | 0.00 | 0.00 | 15 | 0.01 | 0.00 | 1048 | 0.02 | 0.00 | 11 | 0.05 | 0.01 |
| 1138 | 0.00 | 0.00 | 4 | 0.00 | 0.00 | 1883 | 0.01 | 0.00 | 7 | 0.01 | 0.00 |
| 22 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1063 | 0.01 | 0.00 | 6 | 0.00 | 0.00 |
| 12 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 2715 | 0.01 | 0.00 | 6 | 0.03 | 0.00 |
| 40 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 22 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 74 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 12 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 1943 | 0.05 | 0.03 | 37 | 0.02 | 0.01 | 476 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| 635 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 1943 | 0.02 | 0.00 | 14 | 0.03 | 0.00 |
| 1704 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | 40 | 0.00 | 0.00 | 1 | 0.00 | 0.00 |
| -1 | 0.19 | 0.11 | 38 | 0.04 | 0.02 | -1 | 0.07 | 0.01 | 35 | 0.04 | 0.01 |
| Run baseline | Run locus440 |
| - /home/eoseret/3dtransp_code/permute3d_1.omp.cpp: 89-93
- /usr/lib/gcc/x86_64-redhat-linux/11/../../../../include/c++/11/bits/stl_vector.h: 1046-1046
| | |
| ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) | ASM Fct ID | Coverage (%) | Time (s) | Nb Threads | Deviation (cov) | Deviation (tps) |
| 13 | 48.21 | 27.85 | 38 | 10.63 | 6.19 | |
| Name | Module | Coverage (%) | Time (s) | Nb Threads | Deviation (coverage) | Deviation (time) |
| baseline | locus440 | baseline | locus440 | baseline | locus440 | baseline | locus440 | baseline | locus440 |
| main.DIR.OMP.PARALLEL.LOOP.2.split178 | binary | NA | 84.65 | NA | 11.20 | NA | 38 | NA | 13.32 | NA | 1.79 |
| bool _INTERNAL021345c1::__kmp_wait_template<kmp_flag_64<false, true>, true, false, true>(kmp_info*, kmp_flag_64<false, true>*, void*) | libiomp5.so | 45.93 | 14.37 | 26.53 | 1.90 | 37 | 37 | 12.63 | 13.27 | 7.20 | 1.73 |
| main.DIR.OMP.PARALLEL.LOOP.2.split145 | binary | 48.21 | NA | 27.85 | NA | 38 | NA | 10.63 | NA | 6.19 | NA |
| __kmp_hyper_barrier_gather | libiomp5.so | 4.34 | 0.38 | 2.51 | 0.05 | 21 | 13 | 17.19 | 1.39 | 9.85 | 0.18 |
| kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libiomp5.so | 1.22 | 0.40 | 0.71 | 0.05 | 38 | 38 | 0.26 | 0.35 | 0.15 | 0.05 |
| unknown_kernel_region | kernel | 0.19 | 0.07 | 0.11 | 0.01 | 38 | 35 | 0.04 | 0.04 | 0.02 | 0.01 |
| __GI___sched_yield | libc.so.6 | 0.05 | 0.02 | 0.03 | 0.00 | 37 | 14 | 0.02 | 0.03 | 0.01 | 0.00 |
| __kmp_yield | libiomp5.so | 0.03 | 0.01 | 0.02 | 0.00 | 36 | 6 | 0.01 | 0.03 | 0.01 | 0.00 |
| _INTERNAL021345c1::__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libiomp5.so | 0.00 | 0.02 | 0.00 | 0.00 | 15 | 11 | 0.01 | 0.05 | 0.00 | 0.01 |
| __kmp_join_barrier(int) | libiomp5.so | 0.00 | 0.02 | 0.00 | 0.00 | 15 | 13 | 0.01 | 0.02 | 0.00 | 0.00 |
| __kmpc_for_static_init_8 | libiomp5.so | NA | 0.02 | NA | 0.00 | NA | 17 | NA | 0.02 | NA | 0.00 |
| __kmp_launch_thread | libiomp5.so | 0.00 | 0.01 | 0.00 | 0.00 | 5 | 7 | 0.00 | 0.01 | 0.00 | 0.00 |
| __kmpc_for_static_fini | libiomp5.so | 0.00 | 0.01 | 0.00 | 0.00 | 4 | 5 | 0.00 | 0.03 | 0.00 | 0.00 |
| kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check() | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 12 | 3 | 0.00 | 0.02 | 0.00 | 0.00 |
| __kmp_fork_barrier(int, int) | libiomp5.so | 0.00 | 0.01 | 0.00 | 0.00 | 4 | 6 | 0.00 | 0.00 | 0.00 | 0.00 |
| .plt.sec@start | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 16 | 2 | 0.01 | 0.00 | 0.00 | 0.00 |
| __kmp_finish_implicit_task | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 5 | 5 | 0.00 | 0.00 | 0.00 | 0.00 |
| __kmp_invoke_microtask | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 5 | 0.00 | 0.00 | 0.00 | 0.00 |
| __kmpc_for_static_init_8u | libiomp5.so | 0.00 | NA | 0.00 | NA | 11 | NA | 0.00 | NA | 0.00 | NA |
| __kmp_join_call | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
| main | binary | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
| do_lookup_x | ld-linux-x86-64.so.2 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
| __kmp_get_global_thread_id_reg | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
| __kmp_init_implicit_task | libiomp5.so | 0.00 | 0.00 | 0.00 | 0.00 | 4 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
| __kmp_fork_call | libiomp5.so | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
| __intel_avx_rep_memset | binary | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 |
| __kmp_allocate_team | libiomp5.so | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
| __ompt_team_assign_id(kmp_team*, ompt_data_t) | libiomp5.so | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
| __kmp_internal_fork | libiomp5.so | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
| __GI___strtoul_internal | libc.so.6 | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
| unknown_function | [vdso] | NA | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 |
| __kmp_invoke_task_func | libiomp5.so | 0.00 | NA | 0.00 | NA | 2 | NA | 0.00 | NA | 0.00 | NA |
| _dl_relocate_object | ld-linux-x86-64.so.2 | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |
| __kmp_task_team_setup | libiomp5.so | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |
| __GI___clock_gettime | libc.so.6 | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |
| __kmp_free_team | libiomp5.so | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |
| __vfscanf_internal | libc.so.6 | 0.00 | NA | 0.00 | NA | 1 | NA | 0.00 | NA | 0.00 | NA |