Name | Module | Coverage run_0 (%) | Max Time Over Threads run_0 (s) | Time w.r.t. Wall Time run_0 (s) | Nb Threads run_0 | Deviation (coverage) run_0 | Deviation (walltime) run_0 | Categories run_0 | GFLOPS run_0 | Compilation Options |
►.omp_outlined..5.120+ | exec | 30.75 | 4.17 | 3.48 | 192 | 3.43 | 0.39 | Exe (%): 100.00 | 919.09 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
►Loop 106 - ljForce.c:173-216 - exec [...]+ | | 0.02 | 0.01 | 0 | 180 | 0.03 | 0.00 | | 0.00 | |
►Loop 107 - ljForce.c:178-216 - exec [...]+ | | 0.1 | 0.03 | 0.01 | 192 | 0.07 | 0.01 | | 1544.52 | |
►Loop 108 - ljForce.c:187-216 - exec [...]+ | | 0.41 | 0.1 | 0.05 | 192 | 0.14 | 0.02 | | 1106.76 | |
○Loop 109 - ljForce.c:191-216 - exec [...] | | 30.22 | 4.12 | 3.42 | 192 | 3.37 | 0.38 | | 914.33 | |
○Loop 105 - ljForce.c:172-172 - exec | | 0 | 0 | 0 | 0 | 0.00 | 0.00 | | 0.00 | |
○__kmp_hardware_timestamp | libomp.so | 30.23 | 3.88 | 3.42 | 192 | 3.36 | 0.36 | OMP (%): 100.00 | 0.00 | |
○_ZL27__kmp_hyper_barrier_release12barrier_typeP8kmp_infoiiiPv | libomp.so | 30 | 4.01 | 3.4 | 190 | 2.59 | 0.28 | OMP (%): 100.00 | 0.00 | |
►sortAtomsInCell+ | exec | 2.31 | 0.47 | 0.26 | 192 | 1.21 | 0.13 | Exe (%): 100.00 | 21.32 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
○Loop 66 - haloExchange.c:621-630 - exec | | 2.15 | 0.45 | 0.24 | 192 | 1.15 | 0.13 | | 21.78 | |
○Loop 67 - haloExchange.c:633-642 - exec | | 0.04 | 0.03 | 0.01 | 98 | 0.07 | 0.01 | | 4.50 | |
○Loop 68 - haloExchange.c:633-642 - exec | | 0.04 | 0.02 | 0 | 118 | 0.04 | 0.00 | | 0.00 | |
○Loop 69 - haloExchange.c:633-642 - exec | | 0.03 | 0.03 | 0 | 98 | 0.06 | 0.01 | | 0.00 | |
►.omp_outlined..119+ | exec | 2.08 | 0.38 | 0.24 | 192 | 0.85 | 0.09 | Exe (%): 100.00 | 7.24 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
○Loop 104 - ljForce.c:158-161 - exec [...] | | 2.08 | 0.38 | 0.24 | 192 | 0.85 | 0.09 | | 7.24 | |
►.omp_outlined..244+ | exec | 1.22 | 0.25 | 0.14 | 192 | 0.61 | 0.07 | Exe (%): 100.00 | 32.56 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
►Loop 120 - timestep.c:72-78 - exec+ | | 0.01 | 0.01 | 0 | 119 | 0.02 | 0.00 | | 0.00 | |
○Loop 122 - timestep.c:74-78 - exec | | 1.16 | 0.24 | 0.13 | 192 | 0.58 | 0.06 | | 30.63 | |
○Loop 121 - timestep.c:74-78 - exec | | 0.04 | 0.03 | 0 | 162 | 0.05 | 0.01 | | 0.00 | |
○Loop 123 - timestep.c:74-78 - exec | | 0.02 | 0.01 | 0 | 112 | 0.03 | 0.00 | | 0.00 | |
○_ZL26__kmp_hyper_barrier_gather12barrier_typeP8kmp_infoiiPFvPvS2_ES2_ | libomp.so | 1.07 | 1.03 | 0.12 | 73 | 1.99 | 0.22 | OMP (%): 100.00 | 0.05 | |
►.omp_outlined..2+ | exec | 0.85 | 0.18 | 0.1 | 192 | 0.34 | 0.04 | Exe (%): 100.00 | 52.51 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
►Loop 124 - timestep.c:86-94 - exec+ | | 0 | 0.01 | 0 | 44 | 0.02 | 0.00 | | 0.00 | |
○Loop 125 - timestep.c:88-94 - exec | | 0.85 | 0.17 | 0.1 | 192 | 0.34 | 0.04 | | 51.94 | |
○__GI___sched_yield | libc.so.6 | 0.56 | 0.11 | 0.06 | 192 | 0.17 | 0.02 | System (%): 100.00 | 0.00 | |
►updateLinkCells+ | exec | 0.29 | 3.52 | 0.03 | 2 | 6.09 | 0.68 | Exe (%): 100.00 | 220.92 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
►Loop 98 - linkCells.c:291-378 - exec [...]+ | | 0 | 0.02 | 0 | 2 | 0.09 | 0.01 | | 0.00 | |
○Loop 99 - linkCells.c:295-378 - exec [...] | | 0.29 | 3.5 | 0.03 | 2 | 6.09 | 0.68 | | 219.25 | |
○Loop 100 - linkCells.c:384-385 - exec | | 0 | 0.01 | 0 | 1 | 0.00 | 0.00 | | 0.00 | |
○unknown_function | Unknown module | 0.18 | 0.34 | 0.02 | 192 | 0.22 | 0.03 | Others (%): 100.00 | 188.13 | |
►unloadAtomsBuffer+ | exec | 0.11 | 1.22 | 0.01 | 2 | 1.27 | 0.14 | Exe (%): 100.00 | 161.00 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
○Loop 52 - linkCells.c:178-378 - exec [...] | | 0.11 | 1.22 | 0.01 | 2 | 1.27 | 0.14 | | 161.00 | |
►.omp_outlined..4+ | exec | 0.08 | 0.02 | 0.01 | 189 | 0.05 | 0.01 | Exe (%): 100.00 | 44.00 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
○Loop 117 - timestep.c:107-107 - exec | | 0 | 0 | 0 | 0 | 0.00 | 0.00 | | 0.00 | |
►Loop 118 - timestep.c:108-116 - exec+ | | 0 | 0 | 0 | 2 | 0.00 | 0.00 | | 0.00 | |
○Loop 119 - timestep.c:110-116 - exec | | 0.08 | 0.02 | 0.01 | 189 | 0.05 | 0.01 | | 43.75 | |
○getBoxFromTuple | exec | 0.05 | 0.51 | 0.01 | 2 | 0.38 | 0.04 | Exe (%): 100.00 | 347.63 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
○__kmp_yield | libomp.so | 0.05 | 0.03 | 0.01 | 124 | 0.04 | 0.00 | OMP (%): 100.00 | 0.00 | |
○sortAtomsById | exec | 0.05 | 0.02 | 0.01 | 179 | 0.05 | 0.01 | Exe (%): 100.00 | 42.88 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
►loadAtomsBuffer+ | exec | 0.03 | 0.4 | 0 | 2 | 0.73 | 0.08 | Exe (%): 100.00 | 0.00 | AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /beegfs/hackathon/users/eoseret/qaas_runs/170-850-7424/intel/CoMD/build/CoMD/CoMD/src-openmp -I . -D DO_MPI -O2 -march=znver4 -flto=full -g -grecord-... |
►Loop 50 - haloExchange.c:376-389 - exec+ | | 0 | 0.01 | 0 | 2 | 0.03 | 0.00 | | 0.00 | |
○Loop 51 - haloExchange.c:380-389 - exec | | 0.03 | 0.39 | 0 | 2 | 0.69 | 0.08 | | 0.00 | |
○ucp_worker_progress | libucp.so.0.0.0 | 0.01 | 0.1 | 0 | 2 | 0.59 | 0.07 | Others (%): 100.00 | 0.00 | |
○MPID_Progress_wait | libmpi.so.12.0.0 | 0.01 | 0.24 | 0 | 2 | 1.50 | 0.17 | MPI (%): 100.00 | 0.00 | |
○ofi_cq_readfrom | libmlx-fi.so | 0.01 | 0.1 | 0 | 1 | 0.00 | 0.00 | Others (%): 100.00 | 0.00 | |
○MPIDI_OFI_progress | libmpi.so.12.0.0 | 0.01 | 0.16 | 0 | 1 | 0.00 | 0.00 | MPI (%): 100.00 | 0.00 | |
○uct_iface_mpool_empty_warn | libuct.so.0.0.0 | 0.01 | 0.23 | 0 | 2 | 1.37 | 0.16 | Others (%): 100.00 | 0.00 | |