Name | Module | Coverage run_0 (%) | Max Time Over Threads run_0 (s) | Time w.r.t. Wall Time run_0 (s) | Nb Threads run_0 | Deviation (coverage) run_0 | Deviation (walltime) run_0 | Categories run_0 | GFLOPS run_0 | Compilation Options |
---|---|---|---|---|---|---|---|---|---|---|
►_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS... | exec | 84.56 | 54.51 | 53.54 | 96 | 1.06 | 2.39 | Exe (%): 100.00 | 69.44 | GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
►Loop 2044 - RangeSegment.hpp:120-120 - exec [...] | 0 | 0.01 | 0 | 30 | 0.01 | 0.00 | 0.00 | |||
►Loop 2046 - forall.hpp:59-59 - exec [...] | 0.06 | 0.11 | 0.04 | 96 | 0.04 | 0.02 | 111.79 | |||
►Loop 2045 - forall.hpp:59-59 - exec [...] | 84.5 | 54.46 | 53.5 | 96 | 1.07 | 2.39 | 69.41 | |||
○Loop 2047 - Scattering.cpp:91-95 - exec [...] | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | |||
►_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl2EEEEJNS2_3ForILl1ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS... | exec | 4.73 | 3.29 | 2.99 | 96 | 0.43 | 0.31 | Exe (%): 100.00 | 57.20 | GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
►Loop 1903 - RangeSegment.hpp:120-120 - exec [...] | 0 | 0.01 | 0 | 24 | 0.01 | 0.00 | 0.00 | |||
►Loop 1904 - forall.hpp:59-59 - exec [...] | 0 | 0 | 0 | 29 | 0.00 | 0.00 | 0.00 | |||
○Loop 1905 - forall.hpp:59-59 - exec [...] | 4.72 | 3.29 | 2.99 | 96 | 0.43 | 0.30 | 57.20 | |||
►_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl2EEEEJNS2_3ForILl1ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS... | exec | 4.5 | 3.21 | 2.85 | 96 | 0.48 | 0.35 | Exe (%): 100.00 | 60.32 | GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
►Loop 1830 - RangeSegment.hpp:120-120 - exec [...] | 0 | 0 | 0 | 12 | 0.00 | 0.00 | 0.00 | |||
►Loop 1831 - forall.hpp:59-59 - exec [...] | 0 | 0 | 0 | 29 | 0.00 | 0.00 | 0.00 | |||
○Loop 1832 - forall.hpp:59-59 - exec [...] | 4.5 | 3.21 | 2.85 | 96 | 0.48 | 0.35 | 60.31 | |||
○gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 2.94 | 2.57 | 1.86 | 96 | 1.42 | 0.78 | OMP (%): 100.00 | 0.00 | |
►_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS8_ILl4ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSH_E... | exec | 1.84 | 1.31 | 1.17 | 96 | 0.23 | 0.16 | Exe (%): 100.00 | 62.18 | GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
►Loop 2198 - RangeSegment.hpp:120-120 - exec [...] | 0 | 0.01 | 0 | 22 | 0.01 | 0.00 | 0.00 | |||
►Loop 2199 - forall.hpp:59-59 - exec [...] | 0.01 | 0.02 | 0.01 | 96 | 0.01 | 0.01 | 18.50 | |||
►Loop 2200 - forall.hpp:59-59 - exec [...] | 0.07 | 0.08 | 0.05 | 96 | 0.03 | 0.02 | 50.79 | |||
○Loop 2201 - forall.hpp:59-59 - exec [...] | 1.76 | 1.26 | 1.11 | 96 | 0.23 | 0.15 | 63.08 | |||
○gomp_barrier_wait_end | libgomp.so.1.0.0 | 0.79 | 0.56 | 0.5 | 96 | 0.12 | 0.07 | OMP (%): 100.00 | 0.00 | |
►_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS2_6LambdaILl0EJEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSF_EEENS5_5tupleIJNS_4impl4SpanINS_9... | exec | 0.47 | 0.35 | 0.3 | 96 | 0.07 | 0.04 | Exe (%): 100.00 | 39.87 | GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
►Loop 1977 - RangeSegment.hpp:120-120 - exec [...] | 0 | 0.01 | 0 | 11 | 0.01 | 0.00 | 0.00 | |||
○Loop 1978 - forall.hpp:59-59 - exec [...] | 0.47 | 0.35 | 0.3 | 96 | 0.07 | 0.04 | 39.87 | |||
○impi_pause | libmpi.so.12.0.0 | 0.1 | 4.57 | 0.06 | 2 | 4.24 | 2.68 | MPI (%): 100.00 | 0.00 | |
○__memset_avx512_unaligned_erms | libc.so.6 | 0.04 | 1.15 | 0.03 | 2 | 0.00 | 0.01 | Memory (%): 100.00 | 0.00 | |
○unknown_kernel_region | kernel | 0.02 | 0.04 | 0.01 | 90 | 0.01 | 0.01 | System (%): 97.66 OMP (%): 2.34 | 5.05 | |
○MPI_Testany | libmpi.so.12.0.0 | 0.01 | 0.34 | 0 | 1 | 0.00 | 0.00 | MPI (%): 100.00 | 0.00 | |
○MPL_gpu_cuda_init | libmpi.so.12.0.0 | 0.01 | 0.27 | 0 | 2 | 0.26 | 0.16 | MPI (%): 100.00 | 0.00 |