options

Functions and Loops

13 loops and 4 functions have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 0% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Colums Filter

Coverage 2x1 (%) Coverage 2x2 (%) Coverage 2x4 (%) Coverage 2x8 (%) Coverage 2x16 (%) Coverage 2x32 (%) Coverage 2x48 (%) Max Time Over Threads 2x1 (s) Max Time Over Threads 2x2 (s) Max Time Over Threads 2x4 (s) Max Time Over Threads 2x8 (s) Max Time Over Threads 2x16 (s) Max Time Over Threads 2x32 (s) Max Time Over Threads 2x48 (s) Time w.r.t. Wall Time 2x1 (s) Time w.r.t. Wall Time 2x2 (s) Time w.r.t. Wall Time 2x4 (s) Time w.r.t. Wall Time 2x8 (s) Time w.r.t. Wall Time 2x16 (s) Time w.r.t. Wall Time 2x32 (s) Time w.r.t. Wall Time 2x48 (s) Nb Threads 2x1 Nb Threads 2x2 Nb Threads 2x4 Nb Threads 2x8 Nb Threads 2x16 Nb Threads 2x32 Nb Threads 2x48 Deviation (coverage) 2x1 Deviation (coverage) 2x2 Deviation (coverage) 2x4 Deviation (coverage) 2x8 Deviation (coverage) 2x16 Deviation (coverage) 2x32 Deviation (coverage) 2x48 Deviation (walltime) 2x1 Deviation (walltime) 2x2 Deviation (walltime) 2x4 Deviation (walltime) 2x8 Deviation (walltime) 2x16 Deviation (walltime) 2x32 Deviation (walltime) 2x48 Categories 2x1 Categories 2x2 Categories 2x4 Categories 2x8 Categories 2x16 Categories 2x32 Categories 2x48 GFLOPS 2x1 GFLOPS 2x2 GFLOPS 2x4 GFLOPS 2x8 GFLOPS 2x16 GFLOPS 2x32 GFLOPS 2x48 Compilation Options (2x1) Efficiency (2x1) Potential Speed-Up (%) (2x2) Efficiency (2x2) Potential Speed-Up (%) (2x4) Efficiency (2x4) Potential Speed-Up (%) (2x8) Efficiency (2x8) Potential Speed-Up (%) (2x16) Efficiency (2x16) Potential Speed-Up (%) (2x32) Efficiency (2x32) Potential Speed-Up (%) (2x48) Efficiency (2x48) Potential Speed-Up (%)
NameModuleCoverage 2x1 (%)Coverage 2x2 (%)Coverage 2x4 (%)Coverage 2x8 (%)Coverage 2x16 (%)Coverage 2x32 (%)Coverage 2x48 (%)Max Time Over Threads 2x1 (s)Max Time Over Threads 2x2 (s)Max Time Over Threads 2x4 (s)Max Time Over Threads 2x8 (s)Max Time Over Threads 2x16 (s)Max Time Over Threads 2x32 (s)Max Time Over Threads 2x48 (s)Time w.r.t. Wall Time 2x1 (s)Time w.r.t. Wall Time 2x2 (s)Time w.r.t. Wall Time 2x4 (s)Time w.r.t. Wall Time 2x8 (s)Time w.r.t. Wall Time 2x16 (s)Time w.r.t. Wall Time 2x32 (s)Time w.r.t. Wall Time 2x48 (s)Nb Threads 2x1Nb Threads 2x2Nb Threads 2x4Nb Threads 2x8Nb Threads 2x16Nb Threads 2x32Nb Threads 2x48Deviation (coverage) 2x1Deviation (coverage) 2x2Deviation (coverage) 2x4Deviation (coverage) 2x8Deviation (coverage) 2x16Deviation (coverage) 2x32Deviation (coverage) 2x48Deviation (walltime) 2x1Deviation (walltime) 2x2Deviation (walltime) 2x4Deviation (walltime) 2x8Deviation (walltime) 2x16Deviation (walltime) 2x32Deviation (walltime) 2x48Categories 2x1Categories 2x2Categories 2x4Categories 2x8Categories 2x16Categories 2x32Categories 2x48GFLOPS 2x1GFLOPS 2x2GFLOPS 2x4GFLOPS 2x8GFLOPS 2x16GFLOPS 2x32GFLOPS 2x48Compilation Options(2x1) Efficiency(2x1) Potential Speed-Up (%)(2x2) Efficiency(2x2) Potential Speed-Up (%)(2x4) Efficiency(2x4) Potential Speed-Up (%)(2x8) Efficiency(2x8) Potential Speed-Up (%)(2x16) Efficiency(2x16) Potential Speed-Up (%)(2x32) Efficiency(2x32) Potential Speed-Up (%)(2x48) Efficiency(2x48) Potential Speed-Up (%)
_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS...+exec89.0386.8586.4286.0785.5886.684.51922.25464.6241.34133.6273.8953.7653.88921.87465.42242.83133.6773.1153.2653.15248163264960.050.530.840.780.961.180.931.711.512.244.394.132.842.43Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.004.809.5218.2433.1360.5883.1683.33GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp 100.990.840.954.40.8611.870.7918.140.5439.760.3653.97
Loop 2044 - RangeSegment.hpp:120-120 - exec [...]+00000000.040.030.020.020.010.010.010.040.020.010.01000247152316230.000.000.000.000.000.010.000.010.010.010.000.000.000.000.290.700.650.850.000.000.001010100.50101010
Loop 2046 - forall.hpp:59-59 - exec [...]+0.20.170.160.120.070.060.062.170.950.550.240.130.120.142.040.910.460.190.060.040.04248163264960.020.010.020.030.040.040.040.200.050.050.040.030.020.025.0710.5919.6136.4076.1492.89119.00101.1201.1101.3402.1301.5901.06-0
Loop 2045 - forall.hpp:59-59 - exec [...]+88.8386.6786.2585.9585.5186.5484.45920.33463.65240.91133.4573.8853.7453.85919.8464.49242.36133.4773.0453.2253.11248163264960.070.530.840.780.991.210.931.921.492.284.414.152.852.424.809.5118.2433.1360.5883.1583.31100.990.860.954.420.8611.910.7918.210.5439.80.3653.98
Loop 2047 - Scattering.cpp:91-95 - exec [...]00000000000000000000000000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl2EEEEJNS2_3ForILl1ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS...+exec4.665.165.45.886.14.794.4848.427.8215.019.35.032.963.2448.2627.6815.179.135.212.942.82248163264960.030.060.080.320.290.320.470.250.290.110.430.110.190.35Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.004.177.2713.2722.0538.6468.4971.39GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp 100.870.660.81.110.661.990.582.570.512.330.362.88
Loop 1830 - RangeSegment.hpp:120-120 - exec [...]+00000000000000.010000000248911570.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 1831 - forall.hpp:59-59 - exec [...]+00000000.010.020.010000.010.010.0100000248161418230.000.000.000.000.000.000.000.000.010.000.000.000.000.003.954.500.000.000.000.000.00
Loop 1832 - forall.hpp:59-59 - exec [...]4.665.165.45.886.14.794.4848.3927.8115.019.35.032.963.2448.2527.6615.169.135.212.942.82248163264960.030.050.080.320.290.320.470.250.290.110.430.110.190.354.177.2813.2822.0538.6468.4871.38100.870.660.81.10.6620.582.570.512.330.362.88
_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl2EEEEJNS2_3ForILl1ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS...+exec4.555.535.525.295.3354.7547.5430.3915.768.314.513.133.3747.1329.6115.58.224.563.072.99248163264960.070.130.200.240.290.290.410.650.670.430.270.130.110.31Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.004.276.8012.9924.4944.1565.5767.32GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp 100.81.130.761.320.721.50.651.890.482.60.333.19
Loop 1903 - RangeSegment.hpp:120-120 - exec [...]+00000000.0100.0100000.0100000024581711390.000.000.000.000.000.000.000.000.000.000.000.000.000.003.850.000.000.000.000.000.00
Loop 1904 - forall.hpp:59-59 - exec [...]+000000000.0100.01000.0100.0100000248141835460.000.000.000.000.000.000.000.000.010.000.000.000.000.000.002.400.000.000.000.000.00
Loop 1905 - forall.hpp:59-59 - exec [...]4.555.525.515.295.3354.7547.5230.3815.768.314.513.133.3747.1129.6115.58.214.553.072.98248163264960.070.130.200.240.290.280.410.640.660.430.270.130.110.314.276.8012.9924.5244.2465.5667.53100.81.130.761.320.721.50.651.880.482.60.333.19
_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS8_ILl4ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSH_E...+exec1.341.541.531.441.351.121.8513.948.524.52.291.20.761.3313.898.244.292.231.150.691.16248163264960.010.080.070.070.090.090.220.080.360.160.070.060.050.16Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.006.3810.7520.6539.7277.02128.3776.38GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp 100.840.240.810.290.780.320.750.330.630.420.251.39
Loop 2198 - RangeSegment.hpp:120-120 - exec [...]+00000000.01000.01000.010.01000000247121216470.000.000.000.000.000.000.010.010.000.000.000.000.000.000.250.000.000.000.000.000.00
Loop 2199 - forall.hpp:59-59 - exec [...]+00.010.010.010.010.010.010.030.050.030.030.010.020.030.030.040.020.01000.01248163264960.000.000.000.000.010.010.010.000.010.010.010.000.000.019.126.2312.4323.600.000.0021.45100.380.010.380.010.380.0110100.060.01
Loop 2200 - forall.hpp:59-59 - exec [...]+0.030.050.050.040.040.040.070.350.310.160.090.060.050.080.330.250.130.070.030.020.04248163264960.000.010.010.010.020.020.030.040.060.030.020.020.010.026.639.8518.7535.0283.05127.0064.29100.660.020.630.020.590.020.690.010.520.020.170.06
Loop 2201 - forall.hpp:59-59 - exec [...]1.311.481.481.391.31.071.7713.548.174.332.241.180.721.313.537.954.152.161.110.661.11248163264960.010.060.060.070.090.090.220.040.300.140.070.050.050.156.3710.8020.6939.7677.32129.9677.29100.850.220.820.270.780.30.760.310.640.380.251.32
_ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS2_6LambdaILl0EJEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSF_EEENS5_5tupleIJNS_4impl4SpanINS_9...+exec0.190.220.230.220.210.490.471.951.230.660.350.190.330.351.921.170.640.340.180.30.3248163264960.000.010.010.020.020.050.070.040.060.020.020.010.030.04Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.006.2910.3218.8735.5167.1140.3240.09GNU C++14 13.2.0 -march=sapphirerapids -mprefer-vector-width=512 -g -O2 -std=c++14 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp 100.820.040.750.060.710.060.670.070.20.390.130.41
Loop 1977 - RangeSegment.hpp:120-120 - exec [...]+00000000000000.0100000002452117120.000.000.000.000.000.000.010.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 1978 - forall.hpp:59-59 - exec [...]0.190.220.230.220.210.490.471.951.220.660.350.190.330.351.921.160.640.340.180.30.3248163264960.000.010.010.020.020.050.070.040.050.020.020.010.030.046.2910.4118.8735.5167.1140.2940.09100.830.040.750.060.710.060.670.070.20.390.130.41
__memset_avx512_unaligned_ermslibc.so.60.10.10.10.090.080.060.041.121.091.131.11.091.11.151.090.530.280.140.070.040.0322222220.000.010.010.040.070.050.040.040.050.040.010.040.020.03Memory (%): 100.00Memory (%): 100.00Memory (%): 100.00Memory (%): 100.00Memory (%): 100.00Memory (%): 100.00Memory (%): 100.000.000.000.000.000.000.000.00101.03-00.9700.9700.9700.850.010.760.01
impi_pauselibmpi.so.12.0.00.090.360.340.170.050.10.081.235.615.662.650.742.843.680.891.920.950.270.040.060.0522222220.050.470.970.510.172.173.300.472.532.740.850.131.272.08MPI (%): 100.00MPI (%): 100.00MPI (%): 100.00MPI (%): 100.00MPI (%): 100.00MPI (%): 100.00MPI (%): 100.000.000.000.000.000.000.000.00100.230.280.230.260.410.11.3900.460.050.370.05
unknown_kernel_regionkernel0.010.010.010.010.010.010.020.130.070.050.020.050.040.040.120.050.030.020.010.010.01248163160930.000.000.010.000.010.010.010.010.020.010.010.010.010.01System (%): 97.96
MPI (%): 2.04
System (%): 97.44
OMP (%): 2.56
System (%): 100.00System (%): 100.00System (%): 100.00System (%): 100.00System (%): 97.24
OMP (%): 2.76
0.320.761.102.103.004.255.85101.2-0100.7500.7500.380.010.250.01
MPL_gpu_cuda_initlibmpi.so.12.0.00.010.020.020.010000.080.260.340.130.040.120.170.050.090.060.0200022222220.000.020.050.020.010.090.150.040.100.140.030.010.050.09MPI (%): 100.00MPI (%): 100.00MPI (%): 100.00MPI (%): 100.00MPI (%): 100.00MPI (%): 100.00MPI (%): 100.000.000.000.000.000.000.000.00100.280.010.210.020.310.01101010
gomp_barrier_wait_endlibgomp.so.1.0.000.040.10.210.450.690.800.410.410.420.430.50.5500.20.270.330.390.430.5047163264960.000.040.050.080.110.140.130.000.220.140.120.090.080.07NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.0010101010101010
gomp_team_barrier_wait_endlibgomp.so.1.0.000.130.290.570.811.122.9701.241.131.31.061.392.600.680.80.890.70.691.87048163264960.000.090.080.170.270.411.440.000.500.240.250.200.220.79NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.000.000.0010101010101010
×