options

Loops Index

7 loops have been discarded from the report because their coverage is lower than the threshold set by object_coverage_threshold (0.01%). It represents about 0.01% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis

Columns Filter

Level Exclusive Coverage tbb_1 (%) Exclusive Coverage tbb_2 (%) Exclusive Coverage tbb_4 (%) Exclusive Coverage tbb_8 (%) Exclusive Coverage tbb_16 (%) Exclusive Coverage tbb_32 (%) Exclusive Coverage tbb_64 (%) Exclusive Coverage tbb_128 (%) Inclusive Coverage tbb_1 (%) Inclusive Coverage tbb_2 (%) Inclusive Coverage tbb_4 (%) Inclusive Coverage tbb_8 (%) Inclusive Coverage tbb_16 (%) Inclusive Coverage tbb_32 (%) Inclusive Coverage tbb_64 (%) Inclusive Coverage tbb_128 (%) Max Exclusive Time Over Threads tbb_1 (s) Max Exclusive Time Over Threads tbb_2 (s) Max Exclusive Time Over Threads tbb_4 (s) Max Exclusive Time Over Threads tbb_8 (s) Max Exclusive Time Over Threads tbb_16 (s) Max Exclusive Time Over Threads tbb_32 (s) Max Exclusive Time Over Threads tbb_64 (s) Max Exclusive Time Over Threads tbb_128 (s) Max Inclusive Time Over Threads tbb_1 (s) Max Inclusive Time Over Threads tbb_2 (s) Max Inclusive Time Over Threads tbb_4 (s) Max Inclusive Time Over Threads tbb_8 (s) Max Inclusive Time Over Threads tbb_16 (s) Max Inclusive Time Over Threads tbb_32 (s) Max Inclusive Time Over Threads tbb_64 (s) Max Inclusive Time Over Threads tbb_128 (s) Exclusive Time w.r.t. Wall Time tbb_1 (s) Exclusive Time w.r.t. Wall Time tbb_2 (s) Exclusive Time w.r.t. Wall Time tbb_4 (s) Exclusive Time w.r.t. Wall Time tbb_8 (s) Exclusive Time w.r.t. Wall Time tbb_16 (s) Exclusive Time w.r.t. Wall Time tbb_32 (s) Exclusive Time w.r.t. Wall Time tbb_64 (s) Exclusive Time w.r.t. Wall Time tbb_128 (s) Inclusive Time w.r.t. Wall Time tbb_1 (s) Inclusive Time w.r.t. Wall Time tbb_2 (s) Inclusive Time w.r.t. Wall Time tbb_4 (s) Inclusive Time w.r.t. Wall Time tbb_8 (s) Inclusive Time w.r.t. Wall Time tbb_16 (s) Inclusive Time w.r.t. Wall Time tbb_32 (s) Inclusive Time w.r.t. Wall Time tbb_64 (s) Inclusive Time w.r.t. Wall Time tbb_128 (s) Nb Threads tbb_1 Nb Threads tbb_2 Nb Threads tbb_4 Nb Threads tbb_8 Nb Threads tbb_16 Nb Threads tbb_32 Nb Threads tbb_64 Nb Threads tbb_128 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing tbb_1 Speedup If Perfect Load Balancing tbb_2 Speedup If Perfect Load Balancing tbb_4 Speedup If Perfect Load Balancing tbb_8 Speedup If Perfect Load Balancing tbb_16 Speedup If Perfect Load Balancing tbb_32 Speedup If Perfect Load Balancing tbb_64 Speedup If Perfect Load Balancing tbb_128 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency (tbb_1) Efficiency (tbb_1) Potential Speed-Up (%) (tbb_2) Efficiency (tbb_2) Potential Speed-Up (%) (tbb_4) Efficiency (tbb_4) Potential Speed-Up (%) (tbb_8) Efficiency (tbb_8) Potential Speed-Up (%) (tbb_16) Efficiency (tbb_16) Potential Speed-Up (%) (tbb_32) Efficiency (tbb_32) Potential Speed-Up (%) (tbb_64) Efficiency (tbb_64) Potential Speed-Up (%) (tbb_128) Efficiency (tbb_128) Potential Speed-Up (%)
Loop idSource LocationSource FunctionLevelExclusive Coverage tbb_1 (%)Exclusive Coverage tbb_2 (%)Exclusive Coverage tbb_4 (%)Exclusive Coverage tbb_8 (%)Exclusive Coverage tbb_16 (%)Exclusive Coverage tbb_32 (%)Exclusive Coverage tbb_64 (%)Exclusive Coverage tbb_128 (%)Inclusive Coverage tbb_1 (%)Inclusive Coverage tbb_2 (%)Inclusive Coverage tbb_4 (%)Inclusive Coverage tbb_8 (%)Inclusive Coverage tbb_16 (%)Inclusive Coverage tbb_32 (%)Inclusive Coverage tbb_64 (%)Inclusive Coverage tbb_128 (%)Max Exclusive Time Over Threads tbb_1 (s)Max Exclusive Time Over Threads tbb_2 (s)Max Exclusive Time Over Threads tbb_4 (s)Max Exclusive Time Over Threads tbb_8 (s)Max Exclusive Time Over Threads tbb_16 (s)Max Exclusive Time Over Threads tbb_32 (s)Max Exclusive Time Over Threads tbb_64 (s)Max Exclusive Time Over Threads tbb_128 (s)Max Inclusive Time Over Threads tbb_1 (s)Max Inclusive Time Over Threads tbb_2 (s)Max Inclusive Time Over Threads tbb_4 (s)Max Inclusive Time Over Threads tbb_8 (s)Max Inclusive Time Over Threads tbb_16 (s)Max Inclusive Time Over Threads tbb_32 (s)Max Inclusive Time Over Threads tbb_64 (s)Max Inclusive Time Over Threads tbb_128 (s)Exclusive Time w.r.t. Wall Time tbb_1 (s)Exclusive Time w.r.t. Wall Time tbb_2 (s)Exclusive Time w.r.t. Wall Time tbb_4 (s)Exclusive Time w.r.t. Wall Time tbb_8 (s)Exclusive Time w.r.t. Wall Time tbb_16 (s)Exclusive Time w.r.t. Wall Time tbb_32 (s)Exclusive Time w.r.t. Wall Time tbb_64 (s)Exclusive Time w.r.t. Wall Time tbb_128 (s)Inclusive Time w.r.t. Wall Time tbb_1 (s)Inclusive Time w.r.t. Wall Time tbb_2 (s)Inclusive Time w.r.t. Wall Time tbb_4 (s)Inclusive Time w.r.t. Wall Time tbb_8 (s)Inclusive Time w.r.t. Wall Time tbb_16 (s)Inclusive Time w.r.t. Wall Time tbb_32 (s)Inclusive Time w.r.t. Wall Time tbb_64 (s)Inclusive Time w.r.t. Wall Time tbb_128 (s)Nb Threads tbb_1Nb Threads tbb_2Nb Threads tbb_4Nb Threads tbb_8Nb Threads tbb_16Nb Threads tbb_32Nb Threads tbb_64Nb Threads tbb_128Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing tbb_1Speedup If Perfect Load Balancing tbb_2Speedup If Perfect Load Balancing tbb_4Speedup If Perfect Load Balancing tbb_8Speedup If Perfect Load Balancing tbb_16Speedup If Perfect Load Balancing tbb_32Speedup If Perfect Load Balancing tbb_64Speedup If Perfect Load Balancing tbb_128Stride 0Stride 1Stride nStride UnknownStride IndirectArray Access Efficiency(tbb_1) Efficiency(tbb_1) Potential Speed-Up (%)(tbb_2) Efficiency(tbb_2) Potential Speed-Up (%)(tbb_4) Efficiency(tbb_4) Potential Speed-Up (%)(tbb_8) Efficiency(tbb_8) Potential Speed-Up (%)(tbb_16) Efficiency(tbb_16) Potential Speed-Up (%)(tbb_32) Efficiency(tbb_32) Potential Speed-Up (%)(tbb_64) Efficiency(tbb_64) Potential Speed-Up (%)(tbb_128) Efficiency(tbb_128) Potential Speed-Up (%)
290multithreading_assembly_perf_test - assembler.hpp:744-746 [...]auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as...InBetween9.679.769.8510.268.816.253.411.5419.8220.7720.6622.1319.1913.487.223.3213.016.753.571.981.150.560.330.2126.6614.527.364.242.241.180.700.3913.016.753.471.870.940.480.240.1326.6614.377.284.032.041.040.520.291248163264128NANANANANA111.031.061.241.161.361.64NANANANANA0.00100.960.350.940.620.871.330.871.150.850.960.830.570.770.36
292multithreading_assembly_perf_test - stl_algo.h:2030-2041 [...]auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as...Innermost9.3110.079.9711.099.616.773.561.669.3110.079.9711.099.616.773.561.6612.537.103.532.161.120.620.350.2412.537.103.532.161.120.620.350.2412.536.963.512.021.020.520.260.1412.536.963.512.021.020.520.260.14124816326412806.25111611.021.011.071.11.191.381.7410000100.00100.91.010.891.080.782.490.772.240.751.690.770.830.680.52
5783libfinite_elements.so - AssignEvaluator.h:480-480 [...]void Eigen::internal::call_dense_assignment_loop<Eigen::Matrix<double, 24, 24, 0, 24, 24>, Eigen::CwiseBinaryOp<Eigen::internal::scalar_product_op<double, double>, Eigen::Product<Eigen::Product<Eigen::Transpose<Eigen::Matrix<d...Single2.692.442.422.431.991.410.760.332.692.442.422.431.991.410.760.333.621.690.910.490.310.160.090.083.621.690.910.490.310.160.090.083.621.690.850.440.210.110.050.033.621.690.850.440.210.110.050.031248163264127100100111111.061.121.51.481.652.980120083.33101.0701.0601.0201.0701.0501.0400.990
5808libfinite_elements.so - finite_elements.tpp:77-77 [...]aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::...Single1.251.411.441.341.120.850.410.171.251.411.441.341.120.850.410.171.681.020.550.280.180.100.050.051.681.020.550.280.180.100.050.051.680.980.510.240.120.070.030.011.680.980.510.240.120.070.030.01124816326412029.622.311.0213.0211.041.081.151.521.611.893.313120091.67100.860.20.830.250.860.180.880.130.80.170.90.040.910.01
5809libfinite_elements.so - generic_elements.hpp:634-635 [...]aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::...Single0.861.151.141.081.020.690.340.160.861.151.141.081.020.690.340.161.150.800.550.240.190.110.050.041.150.800.550.240.190.110.050.041.150.800.400.200.110.050.020.011.150.800.400.200.110.050.020.011248163263119NANANANANA11.011.381.21.722.082.212.74NANANANANA0.00100.730.320.720.320.730.290.670.340.680.220.730.090.650.06
291multithreading_assembly_perf_test - stl_algo.h:2030-2041 [...]auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as...InBetween0.830.950.840.770.770.460.260.1110.1511.0210.8111.8610.387.233.811.771.120.660.330.160.130.080.050.0313.657.773.862.291.210.670.370.251.120.660.290.140.080.040.020.0113.657.623.812.161.100.560.270.151248163263105NANA1NANA11.021.121.171.542.122.673.02NANANANANA0.00100.850.140.950.040.9900.860.110.990.010.940.010.910.01
4828libfinite_elements.so - generic_elements.hpp:596-600 [...]auto aset::asolve::GenericFiniteElement<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceE...Single0.550.470.510.440.430.280.150.050.550.470.510.440.430.280.150.050.730.360.240.120.100.030.020.020.730.360.240.120.100.030.020.020.730.330.180.080.050.020.010.000.730.330.180.080.050.020.010.0012481632567344.6426.7111.272.1111.111.331.492.221.6522.731000100.00101.1301.0201.1401.0201.0801.0501.340
293multithreading_assembly_perf_test - xmmintrin.h:1337-1337 [...]auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as...Innermost0.470.430.370.390.400.250.130.060.470.430.370.390.400.250.130.060.630.340.150.090.080.040.030.020.630.340.150.090.080.040.030.020.630.300.130.070.040.020.010.000.630.300.130.070.040.020.010.0012481632587606.25111611.151.151.271.792.382.952.47NANANANANA0.00101.0501.2101.1100.940.031.0401.0601.01-0
289multithreading_assembly_perf_test - assembler.hpp:740-746 [...]auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as...Outermost0.370.330.320.370.360.270.140.0720.6621.5321.3522.8819.9514.007.493.450.500.240.120.070.060.040.030.0327.7915.087.604.392.311.220.710.390.500.230.110.070.040.020.010.0127.8014.897.534.172.121.080.540.30124816305888010.73.5415.8611.071.11.121.691.832.813.38NANANANANA0.00101.1101.0900.930.020.810.070.760.060.80.030.630.03
5807libfinite_elements.so - finite_elements.hpp:159-160 [...]aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::...Single0.300.320.320.330.300.230.100.060.300.320.320.330.300.230.100.060.410.230.120.080.050.030.020.020.410.230.120.080.050.030.020.020.410.220.110.060.030.020.010.010.410.220.110.060.030.020.010.01124816315081011.881.2915.7611.061.081.321.741.652.222.441101083.33100.920.030.920.030.840.050.810.060.720.060.910.010.610.02
287multithreading_assembly_perf_test - CoreEvaluators.h:217-217 [...]auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as...Single0.220.160.220.190.190.160.070.030.220.160.220.190.190.160.070.030.290.140.100.040.040.030.010.020.290.140.100.040.040.030.010.020.290.110.080.040.020.010.010.000.290.110.080.040.020.010.010.00124815314349012.511.285.3311.291.291.141.92.711.982.680208060.00101.3100.950.011.0500.930.010.740.040.90.010.80.01
5810libfinite_elements.so - generic_elements.hpp:442-442 [...]aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::...Single0.130.100.110.100.140.070.030.010.130.100.110.100.140.070.030.010.170.090.070.040.050.020.010.010.170.090.070.040.050.020.010.010.180.070.040.020.010.010.000.000.180.070.040.020.010.010.000.00124814202526012.52.331711.411.812.132.682.222.421.7311000100.00101.301.1201.1600.740.040.9701.12-01.15-0
5811libfinite_elements.so - generic_elements.hpp:435-435 [...]aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::...Single0.110.090.090.110.080.040.020.010.110.090.090.110.080.040.020.010.140.060.050.030.030.010.010.010.140.060.050.030.030.010.010.010.140.060.030.020.010.000.000.000.140.060.030.020.010.000.000.00124713141728012.52.331711.041.61.312.3221.791.9311000100.00101.1601.1600.90.011.03-01.3801.5200.990
189multithreading_assembly_perf_test - finite_elements.hpp:288-289 [...]aset::asolve::FiniteElement::dofs_rank() constSingle0.080.050.050.060.040.040.010.010.080.050.050.060.040.040.010.010.100.040.030.020.020.010.000.010.100.040.030.020.020.010.000.010.100.040.020.010.000.000.000.000.100.040.020.010.000.000.000.0012479151220010.42116.7411.291.711.942.571.6711.8231000100.00101.501.4901.1601.4901.1601.7400.940
×