Loop id | Source Location | Source Function | Level | Exclusive Coverage tbb_128 (%) | Inclusive Coverage tbb_128 (%) | Max Exclusive Time Over Threads tbb_128 (s) | Max Inclusive Time Over Threads tbb_128 (s) | Exclusive Time w.r.t. Wall Time tbb_128 (s) | Inclusive Time w.r.t. Wall Time tbb_128 (s) | Nb Threads tbb_128 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing tbb_128 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | Array Access Efficiency |
---|
292 | multithreading_assembly_perf_test - stl_algo.h:2030-2041 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | Innermost | 1.67 | 1.67 | 0.27 | 0.27 | 0.14 | 0.14 | 128 | 0 | 6.25 | 1 | 1 | 16 | 1.89 | 1 | 0 | 0 | 0 | 0 | 100.00 |
290 | multithreading_assembly_perf_test - assembler.hpp:744-746 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | InBetween | 1.55 | 3.32 | 0.27 | 0.50 | 0.13 | 0.28 | 128 | NA | NA | NA | NA | NA | 2.05 | NA | NA | NA | NA | NA | 0.00 |
5783 | libfinite_elements.so - AssignEvaluator.h:480-480 [...] | void Eigen::internal::call_dense_assignment_loop<Eigen::Matrix<double, 24, 24, 0, 24, 24>, Eigen::CwiseBinaryOp<Eigen::internal::scalar_product_op<double, double>, Eigen::Product<Eigen::Product<Eigen::Transpose<Eigen::Matrix<d... | Single | 0.34 | 0.34 | 0.06 | 0.06 | 0.03 | 0.03 | 128 | 100 | 100 | 1 | 1 | 1 | 2.26 | 0 | 1 | 2 | 0 | 0 | 83.33 |
5808 | libfinite_elements.so - finite_elements.tpp:77-77 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.17 | 0.17 | 0.05 | 0.05 | 0.01 | 0.01 | 121 | 29.6 | 22.31 | 1.02 | 1 | 3.02 | 2.87 | 3 | 1 | 2 | 0 | 0 | 91.67 |
5809 | libfinite_elements.so - generic_elements.hpp:634-635 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.17 | 0.17 | 0.04 | 0.04 | 0.01 | 0.01 | 123 | NA | NA | NA | NA | NA | 2.59 | NA | NA | NA | NA | NA | 0.00 |
291 | multithreading_assembly_perf_test - stl_algo.h:2030-2041 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | InBetween | 0.10 | 1.78 | 0.04 | 0.28 | 0.01 | 0.15 | 101 | NA | NA | 1 | NA | NA | 3.13 | NA | NA | NA | NA | NA | 0.00 |
289 | multithreading_assembly_perf_test - assembler.hpp:740-746 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | Outermost | 0.08 | 3.47 | 0.03 | 0.52 | 0.01 | 0.30 | 96 | 0 | 10.7 | 3.54 | 1 | 5.86 | 4.02 | NA | NA | NA | NA | NA | 0.00 |
293 | multithreading_assembly_perf_test - xmmintrin.h:1337-1337 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | Innermost | 0.07 | 0.07 | 0.03 | 0.03 | 0.01 | 0.01 | 81 | 0 | 6.25 | 1 | 1 | 16 | 2.7 | NA | NA | NA | NA | NA | 0.00 |
4828 | libfinite_elements.so - generic_elements.hpp:596-600 [...] | auto aset::asolve::GenericFiniteElement<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceE... | Single | 0.06 | 0.06 | 0.02 | 0.02 | 0.01 | 0.01 | 79 | 44.64 | 26.71 | 1 | 1.27 | 2.11 | 2.45 | 3 | 1 | 0 | 0 | 0 | 100.00 |
287 | multithreading_assembly_perf_test - CoreEvaluators.h:217-217 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | Single | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 56 | 0 | 12.5 | 1 | 1.28 | 5.33 | 2.33 | 0 | 2 | 0 | 8 | 0 | 60.00 |
5807 | libfinite_elements.so - finite_elements.hpp:159-160 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 58 | 0 | 11.88 | 1.29 | 1 | 5.76 | 2.45 | 1 | 1 | 0 | 1 | 0 | 83.33 |
5810 | libfinite_elements.so - generic_elements.hpp:442-442 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 36 | 0 | 12.5 | 2.33 | 1 | 7 | 1.71 | 1 | 1 | 0 | 0 | 0 | 100.00 |
5811 | libfinite_elements.so - generic_elements.hpp:435-435 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 28 | 0 | 12.5 | 2.33 | 1 | 7 | 1.65 | 1 | 1 | 0 | 0 | 0 | 100.00 |
189 | multithreading_assembly_perf_test - finite_elements.hpp:288-289 [...] | aset::asolve::FiniteElement::dofs_rank() const | Single | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 23 | 0 | 10.42 | 1 | 1 | 6.74 | 1.92 | 3 | 1 | 0 | 0 | 0 | 100.00 |