Loop id | Source Location | Source Function | Level | Exclusive Coverage tbb_1 (%) | Inclusive Coverage tbb_1 (%) | Max Exclusive Time Over Threads tbb_1 (s) | Max Inclusive Time Over Threads tbb_1 (s) | Exclusive Time w.r.t. Wall Time tbb_1 (s) | Inclusive Time w.r.t. Wall Time tbb_1 (s) | Nb Threads tbb_1 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing tbb_1 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (tbb_1) Efficiency | (tbb_1) Potential Speed-Up (%) |
---|
292 | multithreading_assembly_perf_test - stl_algo.h:2030-2041 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | Innermost | 6.89 | 6.89 | 13.35 | 13.35 | 13.35 | 13.35 | 1 | 0 | 6.25 | 1 | 1 | 16 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 |
290 | multithreading_assembly_perf_test - assembler.hpp:744-746 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | InBetween | 6.81 | 14.22 | 13.20 | 27.57 | 13.20 | 27.57 | 1 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 1 | 0 |
5783 | libfinite_elements.so - AssignEvaluator.h:480-480 [...] | void Eigen::internal::call_dense_assignment_loop<Eigen::Matrix<double, 24, 24, 0, 24, 24>, Eigen::CwiseBinaryOp<Eigen::internal::scalar_product_op<double, double>, Eigen::Product<Eigen::Product<Eigen::Transpose<Eigen::Matrix<d... | Single | 1.71 | 1.71 | 3.33 | 3.33 | 3.33 | 3.33 | 1 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 1 | 2 | 0 | 0 | 1 | 0 |
320 | multithreading_assembly_perf_test - sparse_matrix.hpp:708-714 [...] | aset::asolve::StorageCSC<int, double> aset::asolve::extract_modify_storage_constraint<int, double>(aset::asolve::StorageCSC<int, double>&, std::vector<bool, std::allocator<bool> > const&, double) | InBetween | 1.40 | 1.41 | 2.72 | 2.72 | 2.72 | 2.73 | 1 | 28.86 | 31.87 | 2.37 | 1 | 1.3 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
38 | multithreading_assembly_perf_test - stl_algo.h:1877-1882 [...] | void std::__introsort_loop<__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::detail::d1::scalable_allocator<int> > >, long, __gnu_cxx::__ops::_Iter_less_iter>(__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::det... | InBetween | 1.34 | 2.56 | 2.59 | 4.95 | 2.59 | 4.95 | 1 | 0 | 8.75 | 1 | 1 | 13 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
691 | multithreading_assembly_perf_test - stl_algo.h:1796-1839 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | InBetween | 1.34 | 2.38 | 2.59 | 4.60 | 2.59 | 4.60 | 1 | 0 | 6.25 | 1 | 1 | 13.6 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 |
692 | multithreading_assembly_perf_test - stl_algo.h:1799-1801 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | Innermost | 1.04 | 1.04 | 2.01 | 2.01 | 2.01 | 2.01 | 1 | 0 | 9.38 | 1 | 1 | 16 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
5808 | libfinite_elements.so - finite_elements.tpp:77-77 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.99 | 0.99 | 1.91 | 1.91 | 1.91 | 1.91 | 1 | 29.6 | 22.31 | 1.02 | 1 | 3.02 | 1 | 3 | 1 | 2 | 0 | 0 | 1 | 0 |
686 | multithreading_assembly_perf_test - stl_algo.h:916-1799 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | Innermost | 0.97 | 0.97 | 1.88 | 1.88 | 1.88 | 1.88 | 1 | 0 | 7.81 | 1 | 1 | 15.09 | 1 | 0 | 1 | 0 | 0.5 | 0 | 1 | 0 |
36 | multithreading_assembly_perf_test - stl_algo.h:88-1932 [...] | void std::__introsort_loop<__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::detail::d1::scalable_allocator<int> > >, long, __gnu_cxx::__ops::_Iter_less_iter>(__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::det... | Outermost | 0.74 | 3.32 | 1.44 | 6.43 | 1.44 | 6.43 | 1 | 3.7 | 8.8 | 2.13 | 1 | 12.36 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
37 | multithreading_assembly_perf_test - stl_algo.h:1877-1877 [...] | void std::__introsort_loop<__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::detail::d1::scalable_allocator<int> > >, long, __gnu_cxx::__ops::_Iter_less_iter>(__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::det... | Innermost | 0.74 | 0.74 | 1.43 | 1.43 | 1.43 | 1.43 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |
5809 | libfinite_elements.so - generic_elements.hpp:634-635 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.71 | 0.71 | 1.37 | 1.37 | 1.37 | 1.37 | 1 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 1 | 0 |
454 | libfe_space.so - compare:1223-1223 [...] | aset::asolve::Part::create_elements(aset::asolve::FESpace&) | Innermost | 0.65 | 0.65 | 1.25 | 1.25 | 1.25 | 1.25 | 1 | 0 | 9.9 | 1 | 1 | 12.44 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 |
173 | libfe_space.so - compare:1223-1223 [...] | aset::asolve::FESpace::elements(std::basic_string_view<char, std::char_traits<char> >) const | Innermost | 0.59 | 0.59 | 1.14 | 1.14 | 1.14 | 1.14 | 1 | 0 | 11.2 | 1 | 1 | 10.23 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 |
291 | multithreading_assembly_perf_test - stl_algo.h:2030-2041 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | InBetween | 0.53 | 7.41 | 1.02 | 14.37 | 1.02 | 14.37 | 1 | NA | NA | 1 | NA | NA | 1 | NA | NA | NA | NA | NA | 1 | 0 |
379 | multithreading_assembly_perf_test - vector.tcc:114-836 [...] | aset::asolve::StorageCSC<int, double>::StorageCSC<tbb::detail::d1::scalable_allocator<int> >(int, int, std::vector<std::vector<int, tbb::detail::d1::scalable_allocator<int> >, std::allocator<std::vector<int, tbb::deta... | Outermost | 0.51 | 0.51 | 0.99 | 0.99 | 0.99 | 0.99 | 1 | 26.69 | 30.99 | 2.13 | 1 | 1.29 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
40 | multithreading_assembly_perf_test - stl_algo.h:1880-1880 [...] | void std::__introsort_loop<__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::detail::d1::scalable_allocator<int> > >, long, __gnu_cxx::__ops::_Iter_less_iter>(__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::det... | Innermost | 0.48 | 0.48 | 0.94 | 0.94 | 0.93 | 0.93 | 1 | NA | NA | 1 | NA | NA | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
690 | multithreading_assembly_perf_test - stl_algo.h:1880-1880 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | Innermost | 0.47 | 0.47 | 0.92 | 0.92 | 0.92 | 0.92 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
3484 | libfinite_elements.so - char_traits.h:368-445 [...] | std::array<std::pair<aset::asolve::DofType, int>, (total_nb_dofs<TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFl... | InBetween | 0.47 | 0.47 | 0.91 | 0.91 | 0.91 | 0.91 | 1 | 5.97 | 10.73 | 2.43 | 1 | 6.99 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
240 | libdofs.so - | aset::asolve::Node::add_dof(aset::asolve::DofType const&, aset::asolve::DofCollection&) | Single | 0.42 | 0.42 | 0.81 | 0.81 | 0.81 | 0.81 | 1 | 5.41 | 12.67 | 2.81 | 1 | 6.2 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
4828 | libfinite_elements.so - generic_elements.hpp:596-600 [...] | auto aset::asolve::GenericFiniteElement<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceE... | Single | 0.34 | 0.34 | 0.66 | 0.66 | 0.66 | 0.66 | 1 | 44.64 | 26.71 | 1 | 1.27 | 2.11 | 1 | 3 | 1 | 0 | 0 | 0 | 1 | 0 |
480 | multithreading_assembly_perf_test - stl_uninitialized.h:351-351 [...] | auto assemble_func<aset::asolve::SparseMatrixCOO<int, double> >(std::shared_ptr<aset::asolve::FESpace>, aset::asolve::AssemblyMultithreadingMethod, aset::asolve::AssemblyFlag, int) | Single | 0.33 | 0.33 | 0.64 | 0.64 | 0.64 | 0.64 | 1 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 1 | 0 |
293 | multithreading_assembly_perf_test - xmmintrin.h:1337-1337 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | Innermost | 0.31 | 0.31 | 0.61 | 0.61 | 0.61 | 0.61 | 1 | 0 | 6.25 | 1 | 1 | 16 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
289 | multithreading_assembly_perf_test - assembler.hpp:740-746 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | Outermost | 0.29 | 14.83 | 0.56 | 28.74 | 0.56 | 28.74 | 1 | 0 | 10.7 | 3.54 | 1 | 5.86 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
704 | multithreading_assembly_perf_test - stl_algo.h:1877-1882 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | InBetween | 0.26 | 0.50 | 0.50 | 0.96 | 0.50 | 0.96 | 1 | 0 | 7.03 | 1 | 1 | 13 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
695 | multithreading_assembly_perf_test - stl_algo.h:88-1932 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | InBetween | 0.24 | 1.22 | 0.47 | 2.37 | 0.47 | 2.37 | 1 | 3.57 | 10.04 | 2.15 | 1 | 13.52 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
703 | multithreading_assembly_perf_test - stl_algo.h:1877-1877 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | Innermost | 0.24 | 0.24 | 0.46 | 0.46 | 0.46 | 0.46 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |
314 | multithreading_assembly_perf_test - assembler.hpp:298-299 [...] | aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::compute_matrix_profile_() | Innermost | 0.23 | 0.23 | 0.45 | 0.45 | 0.44 | 0.44 | 1 | NA | NA | 1 | NA | NA | 1 | 1 | 0 | 0 | 6 | 0 | 1 | 0 |
693 | multithreading_assembly_perf_test - stl_algo.h:1799-1824 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | InBetween | 0.22 | 0.35 | 0.42 | 0.67 | 0.42 | 0.67 | 1 | 0 | 9.38 | 1 | 1 | 13.65 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
4039 | libfinite_elements.so - generic_elements.tpp:42-45 [...] | aset::asolve::GenericFiniteElement<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElemen... | Single | 0.20 | 0.20 | 0.39 | 0.39 | 0.39 | 0.39 | 1 | 50 | 18.75 | 2.68 | 1 | 4.55 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
5807 | libfinite_elements.so - finite_elements.hpp:159-160 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.20 | 0.20 | 0.38 | 0.38 | 0.38 | 0.38 | 1 | 0 | 11.88 | 1.29 | 1 | 5.76 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 0 |
484 | multithreading_assembly_perf_test - stl_uninitialized.h:351-351 [...] | auto assemble_func<aset::asolve::SparseMatrixCOO<int, double> >(std::shared_ptr<aset::asolve::FESpace>, aset::asolve::AssemblyMultithreadingMethod, aset::asolve::AssemblyFlag, int) | Single | 0.19 | 0.19 | 0.36 | 0.36 | 0.36 | 0.36 | 1 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 1 | 0 |
482 | multithreading_assembly_perf_test - stl_uninitialized.h:351-351 [...] | auto assemble_func<aset::asolve::SparseMatrixCOO<int, double> >(std::shared_ptr<aset::asolve::FESpace>, aset::asolve::AssemblyMultithreadingMethod, aset::asolve::AssemblyFlag, int) | Single | 0.18 | 0.18 | 0.35 | 0.35 | 0.35 | 0.35 | 1 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 1 | 0 |
3482 | libfinite_elements.so - char_traits.h:368-445 [...] | std::array<std::pair<aset::asolve::DofType, int>, (total_nb_dofs<TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFl... | Outermost | 0.18 | 0.18 | 0.34 | 0.34 | 0.34 | 0.34 | 1 | 0 | 9.33 | 1 | 1 | 8.88 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
694 | multithreading_assembly_perf_test - stl_algo.h:1799-1801 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | Innermost | 0.13 | 0.13 | 0.25 | 0.25 | 0.25 | 0.25 | 1 | 0 | 9.38 | 1 | 1 | 16 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
189 | multithreading_assembly_perf_test - finite_elements.hpp:288-289 [...] | aset::asolve::FiniteElement::dofs_rank() const | Single | 0.11 | 0.11 | 0.22 | 0.22 | 0.22 | 0.22 | 1 | 0 | 10.42 | 1 | 1 | 6.74 | 1 | 3 | 1 | 0 | 0 | 0 | 1 | 0 |
287 | multithreading_assembly_perf_test - CoreEvaluators.h:217-217 [...] | auto aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::parallel_assemble_colmutexes_(aset::asolve::AssemblyFlag)::{lambda()#2}::operator()() const::{lambda(auto:1 const&)#1}::operator()<aset::as... | Single | 0.11 | 0.11 | 0.20 | 0.20 | 0.20 | 0.20 | 1 | 0 | 12.5 | 1 | 1.28 | 5.33 | 1 | 0 | 2 | 0 | 8 | 0 | 1 | 0 |
681 | multithreading_assembly_perf_test - stl_algo.h:1799-1801 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | Innermost | 0.10 | 0.10 | 0.20 | 0.20 | 0.20 | 0.20 | 1 | 0 | 9.38 | 1 | 1 | 16 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
583 | multithreading_assembly_perf_test - parallel_for.h:210-210 [...] | tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<int>, tbb::detail::d1::parallel_for_body_wrapper<aset::asolve::SparseMatrixCOO<int, double>::set_from_csc_parallel(aset::asolve::StorageCSC<int, double>&)::{lambda(int)#1... | InBetween | 0.10 | 0.10 | 0.19 | 0.19 | 0.19 | 0.19 | 1 | 22.86 | 27.5 | 1.88 | 1 | 1.31 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
680 | multithreading_assembly_perf_test - stl_algo.h:1796-1839 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | InBetween | 0.10 | 0.20 | 0.19 | 0.39 | 0.19 | 0.39 | 1 | 0 | 6.25 | 1 | 1 | 13.6 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 |
455 | libfe_space.so - stl_construct.h:97-97 [...] | aset::asolve::Part::create_elements(aset::asolve::FESpace&) | InBetween | 0.09 | 0.09 | 0.17 | 0.17 | 0.17 | 0.17 | 1 | 25.45 | 15.26 | 2.39 | 1 | 6.96 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
676 | multithreading_assembly_perf_test - stl_algo.h:916-1799 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | Innermost | 0.09 | 0.09 | 0.17 | 0.17 | 0.17 | 0.17 | 1 | 0 | 7.81 | 1 | 1 | 15.09 | 1 | 0 | 1 | 0 | 0.5 | 0 | 1 | 0 |
5810 | libfinite_elements.so - generic_elements.hpp:442-442 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.08 | 0.08 | 0.16 | 0.16 | 0.16 | 0.16 | 1 | 0 | 12.5 | 2.33 | 1 | 7 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 |
5811 | libfinite_elements.so - generic_elements.hpp:435-435 [...] | aset::asolve::Element_U<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::... | Single | 0.08 | 0.08 | 0.15 | 0.15 | 0.15 | 0.15 | 1 | 0 | 12.5 | 2.33 | 1 | 7 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 |
685 | multithreading_assembly_perf_test - parallel_for_each.h:401-401 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | InBetween | 0.08 | 4.99 | 0.15 | 9.67 | 0.15 | 9.67 | 1 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 1 | 0 |
410 | libfe_space.so - atomicity.h:71-108 [...] | aset::asolve::Part::split(aset::asolve::Mesh const*) const | InBetween | 0.07 | 0.09 | 0.14 | 0.18 | 0.14 | 0.18 | 1 | 8.86 | 11.5 | 2.67 | 1 | 7.27 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
3479 | libfinite_elements.so - basic_string.h:218-230 [...] | std::array<std::pair<aset::asolve::DofType, int>, (total_nb_dofs<TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFl... | Single | 0.07 | 0.07 | 0.13 | 0.13 | 0.14 | 0.14 | 1 | 0 | 6.56 | 1 | 1 | 15.24 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 |
227 | libmesh.so - mesh.cpp:497-498 [...] | aset::asolve::Mesh::setup_id_to_rank_index() | InBetween | 0.06 | 0.10 | 0.11 | 0.18 | 0.11 | 0.19 | 1 | 0 | 10.94 | 6 | 1 | 4 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
3481 | libfinite_elements.so - array:94-94 [...] | std::array<std::pair<aset::asolve::DofType, int>, (total_nb_dofs<TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFl... | Single | 0.06 | 0.06 | 0.11 | 0.11 | 0.11 | 0.11 | 1 | 0 | 12.5 | 1 | 1 | 6.55 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 |
3480 | libfinite_elements.so - generic_elements.hpp:421-425 [...] | std::array<std::pair<aset::asolve::DofType, int>, (total_nb_dofs<TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFl... | Outermost | 0.05 | 0.52 | 0.10 | 1.01 | 0.10 | 1.01 | 1 | 0 | 12.5 | 1 | 1 | 5 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
61 | libfe_space.so - stl_tree.h:781-1936 [...] | std::_Rb_tree<std::pair<int, int>, std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*>, std::_Select1st<std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*> >, std::less<std::pair<in... | Outermost | 0.04 | 0.21 | 0.08 | 0.41 | 0.08 | 0.41 | 1 | 0 | 11.25 | 1 | 1 | 5.33 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
221 | libmesh.so - hashtable.h:1364-1371 [...] | aset::asolve::Mesh::setup_id_to_rank_index() | Single | 0.04 | 0.04 | 0.08 | 0.08 | 0.08 | 0.08 | 1 | 0 | 10.52 | 1 | 1 | 4 | 1 | 1 | 0 | 0 | 4 | 0 | 1 | 0 |
401 | libfe_space.so - numeric:444-448 [...] | aset::asolve::Part::nb_ips() const | Single | 0.04 | 0.04 | 0.07 | 0.07 | 0.08 | 0.08 | 1 | 0 | 6.25 | 1 | 1 | 4.15 | 1 | 1 | 1 | 1 | 8 | 0 | 1 | 0 |
228 | libmesh.so - hashtable.h:2575-2592 [...] | aset::asolve::Mesh::setup_id_to_rank_index() | Innermost | 0.04 | 0.04 | 0.07 | 0.07 | 0.07 | 0.07 | 1 | 0 | 12.08 | 1 | 1 | 4 | 1 | 0.67 | 0 | 0 | 4 | 0 | 1 | 0 |
453 | libfe_space.so - partitions.cpp:89-117 [...] | aset::asolve::Part::create_elements(aset::asolve::FESpace&) | Outermost | 0.03 | 0.77 | 0.06 | 1.49 | 0.07 | 1.49 | 1 | 8.7 | 12.98 | 3.3 | 1 | 6.98 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
68 | libfe_space.so - stl_tree.h:781-1936 [...] | std::_Rb_tree<std::pair<int, int>, std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*>, std::_Select1st<std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*> >, std::less<std::pair<in... | InBetween | 0.03 | 0.06 | 0.06 | 0.11 | 0.06 | 0.11 | 1 | 0 | 9.38 | 1 | 1 | 14.55 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 |
63 | libfe_space.so - stl_tree.h:781-1936 [...] | std::_Rb_tree<std::pair<int, int>, std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*>, std::_Select1st<std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*> >, std::less<std::pair<in... | InBetween | 0.03 | 0.15 | 0.06 | 0.29 | 0.06 | 0.29 | 1 | 0 | 12.5 | 1 | 1 | 5.2 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
75 | libasolve_test_helpers.so - structured_grid.cpp:130-139 [...] | aset::asolve::StructuredGrid::make_mesh() | Innermost | 0.03 | 0.03 | 0.06 | 0.06 | 0.06 | 0.06 | 1 | 8.87 | 10.15 | 4.42 | 1 | 8.54 | 1 | 2 | 0 | 0 | 2.5 | 0 | 1 | 0 |
244 | multithreading_assembly_perf_test - enumerable_thread_specific.h:105-218 [...] | tbb::detail::d1::ets_base<(tbb::detail::d1::ets_key_usage_type)1>::table_lookup(bool&) | Outermost | 0.03 | 0.04 | 0.05 | 0.09 | 0.06 | 0.09 | 1 | 0 | 11.46 | 1 | 1 | 9.41 | 1 | 0.5 | 0 | 0 | 1 | 0 | 1 | 0 |
4038 | libfinite_elements.so - array:94-94 [...] | aset::asolve::GenericFiniteElement<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElemen... | Single | 0.03 | 0.03 | 0.05 | 0.05 | 0.05 | 0.05 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 |
69 | libfe_space.so - stl_tree.h:781-1936 [...] | std::_Rb_tree<std::pair<int, int>, std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*>, std::_Select1st<std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*> >, std::less<std::pair<in... | Innermost | 0.03 | 0.03 | 0.05 | 0.05 | 0.05 | 0.05 | 1 | 0 | 11.25 | 1 | 1 | 9 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 |
175 | libmesh.so - Redux.h:245-482 [...] | aset::asolve::Mesh::centers() const | InBetween | 0.02 | 0.02 | 0.05 | 0.05 | 0.05 | 0.05 | 1 | 3.45 | 12.72 | 1.59 | 1.9 | 7 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
171 | libmesh.so - emmintrin.h:134-287 [...] | aset::asolve::Mesh::centers() const | InBetween | 0.02 | 0.02 | 0.04 | 0.04 | 0.04 | 0.04 | 1 | 63.64 | 20.31 | 1.89 | 2.2 | 3.79 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
317 | multithreading_assembly_perf_test - vector.tcc:114-523 [...] | aset::asolve::StorageCSC<int, double> aset::asolve::extract_modify_storage_constraint<int, double>(aset::asolve::StorageCSC<int, double>&, std::vector<bool, std::allocator<bool> > const&, double) | Outermost | 0.02 | 1.44 | 0.04 | 2.80 | 0.04 | 2.80 | 1 | 24 | 26.68 | 2.52 | 1 | 1.28 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
65 | libfe_space.so - stl_tree.h:781-1936 [...] | std::_Rb_tree<std::pair<int, int>, std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*>, std::_Select1st<std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*> >, std::less<std::pair<in... | InBetween | 0.02 | 0.10 | 0.04 | 0.19 | 0.04 | 0.19 | 1 | 0 | 10.42 | 1 | 1 | 10.67 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
3512 | libfinite_elements.so - stl_algobase.h:262-389 [...] | aset::asolve::GenericFiniteElement<aset::asolve::ReferenceElement<aset::asolve::quadrature::HexaGauss8, aset::asolve::Hexa8Interpolation1, 3, (aset::asolve::FormulationFlag)0>, TypeList<aset::asolve::ElementRule<aset::asolve::ReferenceElemen... | Outermost | 0.02 | 0.02 | 0.04 | 0.04 | 0.04 | 0.04 | 1 | 5.95 | 13.17 | 2.77 | 1 | 7.03 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
39 | multithreading_assembly_perf_test - stl_algo.h:1877-1877 | void std::__introsort_loop<__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::detail::d1::scalable_allocator<int> > >, long, __gnu_cxx::__ops::_Iter_less_iter>(__gnu_cxx::__normal_iterator<int*, std::vector<int, tbb::det... | Innermost | 0.02 | 0.02 | 0.04 | 0.04 | 0.04 | 0.04 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
657 | libfinite_elements.so - stl_tree.h:790-1953 [...] | aset::amat::TypedStateVariable<Eigen::TensorFixedSize<double, Eigen::Sizes<3l, 3l>, 0, long>, (aset::amat::var_type)2, (aset::amat::TensorProperty)0> aset::amat::MaterialBrick::get_variable<(aset::amat::var_type)2, Eigen::TensorFixedSi... | Single | 0.02 | 0.02 | 0.04 | 0.04 | 0.04 | 0.04 | 1 | 0 | 11.03 | 1 | 1 | 10.67 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
62 | libfe_space.so - stl_tree.h:781-1936 [...] | std::_Rb_tree<std::pair<int, int>, std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*>, std::_Select1st<std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*> >, std::less<std::pair<in... | InBetween | 0.02 | 0.17 | 0.04 | 0.33 | 0.04 | 0.33 | 1 | 0 | 11.25 | 1 | 1 | 5.33 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
203 | libdofs.so - alloc_traits.h:532-532 [...] | aset::asolve::DofCollection::push_back_dof(aset::asolve::DofType const&) | Single | 0.02 | 0.00 | 0.04 | 0.00 | 0.04 | 0.00 | 1 | 50 | 28.13 | 1 | 1 | 2.4 | 1 | 0 | 3 | 0 | 0 | 0 | 1 | 0 |
313 | multithreading_assembly_perf_test - assembler.hpp:296-299 [...] | aset::asolve::Assembler<aset::asolve::FESpace, aset::asolve::SparseMatrixCOO<int, double> >::compute_matrix_profile_() | Outermost | 0.02 | 0.00 | 0.04 | 0.00 | 0.04 | 0.00 | 1 | 0 | 12.02 | 1 | 1 | 8.68 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
451 | libfe_space.so - atomicity.h:71-108 [...] | aset::asolve::Part::setup_material(aset::asolve::FESpace&) | Single | 0.02 | 0.02 | 0.04 | 0.04 | 0.04 | 0.04 | 1 | 0 | 10.86 | 1 | 1 | 6.44 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
64 | libfe_space.so - stl_tree.h:781-1936 [...] | std::_Rb_tree<std::pair<int, int>, std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*>, std::_Select1st<std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*> >, std::less<std::pair<in... | InBetween | 0.02 | 0.12 | 0.03 | 0.23 | 0.04 | 0.23 | 1 | 0 | 10.42 | 1 | 1 | 10.67 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
656 | libfinite_elements.so - stl_tree.h:790-1953 [...] | aset::amat::TypedStateVariable<Eigen::TensorFixedSize<double, Eigen::Sizes<3l, 3l>, 0, long>, (aset::amat::var_type)1, (aset::amat::TensorProperty)0> aset::amat::MaterialBrick::get_variable<(aset::amat::var_type)1, Eigen::TensorFixedSi... | Single | 0.02 | 0.02 | 0.03 | 0.03 | 0.04 | 0.04 | 1 | 0 | 11.03 | 1 | 1 | 10.67 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
172 | libfe_space.so - compare:1223-1223 [...] | aset::asolve::FESpace::elements(std::basic_string_view<char, std::char_traits<char> >) const | Outermost | 0.02 | 0.60 | 0.03 | 1.17 | 0.03 | 1.17 | 1 | 7.89 | 12.99 | 2.31 | 1 | 7.2 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
243 | multithreading_assembly_perf_test - enumerable_thread_specific.h:105-218 [...] | tbb::detail::d1::ets_base<(tbb::detail::d1::ets_key_usage_type)1>::table_lookup(bool&) | Innermost | 0.02 | 0.02 | 0.03 | 0.03 | 0.03 | 0.03 | 1 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 |
169 | libmesh.so - AssignEvaluator.h:379-424 [...] | aset::asolve::Mesh::centers() const | InBetween | 0.01 | 0.06 | 0.02 | 0.12 | 0.02 | 0.12 | 1 | 3.92 | 12.01 | 2.17 | 2.23 | 10.76 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
167 | libmesh.so - IndexedView.h:272-272 [...] | aset::asolve::Mesh::cell_coordinates(std::pair<int, int> const&) const | Single | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 1 | 0 | 12.5 | 1 | 1 | 5.6 | 1 | 0 | 2 | 0 | 0 | 1 | 1 | 0 |
675 | multithreading_assembly_perf_test - parallel_for_each.h:401-401 [...] | void tbb::detail::d1::dynamic_grainsize_mode<tbb::detail::d1::adaptive_mode<tbb::detail::d1::auto_partition_type> >::work_balance<tbb::detail::d1::start_for<tbb::detail::d1::blocked_range<unsigned long>, tbb::detail::d2::parallel_fo... | Outermost | 0.01 | 0.31 | 0.02 | 0.60 | 0.02 | 0.60 | 1 | NA | NA | NA | NA | NA | 1 | NA | NA | NA | NA | NA | 1 | 0 |
409 | libfe_space.so - hashtable.h:311-2172 [...] | aset::asolve::Part::split(aset::asolve::Mesh const*) const | Innermost | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 1 | 0 | 11.45 | 1 | 1 | 4.97 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
67 | libfe_space.so - stl_tree.h:781-1936 [...] | std::_Rb_tree<std::pair<int, int>, std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*>, std::_Select1st<std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*> >, std::less<std::pair<in... | InBetween | 0.01 | 0.07 | 0.02 | 0.13 | 0.02 | 0.14 | 1 | 0 | 11.46 | 1 | 1 | 7.47 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
183 | libfe_space.so - fespace.cpp:334-335 | aset::asolve::FESpace::create_elements() | Single | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 1 | 0 | 12.5 | 1 | 1 | 4.44 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
226 | libdofs.so - dof_list.hpp:61-64 [...] | aset::asolve::NodeCollection::NodeCollection(aset::asolve::DofCollection&, Eigen::Map<Eigen::Matrix<double, -1, 3, 1, -1, 3> const, 0, Eigen::Stride<0, 0> > const&) | Outermost | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 1 | 23.81 | 23.74 | 2.75 | 1 | 1.41 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
66 | libfe_space.so - stl_tree.h:781-1936 [...] | std::_Rb_tree<std::pair<int, int>, std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*>, std::_Select1st<std::pair<std::pair<int, int> const, aset::asolve::FiniteElement*> >, std::less<std::pair<in... | InBetween | 0.01 | 0.08 | 0.02 | 0.16 | 0.02 | 0.16 | 1 | 0 | 10.42 | 1 | 1 | 10.67 | 1 | NA | NA | NA | NA | NA | 1 | 0 |
323 | multithreading_assembly_perf_test - stl_uninitialized.h:351-351 [...] | aset::asolve::StorageCSC<int, double> aset::asolve::extract_modify_storage_constraint<int, double>(aset::asolve::StorageCSC<int, double>&, std::vector<bool, std::allocator<bool> > const&, double) | Innermost | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 1 | 100 | 100 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 0 | 1 | 0 |