Loop id | Source Location | Source Function | Level | Coverage run_0 (%) | Max Time Over Threads run_0 (s) | Time w.r.t. Wall Time run_0 (s) | Nb Threads run_0 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing run_0 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect |
---|
768 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0.4 | 2.63 | 1.86 | 52 | 5.56 | 10.76 | 1 | 1 | 12.08 | 1.43 | 0 | 0 | 0 | 1 | 0 |
778 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0.38 | 2.39 | 1.78 | 52 | 5.56 | 10.76 | 1 | 1 | 12.08 | 1.35 | 0 | 0 | 0 | 1 | 0 |
258 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Outermost | 0.32 | 1.62 | 1.48 | 52 | 6.67 | 11.67 | 1 | 1 | 11.69 | 1.1 | NA | NA | NA | NA | NA |
403 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Outermost | 0.31 | 1.72 | 1.46 | 52 | 6.67 | 11.67 | 1 | 1 | 11.69 | 1.19 | NA | NA | NA | NA | NA |
257 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0.3 | 1.67 | 1.39 | 52 | 12.5 | 10.94 | 1 | 1 | 14.64 | 1.21 | 0 | 0 | 0 | 2 | 0 |
402 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0.3 | 1.62 | 1.41 | 52 | 12.5 | 10.94 | 1 | 1 | 14.64 | 1.16 | 0 | 0 | 0 | 2 | 0 |
769 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0.29 | 1.77 | 1.34 | 52 | 6.67 | 8.33 | 1 | 3.81 | 14.9 | 1.33 | 0 | 0 | 0 | 2 | 0 |
779 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Single | 0.29 | 1.81 | 1.34 | 52 | 6.67 | 8.33 | 1 | 3.81 | 14.9 | 1.35 | 0 | 0 | 0 | 2 | 0 |
770 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Outermost | 0.24 | 1.55 | 1.13 | 52 | 9.09 | 11.93 | 3 | 1 | 12.44 | 1.38 | NA | NA | NA | NA | NA |
780 | picongpu - ForEach.hpp:202-279 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Outermost | 0.24 | 1.63 | 1.13 | 52 | 9.09 | 11.93 | 3 | 1 | 12.44 | 1.46 | NA | NA | NA | NA | NA |
401 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0.2 | 1.16 | 0.92 | 52 | 5.56 | 7.99 | 4.83 | 1 | 16 | 1.27 | 1 | 0 | 0 | 2 | 0 |
256 | picongpu - ForEach.hpp:202-202 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Single | 0.19 | 1.13 | 0.87 | 52 | 5.56 | 7.99 | 4.83 | 1 | 16 | 1.31 | 1 | 0 | 0 | 2 | 0 |
291 | picongpu - ParticlesBase.kernel:552-563 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.18 | 1.21 | 0.84 | 52 | 0 | 9.05 | 3.35 | 1 | 16 | 1.46 | 0 | 2.33 | 3.33 | 0.67 | 2 |
290 | picongpu - ForEach.hpp:278-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.16 | 1.05 | 0.76 | 52 | 0 | 11.61 | 1 | 1 | 12.76 | 1.4 | NA | NA | NA | NA | NA |
293 | picongpu - ParticlesBase.kernel:487-490 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.15 | 1.01 | 0.72 | 52 | 0 | 6.77 | 1 | 1 | 15.73 | 1.4 | NA | NA | NA | NA | NA |
884 | picongpu - TaskSetValue.hpp:79-89 [...] | void alpaka::detail::ParallelForImpl<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::trait::OmpSchedule<cupla::cupla_omp2_seq_sync::CuplaKernel<pmacc::KernelSetValue<256u> >, alpaka::AccCpuOmp2Bl... | Innermost | 0.15 | 0.79 | 0.69 | 52 | 6.62 | 9.08 | 3.29 | 1 | 13.18 | 1.14 | 2.5 | 0 | 0 | 0.5 | 0 |
771 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Innermost | 0.15 | 1.05 | 0.71 | 52 | 0 | 11.16 | 1 | 1 | 18.16 | 1.48 | NA | NA | NA | NA | NA |
259 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Innermost | 0.14 | 0.77 | 0.64 | 52 | 0 | 11.16 | 1 | 1 | 18.16 | 1.22 | NA | NA | NA | NA | NA |
781 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu13currentSolver20KernelComputeCurrentINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESD_SC_IiLi4EEEENSB_ISC_IiLi2EESG_SG_EENSB_ISC_IiLi3EESI_SI_EEEEEE... | Innermost | 0.14 | 0.95 | 0.65 | 52 | 0 | 11.16 | 1 | 1 | 18.16 | 1.48 | NA | NA | NA | NA | NA |
279 | picongpu - ParticlesBase.kernel:265-269 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.13 | 0.94 | 0.6 | 52 | 0 | 4.79 | 1 | 1 | 24 | 1.57 | NA | NA | NA | NA | NA |
404 | picongpu - ForEach.hpp:276-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelIN8picongpu26KernelMoveAndMarkParticlesINS0_20SuperCellDescriptionINS0_4math2CT6VectorISt17integral_constantIiLi8EESC_SB_IiLi4EEEENSA_ISB_IiLi1EESF_SF_EENSA_ISB_IiLi2EESH_SH_EEEEEENS1_9Work... | Innermost | 0.13 | 0.78 | 0.61 | 52 | 0 | 11.16 | 1 | 1 | 18.16 | 1.28 | NA | NA | NA | NA | NA |
281 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.1 | 0.68 | 0.46 | 52 | 0 | 5.6 | 1 | 1 | 21.5 | 1.48 | NA | NA | NA | NA | NA |
434 | picongpu - ForEach.hpp:278-284 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.09 | 0.63 | 0.43 | 52 | 0 | 11.61 | 1 | 1 | 12.76 | 1.47 | NA | NA | NA | NA | NA |
278 | picongpu - ParticlesBase.kernel:279-291 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.09 | 0.61 | 0.4 | 52 | 0 | 10.16 | 2.53 | 1 | 15.29 | 1.52 | 2 | 1 | 0 | 3 | 1.5 |
437 | picongpu - ParticlesBase.kernel:487-490 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.08 | 0.59 | 0.38 | 52 | 0 | 6.77 | 1 | 1 | 15.73 | 1.55 | NA | NA | NA | NA | NA |
292 | picongpu - ParticlesBase.kernel:514-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.07 | 0.48 | 0.32 | 52 | 38.1 | 19.64 | 2.15 | 1 | 4.37 | 1.5 | NA | NA | NA | NA | NA |
435 | picongpu - ParticlesBase.kernel:552-563 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.07 | 0.52 | 0.31 | 52 | 0 | 9.05 | 3.35 | 1 | 16 | 1.68 | 0 | 2.33 | 3.33 | 0.67 | 2 |
590 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0.06 | 0.41 | 0.27 | 52 | 0 | 5.6 | 1 | 1 | 21.5 | 1.52 | NA | NA | NA | NA | NA |
667 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_14KernelFillGapsENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj256EEEN... | Innermost | 0.05 | 0.35 | 0.25 | 52 | 0 | 5.6 | 1 | 1 | 21.5 | 1.4 | NA | NA | NA | NA | NA |
425 | picongpu - Op.hpp:24-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.05 | 0.37 | 0.22 | 52 | 0 | 5.6 | 1 | 1 | 21.5 | 1.68 | NA | NA | NA | NA | NA |
752 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.03 | 0.21 | 0.14 | 52 | 7.14 | 8.48 | 1 | 1 | 14.79 | 1.5 | 0 | 0 | 0 | 2 | 0 |
3562 | picongpu - | __intel_avx_rep_memcpy | Single | 0.03 | 2.21 | 0.12 | 52 | 100 | 50 | 1 | 1 | 2 | 18.42 | 0 | 2 | 0 | 0 | 0 |
910 | picongpu - AddExchangeToBorder.hpp:95-126 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<pmacc::fields::operations::KernelAddExchangeToBorder, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsigned l... | Single | 0.03 | 0.26 | 0.16 | 52 | 6.85 | 8.56 | 3.55 | 2.52 | 14.21 | 1.63 | NA | NA | NA | NA | NA |
725 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.03 | 0.18 | 0.13 | 52 | 0 | 7.03 | 1 | 1 | 16 | 1.38 | 0 | 0 | 0 | 2 | 0 |
916 | picongpu - CopyGuardToExchange.hpp:92-121 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<pmacc::fields::operations::KernelCopyGuardToExchange, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsigned l... | Single | 0.03 | 0.26 | 0.13 | 52 | 10.42 | 9.64 | 4.21 | 1 | 14.34 | 2 | NA | NA | NA | NA | NA |
726 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.03 | 0.25 | 0.15 | 52 | 25 | 11.13 | 1.82 | 3.06 | 11.95 | 1.67 | 0 | 0 | 0 | 2 | 0 |
753 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.03 | 0.19 | 0.13 | 52 | 31.25 | 12.3 | 1.79 | 2.96 | 10.98 | 1.46 | 0 | 0 | 0 | 2 | 0 |
855 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.22 | 0.08 | 52 | 0 | 8.93 | 1 | 1 | 14.77 | 2.75 | 0 | 0 | 0 | 2 | 0 |
760 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.25 | 0.11 | 52 | 7.14 | 8.48 | 1 | 1 | 14.79 | 2.27 | 0 | 0 | 0 | 2 | 0 |
422 | picongpu - ParticlesBase.kernel:279-291 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.02 | 0.19 | 0.08 | 52 | 0 | 10.16 | 2.53 | 1 | 15.29 | 2.38 | 2 | 1 | 0 | 3 | 1.5 |
789 | picongpu - ForEach.hpp:202-202 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Innermost | 0.02 | 0.16 | 0.08 | 52 | 20 | 11 | 2.6 | 1.18 | 12.06 | 2 | 3 | 0 | 1 | 1 | 0 |
423 | picongpu - ParticlesBase.kernel:265-269 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.02 | 0.18 | 0.11 | 52 | 0 | 4.79 | 1 | 1 | 24 | 1.64 | NA | NA | NA | NA | NA |
844 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.14 | 0.1 | 52 | 25 | 11.13 | 1.82 | 3.06 | 11.95 | 1.56 | 0 | 0 | 0 | 2 | 0 |
856 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.24 | 0.09 | 52 | 25 | 11.13 | 1.82 | 3.06 | 11.95 | 2.67 | 0 | 0 | 0 | 2 | 0 |
761 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.25 | 0.1 | 52 | 27.03 | 11.66 | 1.79 | 2.6 | 10.85 | 2.5 | 0 | 0 | 0 | 2 | 0 |
843 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::fdtd::KernelUpdateField, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.02 | 0.13 | 0.08 | 52 | 0 | 7.03 | 1 | 1 | 16 | 1.63 | 0 | 0 | 0 | 2 | 0 |
3563 | picongpu - | __intel_avx_rep_memset | Single | 0.02 | 0.12 | 0.08 | 52 | 100 | 50 | 1 | 1 | 2 | 1.71 | 0 | 1 | 0 | 0 | 0 |
413 | picongpu - ParticlesBase.kernel:138-153 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.07 | 0.03 | 51 | 0 | 7.62 | 2.32 | 1 | 20.44 | 2.33 | 3 | 1 | 0 | 4.5 | 0 |
289 | picongpu - ParticlesBase.kernel:487-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Outermost | 0.01 | 0.1 | 0.06 | 52 | 10.16 | 11.53 | 3.23 | 1 | 7.7 | 1.67 | NA | NA | NA | NA | NA |
294 | picongpu - ParticlesBase.kernel:440-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.06 | 0.03 | 52 | 4 | 8.25 | 3.58 | 1 | 14.86 | 2 | NA | NA | NA | NA | NA |
802 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.01 | 0.13 | 0.05 | 52 | 0 | 9.38 | 1 | 1 | 15.48 | 2.6 | 0 | 0 | 0 | 2 | 0 |
1486 | picongpu - Kernel.hpp:161-164 [...] | void std::__invoke_impl<void, pmacc::exec::detail::KernelWithDynSharedMem<pmacc::lockstep::exec::detail::LockStepKernel<pmacc::device::reduce::Kernel<pmacc::math::Vector<double, 3u, pmacc::math::StandardAccessor, pmacc::math::StandardNavigat... | Innermost | 0.01 | 0.08 | 0.03 | 42 | 12.5 | 10.16 | 3.29 | 1.17 | 4.67 | 2 | 0 | 0 | 0 | 1 | 0 |
788 | picongpu - ForEach.hpp:202-202 [...] | void alpaka::detail::ParallelForImpl<pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> >, alpaka::trait::OmpSchedule<pmacc::lockstep::exec::detail::Loc... | Innermost | 0.01 | 0.09 | 0.05 | 52 | 0 | 8.04 | 1 | 1 | 14.93 | 1.8 | 3 | 0 | 0 | 2 | 0 |
803 | picongpu - ForEach.hpp:202-202 [...] | void std::__invoke_impl<void, pmacc::lockstep::exec::detail::LockStepKernel<picongpu::fields::maxwellSolver::KernelAddCurrentDensity, pmacc::lockstep::WorkerCfg<256u> > const&, alpaka::AccCpuOmp2Blocks<std::integral_constant<unsign... | Single | 0.01 | 0.15 | 0.06 | 52 | 18.75 | 11.72 | 2.38 | 2.85 | 12.16 | 2.5 | 0 | 0 | 0 | 2 | 0 |
271 | picongpu - Op.hpp:30-30 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.06 | 0.03 | 51 | 0 | 4.3 | 1 | 1 | 24.57 | 2 | NA | NA | NA | NA | NA |
270 | picongpu - ParticlesBase.kernel:124-128 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.08 | 0.04 | 52 | 0 | 7.95 | 1 | 1 | 23.19 | 2 | NA | NA | NA | NA | NA |
436 | picongpu - ParticlesBase.kernel:514-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Innermost | 0.01 | 0.1 | 0.05 | 52 | 38.1 | 19.64 | 2.15 | 1 | 4.37 | 2 | NA | NA | NA | NA | NA |
269 | picongpu - ParticlesBase.kernel:138-153 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Single | 0.01 | 0.13 | 0.06 | 52 | 0 | 7.62 | 2.32 | 1 | 20.44 | 2.17 | 3 | 1 | 0 | 4.5 | 0 |
277 | picongpu - FramePointer.hpp:55-77 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Outermost | 0.01 | 0.08 | 0.03 | 51 | 12.82 | 11.9 | 4.47 | 1 | 21.19 | 2.67 | NA | NA | NA | NA | NA |
433 | picongpu - ParticlesBase.kernel:487-629 [...] | _ZSt13__invoke_implIvRKN5pmacc8lockstep4exec6detail14LockStepKernelINS0_20KernelShiftParticlesENS1_9WorkerCfgILj256EEEEEJRN6alpaka16AccCpuOmp2BlocksISt17integral_constantImLm3EEjEENS0_12ParticlesBoxINS0_5FrameINS0_6detail29OperatorCreatePairStaticArrayILj2... | Outermost | 0.01 | 0.08 | 0.03 | 52 | 10.16 | 11.53 | 3.23 | 1 | 7.7 | 2.67 | NA | NA | NA | NA | NA |