* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112842)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112847)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 1
Number of walkers per rank = 1
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.3270 0.3270 1 0.326982379
ParticleSet:::update 0.0000 0.0000 1 0.000004383
Total 42.3765 0.0003 1 42.376504688
Diffusion 22.2184 0.0174 5 4.443688108
Complete Updates 0.1707 0.0000 5 0.034147655
DeterminantRef::update 0.1707 0.1707 10 0.017072748
Current Gradient 0.9853 0.0132 30720 0.000032074
DeterminantRef::ratio 0.9646 0.9646 30720 0.000031399
OneBodyJastrowRef 0.0044 0.0044 30720 0.000000144
TwoBodyJastrowRef 0.0031 0.0031 30720 0.000000101
Kinetic Energy 0.2368 0.2365 5 0.047352978
OneBodyJastrowRef 0.0001 0.0001 5 0.000027305
TwoBodyJastrowRef 0.0001 0.0001 5 0.000016840
New Gradient 5.7037 0.0195 30720 0.000185666
DeterminantRef::ratio 0.1425 0.1425 30720 0.000004638
DeterminantRef::spovgl 4.9398 0.2763 30720 0.000160802
Single-Particle Orbitals 4.6636 4.6636 30720 0.000151809
OneBodyJastrowRef 0.0546 0.0546 30720 0.000001779
TwoBodyJastrowRef 0.5472 0.5472 30720 0.000017813
ParticleSet:::acceptMove 2.4571 0.0209 15371 0.000159856
DTAAOMPTarget::update_e_e 2.4017 2.4017 15371 0.000156249
DTABOMPTarget::update_ion_e 0.0345 0.0345 15371 0.000002246
ParticleSet:::computeNewPosDT 0.7233 0.0136 30720 0.000023546
DTAAOMPTarget::move_e_e 0.6170 0.6170 30720 0.000020085
DTABOMPTarget::move_ion_e 0.0927 0.0927 30720 0.000003017
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000001525
Update 11.9241 0.0110 15371 0.000775751
DeterminantRef::update 11.3936 11.3936 15371 0.000741241
OneBodyJastrowRef 0.0016 0.0016 15371 0.000000107
TwoBodyJastrowRef 0.5179 0.5179 15371 0.000033690
Initialization 2.5927 0.3555 1 2.592695627
DeterminantRef::inverse 0.9849 0.9849 2 0.492473422
DeterminantRef::spovgl 1.0287 0.1093 2 0.514360128
Single-Particle Orbitals 0.9194 0.9194 6144 0.000149638
OneBodyJastrowRef 0.0072 0.0072 1 0.007192175
ParticleSet:::update 0.1205 0.0205 2 0.060259089
DTAAOMPTarget::evaluate_e_e 0.0812 0.0812 1 0.081233726
DTABOMPTarget::evaluate_ion_e 0.0187 0.0001 1 0.018741625
DTABOMPTarget::offload_ion_e 0.0187 0.0187 1 0.018674736
TwoBodyJastrowRef 0.0959 0.0959 1 0.095857051
Pseudopotential 17.5651 0.0287 5 3.513020973
DeterminantRef::spoval 12.6068 0.2454 10215 0.001234149
Single-Particle Orbitals 12.3614 12.3614 122580 0.000100844
OneBodyJastrowRef 0.0133 0.0133 10215 0.000001298
ParticleSet:::update 4.4489 0.0058 10215 0.000435522
DTABOMPTarget::evaluate_e_virtual 4.0952 0.0021 10215 0.000400896
DTABOMPTarget::offload_e_virtual 4.0931 4.0931 10215 0.000400693
DTABOMPTarget::evaluate_ion_virtual 0.3479 0.0023 10215 0.000034059
DTABOMPTarget::offload_ion_virtual 0.3456 0.3456 10215 0.000033830
TwoBodyJastrowRef 0.4674 0.4674 10215 0.000045761
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.09461e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 2.08771e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 4.29815e+06
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112847)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112842)
Info: 1/2 lprof instances finished
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_0 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112889)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112894)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 2
Number of walkers per rank = 2
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.1934 0.1934 1 0.193427395
ParticleSet:::update 0.0000 0.0000 1 0.000004714
Total 44.9653 0.0008 1 44.965347594
Diffusion 23.9697 0.0201 5 4.793940323
Complete Updates 0.1783 0.0000 5 0.035661945
DeterminantRef::update 0.1783 0.1783 10 0.017829476
Current Gradient 1.0094 0.0155 30720 0.000032857
DeterminantRef::ratio 0.9865 0.9865 30720 0.000032113
OneBodyJastrowRef 0.0047 0.0047 30720 0.000000153
TwoBodyJastrowRef 0.0027 0.0027 30720 0.000000088
Kinetic Energy 0.2456 0.2453 5 0.049118265
OneBodyJastrowRef 0.0002 0.0002 5 0.000045645
TwoBodyJastrowRef 0.0001 0.0001 5 0.000015824
New Gradient 6.9855 0.0199 30720 0.000227392
DeterminantRef::ratio 0.1431 0.1431 30720 0.000004660
DeterminantRef::spovgl 6.2215 0.2951 30720 0.000202524
Single-Particle Orbitals 5.9264 5.9264 30720 0.000192918
OneBodyJastrowRef 0.0549 0.0549 30720 0.000001787
TwoBodyJastrowRef 0.5460 0.5460 30720 0.000017774
ParticleSet:::acceptMove 2.4587 0.0211 15371 0.000159957
DTAAOMPTarget::update_e_e 2.4035 2.4035 15371 0.000156363
DTABOMPTarget::update_ion_e 0.0341 0.0341 15371 0.000002218
ParticleSet:::computeNewPosDT 0.9472 0.0147 30720 0.000030833
DTAAOMPTarget::move_e_e 0.8395 0.8395 30720 0.000027327
DTABOMPTarget::move_ion_e 0.0930 0.0930 30720 0.000003028
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000001219
Update 12.1249 0.0119 15371 0.000788818
DeterminantRef::update 11.5872 11.5872 15371 0.000753833
OneBodyJastrowRef 0.0015 0.0015 15371 0.000000100
TwoBodyJastrowRef 0.5244 0.5244 15371 0.000034113
Initialization 2.9373 0.3632 1 2.937284381
DeterminantRef::inverse 1.0550 1.0550 2 0.527521101
DeterminantRef::spovgl 1.2947 0.1367 2 0.647364808
Single-Particle Orbitals 1.1580 1.1580 6144 0.000188475
OneBodyJastrowRef 0.0077 0.0077 1 0.007711939
ParticleSet:::update 0.1210 0.0205 2 0.060503032
DTAAOMPTarget::evaluate_e_e 0.0821 0.0821 1 0.082080402
DTABOMPTarget::evaluate_ion_e 0.0184 0.0001 1 0.018412011
DTABOMPTarget::offload_ion_e 0.0183 0.0183 1 0.018346678
TwoBodyJastrowRef 0.0956 0.0956 1 0.095571281
Pseudopotential 18.0576 0.0317 5 3.611520465
DeterminantRef::spoval 13.0365 0.2414 10215 0.001276211
Single-Particle Orbitals 12.7951 12.7951 122580 0.000104381
OneBodyJastrowRef 0.0134 0.0134 10215 0.000001311
ParticleSet:::update 4.5020 0.0062 10215 0.000440720
DTABOMPTarget::evaluate_e_virtual 4.1452 0.0022 10215 0.000405800
DTABOMPTarget::offload_e_virtual 4.1431 4.1431 10215 0.000405588
DTABOMPTarget::evaluate_ion_virtual 0.3505 0.0024 10215 0.000034316
DTABOMPTarget::offload_ion_virtual 0.3482 0.3482 10215 0.000034082
TwoBodyJastrowRef 0.4741 0.4741 10215 0.000046413
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.06317e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 3.87036e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 8.36185e+06
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112894)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112889)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_1 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112968)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 112973)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 4
Number of walkers per rank = 4
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.1057 0.1057 1 0.105698166
ParticleSet:::update 0.0000 0.0000 1 0.000005669
Total 43.3824 0.5277 1 43.382426670
Diffusion 22.7186 0.0218 5 4.543717029
Complete Updates 0.1774 0.0000 5 0.035475395
DeterminantRef::update 0.1774 0.1774 10 0.017736625
Current Gradient 1.0281 0.0150 30720 0.000033467
DeterminantRef::ratio 1.0059 1.0059 30720 0.000032743
OneBodyJastrowRef 0.0041 0.0041 30720 0.000000133
TwoBodyJastrowRef 0.0031 0.0031 30720 0.000000102
Kinetic Energy 0.2496 0.2494 5 0.049923323
OneBodyJastrowRef 0.0002 0.0002 5 0.000034812
TwoBodyJastrowRef 0.0001 0.0001 5 0.000017503
New Gradient 5.7590 0.0211 30720 0.000187466
DeterminantRef::ratio 0.1453 0.1453 30720 0.000004728
DeterminantRef::spovgl 4.9864 0.3152 30720 0.000162318
Single-Particle Orbitals 4.6712 4.6712 30720 0.000152057
OneBodyJastrowRef 0.0557 0.0557 30720 0.000001812
TwoBodyJastrowRef 0.5505 0.5505 30720 0.000017920
ParticleSet:::acceptMove 2.3681 0.0179 15371 0.000154061
DTAAOMPTarget::update_e_e 2.3160 2.3160 15371 0.000150676
DTABOMPTarget::update_ion_e 0.0342 0.0342 15371 0.000002224
ParticleSet:::computeNewPosDT 1.0390 0.0164 30720 0.000033823
DTAAOMPTarget::move_e_e 0.9283 0.9283 30720 0.000030217
DTABOMPTarget::move_ion_e 0.0944 0.0944 30720 0.000003074
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000001362
Update 12.0756 0.0120 15371 0.000785610
DeterminantRef::update 11.5242 11.5242 15371 0.000749737
OneBodyJastrowRef 0.0015 0.0015 15371 0.000000098
TwoBodyJastrowRef 0.5379 0.5379 15371 0.000034993
Initialization 2.7977 0.6954 1 2.797670594
DeterminantRef::inverse 0.9207 0.9207 2 0.460328774
DeterminantRef::spovgl 0.9601 0.0950 2 0.480033883
Single-Particle Orbitals 0.8650 0.8650 6144 0.000140795
OneBodyJastrowRef 0.0077 0.0077 1 0.007654826
ParticleSet:::update 0.1187 0.0207 2 0.059350259
DTAAOMPTarget::evaluate_e_e 0.0795 0.0795 1 0.079456313
DTABOMPTarget::evaluate_ion_e 0.0186 0.0001 1 0.018584611
DTABOMPTarget::offload_ion_e 0.0185 0.0185 1 0.018526706
TwoBodyJastrowRef 0.0952 0.0952 1 0.095239758
Pseudopotential 17.3385 0.0305 5 3.467695513
DeterminantRef::spoval 12.3557 0.2324 10215 0.001209565
Single-Particle Orbitals 12.1233 12.1233 122580 0.000098901
OneBodyJastrowRef 0.0130 0.0130 10215 0.000001276
ParticleSet:::update 4.4694 0.0058 10215 0.000437532
DTABOMPTarget::evaluate_e_virtual 4.1122 0.0021 10215 0.000402561
DTABOMPTarget::offload_e_virtual 4.1101 4.1101 10215 0.000402359
DTABOMPTarget::evaluate_ion_virtual 0.3514 0.0025 10215 0.000034405
DTABOMPTarget::offload_ion_virtual 0.3489 0.3489 10215 0.000034156
TwoBodyJastrowRef 0.4699 0.4699 10215 0.000045998
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 4.27691e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 8.167e+10
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.74173e+07
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112968)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 112973)
Info: 1/2 lprof instances finished
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_2 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113040)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113045)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 8
Number of walkers per rank = 8
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0714 0.0714 1 0.071419500
ParticleSet:::update 0.0000 0.0000 1 0.000004271
Total 44.9975 0.4686 1 44.997495777
Diffusion 23.7474 0.0209 5 4.749477106
Complete Updates 0.1966 0.0000 5 0.039318672
DeterminantRef::update 0.1966 0.1966 10 0.019657799
Current Gradient 1.0368 0.0143 30720 0.000033750
DeterminantRef::ratio 1.0153 1.0153 30720 0.000033050
OneBodyJastrowRef 0.0042 0.0042 30720 0.000000138
TwoBodyJastrowRef 0.0030 0.0030 30720 0.000000097
Kinetic Energy 0.2654 0.2651 5 0.053080173
OneBodyJastrowRef 0.0002 0.0002 5 0.000033901
TwoBodyJastrowRef 0.0001 0.0001 5 0.000019325
New Gradient 5.8018 0.0212 30720 0.000188861
DeterminantRef::ratio 0.1420 0.1420 30720 0.000004622
DeterminantRef::spovgl 5.0528 0.2804 30720 0.000164480
Single-Particle Orbitals 4.7724 4.7724 30720 0.000155352
OneBodyJastrowRef 0.0547 0.0547 30720 0.000001779
TwoBodyJastrowRef 0.5312 0.5312 30720 0.000017291
ParticleSet:::acceptMove 2.4966 0.0213 15371 0.000162420
DTAAOMPTarget::update_e_e 2.4422 2.4422 15371 0.000158882
DTABOMPTarget::update_ion_e 0.0331 0.0331 15371 0.000002155
ParticleSet:::computeNewPosDT 0.8688 0.0151 30720 0.000028282
DTAAOMPTarget::move_e_e 0.7640 0.7640 30720 0.000024871
DTABOMPTarget::move_ion_e 0.0896 0.0896 30720 0.000002918
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000000789
Update 13.0605 0.0133 15371 0.000849687
DeterminantRef::update 12.4937 12.4937 15371 0.000812808
OneBodyJastrowRef 0.0017 0.0017 15371 0.000000107
TwoBodyJastrowRef 0.5519 0.5519 15371 0.000035909
Initialization 2.8181 0.4923 1 2.818070391
DeterminantRef::inverse 1.0637 1.0637 2 0.531851775
DeterminantRef::spovgl 1.0282 0.1124 2 0.514102669
Single-Particle Orbitals 0.9158 0.9158 6144 0.000149054
OneBodyJastrowRef 0.0077 0.0077 1 0.007687805
ParticleSet:::update 0.1303 0.0219 2 0.065167320
DTAAOMPTarget::evaluate_e_e 0.0898 0.0898 1 0.089799057
DTABOMPTarget::evaluate_ion_e 0.0186 0.0001 1 0.018612597
DTABOMPTarget::offload_ion_e 0.0185 0.0185 1 0.018548890
TwoBodyJastrowRef 0.0958 0.0958 1 0.095818948
Pseudopotential 17.9635 0.0319 5 3.592695909
DeterminantRef::spoval 12.8554 0.2195 10215 0.001258478
Single-Particle Orbitals 12.6359 12.6359 122580 0.000103083
OneBodyJastrowRef 0.0151 0.0151 10215 0.000001478
ParticleSet:::update 4.5304 0.0072 10215 0.000443505
DTABOMPTarget::evaluate_e_virtual 4.1616 0.0025 10215 0.000407404
DTABOMPTarget::offload_e_virtual 4.1592 4.1592 10215 0.000407161
DTABOMPTarget::evaluate_ion_virtual 0.3616 0.0029 10215 0.000035400
DTABOMPTarget::offload_ion_virtual 0.3588 0.3588 10215 0.000035121
TwoBodyJastrowRef 0.5308 0.5308 10215 0.000051960
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 8.2468e+10
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 1.56264e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 3.36226e+07
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113045)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113040)
Info: 1/2 lprof instances finished
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_3 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113144)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113149)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 16
Number of walkers per rank = 16
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0668 0.0668 1 0.066807586
ParticleSet:::update 0.0000 0.0000 1 0.000004194
Total 51.4021 0.7414 1 51.402070551
Diffusion 27.4747 0.0205 5 5.494941222
Complete Updates 0.2323 0.0000 5 0.046455391
DeterminantRef::update 0.2323 0.2323 10 0.023226010
Current Gradient 1.1426 0.0158 30720 0.000037195
DeterminantRef::ratio 1.1193 1.1193 30720 0.000036436
OneBodyJastrowRef 0.0044 0.0044 30720 0.000000144
TwoBodyJastrowRef 0.0031 0.0031 30720 0.000000102
Kinetic Energy 0.3076 0.3073 5 0.061517052
OneBodyJastrowRef 0.0002 0.0002 5 0.000032285
TwoBodyJastrowRef 0.0001 0.0001 5 0.000018519
New Gradient 6.6775 0.0193 30720 0.000217366
DeterminantRef::ratio 0.1465 0.1465 30720 0.000004767
DeterminantRef::spovgl 5.8941 0.2783 30720 0.000191867
Single-Particle Orbitals 5.6158 5.6158 30720 0.000182808
OneBodyJastrowRef 0.0580 0.0580 30720 0.000001887
TwoBodyJastrowRef 0.5596 0.5596 30720 0.000018217
ParticleSet:::acceptMove 3.1231 0.0255 15371 0.000203180
DTAAOMPTarget::update_e_e 3.0635 3.0635 15371 0.000199304
DTABOMPTarget::update_ion_e 0.0341 0.0341 15371 0.000002219
ParticleSet:::computeNewPosDT 0.7777 0.0148 30720 0.000025316
DTAAOMPTarget::move_e_e 0.6684 0.6684 30720 0.000021758
DTABOMPTarget::move_ion_e 0.0945 0.0945 30720 0.000003075
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000001637
Update 15.1934 0.0141 15371 0.000988446
DeterminantRef::update 14.5033 14.5033 15371 0.000943550
OneBodyJastrowRef 0.0016 0.0016 15371 0.000000101
TwoBodyJastrowRef 0.6745 0.6745 15371 0.000043878
Initialization 3.4351 0.8485 1 3.435084857
DeterminantRef::inverse 1.1304 1.1304 2 0.565204658
DeterminantRef::spovgl 1.1878 0.1288 2 0.593908241
Single-Particle Orbitals 1.0590 1.0590 6144 0.000172360
OneBodyJastrowRef 0.0083 0.0083 1 0.008285255
ParticleSet:::update 0.1514 0.0210 2 0.075708665
DTAAOMPTarget::evaluate_e_e 0.1101 0.1101 1 0.110121997
DTABOMPTarget::evaluate_ion_e 0.0203 0.0001 1 0.020336798
DTABOMPTarget::offload_ion_e 0.0203 0.0203 1 0.020266407
TwoBodyJastrowRef 0.1087 0.1087 1 0.108688709
Pseudopotential 19.7509 0.0407 5 3.950177296
DeterminantRef::spoval 14.1817 0.2771 10215 0.001388323
Single-Particle Orbitals 13.9046 13.9046 122580 0.000113433
OneBodyJastrowRef 0.0219 0.0219 10215 0.000002142
ParticleSet:::update 4.7974 0.0103 10215 0.000469646
DTABOMPTarget::evaluate_e_virtual 4.3966 0.0039 10215 0.000430404
DTABOMPTarget::offload_e_virtual 4.3927 4.3927 10215 0.000430020
DTABOMPTarget::evaluate_ion_virtual 0.3905 0.0033 10215 0.000038231
DTABOMPTarget::offload_ion_virtual 0.3872 0.3872 10215 0.000037904
TwoBodyJastrowRef 0.7092 0.7092 10215 0.000069424
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.44385e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 2.70129e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 6.11598e+07
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113144)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113149)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_4 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113274)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113279)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 32
Number of walkers per rank = 32
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0624 0.0624 1 0.062409120
ParticleSet:::update 0.0000 0.0000 1 0.000004199
Total 74.5545 1.1318 1 74.554523638
Diffusion 41.5660 0.0328 5 8.313199207
Complete Updates 0.3603 0.0000 5 0.072069559
DeterminantRef::update 0.3603 0.3603 10 0.036032264
Current Gradient 1.7267 0.0234 30720 0.000056208
DeterminantRef::ratio 1.6926 1.6926 30720 0.000055098
OneBodyJastrowRef 0.0065 0.0065 30720 0.000000212
TwoBodyJastrowRef 0.0042 0.0042 30720 0.000000137
Kinetic Energy 0.4461 0.4457 5 0.089215936
OneBodyJastrowRef 0.0002 0.0002 5 0.000043055
TwoBodyJastrowRef 0.0001 0.0001 5 0.000024284
New Gradient 10.0496 0.0279 30720 0.000327135
DeterminantRef::ratio 0.2205 0.2205 30720 0.000007178
DeterminantRef::spovgl 8.8648 0.4327 30720 0.000288567
Single-Particle Orbitals 8.4321 8.4321 30720 0.000274481
OneBodyJastrowRef 0.0880 0.0880 30720 0.000002865
TwoBodyJastrowRef 0.8484 0.8484 30720 0.000027616
ParticleSet:::acceptMove 5.6717 0.0379 15371 0.000368988
DTAAOMPTarget::update_e_e 5.5908 5.5908 15371 0.000363721
DTABOMPTarget::update_ion_e 0.0431 0.0431 15371 0.000002802
ParticleSet:::computeNewPosDT 1.1776 0.0197 30720 0.000038333
DTAAOMPTarget::move_e_e 1.0302 1.0302 30720 0.000033534
DTABOMPTarget::move_ion_e 0.1277 0.1277 30720 0.000004158
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000002399
Update 22.1012 0.0194 15371 0.001437849
DeterminantRef::update 20.9085 20.9085 15371 0.001360258
OneBodyJastrowRef 0.0021 0.0021 15371 0.000000136
TwoBodyJastrowRef 1.1712 1.1712 15371 0.000076195
Initialization 4.7395 1.0510 1 4.739470701
DeterminantRef::inverse 1.5892 1.5892 2 0.794613049
DeterminantRef::spovgl 1.7197 0.1758 2 0.859845106
Single-Particle Orbitals 1.5439 1.5439 6144 0.000251289
OneBodyJastrowRef 0.0103 0.0103 1 0.010297933
ParticleSet:::update 0.2473 0.0904 2 0.123629081
DTAAOMPTarget::evaluate_e_e 0.1287 0.1287 1 0.128688133
DTABOMPTarget::evaluate_ion_e 0.0281 0.0031 1 0.028120763
DTABOMPTarget::offload_ion_e 0.0250 0.0250 1 0.025004214
TwoBodyJastrowRef 0.1220 0.1220 1 0.122019323
Pseudopotential 27.1172 0.0756 5 5.423441918
DeterminantRef::spoval 19.6306 0.4108 10215 0.001921738
Single-Particle Orbitals 19.2198 19.2198 122580 0.000156794
OneBodyJastrowRef 0.0422 0.0422 10215 0.000004127
ParticleSet:::update 6.1264 0.0183 10215 0.000599746
DTABOMPTarget::evaluate_e_virtual 5.6166 0.0069 10215 0.000549837
DTABOMPTarget::offload_e_virtual 5.6097 5.6097 10215 0.000549164
DTABOMPTarget::evaluate_ion_virtual 0.4916 0.0076 10215 0.000048122
DTABOMPTarget::offload_ion_virtual 0.4839 0.4839 10215 0.000047376
TwoBodyJastrowRef 1.2425 1.2425 10215 0.000121631
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 1.99095e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 3.57105e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 8.90917e+07
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113279)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113274)
Info: 1/2 lprof instances finished
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_5 #
###################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113478)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 113483)miniqmc not built from git repository
number of ranks : 2, number of accelerators : 0
Number of orbitals/splines = 3072
Tile size = 3072
Number of tiles = 1
Number of electrons = 6144
Rmax = 1.7
AcceptanceRatio = 0.5
Iterations = 5
MPI processes = 2
OpenMP threads = 48
Number of walkers per rank = 48
SPO coefficients size = 1572864000 bytes (1500 MB)
delayed update rank = 32
Using the reference implementation for Jastrow,
determinant update, and distance table + einspline of the
reference implementation
==================================
Use --enable-timers= command line option to increase or decrease level of timing information
Stack timer profile
Timer Inclusive_time Exclusive_time Calls Time_per_call
Setup 0.0874 0.0874 1 0.087375768
ParticleSet:::update 0.0000 0.0000 1 0.000006093
Total 106.2653 0.2648 1 106.265251368
Diffusion 63.4955 0.0463 5 12.699109602
Complete Updates 0.3674 0.0000 5 0.073473642
DeterminantRef::update 0.3673 0.3673 10 0.036733776
Current Gradient 2.8390 0.0383 30720 0.000092415
DeterminantRef::ratio 2.7828 2.7828 30720 0.000090585
OneBodyJastrowRef 0.0114 0.0114 30720 0.000000372
TwoBodyJastrowRef 0.0065 0.0065 30720 0.000000213
Kinetic Energy 0.5897 0.5891 5 0.117936987
OneBodyJastrowRef 0.0004 0.0004 5 0.000075926
TwoBodyJastrowRef 0.0002 0.0002 5 0.000041192
New Gradient 15.2225 0.0441 30720 0.000495523
DeterminantRef::ratio 0.3757 0.3757 30720 0.000012231
DeterminantRef::spovgl 13.2822 0.7859 30720 0.000432362
Single-Particle Orbitals 12.4963 12.4963 30720 0.000406781
OneBodyJastrowRef 0.1437 0.1437 30720 0.000004678
TwoBodyJastrowRef 1.3768 1.3768 30720 0.000044817
ParticleSet:::acceptMove 8.6575 0.0243 15371 0.000563234
DTAAOMPTarget::update_e_e 8.5621 8.5621 15371 0.000557031
DTABOMPTarget::update_ion_e 0.0711 0.0711 15371 0.000004624
ParticleSet:::computeNewPosDT 1.8938 0.0244 30720 0.000061646
DTAAOMPTarget::move_e_e 1.6771 1.6771 30720 0.000054594
DTABOMPTarget::move_ion_e 0.1922 0.1922 30720 0.000006256
ParticleSet:::donePbyP 0.0000 0.0000 5 0.000002495
Update 33.8795 0.0157 15371 0.002204118
DeterminantRef::update 32.1439 32.1439 15371 0.002091204
OneBodyJastrowRef 0.0069 0.0069 15371 0.000000448
TwoBodyJastrowRef 1.7130 1.7130 15371 0.000111444
Initialization 6.4792 1.5664 1 6.479216213
DeterminantRef::inverse 2.2005 2.2005 2 1.100271850
DeterminantRef::spovgl 2.2997 0.1676 2 1.149850815
Single-Particle Orbitals 2.1321 2.1321 6144 0.000347025
OneBodyJastrowRef 0.0164 0.0164 1 0.016353389
ParticleSet:::update 0.2252 0.1021 2 0.112589045
DTAAOMPTarget::evaluate_e_e 0.0889 0.0889 1 0.088856692
DTABOMPTarget::evaluate_ion_e 0.0343 0.0080 1 0.034266822
DTABOMPTarget::offload_ion_e 0.0262 0.0262 1 0.026219456
TwoBodyJastrowRef 0.1711 0.1711 1 0.171064253
Pseudopotential 36.0257 0.1055 5 7.205130145
DeterminantRef::spoval 25.9077 0.6149 10215 0.002536239
Single-Particle Orbitals 25.2928 25.2928 122580 0.000206337
OneBodyJastrowRef 0.0622 0.0622 10215 0.000006089
ParticleSet:::update 8.2552 0.0268 10215 0.000808142
DTABOMPTarget::evaluate_e_virtual 7.5218 0.0088 10215 0.000736349
DTABOMPTarget::offload_e_virtual 7.5130 7.5130 10215 0.000735488
DTABOMPTarget::evaluate_ion_virtual 0.7066 0.0097 10215 0.000069169
DTABOMPTarget::offload_ion_virtual 0.6968 0.6968 10215 0.000068216
TwoBodyJastrowRef 1.6951 1.6951 10215 0.000165939
========== Throughput ============
Total throughput ( N_walkers * N_elec^3 / Total time ) = 2.09524e+11
Diffusion throughput ( N_walkers * N_elec^3 / Diffusion time ) = 3.50656e+11
Pseudopotential throughput ( N_walkers * N_elec^2 / Pseudopotential time ) = 1.00592e+08
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113478)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 113483)
Info: 1/2 lprof instances finished
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6
To display your profiling results:
###################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
###################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/miniqmc_DDR/intel/miniqmc/run/oneview_runs/compilers/icx_3/oneview_results_scal/tools/lprof_npsu_run_6 #
###################################################################################################################################################################################################