Run 2x1 | Number processes: 2Number nodes: 1Number processes per node: 2Run Command: <executable> -g "4 2 2" -bMPI Command: mpirun -np <number_processes>Dataset: Run Directory: /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/run/oneview_runs/compilers/icx_7/oneview_run_1712860826I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spreadOMP_NUM_THREADS: 1 |
---|---|
Run 2x2 | OMP_NUM_THREADS: 2I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x4 | OMP_NUM_THREADS: 4I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x8 | OMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x16 | OMP_NUM_THREADS: 16I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x32 | OMP_NUM_THREADS: 32I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x56 | OMP_NUM_THREADS: 56I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Name | Module | Coverage 2x1 (%) | Coverage 2x2 (%) | Coverage 2x4 (%) | Coverage 2x8 (%) | Coverage 2x16 (%) | Coverage 2x32 (%) | Coverage 2x56 (%) | Max Time Over Threads 2x1 (s) | Max Time Over Threads 2x2 (s) | Max Time Over Threads 2x4 (s) | Max Time Over Threads 2x8 (s) | Max Time Over Threads 2x16 (s) | Max Time Over Threads 2x32 (s) | Max Time Over Threads 2x56 (s) | Time w.r.t. Wall Time 2x1 (s) | Time w.r.t. Wall Time 2x2 (s) | Time w.r.t. Wall Time 2x4 (s) | Time w.r.t. Wall Time 2x8 (s) | Time w.r.t. Wall Time 2x16 (s) | Time w.r.t. Wall Time 2x32 (s) | Time w.r.t. Wall Time 2x56 (s) | Nb Threads 2x1 | Nb Threads 2x2 | Nb Threads 2x4 | Nb Threads 2x8 | Nb Threads 2x16 | Nb Threads 2x32 | Nb Threads 2x56 | Deviation (coverage) 2x1 | Deviation (coverage) 2x2 | Deviation (coverage) 2x4 | Deviation (coverage) 2x8 | Deviation (coverage) 2x16 | Deviation (coverage) 2x32 | Deviation (coverage) 2x56 | Deviation (walltime) 2x1 | Deviation (walltime) 2x2 | Deviation (walltime) 2x4 | Deviation (walltime) 2x8 | Deviation (walltime) 2x16 | Deviation (walltime) 2x32 | Deviation (walltime) 2x56 | Categories 2x1 | Categories 2x2 | Categories 2x4 | Categories 2x8 | Categories 2x16 | Categories 2x32 | Categories 2x56 | GFLOPS 2x1 | GFLOPS 2x2 | GFLOPS 2x4 | GFLOPS 2x8 | GFLOPS 2x16 | GFLOPS 2x32 | GFLOPS 2x56 | Compilation Options | (2x1) Efficiency | (2x1) Potential Speed-Up (%) | (2x2) Efficiency | (2x2) Potential Speed-Up (%) | (2x4) Efficiency | (2x4) Potential Speed-Up (%) | (2x8) Efficiency | (2x8) Potential Speed-Up (%) | (2x16) Efficiency | (2x16) Potential Speed-Up (%) | (2x32) Efficiency | (2x32) Potential Speed-Up (%) | (2x56) Efficiency | (2x56) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
►miniqmcreference::einspline_spo_ref<double>::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<qmcplusplus::TinyVector<double, 3u>, std::allocator<... | exec | 21.27 | 20.98 | 21.01 | 21.45 | 21.11 | 18.96 | 17.44 | 13.53 | 13.02 | 13.62 | 13.57 | 14.24 | 15.24 | 21.2 | 13.18 | 13.03 | 12.97 | 13.09 | 13.29 | 14.14 | 19.75 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 1.03 | 0.50 | 0.77 | 0.66 | 0.60 | 0.50 | 0.42 | 0.56 | 0.19 | 0.52 | 0.39 | 0.38 | 0.40 | 0.42 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 11.70 | 23.69 | 47.59 | 94.27 | 185.68 | 349.11 | 437.40 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.01 | 0 | 1.02 | 0 | 1.01 | 0 | 0.99 | 0.17 | 0.93 | 1.29 | 0.67 | 5.8 |
►Loop 751 - MultiBsplineRef.hpp:187-286 - exec [...] | 0.03 | 0.01 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 0.03 | 0.03 | 0.05 | 0.05 | 0.02 | 0.01 | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.30 | 1.65 | 1.48 | 2.70 | 12.25 | 11.68 | 20.78 | 1 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 753 - MultiBsplineRef.hpp:276-286 - exec | 0.44 | 0.44 | 0.45 | 0.43 | 0.45 | 0.37 | 0.32 | 0.27 | 0.29 | 0.32 | 0.3 | 0.41 | 0.36 | 0.52 | 0.27 | 0.27 | 0.28 | 0.26 | 0.28 | 0.28 | 0.36 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.02 | 0.05 | 0.04 | 0.08 | 0.05 | 0.04 | 0.00 | 0.01 | 0.03 | 0.02 | 0.05 | 0.04 | 0.04 | 7.29 | 14.90 | 29.30 | 63.39 | 116.07 | 232.75 | 318.41 | 1 | 0 | 1 | 0 | 0.96 | 0.02 | 1.04 | 0 | 0.96 | 0.02 | 0.96 | 0.01 | 0.75 | 0.08 | |||||||||
►Loop 754 - MultiBsplineRef.hpp:226-262 - exec [...] | 0 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.02 | 0.03 | 0.01 | 0.03 | 0.03 | 0.03 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 2 | 4 | 7 | 13 | 25 | 50 | 103 | 0.00 | 0.00 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.20 | 0.00 | 0.40 | 1.20 | 1.65 | 5.25 | 1 | 0 | 0 | 0.02 | 0 | 0.02 | 0 | 0.01 | 0 | 0.01 | 0 | 0.01 | 0 | 0.01 | |||||||||
►Loop 755 - MultiBsplineRef.hpp:227-262 - exec [...] | 0.03 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.02 | 0.04 | 0.04 | 0.05 | 0.05 | 0.05 | 0.08 | 0.02 | 0.03 | 0.02 | 0.03 | 0.02 | 0.03 | 0.03 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.01 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.70 | 0.90 | 2.53 | 3.48 | 10.58 | 15.90 | 29.57 | 1 | 0 | 0.67 | 0.02 | 1 | 0 | 0.67 | 0.01 | 1 | 0 | 0.67 | 0.01 | 0.67 | 0.01 | |||||||||
►Loop 760 - MultiBsplineRef.hpp:242-262 - exec [...] | 1.17 | 1.32 | 1.26 | 1.35 | 1.27 | 1.1 | 0.88 | 0.73 | 1.01 | 0.88 | 1 | 0.93 | 1 | 1.19 | 0.72 | 0.82 | 0.78 | 0.83 | 0.8 | 0.82 | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.25 | 0.12 | 0.17 | 0.11 | 0.10 | 0.10 | 0.01 | 0.15 | 0.07 | 0.11 | 0.07 | 0.07 | 0.11 | 8.31 | 16.60 | 36.14 | 53.93 | 102.77 | 208.81 | 297.91 | 1 | 0 | 0.88 | 0.16 | 0.92 | 0.1 | 0.87 | 0.18 | 0.9 | 0.13 | 0.88 | 0.13 | 0.72 | 0.25 | |||||||||
○Loop 762 - MultiBsplineRef.hpp:242-262 - exec [...] | 10.22 | 11.22 | 10.8 | 11.16 | 11.15 | 10.29 | 10.13 | 6.49 | 7.39 | 8.04 | 8.04 | 8.22 | 9.41 | 14.67 | 6.34 | 6.97 | 6.67 | 6.81 | 7.02 | 7.67 | 11.47 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.44 | 1.07 | 1.35 | 1.46 | 1.31 | 1.03 | 0.88 | 0.24 | 0.59 | 0.83 | 0.87 | 0.82 | 0.74 | 0.97 | 14.88 | 26.80 | 55.02 | 113.81 | 222.75 | 406.22 | 477.78 | 1 | 0 | 0.91 | 1.01 | 0.95 | 0.53 | 0.93 | 0.77 | 0.9 | 1.08 | 0.83 | 1.78 | 0.55 | 4.53 | |||||||||
○Loop 761 - MultiBsplineRef.hpp:242-261 - exec [...] | 8.39 | 6.98 | 7.36 | 7.47 | 7.22 | 6.27 | 5.23 | 5.44 | 4.68 | 5.38 | 5.6 | 6.63 | 6.92 | 7.97 | 5.2 | 4.33 | 4.54 | 4.56 | 4.55 | 4.68 | 5.93 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.64 | 0.50 | 1.09 | 1.19 | 1.34 | 1.08 | 0.80 | 0.36 | 0.34 | 0.68 | 0.72 | 0.84 | 0.80 | 0.89 | 9.98 | 24.06 | 47.10 | 87.18 | 173.37 | 338.37 | 462.81 | 1 | 0 | 1.2 | 0 | 1.15 | 0 | 1.14 | 0 | 1.14 | 0 | 1.11 | 0 | 0.88 | 0.64 | |||||||||
○Loop 759 - MultiBsplineRef.hpp:242-261 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 758 - MultiBsplineRef.hpp:242-262 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 757 - MultiBsplineRef.hpp:242-262 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 756 - MultiBsplineRef.hpp:242-262 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 752 - MultiBsplineRef.hpp:276-286 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 749 - einspline_spo_ref.hpp:219-227 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 | 3 | 6 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 750 - einspline_spo_ref.hpp:223-227 - exec [...] | 0.99 | 0.93 | 1.05 | 0.95 | 0.94 | 0.85 | 0.81 | 0.63 | 0.6 | 0.76 | 0.71 | 0.76 | 0.8 | 1.14 | 0.61 | 0.58 | 0.65 | 0.58 | 0.59 | 0.63 | 0.92 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.03 | 0.06 | 0.10 | 0.11 | 0.11 | 0.08 | 0.07 | 0.02 | 0.04 | 0.06 | 0.07 | 0.07 | 0.06 | 0.08 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.05 | 0 | 0.94 | 0.06 | 1.05 | 0 | 1.03 | 0 | 0.97 | 0.03 | 0.66 | 0.27 | |||||||||
►miniqmcreference::einspline_spo_ref<double>::evaluate(qmcplusplus::ParticleSet const&, int, qmcplusplus::Vector<double, std::allocator<double> >&) | exec | 21.01 | 21.43 | 20.9 | 21.16 | 21.35 | 22.52 | 23.51 | 13 | 13.36 | 12.94 | 13.02 | 13.74 | 17.17 | 28.12 | 13.02 | 13.31 | 12.91 | 12.91 | 13.44 | 16.79 | 26.62 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.06 | 0.12 | 0.28 | 0.26 | 0.33 | 0.40 | 0.94 | 0.04 | 0.19 | 0.17 | 0.16 | 0.21 | 0.29 | 1.17 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 8.33 | 16.33 | 33.78 | 67.80 | 130.20 | 208.06 | 230.28 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.98 | 0.47 | 1.01 | 0 | 1.01 | 0 | 0.97 | 0.67 | 0.78 | 5.06 | 0.49 | 12.01 |
►Loop 745 - MultiBsplineRef.hpp:42-71 - exec [...] | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.02 | 0.01 | 0.02 | 0.03 | 0.03 | 0.01 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 1.15 | 0.00 | 0.00 | 0.00 | 0.00 | 26.30 | 26.65 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
►Loop 746 - MultiBsplineRef.hpp:63-71 - exec | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0 | 0.01 | 0 | 0 | 0.01 | 0 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 13.70 | 0.00 | 0.00 | 106.00 | 0.00 | 272.31 | |||||||||||||||||||||||
►Loop 747 - MultiBsplineRef.hpp:64-71 - exec | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 7.80 | 11.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 748 - MultiBsplineRef.hpp:68-70 - exec | 20.94 | 21.39 | 20.88 | 21.14 | 21.32 | 22.49 | 23.48 | 12.96 | 13.32 | 12.92 | 13.01 | 13.73 | 17.16 | 28.1 | 12.98 | 13.29 | 12.89 | 12.9 | 13.42 | 16.77 | 26.59 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.04 | 0.12 | 0.28 | 0.27 | 0.33 | 0.40 | 0.94 | 0.04 | 0.18 | 0.16 | 0.16 | 0.21 | 0.29 | 1.17 | 8.34 | 16.34 | 33.79 | 67.77 | 130.23 | 208.05 | 230.30 | 1 | 0 | 0.98 | 0.5 | 1.01 | 0 | 1.01 | 0 | 0.97 | 0.7 | 0.77 | 5.08 | 0.49 | 12.02 | |||||||||
○Loop 744 - einspline_spo_ref.hpp:183-187 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.03 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 0 | 4 | 4 | 11 | 39 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►qmcplusplus::SoaDistanceTableABOMPTarget<double, 3u, 40>::evaluate(qmcplusplus::ParticleSet&) | exec | 13.01 | 13.06 | 13.12 | 13.41 | 13.08 | 11.43 | 8.29 | 8.1 | 8.16 | 8.16 | 8.2 | 8.39 | 8.78 | 10.04 | 8.06 | 8.11 | 8.1 | 8.18 | 8.23 | 8.52 | 9.39 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.21 | 0.19 | 0.21 | 0.12 | 0.21 | 0.29 | 0.27 | 0.08 | 0.10 | 0.12 | 0.07 | 0.12 | 0.20 | 0.34 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 9.14 | 18.22 | 36.59 | 72.73 | 144.51 | 278.67 | 378.03 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.99 | 0.08 | 1 | 0.06 | 0.99 | 0.2 | 0.98 | 0.27 | 0.95 | 0.62 | 0.86 | 1.17 |
○Loop 1972 - Mallocator.hpp:78-78 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 1969 - SoaDistanceTableABOMPTarget.h:194-196 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 1968 - SoaDistanceTableABOMPTarget.h:195-196 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 1970 - VectorSoAContainer.h:151-176 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 1971 - Mallocator.hpp:78-78 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 1967 - SoaDistanceTableABOMPTarget.h:194-196 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 1963 - SoaDistanceTableABOMPTarget.h:214-228 - exec [...] | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 1965 - SoaDistanceTableABOMPTarget.h:215-228 - exec [...] | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 1964 - SoaDistanceTableABOMPTarget.h:228-228 - exec [...] | 12.99 | 13.03 | 13.11 | 13.4 | 13.07 | 11.41 | 8.28 | 8.09 | 8.15 | 8.16 | 8.2 | 8.38 | 8.78 | 10.01 | 8.05 | 8.1 | 8.1 | 8.17 | 8.22 | 8.51 | 9.37 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.22 | 0.20 | 0.21 | 0.13 | 0.21 | 0.29 | 0.27 | 0.09 | 0.11 | 0.12 | 0.07 | 0.12 | 0.20 | 0.34 | 9.15 | 18.24 | 36.58 | 72.79 | 144.63 | 278.89 | 378.74 | 1 | 0 | 0.99 | 0.08 | 0.99 | 0.08 | 0.99 | 0.2 | 0.98 | 0.27 | 0.95 | 0.62 | 0.86 | 1.17 | |||||||||
○Loop 1966 - SoaDistanceTableABOMPTarget.h:194-196 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 3 | 10 | 27 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 1973 - stl_construct.h:98-107 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○mkl_blas_avx512_dgemm_kernel_nocopy_TN_b1 | libmkl_avx512.so.2 | 12.64 | 12.57 | 12.71 | 10.71 | 11.18 | 12.65 | 14.25 | 7.87 | 7.88 | 7.9 | 6.61 | 7.11 | 9.52 | 16.55 | 7.84 | 7.81 | 7.85 | 6.53 | 7.03 | 9.43 | 16.13 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.19 | 0.17 | 0.13 | 0.10 | 0.14 | 0.18 | 0.35 | 0.08 | 0.11 | 0.09 | 0.07 | 0.07 | 0.13 | 0.34 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 69.92 | 139.68 | 278.80 | 670.06 | 1244.77 | 1856.99 | 1851.24 | 1 | 0 | 1 | 0 | 1 | 0.02 | 1.2 | 0 | 1.12 | 0 | 0.83 | 2.13 | 0.49 | 7.32 | |
○mkl_blas_avx512_dgemm_kernel_0 | libmkl_avx512.so.2 | 11.96 | 12.14 | 12.31 | 12.81 | 12.82 | 12.49 | 12.93 | 7.39 | 7.58 | 7.64 | 7.91 | 8.21 | 9.4 | 15.29 | 7.41 | 7.54 | 7.6 | 7.82 | 8.07 | 9.32 | 14.64 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.08 | 0.18 | 0.17 | 0.17 | 0.17 | 0.20 | 0.29 | 0.01 | 0.10 | 0.08 | 0.09 | 0.09 | 0.13 | 0.28 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 89.55 | 175.21 | 348.75 | 677.59 | 1313.29 | 2275.57 | 2208.15 | 1 | 0 | 0.98 | 0.21 | 0.98 | 0.31 | 0.95 | 0.67 | 0.92 | 1.05 | 0.8 | 2.56 | 0.51 | 6.39 | |
►qmcplusplus::SoaDistanceTableAAOMPTarget<double, 3u, 40>::update(int) | exec | 3.39 | 3.5 | 3.42 | 3.39 | 3.55 | 4.87 | 6.57 | 2.13 | 2.28 | 2.3 | 2.18 | 2.42 | 4.07 | 8.07 | 2.1 | 2.17 | 2.11 | 2.07 | 2.23 | 3.63 | 7.44 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.07 | 0.20 | 0.21 | 0.15 | 0.17 | 0.27 | 0.29 | 0.06 | 0.13 | 0.13 | 0.09 | 0.11 | 0.20 | 0.31 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.97 | 0.11 | 1 | 0.02 | 1.01 | 0 | 0.94 | 0.21 | 0.58 | 2.05 | 0.28 | 4.72 |
○Loop 1724 - SoaDistanceTableAAOMPTarget.h:440-442 - exec [...] | 3.39 | 3.5 | 3.42 | 3.38 | 3.55 | 4.87 | 6.57 | 2.13 | 2.28 | 2.29 | 2.18 | 2.42 | 4.07 | 8.07 | 2.1 | 2.17 | 2.11 | 2.06 | 2.23 | 3.63 | 7.44 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.07 | 0.20 | 0.21 | 0.15 | 0.17 | 0.27 | 0.29 | 0.06 | 0.13 | 0.13 | 0.09 | 0.11 | 0.20 | 0.32 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.97 | 0.11 | 1 | 0.02 | 1.02 | 0 | 0.94 | 0.21 | 0.58 | 2.05 | 0.28 | 4.72 | |||||||||
►void qmcplusplus::DTD_BConds<double, 3u, 40>::computeDistances<qmcplusplus::TinyVector<double, 3u>, qmcplusplus::VectorSoAContainer<double, 3u, qmcplusplus::Mallocator<double, 32ul> >, qmcplusplus::VectorSoAContainer<double, 3... | exec | 2 | 1.9 | 2.1 | 2.08 | 2.03 | 1.91 | 1.77 | 1.23 | 1.25 | 1.47 | 1.5 | 1.62 | 1.75 | 2.48 | 1.24 | 1.18 | 1.3 | 1.27 | 1.27 | 1.42 | 2.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.17 | 0.19 | 0.19 | 0.19 | 0.16 | 0.14 | 0.00 | 0.10 | 0.11 | 0.11 | 0.12 | 0.12 | 0.15 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 30.99 | 65.08 | 118.09 | 242.10 | 484.26 | 867.00 | 1071.67 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.05 | 0 | 0.95 | 0.1 | 0.98 | 0.05 | 0.98 | 0.05 | 0.87 | 0.24 | 0.62 | 0.68 |
○Loop 1227 - ParticleBConds3DSoa.h:235-256 - exec | 1.99 | 1.88 | 2.09 | 2.07 | 2.02 | 1.9 | 1.76 | 1.23 | 1.25 | 1.46 | 1.49 | 1.61 | 1.75 | 2.47 | 1.23 | 1.17 | 1.29 | 1.26 | 1.27 | 1.42 | 2 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.17 | 0.19 | 0.19 | 0.20 | 0.16 | 0.14 | 0.01 | 0.10 | 0.11 | 0.11 | 0.12 | 0.12 | 0.15 | 31.22 | 65.60 | 118.95 | 243.92 | 484.03 | 866.67 | 1076.57 | 1 | 0 | 1.05 | 0 | 0.95 | 0.1 | 0.98 | 0.05 | 0.97 | 0.06 | 0.87 | 0.25 | 0.61 | 0.68 | |||||||||
○Loop 1226 - ParticleBConds3DSoa.h:235-255 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 31 | 58 | 110 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►__intel_avx_rep_memset | exec | 1.59 | 1.5 | 1.56 | 1.59 | 1.59 | 1.8 | 1.96 | 1.02 | 0.99 | 1.13 | 1.06 | 1.15 | 1.6 | 2.46 | 0.98 | 0.93 | 0.96 | 0.97 | 1 | 1.34 | 2.21 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.08 | 0.10 | 0.16 | 0.11 | 0.12 | 0.12 | 0.12 | 0.06 | 0.06 | 0.10 | 0.06 | 0.07 | 0.09 | 0.13 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | 0.01 | 0.03 | 0.06 | 0.11 | 0.24 | 0.32 | 0.20 | 1 | 0 | 1.05 | 0 | 1.02 | 0 | 1.01 | 0 | 0.98 | 0.03 | 0.73 | 0.48 | 0.44 | 1.09 | |
○Loop 2247 - - exec | 0.93 | 0.9 | 0.94 | 0.95 | 0.91 | 1.01 | 1.09 | 0.63 | 0.58 | 0.7 | 0.7 | 0.64 | 0.94 | 1.45 | 0.58 | 0.56 | 0.58 | 0.58 | 0.57 | 0.76 | 1.24 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.13 | 0.04 | 0.13 | 0.11 | 0.09 | 0.09 | 0.08 | 0.07 | 0.02 | 0.08 | 0.06 | 0.06 | 0.07 | 0.09 | 0.02 | 0.04 | 0.09 | 0.15 | 0.35 | 0.47 | 0.25 | 1 | 0 | 1.04 | 0 | 1 | 0 | 1 | 0 | 1.02 | 0 | 0.76 | 0.24 | 0.47 | 0.58 | |||||||||
►qmcplusplus::BsplineFunctor<double>::evaluateV(int, int, int, double const*, double*) const | exec | 1.52 | 1.62 | 1.55 | 1.57 | 1.61 | 1.72 | 1.61 | 0.99 | 1.12 | 1.09 | 1.04 | 1.18 | 1.55 | 2.32 | 0.94 | 1.01 | 0.95 | 0.96 | 1.01 | 1.28 | 1.82 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.12 | 0.23 | 0.18 | 0.08 | 0.12 | 0.13 | 0.15 | 0.08 | 0.14 | 0.11 | 0.05 | 0.08 | 0.09 | 0.18 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.24 | 2.29 | 4.93 | 9.65 | 18.46 | 29.25 | 36.26 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.93 | 0.11 | 0.99 | 0.02 | 0.98 | 0.03 | 0.93 | 0.11 | 0.73 | 0.46 | 0.52 | 0.78 |
○Loop 252 - BsplineFunctor.h:236-241 - exec | 1.31 | 1.29 | 1.29 | 1.31 | 1.39 | 1.54 | 1.5 | 0.85 | 0.9 | 0.92 | 0.88 | 1.02 | 1.37 | 2.19 | 0.81 | 0.8 | 0.79 | 0.8 | 0.87 | 1.15 | 1.69 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.08 | 0.20 | 0.15 | 0.06 | 0.12 | 0.13 | 0.15 | 0.06 | 0.12 | 0.09 | 0.04 | 0.07 | 0.09 | 0.17 | 0.00 | 0.01 | 0.02 | 0.05 | 0.10 | 0.19 | 0.21 | 1 | 0 | 1.01 | 0 | 1.03 | 0 | 1.01 | 0 | 0.93 | 0.1 | 0.7 | 0.46 | 0.48 | 0.78 | |||||||||
○Loop 250 - BsplineFunctor.h:246-260 - exec [...] | 0.2 | 0.3 | 0.23 | 0.23 | 0.2 | 0.15 | 0.09 | 0.13 | 0.2 | 0.22 | 0.24 | 0.18 | 0.17 | 0.18 | 0.12 | 0.18 | 0.14 | 0.14 | 0.12 | 0.12 | 0.1 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.03 | 0.02 | 0.07 | 0.07 | 0.05 | 0.05 | 0.03 | 0.02 | 0.01 | 0.04 | 0.04 | 0.03 | 0.04 | 0.03 | 9.02 | 12.03 | 31.34 | 61.86 | 144.94 | 291.14 | 614.02 | 1 | 0 | 0.67 | 0.1 | 0.86 | 0.03 | 0.86 | 0.03 | 1 | 0 | 1 | 0 | 1.2 | 0 | |||||||||
○Loop 251 - BsplineFunctor.h:236-241 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 249 - BsplineFunctor.h:246-260 - exec | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 38.40 | 0.00 | 0.00 | 282.81 | |||||||||||||||||||||||
○mkl_blas_avx512_dgemv_t_intrinsics | libmkl_avx512.so.2 | 1.17 | 1.25 | 1.3 | 1.44 | 1.37 | 1.22 | 1.31 | 0.8 | 0.85 | 0.9 | 0.97 | 0.97 | 1.09 | 1.69 | 0.73 | 0.78 | 0.8 | 0.88 | 0.86 | 0.91 | 1.49 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.18 | 0.10 | 0.10 | 0.10 | 0.10 | 0.10 | 0.08 | 0.11 | 0.07 | 0.06 | 0.06 | 0.06 | 0.07 | 0.09 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 12.47 | 23.30 | 45.29 | 81.70 | 167.30 | 317.24 | 333.59 | 1 | 0 | 0.94 | 0.08 | 0.91 | 0.11 | 0.83 | 0.25 | 0.85 | 0.21 | 0.8 | 0.24 | 0.49 | 0.67 | |
○mkl_blas_avx512_dgemv_n_intrinsics | libmkl_avx512.so.2 | 1.04 | 0.94 | 0.97 | 1 | 0.98 | 0.86 | 0.84 | 0.65 | 0.61 | 0.68 | 0.71 | 0.77 | 0.76 | 1.08 | 0.64 | 0.58 | 0.6 | 0.61 | 0.62 | 0.64 | 0.95 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.03 | 0.06 | 0.10 | 0.10 | 0.11 | 0.07 | 0.06 | 0.02 | 0.03 | 0.06 | 0.06 | 0.07 | 0.05 | 0.07 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 9.96 | 21.78 | 42.37 | 83.77 | 164.17 | 316.00 | 365.10 | 1 | 0 | 1.1 | 0 | 1.07 | 0 | 1.05 | 0 | 1.03 | 0 | 1 | 0 | 0.67 | 0.27 | |
○mkl_blas_avx512_dgemm_kernel_nocopy_TN_b0 | libmkl_avx512.so.2 | 0.87 | 0.98 | 0.96 | 0.83 | 0.83 | 0.9 | 0.94 | 0.6 | 0.62 | 0.72 | 0.6 | 0.63 | 0.76 | 1.18 | 0.54 | 0.61 | 0.59 | 0.5 | 0.52 | 0.67 | 1.07 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.16 | 0.03 | 0.11 | 0.08 | 0.07 | 0.08 | 0.05 | 0.10 | 0.02 | 0.07 | 0.05 | 0.05 | 0.06 | 0.05 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 67.28 | 118.71 | 246.79 | 582.18 | 1119.06 | 1737.93 | 1870.83 | 1 | 0 | 0.89 | 0.11 | 0.92 | 0.08 | 1.08 | 0 | 1.04 | 0 | 0.81 | 0.17 | 0.5 | 0.47 | |
►qmcplusplus::BsplineFunctor<double>::evaluateVGL(int, int, int, double const*, double*, double*, double*, double*, int*) const | exec | 0.81 | 0.74 | 0.76 | 0.73 | 0.73 | 0.68 | 0.66 | 0.51 | 0.49 | 0.51 | 0.54 | 0.55 | 0.59 | 0.84 | 0.5 | 0.46 | 0.47 | 0.44 | 0.46 | 0.51 | 0.74 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.04 | 0.06 | 0.08 | 0.08 | 0.08 | 0.07 | 0.05 | 0.02 | 0.03 | 0.05 | 0.05 | 0.05 | 0.05 | 0.06 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 2.22 | 4.78 | 9.33 | 19.79 | 37.53 | 66.71 | 79.62 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.09 | 0 | 1.06 | 0 | 1.14 | 0 | 1.09 | 0 | 0.98 | 0.01 | 0.68 | 0.21 |
○Loop 248 - BsplineFunctor.h:291-298 - exec | 0.65 | 0.61 | 0.6 | 0.6 | 0.6 | 0.57 | 0.55 | 0.42 | 0.39 | 0.43 | 0.45 | 0.48 | 0.51 | 0.71 | 0.4 | 0.38 | 0.37 | 0.36 | 0.38 | 0.43 | 0.63 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.06 | 0.04 | 0.08 | 0.08 | 0.07 | 0.06 | 0.05 | 0.04 | 0.02 | 0.05 | 0.05 | 0.04 | 0.04 | 0.05 | 0.00 | 0.00 | 0.00 | 0.01 | 0.02 | 0.02 | 0.05 | 1 | 0 | 1.05 | 0 | 1.08 | 0 | 1.11 | 0 | 1.05 | 0 | 0.93 | 0.04 | 0.63 | 0.2 | |||||||||
○Loop 246 - BsplineFunctor.h:303-338 - exec [...] | 0.13 | 0.1 | 0.11 | 0.1 | 0.09 | 0.08 | 0.07 | 0.08 | 0.08 | 0.09 | 0.08 | 0.1 | 0.11 | 0.14 | 0.08 | 0.06 | 0.07 | 0.06 | 0.06 | 0.06 | 0.07 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.03 | 0.02 | 0.02 | 0.03 | 0.02 | 0.02 | 0.01 | 0.02 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 13.33 | 35.24 | 59.94 | 139.35 | 275.56 | 544.47 | 808.84 | 1 | 0 | 1.33 | 0 | 1.14 | 0 | 1.33 | 0 | 1.33 | 0 | 1.33 | 0 | 1.14 | 0 | |||||||||
○Loop 245 - BsplineFunctor.h:303-338 - exec | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.01 | 0.02 | 0.02 | 0.02 | 0.01 | 0 | 0.01 | 0 | 0 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 3.85 | 0.00 | 15.45 | 0.00 | 0.00 | 103.45 | 175.06 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 247 - BsplineFunctor.h:291-298 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::acceptMove(qmcplusplus::ParticleSet&, int) | exec | 0.78 | 0.81 | 0.75 | 0.76 | 0.77 | 0.91 | 1.13 | 0.52 | 0.54 | 0.51 | 0.54 | 0.58 | 0.77 | 1.47 | 0.48 | 0.5 | 0.47 | 0.46 | 0.48 | 0.68 | 1.28 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.09 | 0.05 | 0.05 | 0.07 | 0.09 | 0.07 | 0.08 | 0.06 | 0.03 | 0.03 | 0.04 | 0.05 | 0.05 | 0.09 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 10.78 | 20.55 | 43.50 | 89.14 | 170.61 | 239.35 | 223.10 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.96 | 0.03 | 1.02 | 0 | 1.04 | 0 | 1 | 0 | 0.71 | 0.27 | 0.38 | 0.71 |
○Loop 278 - TwoBodyJastrowRef.h:324-331 - exec | 0.2 | 0.21 | 0.2 | 0.21 | 0.22 | 0.25 | 0.32 | 0.13 | 0.15 | 0.16 | 0.16 | 0.18 | 0.25 | 0.46 | 0.12 | 0.13 | 0.12 | 0.13 | 0.14 | 0.19 | 0.37 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.01 | 0.03 | 0.03 | 0.03 | 0.02 | 0.03 | 0.04 | 15.38 | 28.31 | 59.94 | 106.60 | 195.55 | 288.91 | 257.39 | 1 | 0 | 0.92 | 0.02 | 1 | 0 | 0.92 | 0.02 | 0.86 | 0.03 | 0.63 | 0.09 | 0.32 | 0.22 | |||||||||
►Loop 273 - TwoBodyJastrowRef.h:324-349 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 4 | 6 | 10 | 25 | 34 | 51 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 275 - TwoBodyJastrowRef.h:342-347 - exec | 0.56 | 0.6 | 0.55 | 0.54 | 0.54 | 0.64 | 0.79 | 0.39 | 0.39 | 0.37 | 0.38 | 0.44 | 0.53 | 1.08 | 0.35 | 0.37 | 0.34 | 0.33 | 0.34 | 0.47 | 0.9 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.11 | 0.07 | 0.04 | 0.06 | 0.08 | 0.06 | 0.07 | 0.07 | 0.04 | 0.02 | 0.03 | 0.05 | 0.04 | 0.07 | 9.44 | 17.72 | 38.81 | 81.88 | 159.66 | 228.55 | 210.65 | 1 | 0 | 0.95 | 0.03 | 1.03 | 0 | 1.06 | 0 | 1.03 | 0 | 0.74 | 0.16 | 0.39 | 0.48 | |||||||||
○Loop 274 - TwoBodyJastrowRef.h:342-347 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 280 - TwoBodyJastrowRef.h:269-274 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 4 | 12 | 15 | 38 | 79 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 276 - TwoBodyJastrowRef.h:324-331 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 279 - TwoBodyJastrowRef.h:269-274 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 277 - TwoBodyJastrowRef.h:324-331 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○unknown_function | Unknown module | 0.66 | 0.66 | 0.73 | 0.7 | 0.7 | 0.7 | 0.72 | 0.43 | 0.5 | 0.52 | 0.51 | 0.59 | 0.63 | 0.93 | 0.41 | 0.41 | 0.45 | 0.43 | 0.44 | 0.52 | 0.81 | 3 | 4 | 9 | 17 | 33 | 65 | 114 | 4.43 | 0.10 | 1.84 | 1.66 | 1.33 | 0.48 | 7.48 | 0.23 | 0.06 | 0.15 | 0.11 | 0.09 | 0.08 | 0.12 | Others (%): 99.39 OMP (%): 0.61 | Others (%): 100.00 | Others (%): 99.86 MPI (%): 0.14 | Others (%): 100.00 | Others (%): 99.93 MPI (%): 0.04 OMP (%): 0.04 | Others (%): 100.00 | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.03 | 0.07 | 0.05 | 1 | 0 | 1 | 0 | 0.91 | 0.06 | 0.95 | 0.03 | 0.93 | 0.05 | 0.79 | 0.15 | 0.51 | 0.36 | |
►miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&) | exec | 0.54 | 0.57 | 0.58 | 0.6 | 0.6 | 0.51 | 0.47 | 0.35 | 0.37 | 0.4 | 0.46 | 0.46 | 0.43 | 0.67 | 0.34 | 0.35 | 0.36 | 0.36 | 0.38 | 0.38 | 0.53 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.03 | 0.07 | 0.05 | 0.08 | 0.07 | 0.05 | 0.04 | 0.02 | 0.04 | 0.03 | 0.05 | 0.04 | 0.04 | 0.05 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 7.63 | 14.80 | 29.47 | 59.22 | 112.10 | 223.26 | 279.71 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.97 | 0.02 | 0.94 | 0.03 | 0.94 | 0.03 | 0.89 | 0.06 | 0.89 | 0.05 | 0.64 | 0.17 |
○Loop 265 - TwoBodyJastrowRef.h:155-156 - exec | 0.17 | 0.17 | 0.16 | 0.16 | 0.16 | 0.13 | 0.13 | 0.11 | 0.12 | 0.14 | 0.16 | 0.17 | 0.14 | 0.21 | 0.1 | 0.11 | 0.1 | 0.1 | 0.1 | 0.1 | 0.15 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.03 | 0.02 | 0.03 | 0.04 | 0.04 | 0.03 | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 0.03 | 0.02 | 0.02 | 6.58 | 12.89 | 29.33 | 59.36 | 119.24 | 241.10 | 281.52 | 1 | 0 | 0.91 | 0.02 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.67 | 0.04 | |||||||||
○Loop 269 - TwoBodyJastrowRef.h:155-156 - exec | 0.16 | 0.15 | 0.17 | 0.16 | 0.17 | 0.15 | 0.13 | 0.1 | 0.12 | 0.13 | 0.14 | 0.16 | 0.16 | 0.19 | 0.1 | 0.1 | 0.1 | 0.1 | 0.11 | 0.11 | 0.15 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 7.76 | 15.68 | 31.12 | 62.36 | 110.69 | 217.50 | 277.98 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.91 | 0.02 | 0.91 | 0.01 | 0.67 | 0.04 | |||||||||
○Loop 267 - TwoBodyJastrowRef.h:155-156 - exec | 0.15 | 0.14 | 0.16 | 0.17 | 0.16 | 0.14 | 0.12 | 0.1 | 0.12 | 0.13 | 0.14 | 0.17 | 0.16 | 0.2 | 0.09 | 0.09 | 0.1 | 0.1 | 0.1 | 0.1 | 0.14 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.03 | 0.04 | 0.04 | 0.04 | 0.04 | 0.03 | 0.02 | 0.02 | 0.03 | 0.02 | 0.03 | 0.02 | 0.02 | 0.02 | 8.93 | 16.85 | 31.48 | 61.22 | 122.83 | 242.01 | 304.45 | 1 | 0 | 1 | 0 | 0.9 | 0.02 | 0.9 | 0.02 | 0.9 | 0.02 | 0.9 | 0.01 | 0.64 | 0.04 | |||||||||
○Loop 271 - stl_numeric.h:126-127 - exec [...] | 0.06 | 0.08 | 0.07 | 0.08 | 0.09 | 0.07 | 0.06 | 0.04 | 0.06 | 0.07 | 0.09 | 0.08 | 0.09 | 0.11 | 0.04 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.07 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.02 | 0.02 | 0.03 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.02 | 0.02 | 8.43 | 12.96 | 27.33 | 58.02 | 120.41 | 243.76 | 299.07 | 1 | 0 | 0.8 | 0.02 | 0.8 | 0.01 | 0.8 | 0.02 | 0.8 | 0.02 | 0.8 | 0.01 | 0.57 | 0.03 | |||||||||
○Loop 268 - TwoBodyJastrowRef.h:155-156 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 264 - TwoBodyJastrowRef.h:155-156 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 270 - TwoBodyJastrowRef.h:0-0 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 5 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 272 - TwoBodyJastrowRef.h:269-274 - exec [...] | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.02 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 8 | 14 | 29 | 57 | 100 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 266 - TwoBodyJastrowRef.h:155-156 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector<... | exec | 0.53 | 0.52 | 0.51 | 0.53 | 0.56 | 0.67 | 0.8 | 0.33 | 0.33 | 0.34 | 0.34 | 0.37 | 0.56 | 1.12 | 0.33 | 0.32 | 0.32 | 0.32 | 0.35 | 0.5 | 0.91 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.02 | 0.02 | 0.02 | 0.02 | 0.05 | 0.13 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.14 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 4.58 | 9.44 | 18.87 | 37.76 | 69.07 | 96.64 | 92.89 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.03 | 0 | 1.03 | 0 | 1.03 | 0 | 0.94 | 0.03 | 0.66 | 0.23 | 0.36 | 0.51 |
►Loop 864 - inner_product.hpp:82-155 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 7 | 13 | 30 | 48 | 80 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 866 - inner_product.hpp:155-155 - exec [...] | 0.41 | 0.37 | 0.36 | 0.38 | 0.39 | 0.46 | 0.55 | 0.26 | 0.24 | 0.24 | 0.26 | 0.3 | 0.42 | 0.83 | 0.25 | 0.23 | 0.22 | 0.23 | 0.25 | 0.35 | 0.63 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.03 | 0.02 | 0.03 | 0.04 | 0.03 | 0.04 | 0.09 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.03 | 0.10 | 4.53 | 9.81 | 20.53 | 39.31 | 72.37 | 103.44 | 100.50 | 1 | 0 | 1.09 | 0 | 1.14 | 0 | 1.09 | 0 | 1 | 0 | 0.71 | 0.13 | 0.4 | 0.33 | |||||||||
○Loop 868 - inner_product.hpp:82-83 - exec | 0.12 | 0.14 | 0.15 | 0.15 | 0.16 | 0.21 | 0.24 | 0.09 | 0.11 | 0.12 | 0.13 | 0.13 | 0.19 | 0.38 | 0.08 | 0.09 | 0.09 | 0.09 | 0.1 | 0.15 | 0.28 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.03 | 0.02 | 0.02 | 0.03 | 0.03 | 0.03 | 0.05 | 0.02 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.05 | 4.73 | 8.46 | 16.75 | 33.57 | 60.32 | 80.27 | 75.38 | 1 | 0 | 0.89 | 0.02 | 0.89 | 0.02 | 0.89 | 0.02 | 0.8 | 0.03 | 0.53 | 0.1 | 0.29 | 0.17 | |||||||||
○Loop 865 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 867 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::evalGrad(qmcplusplus::ParticleSet&, int) | exec | 0.48 | 0.41 | 0.39 | 0.41 | 0.4 | 0.46 | 0.52 | 0.3 | 0.29 | 0.28 | 0.3 | 0.38 | 0.43 | 0.69 | 0.3 | 0.25 | 0.24 | 0.25 | 0.25 | 0.35 | 0.59 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.06 | 0.05 | 0.06 | 0.07 | 0.06 | 0.05 | 0.01 | 0.04 | 0.03 | 0.03 | 0.04 | 0.04 | 0.05 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 3.85 | 9.14 | 18.84 | 36.19 | 73.29 | 105.08 | 108.95 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.2 | 0 | 1.25 | 0 | 1.2 | 0 | 1.2 | 0 | 0.86 | 0.07 | 0.51 | 0.26 |
○Loop 853 - inner_product.hpp:155-155 - exec [...] | 0.48 | 0.41 | 0.39 | 0.41 | 0.4 | 0.46 | 0.52 | 0.3 | 0.28 | 0.28 | 0.3 | 0.38 | 0.43 | 0.69 | 0.3 | 0.25 | 0.24 | 0.25 | 0.25 | 0.34 | 0.59 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.06 | 0.05 | 0.06 | 0.07 | 0.06 | 0.05 | 0.01 | 0.04 | 0.03 | 0.03 | 0.04 | 0.04 | 0.05 | 3.79 | 9.12 | 18.82 | 36.10 | 72.90 | 107.83 | 108.68 | 1 | 0 | 1.2 | 0 | 1.25 | 0 | 1.2 | 0 | 1.2 | 0 | 0.88 | 0.05 | 0.51 | 0.26 | |||||||||
○Loop 852 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○unknown_kernel_region | kernel | 0.44 | 0.36 | 0.33 | 0.34 | 0.32 | 0.31 | 0.31 | 0.27 | 0.25 | 0.21 | 0.24 | 0.27 | 0.32 | 0.55 | 0.27 | 0.22 | 0.21 | 0.21 | 0.2 | 0.23 | 0.35 | 3 | 5 | 9 | 17 | 33 | 65 | 113 | 9.37 | 22.20 | 4.06 | 4.04 | 2.85 | 1.65 | 0.91 | 0.15 | 0.09 | 0.06 | 0.04 | 0.04 | 0.05 | 0.12 | System (%): 100.00 | System (%): 97.16 Math (%): 2.84 | System (%): 97.55 Math (%): 1.53 MPI (%): 0.61 OMP (%): 0.31 | System (%): 96.82 Math (%): 3.03 MPI (%): 0.15 | System (%): 96.28 Math (%): 3.40 OMP (%): 0.32 | System (%): 92.20 Math (%): 6.95 OMP (%): 0.85 | System (%): 84.89 Math (%): 13.63 OMP (%): 1.47 Pthread (%): 0.01 | 0.05 | 0.17 | 0.43 | 0.79 | 1.71 | 3.28 | 3.12 | 1 | 0 | 1.23 | 0 | 1.29 | 0 | 1.29 | 0 | 1.35 | 0 | 1.17 | 0 | 0.77 | 0.07 | |
►miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&) | exec | 0.44 | 0.37 | 0.41 | 0.43 | 0.41 | 0.34 | 0.33 | 0.28 | 0.28 | 0.31 | 0.34 | 0.37 | 0.32 | 0.52 | 0.27 | 0.23 | 0.26 | 0.26 | 0.26 | 0.25 | 0.37 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.06 | 0.06 | 0.06 | 0.06 | 0.05 | 0.04 | 0.01 | 0.04 | 0.04 | 0.03 | 0.04 | 0.03 | 0.04 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 5.58 | 13.14 | 23.45 | 46.66 | 92.75 | 192.24 | 228.23 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.17 | 0 | 1.04 | 0 | 1.04 | 0 | 1.04 | 0 | 1.08 | 0 | 0.73 | 0.09 |
○Loop 854 - inner_product.hpp:155-155 - exec [...] | 0.33 | 0.31 | 0.32 | 0.33 | 0.32 | 0.26 | 0.24 | 0.21 | 0.23 | 0.23 | 0.29 | 0.28 | 0.25 | 0.38 | 0.21 | 0.19 | 0.2 | 0.2 | 0.2 | 0.19 | 0.27 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.06 | 0.05 | 0.06 | 0.05 | 0.05 | 0.03 | 0.01 | 0.04 | 0.03 | 0.04 | 0.03 | 0.03 | 0.04 | 5.41 | 11.86 | 22.76 | 45.47 | 90.25 | 190.35 | 234.59 | 1 | 0 | 1.11 | 0 | 1.05 | 0 | 1.05 | 0 | 1.05 | 0 | 1.11 | 0 | 0.78 | 0.05 | |||||||||
○Loop 857 - inner_product.hpp:82-83 - exec | 0.1 | 0.06 | 0.08 | 0.09 | 0.09 | 0.08 | 0.09 | 0.06 | 0.04 | 0.07 | 0.08 | 0.1 | 0.1 | 0.15 | 0.06 | 0.04 | 0.05 | 0.06 | 0.06 | 0.06 | 0.1 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.00 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 6.13 | 19.20 | 30.80 | 50.33 | 100.54 | 197.14 | 209.83 | 1 | 0 | 1.5 | 0 | 1.2 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.6 | 0.04 | |||||||||
○Loop 855 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 856 - DiracDeterminantRef.cpp:0-0 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►qmcplusplus::SPOSet::evaluateDetRatios(qmcplusplus::VirtualParticleSet const&, qmcplusplus::Vector<double, std::allocator<double> >&, qmcplusplus::Vector<double, std::allocator<double> > const&, std::vector<double, st... | exec | 0.4 | 0.42 | 0.35 | 0.39 | 0.38 | 0.35 | 0.32 | 0.28 | 0.28 | 0.27 | 0.31 | 0.29 | 0.36 | 0.44 | 0.25 | 0.26 | 0.22 | 0.24 | 0.24 | 0.26 | 0.36 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.07 | 0.02 | 0.06 | 0.07 | 0.05 | 0.05 | 0.04 | 0.05 | 0.02 | 0.04 | 0.04 | 0.03 | 0.04 | 0.05 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 5.97 | 11.68 | 27.60 | 51.23 | 101.41 | 186.83 | 236.54 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.96 | 0.02 | 1.14 | 0 | 1.04 | 0 | 1.04 | 0 | 0.96 | 0.01 | 0.69 | 0.1 |
►Loop 763 - inner_product.hpp:82-83 - exec [...] | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 3 | 6 | 12 | 26 | 49 | 93 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 765 - inner_product.hpp:82-83 - exec | 0.39 | 0.42 | 0.35 | 0.39 | 0.38 | 0.35 | 0.31 | 0.28 | 0.27 | 0.27 | 0.31 | 0.29 | 0.36 | 0.44 | 0.24 | 0.26 | 0.22 | 0.24 | 0.24 | 0.26 | 0.35 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.09 | 0.03 | 0.06 | 0.06 | 0.05 | 0.05 | 0.04 | 0.06 | 0.02 | 0.04 | 0.04 | 0.03 | 0.04 | 0.05 | 6.22 | 11.66 | 27.47 | 50.98 | 100.92 | 186.11 | 242.27 | 1 | 0 | 0.92 | 0.03 | 1.09 | 0 | 1 | 0 | 1 | 0 | 0.92 | 0.03 | 0.69 | 0.1 | |||||||||
○Loop 764 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 3 | 5 | 5 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►qmcplusplus::BsplineAllocator<double, 32ul, qmcplusplus::Mallocator<double, 32ul> >::setCoefficientsForOrbitals(int, int, Array<double, 3u>&, multi_UBspline_3d_d*) [clone .extracted] | exec | 0.34 | 0.2 | 0.09 | 0.05 | 0.02 | 0.01 | 0 | 0.22 | 0.14 | 0.06 | 0.04 | 0.02 | 0.01 | 0 | 0.21 | 0.12 | 0.06 | 0.03 | 0.02 | 0.01 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.03 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.86 | 3.26 | 6.53 | 13.07 | 19.25 | 38.40 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.75 | 0 | 3.5 | 0 | 7 | 0 | 10.5 | 0 | 21 | 0 | 1 | 0 |
►Loop 726 - BsplineAllocator.hpp:172-181 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 727 - BsplineAllocator.hpp:179-180 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 728 - BsplineAllocator.hpp:179-180 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 110 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 729 - BsplineAllocator.hpp:172-181 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 3 | 1 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 730 - BsplineAllocator.hpp:179-180 - exec | 0.34 | 0.2 | 0.09 | 0.05 | 0.02 | 0.01 | 0 | 0.22 | 0.14 | 0.06 | 0.04 | 0.02 | 0.01 | 0 | 0.21 | 0.12 | 0.06 | 0.03 | 0.02 | 0.01 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.03 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 1.86 | 3.25 | 6.51 | 13.05 | 19.25 | 38.40 | 0.00 | 1 | 0 | 1.75 | 0 | 3.5 | 0 | 7 | 0 | 10.5 | 0 | 21 | 0 | 1 | 0 | |||||||||
►__intel_avx_rep_memcpy | exec | 0.26 | 0.24 | 0.28 | 0.32 | 0.31 | 0.29 | 0.26 | 0.17 | 0.19 | 0.18 | 0.25 | 0.25 | 0.28 | 0.39 | 0.16 | 0.15 | 0.17 | 0.19 | 0.2 | 0.22 | 0.3 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.05 | 0.03 | 0.04 | 0.05 | 0.04 | 0.04 | 0.01 | 0.03 | 0.02 | 0.03 | 0.03 | 0.03 | 0.04 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | Memory (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.02 | 0.00 | 0.07 | 1 | 0 | 1.07 | 0 | 0.94 | 0.02 | 0.84 | 0.05 | 0.8 | 0.06 | 0.73 | 0.08 | 0.53 | 0.12 | |
○Loop 2245 - - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 2246 - - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○MPL_gpu_cuda_finalize | libmpi.so.12.0.0 | 0.23 | 0.09 | 0.01 | 0.02 | 0 | 0 | 0 | 0.28 | 0.22 | 0.05 | 0.23 | 0.04 | 0.05 | 0.24 | 0.14 | 0.06 | 0.01 | 0.01 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.03 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 2.33 | 0 | 14 | 0 | 14 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
►miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector<double, ... | exec | 0.15 | 0.18 | 0.17 | 0.19 | 0.18 | 0.16 | 0.12 | 0.1 | 0.12 | 0.12 | 0.13 | 0.13 | 0.14 | 0.2 | 0.09 | 0.11 | 0.11 | 0.11 | 0.11 | 0.12 | 0.14 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 8.38 | 13.75 | 27.53 | 55.06 | 111.20 | 202.41 | 303.97 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.82 | 0.03 | 0.82 | 0.03 | 0.82 | 0.03 | 0.82 | 0.03 | 0.75 | 0.04 | 0.64 | 0.04 |
○Loop 282 - TwoBodyJastrowRef.h:423-427 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 3 | 9 | 15 | 26 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 283 - TwoBodyJastrowRef.h:268-420 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 284 - TwoBodyJastrowRef.h:268-420 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 3 | 6 | 13 | 22 | 42 | 67 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 286 - TwoBodyJastrowRef.h:393-398 - exec | 0.05 | 0.06 | 0.07 | 0.06 | 0.07 | 0.06 | 0.04 | 0.05 | 0.04 | 0.05 | 0.06 | 0.07 | 0.07 | 0.07 | 0.03 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.05 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.01 | 0.03 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 7.50 | 11.55 | 22.95 | 45.26 | 91.04 | 182.02 | 318.31 | 1 | 0 | 0.75 | 0.01 | 0.75 | 0.02 | 0.75 | 0.01 | 0.75 | 0.02 | 0.75 | 0.01 | 0.75 | 0.01 | |||||||||
○Loop 296 - TwoBodyJastrowRef.h:375-376 - exec | 0.02 | 0.03 | 0.02 | 0.04 | 0.03 | 0.03 | 0.01 | 0.02 | 0.03 | 0.03 | 0.04 | 0.04 | 0.04 | 0.04 | 0.01 | 0.02 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 14.00 | 12.80 | 49.20 | 47.25 | 93.85 | 182.23 | 319.98 | 1 | 0 | 0.5 | 0.01 | 1 | 0 | 0.5 | 0.02 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | |||||||||
○Loop 288 - TwoBodyJastrowRef.h:388-391 - exec | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 0.03 | 0.04 | 0.03 | 0.04 | 0.04 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 5.80 | 26.20 | 54.30 | 114.00 | 228.71 | 466.81 | 417.16 | 1 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 1 | 0 | |||||||||
○Loop 292 - TwoBodyJastrowRef.h:381-382 - exec | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.02 | 0.03 | 0.03 | 0.05 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 3.80 | 16.80 | 36.80 | 68.90 | 130.10 | 257.41 | 214.56 | 1 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 1 | 0 | |||||||||
○Loop 290 - TwoBodyJastrowRef.h:381-382 - exec | 0.01 | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.03 | 0.02 | 0.03 | 0.02 | 0.06 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 6.85 | 15.20 | 28.45 | 58.40 | 129.00 | 242.41 | 213.03 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.01 | |||||||||
○Loop 298 - stl_numeric.h:126-127 - exec [...] | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.02 | 0.01 | 0.01 | 0.02 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 5.60 | 8.80 | 14.40 | 0.00 | 57.70 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 293 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 9 | 10 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 295 - TwoBodyJastrowRef.h:375-376 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 2 | 2 | 2 | 3 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 287 - TwoBodyJastrowRef.h:388-391 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 291 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 289 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 3 | 6 | 6 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 297 - stl_numeric.h:126-127 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 299 - TwoBodyJastrowRef.h:269-274 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 1 | 3 | 8 | 16 | 30 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 294 - TwoBodyJastrowRef.h:381-382 - exec | 0 | 0.02 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0 | 0.03 | 0.02 | 0.02 | 0.03 | 0.04 | 0.06 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 10.80 | 26.00 | 54.80 | 120.20 | 237.31 | 213.73 | 1 | 0 | 0 | 0.02 | 0 | 0.01 | 0 | 0.02 | 0 | 0.02 | 0 | 0.02 | 0 | 0.02 | |||||||||
○Loop 285 - TwoBodyJastrowRef.h:388-391 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○MPIR_Progress_hook_exec_on_vci | libmpi.so.12.0.0 | 0.15 | 0.07 | 0.01 | 0.01 | 0 | 0 | 0 | 0.18 | 0.18 | 0.04 | 0.14 | 0.03 | 0.04 | 0.2 | 0.09 | 0.05 | 0.01 | 0.01 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.8 | 0 | 9 | 0 | 9 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○__GI___pthread_mutex_lock | libpthread-2.28.so | 0.13 | 0.08 | 0 | 0 | 0 | 0 | 0 | 0.16 | 0.19 | 0.01 | 0.04 | 0 | 0.03 | 0.17 | 0.08 | 0.05 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.6 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○mkl_lapack_xdlaswp | libmkl_core.so.2 | 0.13 | 0.12 | 0.11 | 0.12 | 0.13 | 0.15 | 0.2 | 0.09 | 0.09 | 0.07 | 0.09 | 0.11 | 0.14 | 0.31 | 0.08 | 0.07 | 0.07 | 0.07 | 0.08 | 0.11 | 0.22 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.03 | 0.01 | 0.02 | 0.02 | 0.02 | 0.03 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.14 | 0 | 1.14 | 0 | 1.14 | 0 | 1 | 0 | 0.73 | 0.04 | 0.36 | 0.13 | |
►qmcplusplus::DiracMatrix<double, double>::invert_transpose(qmcplusplus::Matrix<double, std::allocator<double> > const&, qmcplusplus::Matrix<double, std::allocator<double> >&, double&, double&) | exec | 0.13 | 0.11 | 0.14 | 0.11 | 0.1 | 0.11 | 0.13 | 0.1 | 0.11 | 0.1 | 0.11 | 0.12 | 0.13 | 0.29 | 0.08 | 0.07 | 0.09 | 0.07 | 0.07 | 0.08 | 0.15 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.05 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.03 | 0.06 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.14 | 0 | 0.89 | 0.02 | 1.14 | 0 | 1.14 | 0 | 1 | 0 | 0.53 | 0.06 |
○Loop 827 - DiracMatrix.h:31-35 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 830 - DiracMatrix.h:112-113 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 828 - DiracMatrix.h:31-35 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 829 - DiracMatrix.h:112-113 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 832 - inner_product.hpp:210-212 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 833 - inner_product.hpp:210-212 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 834 - inner_product.hpp:210-212 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 9 | 8 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 835 - inner_product.hpp:211-212 - exec | 0.13 | 0.11 | 0.14 | 0.11 | 0.1 | 0.11 | 0.13 | 0.1 | 0.11 | 0.1 | 0.11 | 0.12 | 0.13 | 0.29 | 0.08 | 0.07 | 0.09 | 0.06 | 0.07 | 0.08 | 0.15 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.05 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.03 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.14 | 0 | 0.89 | 0.02 | 1.33 | 0 | 1.14 | 0 | 1 | 0 | 0.53 | 0.06 | |||||||||
○Loop 831 - inner_product.hpp:211-212 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○uct_rc_mlx5_iface_progress_cyclic | libuct_ib.so.0.0.0 | 0.12 | 0.03 | 0.01 | 0.01 | 0 | 0 | 0 | 0.15 | 0.07 | 0.07 | 0.07 | 0.01 | 0.01 | 0.14 | 0.08 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 4 | 0 | 8 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○__pthread_mutex_unlock_usercnt | libpthread-2.28.so | 0.12 | 0.06 | 0.01 | 0.01 | 0 | 0 | 0 | 0.15 | 0.13 | 0.05 | 0.07 | 0.02 | 0.02 | 0.1 | 0.08 | 0.03 | 0.01 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | Pthread (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 2.67 | 0 | 8 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○unknown_function | [vdso] | 0.1 | 0.04 | 0.02 | 0.01 | 0 | 0 | 0 | 0.07 | 0.06 | 0.05 | 0.06 | 0.04 | 0.09 | 0.08 | 0.06 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.02 | 0.03 | 0.01 | 0.03 | 0.01 | 0.03 | 0.03 | 0.01 | 0.02 | 0.00 | 0.02 | 0.00 | 0.02 | 0.03 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 3 | 0 | 6 | 0 | 6 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
►miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::evaluateLog(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector&l... | exec | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.13 | 0.06 | 0.06 | 0.06 | 0.06 | 0.07 | 0.09 | 0.23 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.08 | 0.15 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.04 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 5.07 | 10.05 | 20.00 | 39.95 | 80.05 | 119.65 | 112.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.75 | 0.03 | 0.4 | 0.08 |
►Loop 847 - inner_product.hpp:82-155 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 851 - inner_product.hpp:155-155 - exec [...] | 0.07 | 0.07 | 0.07 | 0.07 | 0.07 | 0.08 | 0.1 | 0.04 | 0.05 | 0.05 | 0.05 | 0.06 | 0.08 | 0.2 | 0.04 | 0.04 | 0.04 | 0.05 | 0.05 | 0.06 | 0.12 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.03 | 5.70 | 11.38 | 22.65 | 36.33 | 72.93 | 121.05 | 106.24 | 1 | 0 | 1 | 0 | 1 | 0 | 0.8 | 0.01 | 0.8 | 0.01 | 0.67 | 0.03 | 0.33 | 0.07 | |||||||||
○Loop 849 - inner_product.hpp:82-83 - exec | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.02 | 0.03 | 0.02 | 0.02 | 0.03 | 0.04 | 0.03 | 0.04 | 0.07 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 0.03 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 3.80 | 7.40 | 14.70 | 29.03 | 115.65 | 115.48 | 134.99 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 2 | 0 | 1 | 0 | 0.67 | 0.01 | |||||||||
○Loop 850 - inner_product.hpp:155-155 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 848 - inner_product.hpp:82-83 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○mkl_blas_avx512_dgemm_kernel_nocopy_NN_b0 | libmkl_avx512.so.2 | 0.1 | 0.1 | 0.13 | 0.14 | 0.12 | 0.13 | 0.14 | 0.06 | 0.08 | 0.11 | 0.15 | 0.11 | 0.15 | 0.23 | 0.06 | 0.06 | 0.08 | 0.09 | 0.08 | 0.1 | 0.16 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.03 | 0.02 | 0.04 | 0.03 | 0.03 | 0.03 | 0.01 | 0.02 | 0.01 | 0.02 | 0.02 | 0.02 | 0.03 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 99.47 | 201.87 | 308.21 | 547.17 | 1224.94 | 1957.86 | 2071.61 | 1 | 0 | 1 | 0 | 0.75 | 0.03 | 0.67 | 0.05 | 0.75 | 0.03 | 0.6 | 0.05 | 0.38 | 0.09 | |
►miniqmcreference::OneBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&) | exec | 0.08 | 0.06 | 0.06 | 0.07 | 0.07 | 0.06 | 0.06 | 0.05 | 0.05 | 0.05 | 0.08 | 0.06 | 0.08 | 0.11 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.06 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 6.08 | 14.26 | 28.08 | 57.21 | 125.22 | 249.98 | 290.09 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.25 | 0 | 1.25 | 0 | 1.25 | 0 | 1.25 | 0 | 1.25 | 0 | 0.83 | 0.01 |
○Loop 184 - OneBodyJastrowRef.h:192-193 - exec | 0.02 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.03 | 0.01 | 0.01 | 0.02 | 0.03 | 0.01 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 7.20 | 0.00 | 21.60 | 0.00 | 99.30 | 193.61 | 346.86 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 189 - OneBodyJastrowRef.h:186-187 - exec | 0.02 | 0.04 | 0.02 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.04 | 0.02 | 0.04 | 0.03 | 0.04 | 0.04 | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.02 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 4.60 | 8.60 | 33.60 | 34.00 | 71.80 | 140.20 | 241.31 | 1 | 0 | 1 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 181 - OneBodyJastrowRef.h:192-193 - exec | 0.02 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.02 | 0.03 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 6.00 | 12.45 | 0.00 | 46.60 | 104.40 | 209.71 | 350.91 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 185 - OneBodyJastrowRef.h:192-193 - exec | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.04 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 4.40 | 0.00 | 0.00 | 36.50 | 92.50 | 184.16 | 334.66 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 182 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 179 - stl_numeric.h:126-127 - exec [...] | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 183 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 178 - OneBodyJastrowRef.h:0-0 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 186 - OneBodyJastrowRef.h:186-194 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 187 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 180 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 188 - OneBodyJastrowRef.h:186-187 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○mkl_blas_avx512_dgemm_dcopy_right8_ea | libmkl_avx512.so.2 | 0.07 | 0.04 | 0.03 | 0.05 | 0.06 | 0.09 | 0.1 | 0.04 | 0.03 | 0.03 | 0.05 | 0.06 | 0.13 | 0.18 | 0.04 | 0.03 | 0.02 | 0.03 | 0.04 | 0.06 | 0.11 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.03 | 0.02 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.02 | 0.03 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1.33 | 0 | 2 | 0 | 1.33 | 0 | 1 | 0 | 0.67 | 0.03 | 0.36 | 0.06 | |
○MPIDI_Progress_test | libmpi.so.12.0.0 | 0.06 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0.08 | 0.05 | 0.02 | 0.03 | 0 | 0 | 0.08 | 0.04 | 0.01 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 4 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
►qmcplusplus::NonLocalPP<double>::evaluate(qmcplusplus::ParticleSet const&, qmcplusplus::WaveFunction&) | exec | 0.06 | 0.04 | 0.05 | 0.05 | 0.05 | 0.06 | 0.07 | 0.04 | 0.04 | 0.04 | 0.06 | 0.07 | 0.09 | 0.13 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.05 | 0.08 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.02 | 0.02 | 0.03 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.11 | 0.15 | 0.25 | 0.53 | 1.15 | 1.40 | 5.08 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.33 | 0 | 1.33 | 0 | 1.33 | 0 | 1.33 | 0 | 0.8 | 0.01 | 0.5 | 0.04 |
○Loop 52 - NonLocalPP.hpp:110-111 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 53 - stl_uninitialized.h:526-526 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 49 - NonLocalPP.hpp:122-135 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 4 | 26 | 67 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 50 - NonLocalPP.hpp:126-135 - exec [...] | 0.05 | 0.04 | 0.04 | 0.05 | 0.05 | 0.05 | 0.05 | 0.04 | 0.04 | 0.04 | 0.05 | 0.06 | 0.08 | 0.11 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.04 | 0.06 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.15 | 0.13 | 0.17 | 0.25 | 0.58 | 1.05 | 3.36 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.75 | 0.01 | 0.5 | 0.03 | |||||||||
○Loop 51 - NonLocalPP.hpp:131-132 - exec [...] | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 1 | 1 | 6 | 13 | 28 | 54 | 108 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.80 | 20.45 | |||||||||||||||||||||||
○Loop 55 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 54 - stl_uninitialized.h:526-526 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 56 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○mkl_blas_avx512_dtrsm_kernel_ll_0 | libmkl_avx512.so.2 | 0.06 | 0.05 | 0.05 | 0.05 | 0.06 | 0.06 | 0.07 | 0.04 | 0.04 | 0.04 | 0.04 | 0.06 | 0.06 | 0.12 | 0.04 | 0.03 | 0.03 | 0.03 | 0.04 | 0.04 | 0.08 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 43.80 | 116.80 | 233.61 | 467.21 | 700.62 | 1400.94 | 1236.59 | 1 | 0 | 1.33 | 0 | 1.33 | 0 | 1.33 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.04 | |
►qmcplusplus::TimerType<std::chrono::_V2::system_clock>::stop() | exec | 0.05 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.05 | 0.08 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 111 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.23 | 1.13 | 3.00 | 5.90 | 14.10 | 26.20 | 18.15 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1.5 | 0 | 3 | 0 | 3 | 0 | 3 | 0 | 3 | 0 | 1.5 | 0 |
○Loop 1439 - NewTimer.cpp:99-100 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 6 | 6 | 20 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 1440 - NewTimer.cpp:99-100 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○__kmp_api_omp_get_level | libiomp5.so | 0.04 | 0.06 | 0.05 | 0.04 | 0.05 | 0.04 | 0.03 | 0.03 | 0.05 | 0.04 | 0.06 | 0.07 | 0.08 | 0.07 | 0.03 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 2 | 4 | 8 | 16 | 32 | 64 | 111 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.18 | 0.18 | 0.77 | 1.30 | 2.18 | 5.22 | 7.30 | 1 | 0 | 0.75 | 0.01 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○__kmp_get_global_thread_id_reg | libiomp5.so | 0.04 | 0.03 | 0.03 | 0.04 | 0.03 | 0.02 | 0.02 | 0.03 | 0.03 | 0.03 | 0.03 | 0.04 | 0.04 | 0.06 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.03 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.28 | 0.65 | 2.05 | 2.25 | 4.55 | 12.58 | 11.05 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.67 | 0.01 | |
►miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::resize(int, int) | exec | 0.04 | 0.04 | 0.03 | 0.04 | 0.04 | 0.03 | 0.02 | 0.03 | 0.04 | 0.03 | 0.04 | 0.04 | 0.04 | 0.05 | 0.02 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.67 | 0.01 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
○Loop 876 - stl_algobase.h:740-742 - exec | 0.04 | 0.04 | 0.03 | 0.04 | 0.04 | 0.03 | 0.02 | 0.03 | 0.04 | 0.03 | 0.04 | 0.04 | 0.04 | 0.05 | 0.02 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.67 | 0.01 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○Loop 871 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 875 - stl_algobase.h:741-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 870 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 872 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 874 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 873 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 869 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○ucp_worker_progress | libucp.so.0.0.0 | 0.04 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0.04 | 0.02 | 0.02 | 0 | 0 | 0 | 0.03 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | NA | Others (%): 100.00 | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
►std::map<qmcplusplus::StackKeyParam<2>, double, std::less<qmcplusplus::StackKeyParam<2> >, std::allocator<std::pair<qmcplusplus::StackKeyParam<2> const, double> > >::operator[](qmcplusplus::StackKeyParam<2> c... | exec | 0.04 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0.04 | 0.03 | 0.03 | 0.03 | 0.03 | 0.05 | 0.08 | 0.03 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.02 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.02 | 0.00 | 0.01 | 0.00 | 0.01 | 0.02 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.03 | 0.20 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 3 | 0 | 3 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
○Loop 1441 - NewTimer.h:119-121 - exec [...] | 0.04 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0.04 | 0.03 | 0.03 | 0.02 | 0.03 | 0.04 | 0.08 | 0.03 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.02 | 0.03 | 0.01 | 0.00 | 0.00 | 0.00 | 0.02 | 0.01 | 0.02 | 0.00 | 0.00 | 0.00 | 0.00 | 0.02 | 0.03 | 0.20 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 3 | 0 | 3 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |||||||||
○MPIDI_OFI_progress | libmpi.so.12.0.0 | 0.03 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0.04 | 0.04 | 0.03 | 0.02 | 0 | 0.01 | 0.05 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○__libm_exp_z0 | exec | 0.03 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.02 | 0.03 | 0.03 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 1.15 | 3.40 | 5.75 | 11.10 | 27.10 | 48.80 | 89.30 | 1 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | |
○_dl_update_slotinfo | ld-2.28.so | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.03 | 0.03 | 0.03 | 0.03 | 0.04 | 0.06 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | OMP (%): 57.15 System (%): 42.85 | System (%): 58.34 OMP (%): 41.66 | System (%): 52.18 OMP (%): 47.82 | System (%): 57.15 OMP (%): 42.85 | System (%): 51.77 OMP (%): 48.23 | System (%): 58.72 OMP (%): 41.28 | System (%): 59.36 OMP (%): 40.64 | 0.28 | 0.63 | 2.10 | 4.65 | 7.70 | 7.03 | 13.13 | 1 | 0 | 1 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | |
○__dynamic_cast | libstdc++.so.6.0.25 | 0.03 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.03 | 0.02 | 0.03 | 0.03 | 0.03 | 0.05 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 8 | 16 | 30 | 63 | 111 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | 0.08 | 0.30 | 1.10 | 1.35 | 3.25 | 6.35 | 2.23 | 1 | 0 | 1 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 1 | 0 | |
○inflate_fast | libmpi.so.12.0.0 | 0.03 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 2 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
►qmcplusplus::Vector<double, std::allocator<double> >::resize(unsigned long, double) | exec | 0.02 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.01 | 0.03 | 0.04 | 0.03 | 0.03 | 0.04 | 0.04 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 32 | 63 | 112 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 |
○Loop 230 - stl_algobase.h:752-754 - exec | 0.02 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.02 | 0.01 | 0.03 | 0.04 | 0.03 | 0.03 | 0.04 | 0.04 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 2 | 4 | 8 | 16 | 32 | 63 | 112 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | |||||||||
○Loop 229 - stl_algobase.h:752-754 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 228 - stl_algobase.h:752-754 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 227 - stl_algobase.h:752-754 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○mkl_blas_avx512_dgemm_kernel_nocopy_NN_b1 | libmkl_avx512.so.2 | 0.02 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.01 | 0.05 | 0.03 | 0.03 | 0.03 | 0.04 | 0.05 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.03 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.03 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.00 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 138.40 | 138.40 | 276.81 | 553.62 | 1107.23 | 2214.07 | 2413.41 | 1 | 0 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | 0.5 | 0.01 | 0.33 | 0.02 | |
○__libc_disable_asynccancel | libc-2.28.so | 0.02 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0.02 | 0 | 0.03 | 0.07 | 0.01 | 0.08 | 0.16 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | System (%): 100.00 | NA | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○ofi_cq_readfrom | libmlx-fi.so | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0 | 0.01 | 0 | 0 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | NA | NA | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
►qmcplusplus::TimerType<std::chrono::_V2::system_clock>::start() | exec | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.02 | 0.03 | 0.03 | 0.04 | 0.04 | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 2 | 4 | 7 | 15 | 31 | 60 | 112 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.05 | 0.05 | 0.25 | 0.45 | 1.35 | 0.75 | 1.18 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.01 | 0.5 | 0.01 |
○Loop 1437 - NewTimer.cpp:53-54 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 3 | 6 | 7 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 1438 - NewTimer.cpp:53-54 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►miniqmcreference::OneBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::evaluateGL(qmcplusplus::ParticleSet&, qmcplusplus::ParticleAttrib<qmcplusplus::TinyVector<double, 3u>, std::allocator<qmcplusplus::TinyVector<double, ... | exec | 0.02 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 6.45 | 12.40 | 24.95 | 50.45 | 101.20 | 204.81 | 178.58 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.01 |
○Loop 205 - OneBodyJastrowRef.h:171-172 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 4 | 5 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 204 - stl_numeric.h:126-127 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 7 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 206 - OneBodyJastrowRef.h:171-172 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 210 - OneBodyJastrowRef.h:109-194 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 2 | 3 | 7 | 15 | 28 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 215 - OneBodyJastrowRef.h:192-193 - exec | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.25 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 217 - OneBodyJastrowRef.h:186-194 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 218 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 211 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 213 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 214 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 221 - OneBodyJastrowRef.h:0-0 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 216 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 2 | 4 | 8 | 16 | 31 | 64 | 112 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 72.65 | |||||||||||||||||||||||
○Loop 219 - OneBodyJastrowRef.h:186-187 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 212 - OneBodyJastrowRef.h:192-193 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 222 - stl_numeric.h:126-127 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 28 | 53 | 96 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 220 - OneBodyJastrowRef.h:186-187 - exec | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 202 - OneBodyJastrowRef.h:171-172 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 207 - OneBodyJastrowRef.h:169-169 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 5 | 16 | 23 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 209 - OneBodyJastrowRef.h:169-169 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 208 - TinyVectorOps.h:49-49 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 203 - OneBodyJastrowRef.h:0-0 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○MPIDI_OFI_get_buffered | libmpi.so.12.0.0 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.03 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | NA | NA | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○MPIDI_SHMI_progress | libmpi.so.12.0.0 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.03 | 0 | 0.01 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | NA | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○mlx_ep_progress | libmlx-fi.so | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○ofi_cq_progress | libmlx-fi.so | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0.02 | 0 | 0 | 0.01 | 0 | 0.03 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | NA | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○mkl_blas_avx512_dgemm_dcopy_down24_ea | libmkl_avx512.so.2 | 0.02 | 0.02 | 0.04 | 0.04 | 0.04 | 0.03 | 0.04 | 0.01 | 0.02 | 0.04 | 0.03 | 0.05 | 0.04 | 0.08 | 0.01 | 0.02 | 0.03 | 0.02 | 0.02 | 0.02 | 0.04 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 0.80 | 0.80 | 1.07 | 2.40 | 4.40 | 10.40 | 8.60 | 1 | 0 | 0.5 | 0.01 | 0.33 | 0.03 | 0.5 | 0.02 | 0.5 | 0.02 | 0.5 | 0.01 | 0.25 | 0.03 | |
►qmcplusplus::WaveFunction::ratioGrad(qmcplusplus::ParticleSet&, int, qmcplusplus::TinyVector<double, 3u>&) | exec | 0.02 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.01 | 2 | 1 | 3 | 9 | 17 | 30 | 70 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | ||||||||||||||
○Loop 75 - WaveFunction.cpp:198-201 - exec [...] | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 3 | 6 | 11 | 23 | 53 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○.plt.sec@start | libstdc++.so.6.0.25 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 3 | 1 | 7 | 11 | 25 | 32 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||||||||
○MPID_Progress_wait | libmpi.so.12.0.0 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0.03 | 0.01 | 0.01 | 0 | 0 | 0 | 0.02 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | NA | MPI (%): 100.00 | NA | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○do_lookup_x | ld-2.28.so | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 0 | 2 | 2 | 2 | 3 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | System (%): 100.00 | System (%): 100.00 | NA | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○ofi_cq_read | libmlx-fi.so | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | NA | NA | NA | NA | NA | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
►main.extracted.110 | exec | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.01 | 0.03 | 0.04 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 1 | 4 | 6 | 13 | 25 | 51 | 109 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.05 | 0.05 | 0.25 | 0.40 | 0.75 | 0.95 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.01 |
►Loop 24 - new_allocator.h:101-125 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 32 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 34 - stl_algobase.h:740-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 5 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 33 - stl_algobase.h:741-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►Loop 27 - miniqmc.cpp:425-461 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 28 - miniqmc.cpp:429-458 - exec [...] | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.01 | 0.03 | 0.04 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 1 | 4 | 6 | 12 | 22 | 51 | 107 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.05 | 0.05 | 0.10 | 0.05 | 0.20 | 0.13 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.01 | |||||||||
○Loop 30 - StdRandom.h:102-103 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 29 - RandomGenerator.h:51-55 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 5 | 10 | 30 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 26 - NonLocalPP.hpp:110-111 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 31 - stl_algobase.h:741-742 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 25 - Mallocator.hpp:69-69 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○MPIDU_Init_shm_barrier | libmpi.so.12.0.0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 2 | 1 | 2 | 1 | 2 | 0.01 | 0.00 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○ofi_mutex_lock_noop | libmlx-fi.so | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | NA | Others (%): 100.00 | NA | NA | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○__cxxabiv1::__vmi_class_type_info::__do_dyncast(long, __cxxabiv1::__class_type_info::__sub_kind, __cxxabiv1::__class_type_info const*, void const*, __cxxabiv1::__class_type_info const*, void const*, __cxxabiv1::__class_type_info::__dyncast_result&) co... | libstdc++.so.6.0.25 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.04 | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 3 | 8 | 16 | 32 | 63 | 112 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | 0.05 | 0.45 | 1.10 | 2.35 | 5.00 | 8.40 | 3.90 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.01 | |
○__tls_get_addr | ld-2.28.so | 0.01 | 0.02 | 0.02 | 0.01 | 0.02 | 0.02 | 0.02 | 0.01 | 0.03 | 0.04 | 0.03 | 0.02 | 0.04 | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 8 | 15 | 32 | 64 | 112 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | OMP (%): 50.00 System (%): 50.00 | System (%): 100.00 | System (%): 94.44 OMP (%): 5.56 | System (%): 80.77 OMP (%): 19.23 | System (%): 87.50 OMP (%): 12.50 | System (%): 88.62 OMP (%): 11.38 | System (%): 92.76 OMP (%): 7.24 | 0.65 | 1.35 | 1.75 | 2.60 | 7.20 | 18.45 | 14.98 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.01 | |
○qmcplusplus::SoaDistanceTableABOMPTarget<double, 3u, 40>::move(qmcplusplus::ParticleSet const&, qmcplusplus::TinyVector<double, 3u> const&, int, bool) | exec | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.02 | 0.03 | 0.03 | 0.04 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 2 | 4 | 5 | 14 | 30 | 61 | 109 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.40 | 0.80 | 1.60 | 3.00 | 8.85 | 20.40 | 19.10 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0.5 | 0.01 |
►miniqmcreference::OneBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::computeU3(qmcplusplus::ParticleSet&, int, double const*) | exec | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.05 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 1 | 3 | 7 | 14 | 24 | 53 | 102 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.40 | 0.40 | 0.50 | 0.00 | 1.80 | 4.10 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
○Loop 244 - OneBodyJastrowRef.h:214-219 - exec [...] | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.04 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 1 | 3 | 7 | 12 | 16 | 44 | 94 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.05 | 0.00 | 0.30 | 0.95 | 1 | 0 | 0 | 0.01 | 0 | 0.01 | 0 | 0.01 | 1 | 0 | 0 | 0.01 | 0 | 0.01 | |||||||||
○Loop 243 - OneBodyJastrowRef.h:231-237 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
►std::map<qmcplusplus::StackKeyParam<2>, long, std::less<qmcplusplus::StackKeyParam<2> >, std::allocator<std::pair<qmcplusplus::StackKeyParam<2> const, long> > >::operator[](qmcplusplus::StackKeyParam<2> const... | exec | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.03 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 1 | 2 | 2 | 2 | 2 | 0.02 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.15 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | ||||||||||||||
○Loop 1442 - NewTimer.h:119-121 - exec [...] | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 1 | 2 | 2 | 1 | 2 | 0.02 | 0.01 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.15 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○ofi_mutex_unlock_noop | libmlx-fi.so | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | NA | Others (%): 100.00 | NA | NA | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
►qmcplusplus::ParticleSet::computeNewPosDistTables(int, qmcplusplus::TinyVector<double, 3u> const&, bool) | exec | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 2 | 4 | 11 | 10 | 28 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | ||||||||||||||
○Loop 1083 - ParticleSet.cpp:343-344 - exec [...] | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 2 | 3 | 6 | 8 | 21 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○__kmp_get_ancestor_thread_num | libiomp5.so | 0.01 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.01 | 2 | 0 | 2 | 6 | 14 | 31 | 70 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | OMP (%): 100.00 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○miniqmcreference::OneBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::evalGrad(qmcplusplus::ParticleSet&, int) | exec | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0.01 | 0.01 | 0.02 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 2 | 2 | 1 | 9 | 13 | 42 | 94 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | ||||||||||||||
►qmcplusplus::WaveFunction::acceptMove(qmcplusplus::ParticleSet&, int) | exec | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.03 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 1 | 5 | 14 | 27 | 58 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | ||||||||||||||
○Loop 77 - WaveFunction.cpp:225-228 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.03 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 0 | 4 | 12 | 24 | 48 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○mkl_serv_domain_get_max_threads | libmkl_intel_thread.so.2 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 2 | 2 | 11 | 15 | 41 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○mkl_blas_avx512_xdgemv | libmkl_avx512.so.2 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0.02 | 0.03 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 1 | 3 | 6 | 11 | 28 | 60 | 107 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | Math (%): 100.00 | 0.00 | 0.00 | 1.60 | 2.50 | 8.80 | 23.25 | 20.15 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
►miniqmcreference::TwoBodyJastrowRef<qmcplusplus::BsplineFunctor<double> >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector<double, std::allocator<double> >&) | exec | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.04 | 0.01 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 2 | 4 | 7 | 16 | 32 | 63 | 111 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.05 | 0.00 | 0.00 | 0.00 | 17.35 | 29.40 | 48.35 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
►Loop 300 - TwoBodyJastrowRef.h:107-132 - exec [...] | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0.01 | 0.02 | 0.02 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 2 | 2 | 3 | 12 | 28 | 57 | 103 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 13.65 | |||||||||||||||||||||||
○Loop 301 - TwoBodyJastrowRef.h:127-132 - exec [...] | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 | 7 | 15 | 32 | 58 | 97 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.60 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○__kmp_api_omp_in_parallel | libiomp5.so | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0.01 | 2 | 3 | 6 | 10 | 18 | 36 | 72 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○__libc_enable_asynccancel | libc-2.28.so | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0.04 | 0.01 | 0.02 | 0.04 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | System (%): 100.00 | NA | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | System (%): 100.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | ||
○ucp_worker_progress@plt | libmlx-fi.so | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0.02 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | NA | Others (%): 100.00 | NA | NA | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○__libm_logl_ex | exec | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 0.01 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 1.20 | 2.80 | 0.00 | 10.40 | 20.30 | 42.90 | 71.20 | |||||||||||||||
○adler32_z | libmpi.so.12.0.0 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | MPI (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○miniqmcreference::DiracDeterminantRef<qmcplusplus::DelayedUpdate<double, double> >::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector<double, std::allocator<double> >&) | exec | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 1 | 7 | 16 | 42 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | NA | NA | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | ||||||||||||||
►qmcplusplus::Vector<double, qmcplusplus::OMPallocator<double, qmcplusplus::Mallocator<double, 32ul> > >::resize(unsigned long, double) | exec | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 5 | 11 | 24 | 35 | 66 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | ||||||||||||||
○Loop 924 - stl_algobase.h:752-754 - exec | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 5 | 11 | 23 | 33 | 64 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 921 - stl_algobase.h:752-754 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 922 - stl_algobase.h:752-754 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 923 - stl_algobase.h:752-754 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○ucs_async_thread_func | libucs.so.0.0.0 | 0.01 | 0 | 0.01 | 0.01 | 0 | 0 | 0 | 0.01 | 0 | 0.02 | 0.05 | 0.01 | 0.07 | 0.08 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○uct_ud_mlx5_iface_progress | libuct_ib.so.0.0.0 | 0 | 0.06 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0.15 | 0 | 0.13 | 0.04 | 0 | 0 | 0 | 0.04 | 0 | 0.01 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | NA | Others (%): 100.00 | NA | Others (%): 100.00 | Others (%): 100.00 | NA | NA | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○std::chrono::_V2::system_clock::now() | libstdc++.so.6.0.25 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 0 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 1 | 0 | 1 | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | NA | Others (%): 100.00 | Others (%): 100.00 | Others (%): 100.00 | NA | Others (%): 100.00 | Others (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
○kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libiomp5.so | 0 | 0.01 | 0.01 | 0.02 | 0.03 | 0.03 | 0.02 | 0 | 0.01 | 0.01 | 0.04 | 0.04 | 0.05 | 0.08 | 0 | 0 | 0.01 | 0.01 | 0.02 | 0.02 | 0.02 | 0 | 1 | 6 | 15 | 30 | 62 | 107 | 0.00 | 0.00 | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○qmcplusplus::SoaDistanceTableABOMPTarget<double, 3u, 40>::update(int) | exec | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0 | 0.01 | 0.02 | 0.03 | 0.04 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 1 | 4 | 9 | 16 | 40 | 104 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | NA | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
○kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libiomp5.so | 0 | 0.76 | 1.28 | 1.63 | 1.76 | 1.89 | 1.43 | 0 | 0.67 | 1.15 | 1.41 | 1.73 | 2.39 | 3.8 | 0 | 0.47 | 0.79 | 0.99 | 1.11 | 1.41 | 1.62 | 0 | 4 | 8 | 16 | 32 | 64 | 112 | 0.00 | 0.36 | 0.63 | 0.47 | 0.44 | 0.49 | 0.51 | 0.00 | 0.22 | 0.39 | 0.28 | 0.28 | 0.36 | 0.57 | NA | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | OMP (%): 100.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | |
○.plt.sec@start | libuct_ib.so.0.0.0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | NA | System (%): 100.00 | NA | System (%): 100.00 | NA | NA | NA | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||
►qmcplusplus::WaveFunction::evaluateRatios(qmcplusplus::VirtualParticleSet&, std::vector<double, std::allocator<double> >&) | exec | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.01 | 0 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.05 | 0 | 0.01 | 0 | 0 | 0 | 0.01 | 0.01 | 0 | 3 | 3 | 10 | 20 | 50 | 102 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.01 | NA | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | Exe (%): 100.00 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0.90 | 0.50 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.1.0 (2024.1.0.20240308) --driver-mode=g++ --intel -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/build/miniqmc/src -I /scratch_na/users/xoserete/qaas_runs/171-284-5201/intel/miniqmc/b... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
►Loop 80 - WaveFunction.cpp:263-274 - exec [...] | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0.01 | 0.01 | 0.02 | 0.03 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 3 | 9 | 30 | 61 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 81 - WaveFunction.cpp:273-274 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 83 - WaveFunction.cpp:273-274 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||||||||||||||
○Loop 82 - WaveFunction.cpp:273-274 - exec | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 4 | 9 | 14 | 36 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |