* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117341)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117346)Wed Mar 27 17:48:38 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: idp10.benchmarkcenter.megware.com
kernel name: 'Linux'
kernel release: '5.14.0-362.13.1.el9_3.x86_64'
processor: 'x86_64'
Build:
CC: '/cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fno-alias -qopt-report=3 -DDO_MPI -O3 -xSAPPHIRERAPIDS -mprefer-vector-width=512 -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=icx -fiopenmp'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (1 threads)
Double Precision: true
Run Date/Time: 2024-03-27, 17:48:38
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303588, atom count : 4000000
Wed Mar 27 17:48:41 2024: Initialization Finished
Wed Mar 27 17:48:41 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303588 -1.243619295188 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650499 -1.233157709948 0.067098059449 519.0938 1.1309 4000000
20 20.00 -1.166048438417 -1.208183014318 0.042134575902 325.9677 1.3168 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 1.3528 4000000
40 40.00 -1.166042093135 -1.183625399859 0.017583306724 136.0305 1.3609 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 1.3625 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 1.3622 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 1.3615 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 1.3594 4000000
90 90.00 -1.166048006780 -1.203820491598 0.037772484818 292.2210 1.3578 4000000
100 100.00 -1.166049793505 -1.206862845061 0.040813051556 315.7439 1.3556 4000000
Wed Mar 27 17:53:07 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303588
Final energy : -1.166049793505
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 269.0192 269.0192 100.98
loop 1 266.4079 266.4079 100.00
timestep 10 26.6407 266.4075 100.00
position 100 0.0173 1.7273 0.65
velocity 200 0.0151 3.0151 1.13
redistribute 101 0.0885 8.9394 3.36
atomHalo 101 0.0207 2.0868 0.78
force 101 2.5217 254.6927 95.60
commHalo 303 0.0019 0.5788 0.22
commReduce 39 0.0001 0.0022 0.00
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 1: 269.0177 0: 269.0192 269.0184 0.0008
loop 0: 266.4079 1: 266.4079 266.4079 0.0000
timestep 0: 266.4075 1: 266.4076 266.4075 0.0000
position 1: 1.7237 0: 1.7273 1.7255 0.0018
velocity 1: 3.0104 0: 3.0151 3.0128 0.0023
redistribute 0: 8.9394 1: 9.2879 9.1137 0.1742
atomHalo 0: 2.0868 1: 2.4222 2.2545 0.1677
force 1: 254.3082 0: 254.6927 254.5005 0.1922
commHalo 0: 0.5788 1: 0.8798 0.7293 0.1505
commReduce 0: 0.0022 1: 0.0450 0.0236 0.0214
---------------------------------------------------
Average atom update rate: 1.33 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.67 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 1.50 atoms/us
---------------------------------------------------
Wed Mar 27 17:53:07 2024: CoMD Ending
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117346)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117341)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_0
To display your profiling results:
#############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_0 #
#############################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117426)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117432)Wed Mar 27 17:53:18 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: idp10.benchmarkcenter.megware.com
kernel name: 'Linux'
kernel release: '5.14.0-362.13.1.el9_3.x86_64'
processor: 'x86_64'
Build:
CC: '/cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fno-alias -qopt-report=3 -DDO_MPI -O3 -xSAPPHIRERAPIDS -mprefer-vector-width=512 -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=icx -fiopenmp'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (2 threads)
Double Precision: true
Run Date/Time: 2024-03-27, 17:53:18
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063304092, atom count : 4000000
Wed Mar 27 17:53:19 2024: Initialization Finished
Wed Mar 27 17:53:19 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063304092 -1.243619295692 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.5826 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.6717 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.6890 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.6929 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 0.6934 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.6934 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 0.6924 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.6914 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.6908 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.6895 4000000
Wed Mar 27 17:55:35 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063304092
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 137.2309 137.2309 101.10
loop 1 135.7442 135.7442 100.00
timestep 10 13.5744 135.7437 100.00
position 100 0.0086 0.8650 0.64
velocity 200 0.0077 1.5355 1.13
redistribute 101 0.0660 6.6664 4.91
atomHalo 101 0.0221 2.2349 1.65
force 101 1.2644 127.7039 94.08
commHalo 303 0.0020 0.5994 0.44
commReduce 39 0.0001 0.0020 0.00
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 1: 137.2306 0: 137.2309 137.2308 0.0001
loop 0: 135.7442 1: 135.7442 135.7442 0.0000
timestep 0: 135.7437 1: 135.7438 135.7438 0.0001
position 1: 0.8645 0: 0.8650 0.8647 0.0002
velocity 0: 1.5355 1: 1.5477 1.5416 0.0061
redistribute 0: 6.6664 1: 6.7524 6.7094 0.0430
atomHalo 0: 2.2349 1: 2.3128 2.2738 0.0390
force 1: 127.6001 0: 127.7039 127.6520 0.0519
commHalo 0: 0.5994 1: 0.8079 0.7037 0.1043
commReduce 0: 0.0020 1: 0.0086 0.0053 0.0033
---------------------------------------------------
Average atom update rate: 0.68 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.34 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 2.95 atoms/us
---------------------------------------------------
Wed Mar 27 17:55:35 2024: CoMD Ending
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117432)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117426)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_1
To display your profiling results:
#############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_1 #
#############################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117507)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117512)Wed Mar 27 17:55:45 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: idp10.benchmarkcenter.megware.com
kernel name: 'Linux'
kernel release: '5.14.0-362.13.1.el9_3.x86_64'
processor: 'x86_64'
Build:
CC: '/cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fno-alias -qopt-report=3 -DDO_MPI -O3 -xSAPPHIRERAPIDS -mprefer-vector-width=512 -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=icx -fiopenmp'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (4 threads)
Double Precision: true
Run Date/Time: 2024-03-27, 17:55:45
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303067, atom count : 4000000
Wed Mar 27 17:55:46 2024: Initialization Finished
Wed Mar 27 17:55:46 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303067 -1.243619294667 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650499 -1.233157709949 0.067098059449 519.0938 0.3044 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.3481 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.3550 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.3576 4000000
50 50.00 -1.166051684893 -1.193713710257 0.027662025365 214.0030 0.3572 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.3572 4000000
70 70.00 -1.166052143011 -1.204911990845 0.038859847833 300.6332 0.3571 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.3557 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.3554 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.3548 4000000
Wed Mar 27 17:56:56 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303067
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 70.9701 70.9701 101.31
loop 1 70.0499 70.0499 100.00
timestep 10 7.0049 70.0493 100.00
position 100 0.0043 0.4347 0.62
velocity 200 0.0039 0.7713 1.10
redistribute 101 0.0531 5.3604 7.65
atomHalo 101 0.0229 2.3079 3.29
force 101 0.6340 64.0386 91.42
commHalo 303 0.0024 0.7138 1.02
commReduce 39 0.0001 0.0036 0.01
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 1: 70.9694 0: 70.9701 70.9697 0.0003
loop 0: 70.0499 0: 70.0499 70.0499 0.0000
timestep 0: 70.0493 1: 70.0495 70.0494 0.0001
position 1: 0.4342 0: 0.4347 0.4344 0.0003
velocity 0: 0.7713 1: 0.7738 0.7726 0.0012
redistribute 0: 5.3604 1: 5.4134 5.3869 0.0265
atomHalo 0: 2.3079 1: 2.3526 2.3302 0.0223
force 1: 63.9572 0: 64.0386 63.9979 0.0407
commHalo 0: 0.7138 1: 0.7795 0.7466 0.0328
commReduce 0: 0.0036 1: 0.0307 0.0171 0.0135
---------------------------------------------------
Average atom update rate: 0.35 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.18 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 5.71 atoms/us
---------------------------------------------------
Wed Mar 27 17:56:56 2024: CoMD Ending
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117512)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117507)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_2
To display your profiling results:
#############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_2 #
#############################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117592)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117597)Wed Mar 27 17:57:06 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: idp10.benchmarkcenter.megware.com
kernel name: 'Linux'
kernel release: '5.14.0-362.13.1.el9_3.x86_64'
processor: 'x86_64'
Build:
CC: '/cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fno-alias -qopt-report=3 -DDO_MPI -O3 -xSAPPHIRERAPIDS -mprefer-vector-width=512 -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=icx -fiopenmp'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (8 threads)
Double Precision: true
Run Date/Time: 2024-03-27, 17:57:06
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303663, atom count : 4000000
Wed Mar 27 17:57:07 2024: Initialization Finished
Wed Mar 27 17:57:07 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303663 -1.243619295263 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.1692 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.1871 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.1902 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.1904 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 0.1910 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.1909 4000000
70 70.00 -1.166052143011 -1.204911990845 0.038859847833 300.6332 0.1908 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.1902 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.1898 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.1897 4000000
Wed Mar 27 17:57:44 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303663
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 38.2127 38.2127 101.66
loop 1 37.5887 37.5887 100.00
timestep 10 3.7588 37.5882 100.00
position 100 0.0026 0.2600 0.69
velocity 200 0.0025 0.4915 1.31
redistribute 101 0.0475 4.7980 12.76
atomHalo 101 0.0232 2.3388 6.22
force 101 0.3201 32.3325 86.02
commHalo 303 0.0023 0.6905 1.84
commReduce 39 0.0001 0.0036 0.01
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 1: 38.2122 0: 38.2127 38.2124 0.0003
loop 1: 37.5887 0: 37.5887 37.5887 0.0000
timestep 0: 37.5882 1: 37.5884 37.5883 0.0001
position 0: 0.2600 1: 0.2616 0.2608 0.0008
velocity 0: 0.4915 1: 0.5035 0.4975 0.0060
redistribute 0: 4.7980 1: 4.8060 4.8020 0.0040
atomHalo 1: 2.3290 0: 2.3388 2.3339 0.0049
force 1: 32.2985 0: 32.3325 32.3155 0.0170
commHalo 0: 0.6905 1: 0.8353 0.7629 0.0724
commReduce 0: 0.0036 1: 0.0155 0.0096 0.0059
---------------------------------------------------
Average atom update rate: 0.19 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.09 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 10.64 atoms/us
---------------------------------------------------
Wed Mar 27 17:57:44 2024: CoMD Ending
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117592)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117597)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_3
To display your profiling results:
#############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_3 #
#############################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117671)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117676)Wed Mar 27 17:57:53 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: idp10.benchmarkcenter.megware.com
kernel name: 'Linux'
kernel release: '5.14.0-362.13.1.el9_3.x86_64'
processor: 'x86_64'
Build:
CC: '/cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fno-alias -qopt-report=3 -DDO_MPI -O3 -xSAPPHIRERAPIDS -mprefer-vector-width=512 -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=icx -fiopenmp'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (16 threads)
Double Precision: true
Run Date/Time: 2024-03-27, 17:57:53
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303487, atom count : 4000000
Wed Mar 27 17:57:54 2024: Initialization Finished
Wed Mar 27 17:57:54 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303487 -1.243619295087 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.1001 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.1073 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.1064 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.1072 4000000
50 50.00 -1.166051684893 -1.193713710257 0.027662025365 214.0030 0.1072 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.1071 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 0.1070 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.1070 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.1068 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.1065 4000000
Wed Mar 27 17:58:15 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303487
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 21.7561 21.7561 102.38
loop 1 21.2511 21.2511 100.00
timestep 10 2.1251 21.2506 100.00
position 100 0.0018 0.1837 0.86
velocity 200 0.0019 0.3816 1.80
redistribute 101 0.0438 4.4236 20.82
atomHalo 101 0.0217 2.1896 10.30
force 101 0.1628 16.4416 77.37
commHalo 303 0.0017 0.5224 2.46
commReduce 39 0.0001 0.0022 0.01
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 1: 21.7556 0: 21.7561 21.7559 0.0002
loop 0: 21.2511 0: 21.2511 21.2511 0.0000
timestep 0: 21.2506 1: 21.2509 21.2507 0.0001
position 0: 0.1837 1: 0.1848 0.1843 0.0005
velocity 0: 0.3816 1: 0.3843 0.3829 0.0014
redistribute 0: 4.4236 1: 4.5064 4.4650 0.0414
atomHalo 0: 2.1896 1: 2.2891 2.2393 0.0498
force 1: 16.3406 0: 16.4416 16.3911 0.0505
commHalo 0: 0.5224 1: 0.7559 0.6392 0.1168
commReduce 0: 0.0022 1: 0.0183 0.0102 0.0081
---------------------------------------------------
Average atom update rate: 0.11 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.05 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 18.82 atoms/us
---------------------------------------------------
Wed Mar 27 17:58:15 2024: CoMD Ending
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117676)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117671)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_4
To display your profiling results:
#############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_4 #
#############################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117804)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117809)Wed Mar 27 17:58:26 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: idp10.benchmarkcenter.megware.com
kernel name: 'Linux'
kernel release: '5.14.0-362.13.1.el9_3.x86_64'
processor: 'x86_64'
Build:
CC: '/cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fno-alias -qopt-report=3 -DDO_MPI -O3 -xSAPPHIRERAPIDS -mprefer-vector-width=512 -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=icx -fiopenmp'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (32 threads)
Double Precision: true
Run Date/Time: 2024-03-27, 17:58:26
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303522, atom count : 4000000
Wed Mar 27 17:58:26 2024: Initialization Finished
Wed Mar 27 17:58:26 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303522 -1.243619295122 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.0798 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.0784 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.0790 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.0791 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 0.0774 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.0774 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 0.0775 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.0774 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.0773 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.0773 4000000
Wed Mar 27 17:58:42 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303522
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 16.0699 16.0699 102.93
loop 1 15.6131 15.6131 100.00
timestep 10 1.5613 15.6125 100.00
position 100 0.0018 0.1751 1.12
velocity 200 0.0018 0.3521 2.26
redistribute 101 0.0475 4.7972 30.73
atomHalo 101 0.0252 2.5461 16.31
force 101 0.1031 10.4132 66.70
commHalo 303 0.0028 0.8565 5.49
commReduce 39 0.0002 0.0078 0.05
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 1: 16.0697 0: 16.0699 16.0698 0.0001
loop 0: 15.6131 1: 15.6131 15.6131 0.0000
timestep 0: 15.6125 1: 15.6128 15.6127 0.0002
position 0: 0.1751 1: 0.1972 0.1861 0.0111
velocity 1: 0.3355 0: 0.3521 0.3438 0.0083
redistribute 0: 4.7972 1: 4.8030 4.8001 0.0029
atomHalo 1: 2.4746 0: 2.5461 2.5103 0.0357
force 1: 10.3950 0: 10.4132 10.4041 0.0091
commHalo 1: 0.7966 0: 0.8565 0.8265 0.0299
commReduce 0: 0.0078 1: 0.0161 0.0120 0.0042
---------------------------------------------------
Average atom update rate: 0.08 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.04 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 25.62 atoms/us
---------------------------------------------------
Wed Mar 27 17:58:42 2024: CoMD Ending
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117809)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117804)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_5
To display your profiling results:
#############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_5 #
#############################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117975)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 117980)Wed Mar 27 17:58:51 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: idp10.benchmarkcenter.megware.com
kernel name: 'Linux'
kernel release: '5.14.0-362.13.1.el9_3.x86_64'
processor: 'x86_64'
Build:
CC: '/cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fno-alias -qopt-report=3 -DDO_MPI -O3 -xSAPPHIRERAPIDS -mprefer-vector-width=512 -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=icx -fiopenmp'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (48 threads)
Double Precision: true
Run Date/Time: 2024-03-27, 17:58:51
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303481, atom count : 4000000
Wed Mar 27 17:58:51 2024: Initialization Finished
Wed Mar 27 17:58:51 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303481 -1.243619295081 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.0740 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.0733 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.0733 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.0734 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 0.0725 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.0707 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 0.0707 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.0706 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.0704 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.0708 4000000
Wed Mar 27 17:59:05 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303481
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 14.8464 14.8464 103.12
loop 1 14.3966 14.3966 100.00
timestep 10 1.4396 14.3960 100.00
position 100 0.0036 0.3640 2.53
velocity 200 0.0029 0.5880 4.08
redistribute 101 0.0527 5.3257 36.99
atomHalo 101 0.0260 2.6280 18.25
force 101 0.0813 8.2077 57.01
commHalo 303 0.0033 0.9854 6.84
commReduce 39 0.0004 0.0168 0.12
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 1: 14.8458 0: 14.8464 14.8461 0.0003
loop 0: 14.3966 1: 14.3966 14.3966 0.0000
timestep 0: 14.3960 1: 14.3963 14.3961 0.0002
position 1: 0.2756 0: 0.3640 0.3198 0.0442
velocity 1: 0.4732 0: 0.5880 0.5306 0.0574
redistribute 0: 5.3257 1: 5.4975 5.4116 0.0859
atomHalo 0: 2.6280 1: 2.9073 2.7677 0.1397
force 0: 8.2077 1: 8.2290 8.2183 0.0106
commHalo 0: 0.9854 1: 1.1134 1.0494 0.0640
commReduce 0: 0.0168 1: 0.0343 0.0255 0.0087
---------------------------------------------------
Average atom update rate: 0.07 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.04 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 27.79 atoms/us
---------------------------------------------------
Wed Mar 27 17:59:05 2024: CoMD Ending
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117980)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 117975)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_6
To display your profiling results:
#############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/CoMD_DDR/intel/CoMD/run/oneview_runs/compilers/icx_2/oneview_results_scal/tools/lprof_npsu_run_6 #
#############################################################################################################################################################################################