* Info: Selecting the 'perf-high-ppn' engine for node o405
* Info: Process launched (host o405, process 2502812)
* Info: Process launched (host o405, process 2502815)Thu Apr 18 02:55:27 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: o405
kernel name: 'Linux'
kernel release: '4.18.0-477.27.1.el8_8.x86_64'
processor: 'x86_64'
Build:
CC: '/opt/intel/oneapi/mpi/2021.8.0/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fsave-optimization-record -DDO_MPI -O2 -march=sapphirerapids -mprefer-vector-width=512 -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=gcc -fopenmp -funroll-loops'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (1 threads)
Double Precision: true
Run Date/Time: 2024-04-18, 02:55:27
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303588, atom count : 4000000
Thu Apr 18 02:55:32 2024: Initialization Finished
Thu Apr 18 02:55:32 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303588 -1.243619295188 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650499 -1.233157709948 0.067098059449 519.0938 1.9950 4000000
20 20.00 -1.166048438417 -1.208183014318 0.042134575902 325.9677 2.4643 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 2.5731 4000000
40 40.00 -1.166042093135 -1.183625399859 0.017583306724 136.0305 2.5973 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 2.6038 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 2.6027 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 2.5989 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 2.5931 4000000
90 90.00 -1.166048006780 -1.203820491598 0.037772484818 292.2210 2.5879 4000000
100 100.00 -1.166049793505 -1.206862845061 0.040813051556 315.7439 2.5820 4000000
Thu Apr 18 03:03:56 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303588
Final energy : -1.166049793505
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 508.8474 508.8474 100.97
loop 1 503.9610 503.9610 100.00
timestep 10 50.3960 503.9597 100.00
position 100 0.0174 1.7406 0.35
velocity 200 0.0167 3.3386 0.66
redistribute 101 0.1319 13.3245 2.64
atomHalo 101 0.0285 2.8744 0.57
force 101 4.8419 489.0307 97.04
commHalo 303 0.0028 0.8626 0.17
commReduce 39 0.0023 0.0896 0.02
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 508.8474 1: 508.8639 508.8556 0.0082
loop 1: 503.9609 0: 503.9610 503.9610 0.0000
timestep 0: 503.9597 1: 503.9599 503.9598 0.0001
position 0: 1.7406 1: 1.7436 1.7421 0.0015
velocity 1: 3.3279 0: 3.3386 3.3333 0.0054
redistribute 0: 13.3245 1: 13.6263 13.4754 0.1509
atomHalo 0: 2.8744 1: 3.2908 3.0826 0.2082
force 1: 488.7020 0: 489.0307 488.8664 0.1643
commHalo 0: 0.8626 1: 1.2811 1.0718 0.2093
commReduce 1: 0.0533 0: 0.0896 0.0715 0.0182
---------------------------------------------------
Average atom update rate: 2.52 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 1.26 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 0.79 atoms/us
---------------------------------------------------
Thu Apr 18 03:03:56 2024: CoMD Ending
* Info: Process finished (host o405, process 2502812)
* Info: Process finished (host o405, process 2502815)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_0
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_0 #
######################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o405
* Info: Process launched (host o405, process 2502951)
* Info: Process launched (host o405, process 2502953)Thu Apr 18 03:04:08 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: o405
kernel name: 'Linux'
kernel release: '4.18.0-477.27.1.el8_8.x86_64'
processor: 'x86_64'
Build:
CC: '/opt/intel/oneapi/mpi/2021.8.0/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fsave-optimization-record -DDO_MPI -O2 -march=sapphirerapids -mprefer-vector-width=512 -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=gcc -fopenmp -funroll-loops'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (2 threads)
Double Precision: true
Run Date/Time: 2024-04-18, 03:04:08
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063304092, atom count : 4000000
Thu Apr 18 03:04:11 2024: Initialization Finished
Thu Apr 18 03:04:11 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063304092 -1.243619295692 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 1.0325 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 1.2658 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 1.3203 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 1.3341 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 1.3363 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 1.3348 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 1.3332 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 1.3301 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 1.3283 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 1.3243 4000000
Thu Apr 18 03:08:30 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063304092
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 261.5800 261.5800 101.08
loop 1 258.7926 258.7926 100.00
timestep 10 25.8791 258.7913 100.00
position 100 0.0102 1.0217 0.39
velocity 200 0.0098 1.9543 0.76
redistribute 101 0.1077 10.8811 4.20
atomHalo 101 0.0349 3.5245 1.36
force 101 2.4425 246.6896 95.32
commHalo 303 0.0040 1.2213 0.47
commReduce 39 0.0025 0.0986 0.04
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 261.5800 1: 261.5811 261.5806 0.0006
loop 0: 258.7926 1: 258.7926 258.7926 0.0000
timestep 0: 258.7913 1: 258.7916 258.7914 0.0001
position 1: 1.0181 0: 1.0217 1.0199 0.0018
velocity 1: 1.9476 0: 1.9543 1.9509 0.0033
redistribute 0: 10.8811 1: 10.9569 10.9190 0.0379
atomHalo 1: 3.4794 0: 3.5245 3.5019 0.0226
force 1: 246.6322 0: 246.6896 246.6609 0.0287
commHalo 1: 1.0152 0: 1.2213 1.1183 0.1030
commReduce 1: 0.0237 0: 0.0986 0.0612 0.0375
---------------------------------------------------
Average atom update rate: 1.29 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.65 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 1.55 atoms/us
---------------------------------------------------
Thu Apr 18 03:08:30 2024: CoMD Ending
* Info: Process finished (host o405, process 2502951)
* Info: Process finished (host o405, process 2502953)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_1
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_1 #
######################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o405
* Info: Process launched (host o405, process 2503066)
* Info: Process launched (host o405, process 2503068)Thu Apr 18 03:08:43 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: o405
kernel name: 'Linux'
kernel release: '4.18.0-477.27.1.el8_8.x86_64'
processor: 'x86_64'
Build:
CC: '/opt/intel/oneapi/mpi/2021.8.0/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fsave-optimization-record -DDO_MPI -O2 -march=sapphirerapids -mprefer-vector-width=512 -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=gcc -fopenmp -funroll-loops'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (4 threads)
Double Precision: true
Run Date/Time: 2024-04-18, 03:08:43
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303067, atom count : 4000000
Thu Apr 18 03:08:44 2024: Initialization Finished
Thu Apr 18 03:08:44 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303067 -1.243619294667 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650499 -1.233157709949 0.067098059449 519.0938 0.5420 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.6589 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.6851 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.6900 4000000
50 50.00 -1.166051684893 -1.193713710257 0.027662025365 214.0030 0.6930 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.6933 4000000
70 70.00 -1.166052143011 -1.204911990845 0.038859847833 300.6332 0.6923 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.6902 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.6876 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.6879 4000000
Thu Apr 18 03:10:59 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303067
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 136.1376 136.1376 101.29
loop 1 134.4060 134.4060 100.00
timestep 10 13.4405 134.4046 100.00
position 100 0.0063 0.6258 0.47
velocity 200 0.0060 1.1949 0.89
redistribute 101 0.0945 9.5482 7.10
atomHalo 101 0.0375 3.7873 2.82
force 101 1.2272 123.9424 92.21
commHalo 303 0.0038 1.1484 0.85
commReduce 39 0.0008 0.0314 0.02
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 136.1376 1: 136.1387 136.1381 0.0005
loop 0: 134.4060 1: 134.4060 134.4060 0.0000
timestep 0: 134.4046 1: 134.4049 134.4048 0.0001
position 0: 0.6258 1: 0.6266 0.6262 0.0004
velocity 0: 1.1949 1: 1.1955 1.1952 0.0003
redistribute 0: 9.5482 1: 9.6497 9.5990 0.0507
atomHalo 0: 3.7873 1: 3.9087 3.8480 0.0607
force 1: 123.8202 0: 123.9424 123.8813 0.0611
commHalo 0: 1.1484 1: 1.2473 1.1978 0.0494
commReduce 0: 0.0314 1: 0.0477 0.0396 0.0082
---------------------------------------------------
Average atom update rate: 0.67 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.34 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 2.98 atoms/us
---------------------------------------------------
Thu Apr 18 03:10:59 2024: CoMD Ending
* Info: Process finished (host o405, process 2503066)
* Info: Process finished (host o405, process 2503068)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_2
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_2 #
######################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o405
* Info: Process launched (host o405, process 2503198)
* Info: Process launched (host o405, process 2503200)Thu Apr 18 03:11:12 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: o405
kernel name: 'Linux'
kernel release: '4.18.0-477.27.1.el8_8.x86_64'
processor: 'x86_64'
Build:
CC: '/opt/intel/oneapi/mpi/2021.8.0/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fsave-optimization-record -DDO_MPI -O2 -march=sapphirerapids -mprefer-vector-width=512 -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=gcc -fopenmp -funroll-loops'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (8 threads)
Double Precision: true
Run Date/Time: 2024-04-18, 03:11:12
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303663, atom count : 4000000
Thu Apr 18 03:11:13 2024: Initialization Finished
Thu Apr 18 03:11:13 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303663 -1.243619295263 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.2979 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.3578 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.3623 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.3653 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 0.3661 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.3660 4000000
70 70.00 -1.166052143011 -1.204911990845 0.038859847833 300.6332 0.3656 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.3647 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.3640 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.3630 4000000
Thu Apr 18 03:12:25 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303663
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 72.6737 72.6737 101.70
loop 1 71.4557 71.4557 100.00
timestep 10 7.1455 71.4546 100.00
position 100 0.0049 0.4867 0.68
velocity 200 0.0047 0.9440 1.32
redistribute 101 0.0749 7.5677 10.59
atomHalo 101 0.0303 3.0653 4.29
force 101 0.6232 62.9467 88.09
commHalo 303 0.0031 0.9535 1.33
commReduce 39 0.0005 0.0185 0.03
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 72.6737 1: 72.6743 72.6740 0.0003
loop 1: 71.4557 0: 71.4557 71.4557 0.0000
timestep 0: 71.4546 1: 71.4548 71.4547 0.0001
position 1: 0.4843 0: 0.4867 0.4855 0.0012
velocity 1: 0.9356 0: 0.9440 0.9398 0.0042
redistribute 0: 7.5677 1: 7.6683 7.6180 0.0503
atomHalo 0: 3.0653 1: 3.1169 3.0911 0.0258
force 1: 62.8672 0: 62.9467 62.9070 0.0397
commHalo 1: 0.9280 0: 0.9535 0.9407 0.0128
commReduce 1: 0.0153 0: 0.0185 0.0169 0.0016
---------------------------------------------------
Average atom update rate: 0.36 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.18 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 5.60 atoms/us
---------------------------------------------------
Thu Apr 18 03:12:25 2024: CoMD Ending
* Info: Process finished (host o405, process 2503200)
* Info: Process finished (host o405, process 2503198)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_3
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_3 #
######################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o405
* Info: Process launched (host o405, process 2503348)
* Info: Process launched (host o405, process 2503350)Thu Apr 18 03:12:37 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: o405
kernel name: 'Linux'
kernel release: '4.18.0-477.27.1.el8_8.x86_64'
processor: 'x86_64'
Build:
CC: '/opt/intel/oneapi/mpi/2021.8.0/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fsave-optimization-record -DDO_MPI -O2 -march=sapphirerapids -mprefer-vector-width=512 -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=gcc -fopenmp -funroll-loops'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (16 threads)
Double Precision: true
Run Date/Time: 2024-04-18, 03:12:37
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303487, atom count : 4000000
Thu Apr 18 03:12:38 2024: Initialization Finished
Thu Apr 18 03:12:38 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303487 -1.243619295087 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.1775 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.2078 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.2073 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.2063 4000000
50 50.00 -1.166051684893 -1.193713710257 0.027662025365 214.0030 0.2066 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.2065 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 0.2063 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.2058 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.2056 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.2053 4000000
Thu Apr 18 03:13:18 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303487
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 41.6661 41.6661 102.37
loop 1 40.7019 40.7019 100.00
timestep 10 4.0701 40.7008 100.00
position 100 0.0049 0.4857 1.19
velocity 200 0.0046 0.9188 2.26
redistribute 101 0.0728 7.3500 18.06
atomHalo 101 0.0308 3.1153 7.65
force 101 0.3191 32.2266 79.18
commHalo 303 0.0030 0.9111 2.24
commReduce 39 0.0019 0.0725 0.18
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 41.6661 1: 41.6661 41.6661 0.0000
loop 0: 40.7019 0: 40.7019 40.7019 0.0000
timestep 0: 40.7008 1: 40.7010 40.7009 0.0001
position 1: 0.4850 0: 0.4857 0.4854 0.0003
velocity 1: 0.9148 0: 0.9188 0.9168 0.0020
redistribute 0: 7.3500 1: 7.4187 7.3844 0.0344
atomHalo 0: 3.1153 1: 3.1725 3.1439 0.0286
force 1: 32.1645 0: 32.2266 32.1955 0.0310
commHalo 0: 0.9111 1: 0.9654 0.9383 0.0272
commReduce 1: 0.0074 0: 0.0725 0.0399 0.0325
---------------------------------------------------
Average atom update rate: 0.20 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.10 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 9.83 atoms/us
---------------------------------------------------
Thu Apr 18 03:13:18 2024: CoMD Ending
* Info: Process finished (host o405, process 2503350)
* Info: Process finished (host o405, process 2503348)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_4
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_4 #
######################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o405
* Info: Process launched (host o405, process 2503565)
* Info: Process launched (host o405, process 2503567)Thu Apr 18 03:13:30 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: o405
kernel name: 'Linux'
kernel release: '4.18.0-477.27.1.el8_8.x86_64'
processor: 'x86_64'
Build:
CC: '/opt/intel/oneapi/mpi/2021.8.0/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fsave-optimization-record -DDO_MPI -O2 -march=sapphirerapids -mprefer-vector-width=512 -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=gcc -fopenmp -funroll-loops'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (32 threads)
Double Precision: true
Run Date/Time: 2024-04-18, 03:13:30
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303522, atom count : 4000000
Thu Apr 18 03:13:31 2024: Initialization Finished
Thu Apr 18 03:13:31 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303522 -1.243619295122 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.1253 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.1421 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.1426 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.1428 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 0.1431 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.1435 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 0.1435 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.1432 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.1430 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.1427 4000000
Thu Apr 18 03:13:59 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303522
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 29.1259 29.1259 103.14
loop 1 28.2404 28.2404 100.00
timestep 10 2.8239 28.2392 100.00
position 100 0.0053 0.5348 1.89
velocity 200 0.0047 0.9475 3.36
redistribute 101 0.0879 8.8819 31.45
atomHalo 101 0.0391 3.9496 13.99
force 101 0.1789 18.0645 63.97
commHalo 303 0.0031 0.9281 3.29
commReduce 39 0.0001 0.0040 0.01
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 29.1259 1: 29.1268 29.1264 0.0004
loop 1: 28.2404 0: 28.2404 28.2404 0.0000
timestep 0: 28.2392 1: 28.2394 28.2393 0.0001
position 1: 0.5336 0: 0.5348 0.5342 0.0006
velocity 1: 0.9274 0: 0.9475 0.9375 0.0101
redistribute 0: 8.8819 1: 8.9265 8.9042 0.0223
atomHalo 0: 3.9496 1: 4.0149 3.9822 0.0327
force 1: 18.0366 0: 18.0645 18.0505 0.0139
commHalo 0: 0.9281 1: 1.0166 0.9724 0.0443
commReduce 0: 0.0040 1: 0.0614 0.0327 0.0287
---------------------------------------------------
Average atom update rate: 0.14 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.07 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 14.16 atoms/us
---------------------------------------------------
Thu Apr 18 03:13:59 2024: CoMD Ending
* Info: Process finished (host o405, process 2503567)
* Info: Process finished (host o405, process 2503565)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_5
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_5 #
######################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o405
* Info: Process launched (host o405, process 2503907)
* Info: Process launched (host o405, process 2503909)Thu Apr 18 03:14:13 2024: Starting Initialization
Mini-Application Name : CoMD-openmp-mpi
Mini-Application Version : 1.1
Platform:
hostname: o405
kernel name: 'Linux'
kernel release: '4.18.0-477.27.1.el8_8.x86_64'
processor: 'x86_64'
Build:
CC: '/opt/intel/oneapi/mpi/2021.8.0/bin/mpiicc'
compiler version: 'unknown'
CFLAGS: '-O3 -march=native -fsave-optimization-record -DDO_MPI -O2 -march=sapphirerapids -mprefer-vector-width=512 -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cc=gcc -fopenmp -funroll-loops'
LDFLAGS: ' '
using MPI: true
Threading: OpenMP (56 threads)
Double Precision: true
Run Date/Time: 2024-04-18, 03:14:13
Command Line Parameters:
doeam: 0
potDir: pots
potName: Cu_u6.eam
potType: funcfl
nx: 100
ny: 100
nz: 100
xproc: 2
yproc: 1
zproc: 1
Lattice constant: -1 Angstroms
nSteps: 100
printRate: 10
Time step: 1 fs
Initial Temperature: 600 K
Initial Delta: 0 Angstroms
Simulation data:
Total atoms : 4000000
Min global bounds : [ 0.0000000000, 0.0000000000, 0.0000000000 ]
Max global bounds : [ 361.5000000000, 361.5000000000, 361.5000000000 ]
Decomposition data:
Processors : 2, 1, 1
Local boxes : 31, 62, 62 = 119164
Box size : [ 5.8306451613, 5.8306451613, 5.8306451613 ]
Box factor : [ 1.0074548875, 1.0074548875, 1.0074548875 ]
Max Link Cell Occupancy: 32 of 64
Potential data:
Potential type : Lennard-Jones
Species name : Cu
Atomic number : 29
Mass : 63.55 amu
Lattice Type : FCC
Lattice spacing : 3.615 Angstroms
Cutoff : 5.7875 Angstroms
Epsilon : 0.167 eV
Sigma : 2.315 Angstroms
Initial energy : -1.166063303464, atom count : 4000000
Thu Apr 18 03:14:14 2024: Initialization Finished
Thu Apr 18 03:14:14 2024: Starting simulation
# Performance
# Loop Time(fs) Total Energy Potential Energy Kinetic Energy Temperature (us/atom) # Atoms
0 0.00 -1.166063303464 -1.243619295064 0.077555991600 600.0000 0.0000 4000000
10 10.00 -1.166059650500 -1.233157709949 0.067098059449 519.0938 0.1020 4000000
20 20.00 -1.166048438416 -1.208183014318 0.042134575902 325.9677 0.1096 4000000
30 30.00 -1.166037590737 -1.186586197151 0.020548606414 158.9711 0.1088 4000000
40 40.00 -1.166042093134 -1.183625399859 0.017583306724 136.0305 0.1087 4000000
50 50.00 -1.166051684893 -1.193713710258 0.027662025365 214.0030 0.1092 4000000
60 60.00 -1.166054646931 -1.202662201513 0.036607554582 283.2087 0.1086 4000000
70 70.00 -1.166052143011 -1.204911990844 0.038859847833 300.6332 0.1094 4000000
80 80.00 -1.166048803912 -1.203635015020 0.037586211108 290.7799 0.1085 4000000
90 90.00 -1.166048006780 -1.203820491599 0.037772484818 292.2210 0.1095 4000000
100 100.00 -1.166049793504 -1.206862845060 0.040813051556 315.7439 0.1097 4000000
Thu Apr 18 03:14:36 2024: Ending simulation
Simulation Validation:
Initial energy : -1.166063303464
Final energy : -1.166049793504
eFinal/eInitial : 0.999988
Final atom count : 4000000, no atoms lost
Timings for Rank 0
Timer # Calls Avg/Call (s) Total (s) % Loop
___________________________________________________________________
total 1 22.5357 22.5357 103.95
loop 1 21.6790 21.6790 100.00
timestep 10 2.1678 21.6776 99.99
position 100 0.0056 0.5634 2.60
velocity 200 0.0048 0.9696 4.47
redistribute 101 0.0857 8.6545 39.92
atomHalo 101 0.0447 4.5133 20.82
force 101 0.1140 11.5134 53.11
commHalo 303 0.0083 2.5097 11.58
commReduce 39 0.0035 0.1381 0.64
Timing Statistics Across 2 Ranks:
Timer Rank: Min(s) Rank: Max(s) Avg(s) Stdev(s)
_____________________________________________________________________________
total 0: 22.5357 1: 22.5364 22.5360 0.0004
loop 0: 21.6790 1: 21.6790 21.6790 0.0000
timestep 0: 21.6776 1: 21.6780 21.6778 0.0002
position 1: 0.5609 0: 0.5634 0.5621 0.0013
velocity 1: 0.9694 0: 0.9696 0.9695 0.0001
redistribute 0: 8.6545 1: 8.7705 8.7125 0.0580
atomHalo 1: 3.8685 0: 4.5133 4.1909 0.3224
force 0: 11.5134 1: 11.5203 11.5168 0.0035
commHalo 1: 0.8602 0: 2.5097 1.6849 0.8248
commReduce 1: 0.0011 0: 0.1381 0.0696 0.0685
---------------------------------------------------
Average atom update rate: 0.11 us/atom/task
---------------------------------------------------
---------------------------------------------------
Average all atom update rate: 0.05 us/atom
---------------------------------------------------
---------------------------------------------------
Average atom rate: 18.45 atoms/us
---------------------------------------------------
Thu Apr 18 03:14:36 2024: CoMD Ending
* Info: Process finished (host o405, process 2503907)
* Info: Process finished (host o405, process 2503909)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_6
To display your profiling results:
######################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
######################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-172-4338/intel/CoMD/run/oneview_runs/compilers/gcc_14/oneview_results_scal/tools/lprof_npsu_run_6 #
######################################################################################################################################################################################################