* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115663)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115668)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 1 threads on rank 0
0-> 0
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.04195
LPlusTimes 10 22.15931
LTimes 10 23.37315
Population 10 8.55787
Scattering 10 1160.24743
Solve 1 1237.90936
Source 10 0.11748
SweepSolver 10 19.66133
SweepSubdomain 160 18.26552
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.041948,22.159314,23.373153,8.557865,1160.247426,1237.909362,0.117478,19.661333,18.265518
Figures of Merit
================
Throughput: 3.252687e+06 [unknowns/(second/iteration)]
Grind time : 3.074381e-07 [(seconds/iteration)/unknowns]
Sweep efficiency : 92.90071 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115668)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115663)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0
To display your profiling results:
#################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_0 #
#################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115805)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115810)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 2 threads on rank 0
0-> 0 1-> 24
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.04259
LPlusTimes 10 11.17036
LTimes 10 12.52751
Population 10 5.06895
Scattering 10 579.54534
Solve 1 625.56195
Source 10 0.06132
SweepSolver 10 13.16952
SweepSubdomain 160 11.24114
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.042591,11.170359,12.527508,5.068955,579.545337,625.561952,0.061316,13.169523,11.241136
Figures of Merit
================
Throughput: 6.436664e+06 [unknowns/(second/iteration)]
Grind time : 1.553600e-07 [(seconds/iteration)/unknowns]
Sweep efficiency : 85.35720 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115810)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115805)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1
To display your profiling results:
#################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_1 #
#################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115916)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 115921)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 4 threads on rank 0
0-> 0 1-> 12 2-> 24 3-> 36
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.04238
LPlusTimes 10 5.96350
LTimes 10 6.35309
Population 10 2.41175
Scattering 10 290.57646
Solve 1 317.08123
Source 10 0.03232
SweepSolver 10 7.54883
SweepSubdomain 160 5.90563
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.042382,5.963496,6.353088,2.411746,290.576460,317.081233,0.032320,7.548835,5.905629
Figures of Merit
================
Throughput: 1.269874e+07 [unknowns/(second/iteration)]
Grind time : 7.874798e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 78.23233 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115921)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 115916)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2
To display your profiling results:
#################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_2 #
#################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116018)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116024)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 8 threads on rank 0
0-> 0 1-> 6 2-> 12 3-> 18 4-> 24 5-> 30 6-> 36 7-> 42
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.04076
LPlusTimes 10 3.44268
LTimes 10 3.51172
Population 10 1.25713
Scattering 10 147.00756
Solve 1 164.11541
Source 10 0.01930
SweepSolver 10 4.79181
SweepSubdomain 160 3.21864
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.040759,3.442679,3.511724,1.257130,147.007564,164.115412,0.019302,4.791809,3.218641
Figures of Merit
================
Throughput: 2.453476e+07 [unknowns/(second/iteration)]
Grind time : 4.075850e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 67.16964 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116024)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116018)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3
To display your profiling results:
#################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_3 #
#################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116138)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116143)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 16 threads on rank 0
0-> 0 1-> 3 2-> 6 3-> 9 4-> 12 5-> 15 6-> 18 7-> 21
8-> 24 9-> 27 10-> 30 11-> 33 12-> 36 13-> 39 14-> 42 15-> 45
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.03792
LPlusTimes 10 1.94862
LTimes 10 2.19973
Population 10 0.25538
Scattering 10 77.30844
Solve 1 91.72583
Source 10 0.00859
SweepSolver 10 5.91798
SweepSubdomain 160 1.66461
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.037920,1.948616,2.199732,0.255380,77.308443,91.725827,0.008591,5.917984,1.664609
Figures of Merit
================
Throughput: 4.389747e+07 [unknowns/(second/iteration)]
Grind time : 2.278036e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 28.12797 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116143)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116138)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4
To display your profiling results:
#################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_4 #
#################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116268)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116273)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 32 threads on rank 0
0-> 0 1-> 97 2-> 3 3->100 4-> 6 5->103 6-> 9 7->106
8-> 12 9->109 10-> 15 11->112 12-> 18 13->115 14-> 21 15->118
16-> 24 17->121 18-> 27 19->124 20-> 30 21->127 22-> 33 23->130
24-> 36 25->133 26-> 39 27->136 28-> 42 29->139 30-> 45 31->142
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.04093
LPlusTimes 10 1.59042
LTimes 10 1.66630
Population 10 0.13222
Scattering 10 47.18606
Solve 1 60.46603
Source 10 0.00631
SweepSolver 10 5.62719
SweepSubdomain 160 0.98740
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.040928,1.590425,1.666296,0.132224,47.186056,60.466029,0.006309,5.627190,0.987396
Figures of Merit
================
Throughput: 6.659164e+07 [unknowns/(second/iteration)]
Grind time : 1.501690e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 17.54687 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116273)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116268)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5
To display your profiling results:
#################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_5 #
#################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node idp09.benchmarkcenter.megware.com
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116464)
* Warning: Found no event able to derive walltime: prepending ref-cycles
* Info: Process launched (host idp09.benchmarkcenter.megware.com, process 116469)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /cluster/intel/oneapi/2024.0.0/mpi/2021.11/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -xSAPPHIRERAPIDS -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=icpx -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 48 threads on rank 0
0-> 0 1-> 1 2-> 2 3-> 3 4-> 4 5-> 5 6-> 6 7-> 7
8-> 8 9-> 9 10-> 10 11-> 11 12-> 12 13-> 13 14-> 14 15-> 15
16-> 16 17-> 17 18-> 18 19-> 19 20-> 20 21-> 21 22-> 22 23-> 23
24-> 24 25-> 25 26-> 26 27-> 27 28-> 28 29-> 29 30-> 30 31-> 31
32-> 32 33-> 33 34-> 34 35-> 35 36-> 36 37-> 37 38-> 38 39-> 39
40-> 40 41-> 41 42-> 42 43-> 43 44-> 44 45-> 45 46-> 46 47-> 47
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.04013
LPlusTimes 10 2.89191
LTimes 10 1.69540
Population 10 0.12031
Scattering 10 38.12629
Solve 1 61.28889
Source 10 0.00472
SweepSolver 10 13.83383
SweepSubdomain 160 0.79122
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.040127,2.891908,1.695405,0.120311,38.126292,61.288892,0.004720,13.833830,0.791221
Figures of Merit
================
Throughput: 6.569758e+07 [unknowns/(second/iteration)]
Grind time : 1.522126e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 5.71946 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116469)
* Info: Process finished (host idp09.benchmarkcenter.megware.com, process 116464)
Your experiment path is /home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6
To display your profiling results:
#################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/eoseret/qaas_runs_CPU_9468/Kripke_DDR/intel/Kripke/run/oneview_runs/compilers/icx_5/oneview_results_scal/tools/lprof_npsu_run_6 #
#################################################################################################################################################################################################