* Info: Selecting the 'perf-high-ppn' engine for node o401
* Info: Process launched (host o401, process 570752)
* Info: Process launched (host o401, process 570754)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++ -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 1 threads on rank 0
0-> 0
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.09923
LPlusTimes 10 54.17725
LTimes 10 55.50005
Population 10 4.30307
Scattering 10 1633.91200
Solve 1 1799.61574
Source 10 0.01678
SweepSolver 10 46.31607
SweepSubdomain 160 34.49251
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.099231,54.177254,55.500052,4.303067,1633.912002,1799.615744,0.016775,46.316066,34.492513
Figures of Merit
================
Throughput: 2.237440e+06 [unknowns/(second/iteration)]
Grind time : 4.469394e-07 [(seconds/iteration)/unknowns]
Sweep efficiency : 74.47203 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host o401, process 570752)
* Info: Process finished (host o401, process 570754)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0
To display your profiling results:
########################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
########################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_0 #
########################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o401
* Info: Process launched (host o401, process 570963)
* Info: Process launched (host o401, process 570965)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++ -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 2 threads on rank 0
0-> 0 1-> 28
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.09432
LPlusTimes 10 28.11304
LTimes 10 30.68680
Population 10 2.32574
Scattering 10 819.95254
Solve 1 916.32075
Source 10 0.00921
SweepSolver 10 29.87011
SweepSubdomain 160 21.30041
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.094320,28.113039,30.686803,2.325737,819.952537,916.320746,0.009213,29.870110,21.300414
Figures of Merit
================
Throughput: 4.394238e+06 [unknowns/(second/iteration)]
Grind time : 2.275707e-07 [(seconds/iteration)/unknowns]
Sweep efficiency : 71.31013 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host o401, process 570963)
* Info: Process finished (host o401, process 570965)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1
To display your profiling results:
########################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
########################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_1 #
########################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o401
* Info: Process launched (host o401, process 571123)
* Info: Process launched (host o401, process 571125)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++ -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 4 threads on rank 0
0-> 0 1-> 14 2-> 28 3-> 42
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.10382
LPlusTimes 10 13.96254
LTimes 10 16.94341
Population 10 1.11011
Scattering 10 415.74811
Solve 1 471.36520
Source 10 0.00542
SweepSolver 10 18.23974
SweepSubdomain 160 11.02036
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.103824,13.962536,16.943412,1.110111,415.748112,471.365196,0.005420,18.239741,11.020357
Figures of Merit
================
Throughput: 8.542277e+06 [unknowns/(second/iteration)]
Grind time : 1.170648e-07 [(seconds/iteration)/unknowns]
Sweep efficiency : 60.41948 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host o401, process 571125)
* Info: Process finished (host o401, process 571123)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2
To display your profiling results:
########################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
########################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_2 #
########################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o401
* Info: Process launched (host o401, process 571266)
* Info: Process launched (host o401, process 571268)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++ -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 8 threads on rank 0
0-> 0 1-> 7 2-> 14 3-> 21 4-> 28 5-> 35 6-> 42 7-> 49
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.09190
LPlusTimes 10 8.19369
LTimes 10 9.75829
Population 10 1.13048
Scattering 10 218.09247
Solve 1 250.24649
Source 10 0.00354
SweepSolver 10 7.73954
SweepSubdomain 160 5.53984
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.091896,8.193685,9.758293,1.130478,218.092475,250.246486,0.003537,7.739544,5.539837
Figures of Merit
================
Throughput: 1.609026e+07 [unknowns/(second/iteration)]
Grind time : 6.214939e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 71.57834 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host o401, process 571266)
* Info: Process finished (host o401, process 571268)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3
To display your profiling results:
########################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
########################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_3 #
########################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o401
* Info: Process launched (host o401, process 571426)
* Info: Process launched (host o401, process 571428)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++ -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 16 threads on rank 0
0-> 0 1->115 2-> 7 3->122 4-> 14 5->129 6-> 21 7->136
8-> 28 9->143 10-> 35 11->150 12-> 42 13->157 14-> 49 15->164
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.09165
LPlusTimes 10 4.72703
LTimes 10 6.18573
Population 10 0.40650
Scattering 10 115.15252
Solve 1 136.49909
Source 10 0.00280
SweepSolver 10 4.68193
SweepSubdomain 160 2.85956
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.091647,4.727033,6.185730,0.406503,115.152518,136.499087,0.002797,4.681925,2.859563
Figures of Merit
================
Throughput: 2.949860e+07 [unknowns/(second/iteration)]
Grind time : 3.389991e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 61.07665 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host o401, process 571428)
* Info: Process finished (host o401, process 571426)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4
To display your profiling results:
########################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
########################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_4 #
########################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o401
* Info: Process launched (host o401, process 571647)
* Info: Process launched (host o401, process 571649)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++ -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 32 threads on rank 0
0-> 0 1-> 2 2-> 4 3-> 6 4-> 8 5-> 10 6-> 12 7-> 14
8-> 16 9-> 18 10-> 20 11-> 22 12-> 24 13-> 26 14-> 28 15-> 30
16-> 32 17->145 18-> 35 19->148 20-> 38 21->151 22-> 41 23->154
24-> 44 25->157 26-> 47 27->160 28-> 50 29->163 30-> 53 31->166
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.09175
LPlusTimes 10 3.91778
LTimes 10 4.82476
Population 10 0.38046
Scattering 10 61.00209
Solve 1 78.60612
Source 10 0.00243
SweepSolver 10 3.13457
SweepSubdomain 160 1.61753
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.091754,3.917781,4.824755,0.380461,61.002094,78.606120,0.002432,3.134565,1.617531
Figures of Merit
================
Throughput: 5.122415e+07 [unknowns/(second/iteration)]
Grind time : 1.952204e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 51.60304 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host o401, process 571649)
* Info: Process finished (host o401, process 571647)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5
To display your profiling results:
########################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
########################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_5 #
########################################################################################################################################################################################################
* Info: Selecting the 'perf-high-ppn' engine for node o401
* Info: Process launched (host o401, process 571998)
* Info: Process launched (host o401, process 572000)
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /opt/intel/oneapi/mpi/2021.12/bin/mpiicpc
Compiler Flags: "-O3 -march=native -O2 -march=haswell -fno-tree-vectorize -fno-openmp-simd -flto -funroll-loops -ffast-math -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -cxx=g++ -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 56 threads on rank 0
0-> 0 1-> 1 2-> 2 3-> 3 4-> 4 5-> 5 6-> 6 7-> 7
8-> 8 9-> 9 10-> 10 11-> 11 12-> 12 13-> 13 14-> 14 15-> 15
16-> 16 17-> 17 18-> 18 19-> 19 20-> 20 21-> 21 22-> 22 23-> 23
24-> 24 25-> 25 26-> 26 27-> 27 28-> 28 29-> 29 30-> 30 31-> 31
32-> 32 33-> 33 34-> 34 35-> 35 36-> 36 37-> 37 38-> 38 39-> 39
40-> 40 41-> 41 42-> 42 43-> 43 44-> 44 45-> 45 46-> 46 47-> 47
48-> 48 49-> 49 50-> 50 51-> 51 52-> 52 53-> 53 54-> 54 55-> 55
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.09159
LPlusTimes 10 5.14745
LTimes 10 4.77127
Population 10 0.78831
Scattering 10 49.91892
Solve 1 69.28979
Source 10 0.00218
SweepSolver 10 3.32337
SweepSubdomain 160 1.69590
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.091585,5.147448,4.771266,0.788306,49.918920,69.289795,0.002179,3.323375,1.695904
Figures of Merit
================
Throughput: 5.811147e+07 [unknowns/(second/iteration)]
Grind time : 1.720831e-08 [(seconds/iteration)/unknowns]
Sweep efficiency : 51.02958 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host o401, process 572000)
* Info: Process finished (host o401, process 571998)
Your experiment path is /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6
To display your profiling results:
########################################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
########################################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-node | maqao lprof -df -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-process | maqao lprof -df -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6 #
# Functions | Per-thread | maqao lprof -df -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Cluster-wide | maqao lprof -dl xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-node | maqao lprof -dl -dn xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-process | maqao lprof -dl -dp xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_results_scal/tools/lprof_npsu_run_6 #
########################################################################################################################################################################################################