* Info: Detected 2 Lprof instances in ip-172-31-68-94: processes-per-node/ppn set accordingly.
If this is incorrect, rerun with an explicit value for this setting
[0m
* Warning: perf-events measurements are not allowed on node ip-172-31-68-94: selecting the no-perf engine. Try:
sudo sysctl -w kernel.perf_event_paranoid=1 (*)
To persist across reboots:
sudo sh -c 'echo kernel.perf_event_paranoid=1 >> /etc/sysctl.d/local.conf' (*)
(*) requires sudo permissions. If missing, contact administrators.
=1 allows both kernel+user-space measurements (=2: only user-space)
* Warning: The 'no-perf' engine is feature-limited and suffers higher overhead than other engines. It should be used only when perf-events are not available on the running Linux kernel - for instance with WSL1 (Windows Subsystem for Linux version 1) - or when the paranoid level (as displayed by 'sysctl kernel.perf_event_paranoid') cannot be lowered to 2 or less.
* Info: Process launched (host ip-172-31-68-94, process 362040)[0m
* Info: Process launched (host ip-172-31-68-94, process 362042)[0m
_ __ _ _
| |/ / (_) | |
| ' / _ __ _ _ __ | | __ ___
| < | '__|| || '_ \ | |/ // _ \
| . \ | | | || |_) || <| __/
|_|\_\|_| |_|| .__/ |_|\_\\___|
| |
|_| Version 1.2.4
LLNL-CODE-775068
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC
Kripke is released under the BSD 3-Clause License, please see the
LICENSE file for the full license
This work was produced under the auspices of the U.S. Department of
Energy by Lawrence Livermore National Laboratory under Contract
DE-AC52-07NA27344.
Author: Adam J. Kunen
Compilation Options:
Architecture: OpenMP
Compiler: /home/kcamus/openmpi/openmpi-5.0.0/_install/bin/mpic++
Compiler Flags: "-O3 -march=native -g -fno-omit-frame-pointer -fcf-protection=none -no-pie -Wall -Wextra "
Linker Flags: " "
CHAI Enabled: No
CUDA Enabled: No
MPI Enabled: Yes
OpenMP Enabled: Yes
Caliper Enabled: No
OpenMP Thread->Core mapping for 96 threads on rank 0
0-> 0 1-> 1 2-> 2 3-> 3 4-> 4 5-> 5 6-> 6 7-> 7
8-> 8 9-> 9 10-> 10 11-> 11 12-> 12 13-> 13 14-> 14 15-> 15
16-> 16 17-> 17 18-> 18 19-> 19 20-> 20 21-> 21 22-> 22 23-> 23
24-> 24 25-> 25 26-> 26 27-> 27 28-> 28 29-> 29 30-> 30 31-> 31
32-> 32 33-> 33 34-> 34 35-> 35 36-> 36 37-> 37 38-> 38 39-> 39
40-> 40 41-> 41 42-> 42 43-> 43 44-> 44 45-> 45 46-> 46 47-> 47
48-> 48 49-> 49 50-> 50 51-> 51 52-> 52 53-> 53 54-> 54 55-> 55
56-> 56 57-> 57 58-> 58 59-> 59 60-> 60 61-> 61 62-> 62 63-> 63
64-> 64 65-> 65 66-> 66 67-> 67 68-> 68 69-> 69 70-> 70 71-> 71
72-> 72 73-> 73 74-> 74 75-> 75 76-> 76 77-> 77 78-> 78 79-> 79
80-> 80 81-> 81 82-> 82 83-> 83 84-> 84 85-> 85 86-> 86 87-> 87
88-> 88 89-> 89 90-> 90 91-> 91 92-> 92 93-> 93 94-> 94 95-> 95
Input Parameters
================
Problem Size:
Zones: 16 x 16 x 16 (4096 total)
Groups: 1024
Legendre Order: 4
Quadrature Set: Dummy S2 with 96 points
Physical Properties:
Total X-Sec: sigt=[0.100000, 0.000100, 0.100000]
Scattering X-Sec: sigs=[0.050000, 0.000050, 0.050000]
Solver Options:
Number iterations: 10
MPI Decomposition Options:
Total MPI tasks: 2
Spatial decomp: 2 x 1 x 1 MPI tasks
Block solve method: Sweep
Per-Task Options:
DirSets/Directions: 8 sets, 12 directions/set
GroupSet/Groups: 2 sets, 512 groups/set
Zone Sets: 1 x 1 x 1
Architecture: OpenMP
Data Layout: DGZ
Generating Problem
==================
Decomposition Space: Procs: Subdomains (local/global):
--------------------- ---------- --------------------------
(P) Energy: 1 2 / 2
(Q) Direction: 1 8 / 8
(R) Space: 2 1 / 2
(Rx,Ry,Rz) R in XYZ: 2x1x1 1x1x1 / 2x1x1
(PQR) TOTAL: 2 16 / 32
Material Volumes=[8.789062e+03, 1.177734e+05, 2.753438e+06]
Memory breakdown of Field variables:
Field Variable Num Elements Megabytes
-------------- ------------ ---------
data/sigs 15728640 120.000
dx 16 0.000
dy 16 0.000
dz 16 0.000
ell 2400 0.018
ell_plus 2400 0.018
i_plane 25165824 192.000
j_plane 25165824 192.000
k_plane 25165824 192.000
mixelem_to_fraction 4352 0.033
phi 104857600 800.000
phi_out 104857600 800.000
psi 402653184 3072.000
quadrature/w 96 0.001
quadrature/xcos 96 0.001
quadrature/ycos 96 0.001
quadrature/zcos 96 0.001
rhs 402653184 3072.000
sigt_zonal 4194304 32.000
volume 4096 0.031
-------- ------------ ---------
TOTAL 1110455664 8472.104
Generation Complete!
Steady State Solve
==================
iter 0: particle count=1.197998e+09, change=1.000000e+00
iter 1: particle count=1.801368e+09, change=3.349511e-01
iter 2: particle count=2.102278e+09, change=1.431351e-01
iter 3: particle count=2.251810e+09, change=6.640521e-02
iter 4: particle count=2.325888e+09, change=3.184924e-02
iter 5: particle count=2.362467e+09, change=1.548355e-02
iter 6: particle count=2.380471e+09, change=7.563193e-03
iter 7: particle count=2.389305e+09, change=3.697158e-03
iter 8: particle count=2.393627e+09, change=1.805479e-03
iter 9: particle count=2.395735e+09, change=8.801810e-04
Solver terminated
Timers
======
Timer Count Seconds
---------------- ------------ ------------
Generate 1 0.08356
LPlusTimes 10 1.92212
LTimes 10 1.94114
Population 10 0.13178
Scattering 10 22.21533
Solve 1 29.75779
Source 10 0.00229
SweepSolver 10 1.43724
SweepSubdomain 160 0.45167
TIMER_NAMES:Generate,LPlusTimes,LTimes,Population,Scattering,Solve,Source,SweepSolver,SweepSubdomain
TIMER_DATA:0.083556,1.922119,1.941140,0.131777,22.215331,29.757790,0.002289,1.437244,0.451667
Figures of Merit
================
Throughput: 1.353102e+08 [unknowns/(second/iteration)]
Grind time : 7.390427e-09 [(seconds/iteration)/unknowns]
Sweep efficiency : 31.42591 [100.0 * SweepSubdomain time / SweepSolver time]
Number of unknowns: 402653184
END
* Info: Process finished (host ip-172-31-68-94, process 362040)[0m
* Warning: Restricted access to kernel symbols:
to see kernel functions in profiling results, reprofile as root
or execute sudo sysctl -w kernel.kptr_restrict=0.
To make kptr_restrict=0 persist across reboots:
sudo sh -c "echo kernel.kptr_restrict=0 >> /etc/sysctl.d/local.conf"
* Info: Process finished (host ip-172-31-68-94, process 362042)[0m
Your experiment path is /home/kcamus/qaas_runs/170-247-8274/intel/Kripke/run/oneview_runs/defaults/orig/oneview_results_1702478619/tools/lprof_npsu_run_0
To display your profiling results:
#############################################################################################################################################################################################
# LEVEL | REPORT | COMMAND #
#############################################################################################################################################################################################
# Functions | Cluster-wide | maqao lprof -df xp=/home/kcamus/qaas_runs/170-247-8274/intel/Kripke/run/oneview_runs/defaults/orig/oneview_results_1702478619/tools/lprof_npsu_run_0 #
# Functions | Per-node | maqao lprof -df -dn xp=/home/kcamus/qaas_runs/170-247-8274/intel/Kripke/run/oneview_runs/defaults/orig/oneview_results_1702478619/tools/lprof_npsu_run_0 #
# Functions | Per-process | maqao lprof -df -dp xp=/home/kcamus/qaas_runs/170-247-8274/intel/Kripke/run/oneview_runs/defaults/orig/oneview_results_1702478619/tools/lprof_npsu_run_0 #
# Functions | Per-thread | maqao lprof -df -dt xp=/home/kcamus/qaas_runs/170-247-8274/intel/Kripke/run/oneview_runs/defaults/orig/oneview_results_1702478619/tools/lprof_npsu_run_0 #
# Loops | Cluster-wide | maqao lprof -dl xp=/home/kcamus/qaas_runs/170-247-8274/intel/Kripke/run/oneview_runs/defaults/orig/oneview_results_1702478619/tools/lprof_npsu_run_0 #
# Loops | Per-node | maqao lprof -dl -dn xp=/home/kcamus/qaas_runs/170-247-8274/intel/Kripke/run/oneview_runs/defaults/orig/oneview_results_1702478619/tools/lprof_npsu_run_0 #
# Loops | Per-process | maqao lprof -dl -dp xp=/home/kcamus/qaas_runs/170-247-8274/intel/Kripke/run/oneview_runs/defaults/orig/oneview_results_1702478619/tools/lprof_npsu_run_0 #
# Loops | Per-thread | maqao lprof -dl -dt xp=/home/kcamus/qaas_runs/170-247-8274/intel/Kripke/run/oneview_runs/defaults/orig/oneview_results_1702478619/tools/lprof_npsu_run_0 #
#############################################################################################################################################################################################