options

Executable Output


* [MAQAO] Info: Detected 1 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1652680       34753577
      executing #MPI =      1 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L U Solver for unsymmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1652680
 ... Structural symmetry (in percent)=   69
 Average density of rows/columns =   26
 ... No column permutation
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =     26.8151
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.8556

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =      5834945198
 --  (3) Real space for factors    (estimated)  =      5834945198
 --  (4) Integer space for factors (estimated)  =        24677058
 --  (5) Maximum frontal size      (estimated)  =           17957
 --  (6) Number of nodes in the tree            =           73417
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =               0
 Number of split nodes                          =               0
 RINFOG(1) Operations during elimination (estim)= 4.267D+13

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Total space in MBytes, IC factorization      (INFOG(17)):           69128
    Total space in MBytes,  OOC factorization    (INFOG(27)):           23939

 Elapsed time in analysis driver=      28.7580
 Analysis time by clock_gettime(): 28.757 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1652680       34753577
      executing #MPI =      1 and #OMP =      2

 Elapsed time in save structure driver=       0.0003
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1652680       34753577
      executing #MPI =      1 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0



****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =               1
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=      5834945198
 INFOG(4) Integer space for factors (estim.)=        24677058
 Maximum frontal size (estimated)           =           17957
 Number of nodes in the tree                =           73417
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

* [MAQAO] Info: STARTING COUNTERS (igk-0805)

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.4220
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.48D+00

 Effective size of S     (based on INFO(39))=           4152258345

 Redistrib: total data local/sent           =               0               0
 Elapsed time to reformat/distribute matrix =      0.7685


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =       230000
 Size of async. emission buffer (bytes).. =       497458
 Small emission buffer (bytes) .......... =           20

 ** Memory allocated, total in Mbytes           (INFOG(19)):       69127
 ** Memory effectively used, total in Mbytes    (INFOG(22)):       61462

 Flops under L0 layer                       =   9.088D+12
 Elapsed time under L0                      =     90.7857

 Elapsed time for factorization                     =    532.1767

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.737D+10
 ------ (3) Operations in node elimination          = 4.267D+13
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =      5834945198
 INFOG (10) Integer space for factors               =        24677058
 INFOG (11) Maximum front size                      =           17957
 INFOG (29) Number of entries in factors            =      5834945198
 INFOG (12) Number of off diagonal pivots           =               0
 INFOG (13) Number of delayed pivots                =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 2.946D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 2.946D-03
 RINFOG(21) Largest pivot in absolute value         = 3.637D+00
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =               0

 Elapsed time in factorization driver               =    533.3952
 Factorization time by clock_gettime(): 533.3771 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =      1 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_0

To display your profiling results:
##########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                         COMMAND                                                         #
##########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_0  #
##########################################################################################################################################################


* [MAQAO] Info: Detected 2 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1652680       34753577
      executing #MPI =      2 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L U Solver for unsymmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1652680
 ... Structural symmetry (in percent)=   69
 Average density of rows/columns =   26
 ... No column permutation
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =     26.9604
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.8583
 A root of estimated size         2246  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =      5834945198
 --  (3) Real space for factors    (estimated)  =      5834945198
 --  (4) Integer space for factors (estimated)  =        24721137
 --  (5) Maximum frontal size      (estimated)  =           17957
 --  (6) Number of nodes in the tree            =           73417
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =               5
 Number of split nodes                          =               0
 RINFOG(1) Operations during elimination (estim)= 4.267D+13

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):           36346
    Total space in MBytes, IC factorization      (INFOG(17)):           72493
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):           12283
    Total space in MBytes,  OOC factorization    (INFOG(27)):           24178

 Elapsed time in analysis driver=      29.1549
 Analysis time by clock_gettime(): 29.154 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1652680       34753577
      executing #MPI =      2 and #OMP =      2

 Elapsed time in save structure driver=       0.0004
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1652680       34753577
      executing #MPI =      2 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0



****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =               2
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=      5834945198
 INFOG(4) Integer space for factors (estim.)=        24721137
 Maximum frontal size (estimated)           =           17957
 Number of nodes in the tree                =           73417
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

* [MAQAO] Info: STARTING COUNTERS (igk-0805)

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.4194
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.48D+00

 Average Effective size of S     (based on INFO(39))=           2284398105
 Elapsed time to reformat/distribute matrix =      0.8982


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =     12499728
 Size of async. emission buffer (bytes).. =     37624178
 Small emission buffer (bytes) .......... =          260
 ** Memory allocated, max in Mbytes             (INFOG(18)):       36346

 ** Memory allocated, total in Mbytes           (INFOG(19)):       72493
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):       31785
 ** Memory effectively used, total in Mbytes    (INFOG(22)):       62770

 Flops under L0 layer  (avg/max across MPI) =   4.493D+12   4.498D+12
 Elapsed time under L0 (avg/max across MPI) =     46.2583     47.0539

 Elapsed time to process root node                  =      0.0927

 Elapsed time for factorization                     =    277.4659

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.737D+10
 ------ (3) Operations in node elimination          = 4.267D+13
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =      5834945198
 INFOG (10) Integer space for factors               =        24721207
 INFOG (11) Maximum front size                      =           17957
 INFOG (29) Number of entries in factors            =      5834945198
 INFOG (12) Number of off diagonal pivots           =               0
 INFOG (13) Number of delayed pivots                =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 2.946D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 2.946D-03
 RINFOG(21) Largest pivot in absolute value         = 3.637D+00
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =               0

 Elapsed time in factorization driver               =    278.8144
 Factorization time by clock_gettime(): 278.8050 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =      2 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_1

To display your profiling results:
##########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                         COMMAND                                                         #
##########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_1  #
##########################################################################################################################################################


* [MAQAO] Info: Detected 4 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1652680       34753577
      executing #MPI =      4 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L U Solver for unsymmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1652680
 ... Structural symmetry (in percent)=   69
 Average density of rows/columns =   26
 ... No column permutation
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =     26.9296
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.8577
 A root of estimated size         2246  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =      5834945198
 --  (3) Real space for factors    (estimated)  =      5835002122
 --  (4) Integer space for factors (estimated)  =        24875262
 --  (5) Maximum frontal size      (estimated)  =           17957
 --  (6) Number of nodes in the tree            =           73417
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =              12
 Number of split nodes                          =               0
 RINFOG(1) Operations during elimination (estim)= 4.267D+13

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):           22258
    Total space in MBytes, IC factorization      (INFOG(17)):           76423
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):            9442
    Total space in MBytes,  OOC factorization    (INFOG(27)):           33349

 Elapsed time in analysis driver=      29.1503
 Analysis time by clock_gettime(): 29.150 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1652680       34753577
      executing #MPI =      4 and #OMP =      2

 Elapsed time in save structure driver=       0.0004
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1652680       34753577
      executing #MPI =      4 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------


****** FACTORIZATION STEP ********


* [MAQAO] Info: STARTING COUNTERS (igk-0805)
 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =               4
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=      5835002122
 INFOG(4) Integer space for factors (estim.)=        24875262
 Maximum frontal size (estimated)           =           17957
 Number of nodes in the tree                =           73417
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.4189
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.48D+00

 Average Effective size of S     (based on INFO(39))=           1255422207
 Elapsed time to reformat/distribute matrix =      0.7897


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =     16407500
 Size of async. emission buffer (bytes).. =     49386576
 Small emission buffer (bytes) .......... =          704
 ** Memory allocated, max in Mbytes             (INFOG(18)):       22258

 ** Memory allocated, total in Mbytes           (INFOG(19)):       76247
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):       18833
 ** Memory effectively used, total in Mbytes    (INFOG(22)):       66508

 Flops under L0 layer  (avg/max across MPI) =   2.256D+12   2.694D+12
 Elapsed time under L0 (avg/max across MPI) =     23.2215     25.3751

 Elapsed time to process root node                  =      0.0718

 Elapsed time for factorization                     =    186.3729

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.737D+10
 ------ (3) Operations in node elimination          = 4.267D+13
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =      5834945198
 INFOG (10) Integer space for factors               =        24875740
 INFOG (11) Maximum front size                      =           17957
 INFOG (29) Number of entries in factors            =      5834945198
 INFOG (12) Number of off diagonal pivots           =               0
 INFOG (13) Number of delayed pivots                =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 2.946D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 2.946D-03
 RINFOG(21) Largest pivot in absolute value         = 3.637D+00
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =               2

 Elapsed time in factorization driver               =    187.6115
 Factorization time by clock_gettime(): 187.6210 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =      4 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_2

To display your profiling results:
##########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                         COMMAND                                                         #
##########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_2  #
##########################################################################################################################################################


* [MAQAO] Info: Detected 8 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1652680       34753577
      executing #MPI =      8 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L U Solver for unsymmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1652680
 ... Structural symmetry (in percent)=   69
 Average density of rows/columns =   26
 ... No column permutation
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =     27.0658
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.8733
 A root of estimated size         2246  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =      5834945198
 --  (3) Real space for factors    (estimated)  =      5835120104
 --  (4) Integer space for factors (estimated)  =        25260474
 --  (5) Maximum frontal size      (estimated)  =           17957
 --  (6) Number of nodes in the tree            =           73421
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =              32
 Number of split nodes                          =               4
 RINFOG(1) Operations during elimination (estim)= 4.267D+13

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):           12885
    Total space in MBytes, IC factorization      (INFOG(17)):           84710
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):            5657
    Total space in MBytes,  OOC factorization    (INFOG(27)):           41019

 Elapsed time in analysis driver=      29.3505
 Analysis time by clock_gettime(): 29.350 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1652680       34753577
      executing #MPI =      8 and #OMP =      2

 Elapsed time in save structure driver=       0.0004
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1652680       34753577
      executing #MPI =      8 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

* [MAQAO] Info: STARTING COUNTERS (igk-0805)


****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =               8
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=      5835120104
 INFOG(4) Integer space for factors (estim.)=        25260474
 Maximum frontal size (estimated)           =           17957
 Number of nodes in the tree                =           73421
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.4231
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.48D+00

 Average Effective size of S     (based on INFO(39))=            769763472
 Elapsed time to reformat/distribute matrix =      0.8103


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =     17347740
 Size of async. emission buffer (bytes).. =     52216698
 Small emission buffer (bytes) .......... =         2192
 ** Memory allocated, max in Mbytes             (INFOG(18)):       12884

 ** Memory allocated, total in Mbytes           (INFOG(19)):       84706
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):       10983
 ** Memory effectively used, total in Mbytes    (INFOG(22)):       72047

 Flops under L0 layer  (avg/max across MPI) =   9.795D+11   1.339D+12
 Elapsed time under L0 (avg/max across MPI) =     10.4781     12.8747

 Elapsed time to process root node                  =      0.0495

 Elapsed time for factorization                     =    125.0952

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.745D+10
 ------ (3) Operations in node elimination          = 4.267D+13
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =      5834883650
 INFOG (10) Integer space for factors               =        25261962
 INFOG (11) Maximum front size                      =           17957
 INFOG (29) Number of entries in factors            =      5834883650
 INFOG (12) Number of off diagonal pivots           =               0
 INFOG (13) Number of delayed pivots                =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 2.946D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 2.946D-03
 RINFOG(21) Largest pivot in absolute value         = 3.637D+00
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =              12

 Elapsed time in factorization driver               =    126.3883
 Factorization time by clock_gettime(): 126.3843 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =      8 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_3

To display your profiling results:
##########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                         COMMAND                                                         #
##########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_3  #
##########################################################################################################################################################


* [MAQAO] Info: Detected 16 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1652680       34753577
      executing #MPI =     16 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L U Solver for unsymmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1652680
 ... Structural symmetry (in percent)=   69
 Average density of rows/columns =   26
 ... No column permutation
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =     26.9826
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.8584
 A root of estimated size         2246  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =      5834945198
 --  (3) Real space for factors    (estimated)  =      5836407641
 --  (4) Integer space for factors (estimated)  =        25834265
 --  (5) Maximum frontal size      (estimated)  =           17957
 --  (6) Number of nodes in the tree            =           73427
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =              64
 Number of split nodes                          =              10
 RINFOG(1) Operations during elimination (estim)= 4.267D+13

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):            7898
    Total space in MBytes, IC factorization      (INFOG(17)):           98243
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):            4863
    Total space in MBytes,  OOC factorization    (INFOG(27)):           56968

 Elapsed time in analysis driver=      29.2340
 Analysis time by clock_gettime(): 29.233 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1652680       34753577
      executing #MPI =     16 and #OMP =      2
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

 Elapsed time in save structure driver=       0.0006
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1652680       34753577
      executing #MPI =     16 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0


* [MAQAO] Info: STARTING COUNTERS (igk-0805)


****** FACTORIZATION STEP ********

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =              16
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=      5836407641
 INFOG(4) Integer space for factors (estim.)=        25834265
 Maximum frontal size (estimated)           =           17957
 Number of nodes in the tree                =           73427
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.4284
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.48D+00

 Average Effective size of S     (based on INFO(39))=            497836173
 Elapsed time to reformat/distribute matrix =      0.7825


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =     17347832
 Size of async. emission buffer (bytes).. =     52216975
 Small emission buffer (bytes) .......... =         6928
 ** Memory allocated, max in Mbytes             (INFOG(18)):        7897

 ** Memory allocated, total in Mbytes           (INFOG(19)):       98241
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):        5940
 ** Memory effectively used, total in Mbytes    (INFOG(22)):       75715

 Flops under L0 layer  (avg/max across MPI) =   4.000D+11   5.108D+11
 Elapsed time under L0 (avg/max across MPI) =      4.7101      5.6842

 Elapsed time to process root node                  =      0.0435

 Elapsed time for factorization                     =     73.6564

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.780D+10
 ------ (3) Operations in node elimination          = 4.267D+13
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =      5834781770
 INFOG (10) Integer space for factors               =        25836857
 INFOG (11) Maximum front size                      =           17957
 INFOG (29) Number of entries in factors            =      5834781770
 INFOG (12) Number of off diagonal pivots           =               0
 INFOG (13) Number of delayed pivots                =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 2.946D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 2.946D-03
 RINFOG(21) Largest pivot in absolute value         = 3.637D+00
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =              15

 Elapsed time in factorization driver               =     74.9098
 Factorization time by clock_gettime(): 74.9139 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =     16 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_4

To display your profiling results:
##########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                         COMMAND                                                         #
##########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_4  #
##########################################################################################################################################################


* [MAQAO] Info: Detected 32 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1652680       34753577
      executing #MPI =     32 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L U Solver for unsymmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1652680
 ... Structural symmetry (in percent)=   69
 Average density of rows/columns =   26
 ... No column permutation
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =     27.6050
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.9506
 A root of estimated size         2246  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =      5834945198
 --  (3) Real space for factors    (estimated)  =      5842336943
 --  (4) Integer space for factors (estimated)  =        27117628
 --  (5) Maximum frontal size      (estimated)  =           17957
 --  (6) Number of nodes in the tree            =           73440
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =             114
 Number of split nodes                          =              23
 RINFOG(1) Operations during elimination (estim)= 4.267D+13

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):            3790
    Total space in MBytes, IC factorization      (INFOG(17)):           99780
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):            2220
    Total space in MBytes,  OOC factorization    (INFOG(27)):           57395

 Elapsed time in analysis driver=      30.1142
 Analysis time by clock_gettime(): 30.114 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1652680       34753577
      executing #MPI =     32 and #OMP =      2
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

 Elapsed time in save structure driver=       0.0010
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1652680       34753577
      executing #MPI =     32 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

* [MAQAO] Info: STARTING COUNTERS (igk-0805)
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------


****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =              32
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=      5842336943
 INFOG(4) Integer space for factors (estim.)=        27117628
 Maximum frontal size (estimated)           =           17957
 Number of nodes in the tree                =           73440
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.4195
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.48D+00

 Average Effective size of S     (based on INFO(39))=            261890559
 Elapsed time to reformat/distribute matrix =      0.9061


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =     17348028
 Size of async. emission buffer (bytes).. =     52217557
 Small emission buffer (bytes) .......... =        23800
 ** Memory allocated, max in Mbytes             (INFOG(18)):        3789

 ** Memory allocated, total in Mbytes           (INFOG(19)):       99764
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):        2930
 ** Memory effectively used, total in Mbytes    (INFOG(22)):       76231

 Flops under L0 layer  (avg/max across MPI) =   1.212D+11   2.299D+11
 Elapsed time under L0 (avg/max across MPI) =      1.9627      2.9688

 Elapsed time to process root node                  =      0.0484

 Elapsed time for factorization                     =     46.8079

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.890D+10
 ------ (3) Operations in node elimination          = 4.267D+13
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =      5834607066
 INFOG (10) Integer space for factors               =        27118484
 INFOG (11) Maximum front size                      =           17957
 INFOG (29) Number of entries in factors            =      5834607066
 INFOG (12) Number of off diagonal pivots           =               0
 INFOG (13) Number of delayed pivots                =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 2.946D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 2.946D-03
 RINFOG(21) Largest pivot in absolute value         = 3.637D+00
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =              66

 Elapsed time in factorization driver               =     48.1828
 Factorization time by clock_gettime(): 48.1962 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =     32 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_5

To display your profiling results:
##########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                         COMMAND                                                         #
##########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_5  #
##########################################################################################################################################################


* [MAQAO] Info: Detected 64 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1652680       34753577
      executing #MPI =     64 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L U Solver for unsymmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1652680
 ... Structural symmetry (in percent)=   69
 Average density of rows/columns =   26
 ... No column permutation
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =     27.6822
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.9496
 A root of estimated size         2246  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =      5834945198
 --  (3) Real space for factors    (estimated)  =      5861381850
 --  (4) Integer space for factors (estimated)  =        29371244
 --  (5) Maximum frontal size      (estimated)  =           17957
 --  (6) Number of nodes in the tree            =           73463
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =             191
 Number of split nodes                          =              46
 RINFOG(1) Operations during elimination (estim)= 4.267D+13

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):            1935
    Total space in MBytes, IC factorization      (INFOG(17)):          104095
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):            1366
    Total space in MBytes,  OOC factorization    (INFOG(27)):           65153

 Elapsed time in analysis driver=      30.2426
 Analysis time by clock_gettime(): 30.243 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1652680       34753577
      executing #MPI =     64 and #OMP =      2

 Elapsed time in save structure driver=       0.0014
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1652680       34753577
      executing #MPI =     64 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

* [MAQAO] Info: STARTING COUNTERS (igk-0805)
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------


****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =              64
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=      5861381850
 INFOG(4) Integer space for factors (estim.)=        29371244
 Maximum frontal size (estimated)           =           17957
 Number of nodes in the tree                =           73463
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.4296
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.48D+00

 Average Effective size of S     (based on INFO(39))=            146024416
 Elapsed time to reformat/distribute matrix =      1.0952


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =     11053680
 Size of async. emission buffer (bytes).. =     33271576
 Small emission buffer (bytes) .......... =        87804
 ** Memory allocated, max in Mbytes             (INFOG(18)):        1934

 ** Memory allocated, total in Mbytes           (INFOG(19)):      104080
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):        1525
 ** Memory effectively used, total in Mbytes    (INFOG(22)):       78276

 Flops under L0 layer  (avg/max across MPI) =   3.027D+10   6.404D+10
 Elapsed time under L0 (avg/max across MPI) =      1.0011      1.4745

 Elapsed time to process root node                  =      0.0573

 Elapsed time for factorization                     =     38.4197

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 2.184D+10
 ------ (3) Operations in node elimination          = 4.266D+13
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =      5834325164
 INFOG (10) Integer space for factors               =        29347872
 INFOG (11) Maximum front size                      =           17957
 INFOG (29) Number of entries in factors            =      5834325164
 INFOG (12) Number of off diagonal pivots           =               0
 INFOG (13) Number of delayed pivots                =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 2.946D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 2.946D-03
 RINFOG(21) Largest pivot in absolute value         = 3.637D+00
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =             187
 Extra copies due to In-Place stacking              =         6388623

 Elapsed time in factorization driver               =     40.0007
 Factorization time by clock_gettime(): 40.0144 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =     64 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_6

To display your profiling results:
##########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                         COMMAND                                                         #
##########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_6  #
##########################################################################################################################################################


* [MAQAO] Info: Detected 86 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1652680       34753577
      executing #MPI =     86 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L U Solver for unsymmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1652680
 ... Structural symmetry (in percent)=   69
 Average density of rows/columns =   26
 ... No column permutation
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =     27.5541
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.9508
 A root of estimated size         2246  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =      5834945198
 --  (3) Real space for factors    (estimated)  =      5896259226
 --  (4) Integer space for factors (estimated)  =        30604999
 --  (5) Maximum frontal size      (estimated)  =           17957
 --  (6) Number of nodes in the tree            =           73476
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =             228
 Number of split nodes                          =              59
 RINFOG(1) Operations during elimination (estim)= 4.267D+13

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):            1545
    Total space in MBytes, IC factorization      (INFOG(17)):          108305
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):            1139
    Total space in MBytes,  OOC factorization    (INFOG(27)):           72302

 Elapsed time in analysis driver=      30.1272
 Analysis time by clock_gettime(): 30.128 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1652680       34753577
      executing #MPI =     86 and #OMP =      2

 Elapsed time in save structure driver=       0.0014
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1652680       34753577
      executing #MPI =     86 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

* [MAQAO] Info: STARTING COUNTERS (igk-0805)
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------


****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =              86
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=      5896259226
 INFOG(4) Integer space for factors (estim.)=        30604999
 Maximum frontal size (estimated)           =           17957
 Number of nodes in the tree                =           73476
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.4247
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.48D+00

 Average Effective size of S     (based on INFO(39))=            116532406
 Elapsed time to reformat/distribute matrix =      1.1471


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =      8919844
 Size of async. emission buffer (bytes).. =     26848722
 Small emission buffer (bytes) .......... =       155248
 ** Memory allocated, max in Mbytes             (INFOG(18)):        1545

 ** Memory allocated, total in Mbytes           (INFOG(19)):      108293
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):        1196
 ** Memory effectively used, total in Mbytes    (INFOG(22)):       80033

 Flops under L0 layer  (avg/max across MPI) =   1.623D+10   3.546D+10
 Elapsed time under L0 (avg/max across MPI) =      0.7999      1.2722

 Elapsed time to process root node                  =      0.0558

 Elapsed time for factorization                     =     39.1043

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 2.362D+10
 ------ (3) Operations in node elimination          = 4.265D+13
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =      5834174840
 INFOG (10) Integer space for factors               =        30466746
 INFOG (11) Maximum front size                      =           17957
 INFOG (29) Number of entries in factors            =      5834174840
 INFOG (12) Number of off diagonal pivots           =               0
 INFOG (13) Number of delayed pivots                =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 2.946D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 2.946D-03
 RINFOG(21) Largest pivot in absolute value         = 3.637D+00
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =             248
 Extra copies due to In-Place stacking              =         7438330

 Elapsed time in factorization driver               =     40.7384
 Factorization time by clock_gettime(): 40.7559 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =     86 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_7

To display your profiling results:
##########################################################################################################################################################
#    LEVEL    |     REPORT     |                                                         COMMAND                                                         #
##########################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_7      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_7      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_ss_allowextra_scala_kptr_probe/tools/lprof_run_7  #
##########################################################################################################################################################

×