options

Executable Output


* [MAQAO] Info: Detected 1 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1585478        4623152
      executing #MPI =      1 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L D L^T Solver for general symmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1585478
 Average density of rows/columns =    4
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =      9.7581
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.2509

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =       120195786
 --  (3) Real space for factors    (estimated)  =       133971025
 --  (4) Integer space for factors (estimated)  =        11630484
 --  (5) Maximum frontal size      (estimated)  =            2159
 --  (6) Number of nodes in the tree            =          105704
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =               0
 Number of split nodes                          =               0
 RINFOG(1) Operations during elimination (estim)= 5.014D+10

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Total space in MBytes, IC factorization      (INFOG(17)):            1682
    Total space in MBytes,  OOC factorization    (INFOG(27)):             532

 Elapsed time in analysis driver=      10.5192
 Analysis time by clock_gettime(): 10.519 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1585478        4623152
      executing #MPI =      1 and #OMP =      2

 Elapsed time in save structure driver=       0.0002
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1585478        4623152
      executing #MPI =      1 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0



****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =               1
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=       133971025
 INFOG(4) Integer space for factors (estim.)=        11630484
 Maximum frontal size (estimated)           =            2159
 Number of nodes in the tree                =          105704
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

* [MAQAO] Info: STARTING COUNTERS (igk-0805)

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.0718
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.37D-01

 Effective size of S     (based on INFO(39))=             86359241

 Redistrib: total data local/sent           =               0               0
 Elapsed time to reformat/distribute matrix =      0.0887


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =       230000
 Size of async. emission buffer (bytes).. =       230012
 Small emission buffer (bytes) .......... =           20

 ** Memory allocated, total in Mbytes           (INFOG(19)):        1682
 ** Memory effectively used, total in Mbytes    (INFOG(22)):        1522

 Flops under L0 layer                       =   4.182D+09
 Elapsed time under L0                      =      1.3515

 Elapsed time for factorization                     =      3.7796

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.569D+08
 ------ (3) Operations in node elimination          = 5.014D+10
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =       133971025
 INFOG (10) Integer space for factors               =        11630484
 INFOG (11) Maximum front size                      =            2159
 INFOG (29) Number of entries in factors            =       120195786
 INFOG (12) Number of negative pivots               =               0
 INFOG (13) Number of delayed pivots                =               0
 Number of 2x2 pivots in type 1 nodes               =               0
 Number of 2x2 pivots in type 2 nodes               =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 6.804D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 6.804D-03
 RINFOG(21) Largest pivot in absolute value         = 9.942D-01
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =               0

 Elapsed time in factorization driver               =      3.9526
 Factorization time by clock_gettime(): 3.9526 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =      1 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_0

To display your profiling results:
#################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                            COMMAND                                                             #
#################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_0      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_0      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_0  #
#################################################################################################################################################################


* [MAQAO] Info: Detected 2 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1585478        4623152
      executing #MPI =      2 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L D L^T Solver for general symmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1585478
 Average density of rows/columns =    4
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =      9.6876
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.2464
 A root of estimated size         1800  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =       120195786
 --  (3) Real space for factors    (estimated)  =       135429025
 --  (4) Integer space for factors (estimated)  =        11632304
 --  (5) Maximum frontal size      (estimated)  =            2159
 --  (6) Number of nodes in the tree            =          105704
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =               1
 Number of split nodes                          =               0
 RINFOG(1) Operations during elimination (estim)= 5.208D+10

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):             906
    Total space in MBytes, IC factorization      (INFOG(17)):            1804
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):             344
    Total space in MBytes,  OOC factorization    (INFOG(27)):             677

 Elapsed time in analysis driver=      10.6318
 Analysis time by clock_gettime(): 10.632 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1585478        4623152
      executing #MPI =      2 and #OMP =      2

 Elapsed time in save structure driver=       0.0003
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1585478        4623152
      executing #MPI =      2 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0



****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =               2
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=       135429025
 INFOG(4) Integer space for factors (estim.)=        11632304
 Maximum frontal size (estimated)           =            2159
 Number of nodes in the tree                =          105704

* [MAQAO] Info: STARTING COUNTERS (igk-0805)
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.0717
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.37D-01

 Average Effective size of S     (based on INFO(39))=             47025740
 Elapsed time to reformat/distribute matrix =      0.1133


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =      1448404
 Size of async. emission buffer (bytes).. =      5808091
 Small emission buffer (bytes) .......... =          212
 ** Memory allocated, max in Mbytes             (INFOG(18)):         906

 ** Memory allocated, total in Mbytes           (INFOG(19)):        1804
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):         809
 ** Memory effectively used, total in Mbytes    (INFOG(22)):        1618

 Flops under L0 layer  (avg/max across MPI) =   2.088D+09   2.097D+09
 Elapsed time under L0 (avg/max across MPI) =      0.6762      0.6933

 Elapsed time to process root node                  =      0.0681

 Elapsed time for factorization                     =      1.9805

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.570D+08
 ------ (3) Operations in node elimination          = 5.208D+10
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =       135429025
 INFOG (10) Integer space for factors               =        11632318
 INFOG (11) Maximum front size                      =            2159
 INFOG (29) Number of entries in factors            =       120195786
 INFOG (12) Number of negative pivots               =               0
 INFOG (13) Number of delayed pivots                =               0
 Number of 2x2 pivots in type 1 nodes               =               0
 Number of 2x2 pivots in type 2 nodes               =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 6.804D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 6.804D-03
 RINFOG(21) Largest pivot in absolute value         = 9.942D-01
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =               0

 Elapsed time in factorization driver               =      2.1805
 Factorization time by clock_gettime(): 2.1806 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =      2 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_1

To display your profiling results:
#################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                            COMMAND                                                             #
#################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_1      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_1      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_1  #
#################################################################################################################################################################


* [MAQAO] Info: Detected 4 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1585478        4623152
      executing #MPI =      4 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L D L^T Solver for general symmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1585478
 Average density of rows/columns =    4
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =      9.7388
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.2446
 A root of estimated size         1800  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =       120195786
 --  (3) Real space for factors    (estimated)  =       135426089
 --  (4) Integer space for factors (estimated)  =        11644980
 --  (5) Maximum frontal size      (estimated)  =            2159
 --  (6) Number of nodes in the tree            =          105704
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =               4
 Number of split nodes                          =               0
 RINFOG(1) Operations during elimination (estim)= 5.208D+10

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):             517
    Total space in MBytes, IC factorization      (INFOG(17)):            1922
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):             219
    Total space in MBytes,  OOC factorization    (INFOG(27)):             844

 Elapsed time in analysis driver=      10.7034
 Analysis time by clock_gettime(): 10.703 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1585478        4623152
      executing #MPI =      4 and #OMP =      2
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

 Elapsed time in save structure driver=       0.0004
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1585478        4623152
      executing #MPI =      4 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0



****** FACTORIZATION STEP ********

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =               4
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=       135426089
 INFOG(4) Integer space for factors (estim.)=        11644980
 Maximum frontal size (estimated)           =            2159
 Number of nodes in the tree                =          105704
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

* [MAQAO] Info: STARTING COUNTERS (igk-0805)

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.0724
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.37D-01

 Average Effective size of S     (based on INFO(39))=             24435497
 Elapsed time to reformat/distribute matrix =      0.0976


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =       996472
 Size of async. emission buffer (bytes).. =      3995845
 Small emission buffer (bytes) .......... =          704
 ** Memory allocated, max in Mbytes             (INFOG(18)):         517

 ** Memory allocated, total in Mbytes           (INFOG(19)):        1915
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):         462
 ** Memory effectively used, total in Mbytes    (INFOG(22)):        1715

 Flops under L0 layer  (avg/max across MPI) =   1.084D+09   1.470D+09
 Elapsed time under L0 (avg/max across MPI) =      0.3459      0.3596

 Elapsed time to process root node                  =      0.0572

 Elapsed time for factorization                     =      1.1787

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.574D+08
 ------ (3) Operations in node elimination          = 5.208D+10
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =       135429025
 INFOG (10) Integer space for factors               =        11642651
 INFOG (11) Maximum front size                      =            2159
 INFOG (29) Number of entries in factors            =       120195786
 INFOG (12) Number of negative pivots               =               0
 INFOG (13) Number of delayed pivots                =               0
 Number of 2x2 pivots in type 1 nodes               =               0
 Number of 2x2 pivots in type 2 nodes               =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 6.804D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 6.804D-03
 RINFOG(21) Largest pivot in absolute value         = 9.942D-01
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =               0

 Elapsed time in factorization driver               =      1.3592
 Factorization time by clock_gettime(): 1.3675 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =      4 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_2

To display your profiling results:
#################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                            COMMAND                                                             #
#################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_2      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_2      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_2  #
#################################################################################################################################################################


* [MAQAO] Info: Detected 8 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1585478        4623152
      executing #MPI =      8 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L D L^T Solver for general symmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1585478
 Average density of rows/columns =    4
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =      9.8071
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.2545
 A root of estimated size         1800  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =       120195786
 --  (3) Real space for factors    (estimated)  =       135424651
 --  (4) Integer space for factors (estimated)  =        11656815
 --  (5) Maximum frontal size      (estimated)  =            2159
 --  (6) Number of nodes in the tree            =          105704
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =               8
 Number of split nodes                          =               0
 RINFOG(1) Operations during elimination (estim)= 5.208D+10

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):             413
    Total space in MBytes, IC factorization      (INFOG(17)):            2260
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):             166
    Total space in MBytes,  OOC factorization    (INFOG(27)):            1153

 Elapsed time in analysis driver=      10.8473
 Analysis time by clock_gettime(): 10.847 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1585478        4623152
      executing #MPI =      8 and #OMP =      2
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

 Elapsed time in save structure driver=       0.0004
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1585478        4623152
      executing #MPI =      8 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0


* [MAQAO] Info: STARTING COUNTERS (igk-0805)


****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =               8
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=       135424651
 INFOG(4) Integer space for factors (estim.)=        11656815
 Maximum frontal size (estimated)           =            2159
 Number of nodes in the tree                =          105704
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.0742
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.37D-01

 Average Effective size of S     (based on INFO(39))=             14231871
 Elapsed time to reformat/distribute matrix =      0.0976


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =      1397848
 Size of async. emission buffer (bytes).. =      5605370
 Small emission buffer (bytes) .......... =         2000
 ** Memory allocated, max in Mbytes             (INFOG(18)):         413

 ** Memory allocated, total in Mbytes           (INFOG(19)):        2234
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):         358
 ** Memory effectively used, total in Mbytes    (INFOG(22)):        1969

 Flops under L0 layer  (avg/max across MPI) =   8.017D+08   2.058D+09
 Elapsed time under L0 (avg/max across MPI) =      0.1735      0.2050

 Elapsed time to process root node                  =      0.0387

 Elapsed time for factorization                     =      0.8909

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.579D+08
 ------ (3) Operations in node elimination          = 5.208D+10
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =       135429025
 INFOG (10) Integer space for factors               =        11653024
 INFOG (11) Maximum front size                      =            2159
 INFOG (29) Number of entries in factors            =       120195786
 INFOG (12) Number of negative pivots               =               0
 INFOG (13) Number of delayed pivots                =               0
 Number of 2x2 pivots in type 1 nodes               =               0
 Number of 2x2 pivots in type 2 nodes               =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 6.804D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 6.804D-03
 RINFOG(21) Largest pivot in absolute value         = 9.942D-01
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =               0

 Elapsed time in factorization driver               =      1.0889
 Factorization time by clock_gettime(): 1.0922 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =      8 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_3

To display your profiling results:
#################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                            COMMAND                                                             #
#################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_3      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_3      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_3  #
#################################################################################################################################################################


* [MAQAO] Info: Detected 16 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1585478        4623152
      executing #MPI =     16 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L D L^T Solver for general symmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1585478
 Average density of rows/columns =    4
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =      9.7050
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.2458
 A root of estimated size         1800  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =       120195786
 --  (3) Real space for factors    (estimated)  =       135425397
 --  (4) Integer space for factors (estimated)  =        11680734
 --  (5) Maximum frontal size      (estimated)  =            2159
 --  (6) Number of nodes in the tree            =          105704
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =              18
 Number of split nodes                          =               0
 RINFOG(1) Operations during elimination (estim)= 5.208D+10

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):             237
    Total space in MBytes, IC factorization      (INFOG(17)):            2754
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):             136
    Total space in MBytes,  OOC factorization    (INFOG(27)):            1742

 Elapsed time in analysis driver=      10.6851
 Analysis time by clock_gettime(): 10.685 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1585478        4623152
      executing #MPI =     16 and #OMP =      2
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

 Elapsed time in save structure driver=       0.0006
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1585478        4623152
      executing #MPI =     16 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0


* [MAQAO] Info: STARTING COUNTERS (igk-0805)
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------


****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =              16
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=       135425397
 INFOG(4) Integer space for factors (estim.)=        11680734
 Maximum frontal size (estimated)           =            2159
 Number of nodes in the tree                =          105704
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.0760
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.37D-01

 Average Effective size of S     (based on INFO(39))=              8127468
 Elapsed time to reformat/distribute matrix =      0.1096


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =      1407380
 Size of async. emission buffer (bytes).. =      5643590
 Small emission buffer (bytes) .......... =         6552
 ** Memory allocated, max in Mbytes             (INFOG(18)):         237

 ** Memory allocated, total in Mbytes           (INFOG(19)):        2749
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):         207
 ** Memory effectively used, total in Mbytes    (INFOG(22)):        2435

 Flops under L0 layer  (avg/max across MPI) =   3.237D+08   5.839D+08
 Elapsed time under L0 (avg/max across MPI) =      0.0943      0.1209

 Elapsed time to process root node                  =      0.0368

 Elapsed time for factorization                     =      0.5000

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.574D+08
 ------ (3) Operations in node elimination          = 5.208D+10
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =       135429025
 INFOG (10) Integer space for factors               =        11673289
 INFOG (11) Maximum front size                      =            2159
 INFOG (29) Number of entries in factors            =       120195786
 INFOG (12) Number of negative pivots               =               0
 INFOG (13) Number of delayed pivots                =               0
 Number of 2x2 pivots in type 1 nodes               =               0
 Number of 2x2 pivots in type 2 nodes               =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 6.804D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 6.804D-03
 RINFOG(21) Largest pivot in absolute value         = 9.942D-01
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =               0

 Elapsed time in factorization driver               =      0.7045
 Factorization time by clock_gettime(): 0.7059 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =     16 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_4

To display your profiling results:
#################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                            COMMAND                                                             #
#################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_4      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_4      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_4  #
#################################################################################################################################################################


* [MAQAO] Info: Detected 32 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1585478        4623152
      executing #MPI =     32 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L D L^T Solver for general symmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1585478
 Average density of rows/columns =    4
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =      9.9837
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.2724
 A root of estimated size         1800  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =       120195786
 --  (3) Real space for factors    (estimated)  =       135389807
 --  (4) Integer space for factors (estimated)  =        11777155
 --  (5) Maximum frontal size      (estimated)  =            2159
 --  (6) Number of nodes in the tree            =          105704
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =              37
 Number of split nodes                          =               0
 RINFOG(1) Operations during elimination (estim)= 5.208D+10

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):             146
    Total space in MBytes, IC factorization      (INFOG(17)):            3490
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):             111
    Total space in MBytes,  OOC factorization    (INFOG(27)):            2568

 Elapsed time in analysis driver=      11.0977
 Analysis time by clock_gettime(): 11.098 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1585478        4623152
      executing #MPI =     32 and #OMP =      2

 Elapsed time in save structure driver=       0.0010
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1585478        4623152
      executing #MPI =     32 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

* [MAQAO] Info: STARTING COUNTERS (igk-0805)
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------


****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =              32
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=       135389807
 INFOG(4) Integer space for factors (estim.)=        11777155
 Maximum frontal size (estimated)           =            2159
 Number of nodes in the tree                =          105704
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.0833
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.37D-01

 Average Effective size of S     (based on INFO(39))=              3938083
 Elapsed time to reformat/distribute matrix =      0.1167


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =      1110616
 Size of async. emission buffer (bytes).. =      4453564
 Small emission buffer (bytes) .......... =        23316
 ** Memory allocated, max in Mbytes             (INFOG(18)):         148

 ** Memory allocated, total in Mbytes           (INFOG(19)):        3439
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):         139
 ** Memory effectively used, total in Mbytes    (INFOG(22)):        3134

 Flops under L0 layer  (avg/max across MPI) =   1.628D+08   3.847D+08
 Elapsed time under L0 (avg/max across MPI) =      0.0592      0.0763

 Elapsed time to process root node                  =      0.0362

 Elapsed time for factorization                     =      0.3102

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.572D+08
 ------ (3) Operations in node elimination          = 5.208D+10
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =       135429025
 INFOG (10) Integer space for factors               =        11745001
 INFOG (11) Maximum front size                      =            2159
 INFOG (29) Number of entries in factors            =       120195786
 INFOG (12) Number of negative pivots               =               0
 INFOG (13) Number of delayed pivots                =               0
 Number of 2x2 pivots in type 1 nodes               =               0
 Number of 2x2 pivots in type 2 nodes               =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 6.804D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 6.804D-03
 RINFOG(21) Largest pivot in absolute value         = 9.942D-01
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =               9

 Elapsed time in factorization driver               =      0.5293
 Factorization time by clock_gettime(): 0.5468 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =     32 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_5

To display your profiling results:
#################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                            COMMAND                                                             #
#################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_5      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_5      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_5  #
#################################################################################################################################################################


* [MAQAO] Info: Detected 64 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1585478        4623152
      executing #MPI =     64 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L D L^T Solver for general symmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1585478
 Average density of rows/columns =    4
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =      9.9860
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.2750
 A root of estimated size         1800  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =       120195786
 --  (3) Real space for factors    (estimated)  =       135362831
 --  (4) Integer space for factors (estimated)  =        11930065
 --  (5) Maximum frontal size      (estimated)  =            2159
 --  (6) Number of nodes in the tree            =          105705
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =              72
 Number of split nodes                          =               1
 RINFOG(1) Operations during elimination (estim)= 5.208D+10

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):             152
    Total space in MBytes, IC factorization      (INFOG(17)):            5160
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):             139
    Total space in MBytes,  OOC factorization    (INFOG(27)):            4409

 Elapsed time in analysis driver=      11.1350
 Analysis time by clock_gettime(): 11.136 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1585478        4623152
      executing #MPI =     64 and #OMP =      2
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

 Elapsed time in save structure driver=       0.0012
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1585478        4623152
      executing #MPI =     64 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

* [MAQAO] Info: STARTING COUNTERS (igk-0805)
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------


****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =              64
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=       135362831
 INFOG(4) Integer space for factors (estim.)=        11930065
 Maximum frontal size (estimated)           =            2159
 Number of nodes in the tree                =          105705
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.0783
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.37D-01

 Average Effective size of S     (based on INFO(39))=              2148986
 Elapsed time to reformat/distribute matrix =      0.1298


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =      1111056
 Size of async. emission buffer (bytes).. =      4455335
 Small emission buffer (bytes) .......... =        87504
 ** Memory allocated, max in Mbytes             (INFOG(18)):         153

 ** Memory allocated, total in Mbytes           (INFOG(19)):        5022
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):         151
 ** Memory effectively used, total in Mbytes    (INFOG(22)):        4690

 Flops under L0 layer  (avg/max across MPI) =   6.761D+07   1.443D+08
 Elapsed time under L0 (avg/max across MPI) =      0.0433      0.0521

 Elapsed time to process root node                  =      0.0434

 Elapsed time for factorization                     =      0.2514

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.581D+08
 ------ (3) Operations in node elimination          = 5.208D+10
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =       135429025
 INFOG (10) Integer space for factors               =        11867015
 INFOG (11) Maximum front size                      =            2159
 INFOG (29) Number of entries in factors            =       120195786
 INFOG (12) Number of negative pivots               =               0
 INFOG (13) Number of delayed pivots                =               0
 Number of 2x2 pivots in type 1 nodes               =               0
 Number of 2x2 pivots in type 2 nodes               =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 6.804D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 6.804D-03
 RINFOG(21) Largest pivot in absolute value         = 9.942D-01
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =              30

 Elapsed time in factorization driver               =      0.4859
 Factorization time by clock_gettime(): 0.5060 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =     64 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_6

To display your profiling results:
#################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                            COMMAND                                                             #
#################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_6      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_6      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_6  #
#################################################################################################################################################################


* [MAQAO] Info: Detected 86 Lprof instances in igk-0805. 
If this is incorrect, rerun with number-processes-per-node=X

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   1     1585478        4623152
      executing #MPI =     86 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 =================================================
 MUMPS compiled with option -Dmetis
 MUMPS compiled with option -Dpord
 MUMPS compiled with option -Dptscotch
 MUMPS compiled with option -Dscotch
 =================================================
L D L^T Solver for general symmetric matrices
Type of parallelism: Working host

 ****** ANALYSIS STEP ********

 Processing a graph of size:   1585478
 Average density of rows/columns =    4
 Ordering based on METIS
 ELAPSED TIME SPENT IN METIS reordering  =      9.9185
 SYMBOLIC based on column counts 
 ELAPSED TIME IN symbolic factorization  =      0.2768
 A root of estimated size         1800  has been selected for Scalapack.

Leaving analysis phase with  ...
 INFOG(1)                                       =               0
 INFOG(2)                                       =               0
 -- (20) Number of entries in factors (estim.)  =       120195786
 --  (3) Real space for factors    (estimated)  =       135324459
 --  (4) Integer space for factors (estimated)  =        12065936
 --  (5) Maximum frontal size      (estimated)  =            2159
 --  (6) Number of nodes in the tree            =          105708
 -- (32) Type of analysis effectively used      =               1
 --  (7) Ordering option effectively used       =               5
 ICNTL (6) Maximum transversal option           =               0
 ICNTL (7) Pivot order option                   =               7
 ICNTL(12) Ordering symmetric indef. matrices   =               1
 ICNTL(13) Parallelism/splitting of root node   =               0
 ICNTL(14) Percentage of memory relaxation      =              30
 ICNTL(15) Analysis by block effectively used   =               0
 ICNTL(18) Distributed input matrix (on if >0)  =               0
 ICNTL(32) Forward elimination during facto.    =               0
 ICNTL(35) BLR activation                       =               0
 ICNTL(48) Tree based multithreading (effective)=               1
 ICNTL(58) Symbolic factorization option        =               2
 Number of level 2 nodes                        =             111
 Number of split nodes                          =               4
 RINFOG(1) Operations during elimination (estim)= 5.208D+10

 MEMORY ESTIMATIONS ... 
 Estimations with standard Full-Rank (FR) factorization:
    Maximum estim. space in Mbytes, IC facto.    (INFOG(16)):             120
    Total space in MBytes, IC factorization      (INFOG(17)):            6288
    Maximum estim. space in Mbytes, OOC facto.   (INFOG(26)):             111
    Total space in MBytes,  OOC factorization    (INFOG(27)):            5682

 Elapsed time in analysis driver=      11.0986
 Analysis time by clock_gettime(): 11.100 s

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   7     1585478        4623152
      executing #MPI =     86 and #OMP =      2

 Elapsed time in save structure driver=       0.0017
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
 On return from DMUMPS, INFOG(1)=             -71
 On return from DMUMPS, INFOG(2)=               0
PRE FACTO START LPROF----------------------

Entering DMUMPS 5.8.2 from C interface with JOB, N, NNZ =   2     1585478        4623152
      executing #MPI =     86 and #OMP =      2

Advanced settings:
   KEEP(370) Static mapping              =         1
   KEEP(371) Advanced optimizations      =         0

 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------

* [MAQAO] Info: STARTING COUNTERS (igk-0805)
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------
 ** ERROR RETURN ** FROM DMUMPS INFO(1)=  -71
 ** INFO(2)=               0
PRE FACTO START LPROF----------------------


****** FACTORIZATION STEP ********

 GLOBAL STATISTICS PRIOR NUMERICAL FACTORIZATION ...
 Number of working processes                =              86
 ICNTL(22) Out-of-core option               =               0
 ICNTL(35) BLR activation (eff. choice)     =               0
 ICNTL(37) BLR CB compression (eff. choice) =               0
 ICNTL(49) Compact workarray S (end facto.) =               0
 ICNTL(56) Effective value during facto.    =               0
 ICNTL(14) Memory relaxation                =              30
 INFOG(3) Real space for factors (estimated)=       135324459
 INFOG(4) Integer space for factors (estim.)=        12065936
 Maximum frontal size (estimated)           =            2159
 Number of nodes in the tree                =          105708
 ICNTL(23) Memory allowed (value on host)   =               0
           Sum over all procs               =               0
 Memory provided by user, sum of LWK_USER   =               0
 Effective threshold for pivoting, CNTL(1)  =      0.1000D-01

 Statistics on the scaling phase
 Elapsed time for scaling                   =      0.0792
 Max difference from 1 after scaling the entries for ONE-NORM (option 7/8)   = 0.37D-01

 Average Effective size of S     (based on INFO(39))=              1637286
 Elapsed time to reformat/distribute matrix =      0.1361


 Allocated buffers
 ------------------
 Size of reception buffer in bytes ...... =      1073908
 Size of async. emission buffer (bytes).. =      4306365
 Small emission buffer (bytes) .......... =       155692
 ** Memory allocated, max in Mbytes             (INFOG(18)):         123

 ** Memory allocated, total in Mbytes           (INFOG(19)):        6057
 ** Memory effectively used, max in  Mbytes     (INFOG(21)):         120
 ** Memory effectively used, total in Mbytes    (INFOG(22)):        5748

 Flops under L0 layer  (avg/max across MPI) =   4.851D+07   8.487D+07
 Elapsed time under L0 (avg/max across MPI) =      0.0466      0.0539

 Elapsed time to process root node                  =      0.0436

 Elapsed time for factorization                     =      0.2274

Leaving factorization with ...
 RINFOG (2) Operations in node assembly             = 1.616D+08
 ------ (3) Operations in node elimination          = 5.208D+10
 ICNTL  (8) Scaling effectively used                =               7
 INFOG  (9) Real space for factors                  =       135418141
 INFOG (10) Integer space for factors               =        11973412
 INFOG (11) Maximum front size                      =            2159
 INFOG (29) Number of entries in factors            =       120195786
 INFOG (12) Number of negative pivots               =               0
 INFOG (13) Number of delayed pivots                =               0
 Number of 2x2 pivots in type 1 nodes               =               0
 Number of 2x2 pivots in type 2 nodes               =               0
 RINFOG(19) Smallest pivot WITH perturbed pivots    = 6.804D-03
 RINFOG(20) Smallest pivot WITHOUT perturbed pivots = 6.804D-03
 RINFOG(21) Largest pivot in absolute value         = 9.942D-01
 INFOG (24) Effective value of ICNTL(12)            =               1
 INFOG (14) Number of memory compress               =              69

 Elapsed time in factorization driver               =      0.4773
 Factorization time by clock_gettime(): 0.5049 s

Entering DMUMPS 5.8.2 from C interface with JOB =  -2
      executing #MPI =     86 and #OMP =      2


Your experiment path is /home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_7

To display your profiling results:
#################################################################################################################################################################
#    LEVEL    |     REPORT     |                                                            COMMAND                                                             #
#################################################################################################################################################################
#  Functions  |  Cluster-wide  |  maqao lprof -df xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_7      #
#  Functions  |  Per-node      |  maqao lprof -df -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Functions  |  Per-process   |  maqao lprof -df -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Functions  |  Per-thread    |  maqao lprof -df -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Loops      |  Cluster-wide  |  maqao lprof -dl xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_7      #
#  Loops      |  Per-node      |  maqao lprof -dl -dn xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Loops      |  Per-process   |  maqao lprof -dl -dp xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#  Loops      |  Per-thread    |  maqao lprof -dl -dt xp=/home/mlkaps_org/kevin/matrices/test_m1-86_o2_G3circuit_allowextra_scala_kptr_probe/tools/lprof_run_7  #
#################################################################################################################################################################

×