Run 2x1 | Number processes: 2Number nodes: 1Number processes per node: 2Run Command: <executable> --groups 1024 --procs 2,1,1MPI Command: mpirun -np <number_processes>Dataset: Run Directory: /scratch_na/users/xoserete/qaas_runs/171-291-2973/intel/Kripke/run/oneview_runs/compilers/gcc_16/oneview_run_1712922456I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spreadOMP_NUM_THREADS: 1 |
---|---|
Run 2x2 | OMP_NUM_THREADS: 2I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x4 | OMP_NUM_THREADS: 4I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x8 | OMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x16 | OMP_NUM_THREADS: 16I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x32 | OMP_NUM_THREADS: 32I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x56 | OMP_NUM_THREADS: 56I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Loop id | Source Location | Source Function | Level | Coverage 2x1 (%) | Coverage 2x2 (%) | Coverage 2x4 (%) | Coverage 2x8 (%) | Coverage 2x16 (%) | Coverage 2x32 (%) | Coverage 2x56 (%) | Max Time Over Threads 2x1 (s) | Max Time Over Threads 2x2 (s) | Max Time Over Threads 2x4 (s) | Max Time Over Threads 2x8 (s) | Max Time Over Threads 2x16 (s) | Max Time Over Threads 2x32 (s) | Max Time Over Threads 2x56 (s) | Time w.r.t. Wall Time 2x1 (s) | Time w.r.t. Wall Time 2x2 (s) | Time w.r.t. Wall Time 2x4 (s) | Time w.r.t. Wall Time 2x8 (s) | Time w.r.t. Wall Time 2x16 (s) | Time w.r.t. Wall Time 2x32 (s) | Time w.r.t. Wall Time 2x56 (s) | Nb Threads 2x1 | Nb Threads 2x2 | Nb Threads 2x4 | Nb Threads 2x8 | Nb Threads 2x16 | Nb Threads 2x32 | Nb Threads 2x56 | GFLOPS 2x1 | GFLOPS 2x2 | GFLOPS 2x4 | GFLOPS 2x8 | GFLOPS 2x16 | GFLOPS 2x32 | GFLOPS 2x56 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing 2x1 | Speedup If Perfect Load Balancing 2x2 | Speedup If Perfect Load Balancing 2x4 | Speedup If Perfect Load Balancing 2x8 | Speedup If Perfect Load Balancing 2x16 | Speedup If Perfect Load Balancing 2x32 | Speedup If Perfect Load Balancing 2x56 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (2x1) Efficiency | (2x1) Potential Speed-Up (%) | (2x2) Efficiency | (2x2) Potential Speed-Up (%) | (2x4) Efficiency | (2x4) Potential Speed-Up (%) | (2x8) Efficiency | (2x8) Potential Speed-Up (%) | (2x16) Efficiency | (2x16) Potential Speed-Up (%) | (2x32) Efficiency | (2x32) Potential Speed-Up (%) | (2x56) Efficiency | (2x56) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1082 | libkripke.so - forall.hpp:59-59 [...] | void RAJA::internal::StatementExecutor<RAJA::statement::Collapse<RAJA::omp_parallel_collapse_exec, camp::int_seq<long, 0l, 1l>, RAJA::statement::For<2l, RAJA::policy::loop::loop_exec, RAJA::statement::For<3l, RAJA::policy::loop::loop_exec... | InBetween | 90.91 | 90.21 | 89.74 | 89.03 | 87.52 | 82.97 | 77.95 | 1626.68 | 816.75 | 412.26 | 211.79 | 110.98 | 58.96 | 48.75 | 1623.87 | 818.72 | 416 | 214.94 | 112.5 | 61.03 | 50.77 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 2.72 | 5.40 | 10.63 | 20.58 | 39.34 | 72.54 | 87.18 | 6.67 | 13.33 | 2.2 | 2.43 | 8 | 1 | 1 | 1 | 1.01 | 1.02 | 1.03 | 1.1 | NA | NA | NA | NA | NA | 1 | 0 | 0.99 | 0.75 | 0.98 | 2.16 | 0.94 | 4.95 | 0.9 | 8.56 | 0.83 | 13.98 | 0.57 | 33.43 |
1020 | libkripke.so - forall.hpp:59-59 [...] | void RAJA::internal::StatementExecutor<RAJA::statement::Collapse<RAJA::omp_parallel_collapse_exec, camp::int_seq<long, 0l, 2l>, RAJA::statement::For<1l, RAJA::policy::loop::loop_exec, RAJA::statement::For<3l, RAJA::policy::loop::loop_exec... | Innermost | 3.08 | 3.36 | 3.5 | 3.87 | 4.75 | 6.5 | 6.44 | 55.16 | 31.53 | 16.7 | 9.61 | 6.1 | 4.75 | 4.71 | 55 | 30.48 | 16.22 | 9.34 | 6.1 | 4.78 | 4.2 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 3.66 | 6.60 | 12.41 | 21.55 | 33.00 | 42.10 | 47.91 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1.04 | 1.04 | 1.05 | 1.04 | 1.06 | 1.28 | 1 | 2 | 0 | 0 | 0 | 1 | 0 | 0.9 | 0.33 | 0.85 | 0.53 | 0.74 | 1.02 | 0.56 | 2.07 | 0.36 | 4.16 | 0.23 | 4.93 |
903 | libkripke.so - forall.hpp:59-59 [...] | void RAJA::internal::StatementExecutor<RAJA::statement::Collapse<RAJA::omp_parallel_collapse_exec, camp::int_seq<long, 0l, 2l>, RAJA::statement::For<1l, RAJA::policy::loop::loop_exec, RAJA::statement::For<3l, RAJA::policy::loop::loop_exec... | Innermost | 3.01 | 3.06 | 3.16 | 3.36 | 3.5 | 5.18 | 6.87 | 53.94 | 27.73 | 15.16 | 8.07 | 4.58 | 3.82 | 5.07 | 53.74 | 27.81 | 14.66 | 8.11 | 4.5 | 3.81 | 4.47 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 3.75 | 7.24 | 13.73 | 24.83 | 44.74 | 52.84 | 45.02 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1 | 1.05 | 1.02 | 1.06 | 1.07 | 1.3 | 1 | 2 | 0 | 0 | 0 | 1 | 0 | 0.97 | 0.1 | 0.92 | 0.26 | 0.83 | 0.58 | 0.75 | 0.89 | 0.44 | 2.9 | 0.21 | 5.4 |
1036 | libkripke.so - forall.hpp:59-59 [...] | void RAJA::internal::StatementExecutor<RAJA::statement::Collapse<RAJA::omp_parallel_collapse_exec, camp::int_seq<long, 0l, 1l>, RAJA::statement::For<2l, RAJA::policy::loop::loop_exec, RAJA::statement::Lambda<0l> > > >::exec<... | Innermost | 0.24 | 0.24 | 0.24 | 0.23 | 0.23 | 0.36 | 0.4 | 4.28 | 2.21 | 1.12 | 0.57 | 0.3 | 0.28 | 0.29 | 4.27 | 2.2 | 1.11 | 0.57 | 0.3 | 0.27 | 0.26 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 2.83 | 5.49 | 10.88 | 21.17 | 40.27 | 44.79 | 46.52 | 0 | 12.5 | 1 | 1.95 | 8 | 1 | 1.01 | 1.02 | 1.04 | 1.03 | 1.12 | 1.26 | 0 | 2 | 0 | 0 | 0 | 1 | 0 | 0.97 | 0.01 | 0.96 | 0.01 | 0.94 | 0.01 | 0.89 | 0.03 | 0.49 | 0.18 | 0.29 | 0.28 |
1083 | libkripke.so - forall.hpp:59-59 [...] | void RAJA::internal::StatementExecutor<RAJA::statement::Collapse<RAJA::omp_parallel_collapse_exec, camp::int_seq<long, 0l, 1l>, RAJA::statement::For<2l, RAJA::policy::loop::loop_exec, RAJA::statement::For<3l, RAJA::policy::loop::loop_exec... | InBetween | 0.21 | 0.12 | 0.13 | 0.1 | 0.08 | 0.05 | 0.05 | 3.83 | 1.12 | 0.89 | 0.33 | 0.16 | 0.09 | 0.1 | 3.7 | 1.09 | 0.58 | 0.24 | 0.1 | 0.04 | 0.03 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 3.13 | 6.11 | 12.06 | 22.26 | 37.25 | 55.26 | 100.54 | 0 | 12.5 | 1 | 1 | 8 | 1.04 | 1.04 | 1.56 | 1.43 | 1.6 | 3 | 3.33 | NA | NA | NA | NA | NA | 1 | 0 | 1.7 | 0 | 1.59 | 0 | 1.93 | 0 | 2.31 | 0 | 2.89 | 0 | 2.2 | 0 |
1120 | libkripke.so - forall.hpp:59-59 [...] | void RAJA::internal::StatementExecutor<RAJA::statement::Collapse<RAJA::omp_parallel_collapse_exec, camp::int_seq<long, 0l, 1l>, RAJA::statement::For<2l, RAJA::policy::loop::loop_exec, RAJA::statement::For<3l, RAJA::policy::loop::loop_exec... | Innermost | 0.12 | 0.15 | 0.15 | 0.15 | 0.14 | 0.14 | 0.11 | 2.11 | 1.48 | 0.76 | 0.4 | 0.23 | 0.18 | 0.1 | 2.11 | 1.37 | 0.7 | 0.36 | 0.18 | 0.1 | 0.07 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 4.07 | 6.25 | 12.25 | 24.10 | 47.99 | 88.34 | 124.53 | 0 | 12.5 | 1 | 1 | 8 | 1 | 1.09 | 1.1 | 1.14 | 1.35 | 2 | 1.67 | NA | NA | NA | NA | NA | 1 | 0 | 0.77 | 0.03 | 0.75 | 0.04 | 0.73 | 0.04 | 0.73 | 0.04 | 0.66 | 0.05 | 0.54 | 0.05 |
1119 | libkripke.so - forall.hpp:59-59 [...] | void RAJA::internal::StatementExecutor<RAJA::statement::Collapse<RAJA::omp_parallel_collapse_exec, camp::int_seq<long, 0l, 1l>, RAJA::statement::For<2l, RAJA::policy::loop::loop_exec, RAJA::statement::For<3l, RAJA::policy::loop::loop_exec... | InBetween | 0.01 | 0.02 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.22 | 0.15 | 0.09 | 0.05 | 0.03 | 0.02 | 0.02 | 0.22 | 0.14 | 0.07 | 0.04 | 0.02 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 112 | 2.47 | 3.38 | 6.49 | 11.01 | 24.50 | 48.65 | 43.40 | 0 | 11.67 | 1 | 1 | 11.82 | 1.05 | 1.07 | 1.29 | 1.67 | 1.5 | 2 | 2 | NA | NA | NA | NA | NA | 1 | 0 | 0.79 | 0 | 0.79 | 0 | 0.69 | 0 | 0.69 | 0 | 0.69 | 0 | 0.39 | 0.01 |