Run 2x1 | Number processes: 2Number nodes: 1Number processes per node: 2Run Command: <executable> --groups 1024 --procs 2,1,1MPI Command: mpirun -np <number_processes>Dataset: Run Directory: /beegfs/hackathon/users/eoseret/qaas_runs/170-850-6313/intel/Kripke/run/oneview_runs/compilers/gcc_13/oneview_run_1708510989I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spreadOMP_NUM_THREADS: 1 |
---|---|
Run 2x2 | OMP_NUM_THREADS: 2I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x4 | OMP_NUM_THREADS: 4I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x8 | OMP_NUM_THREADS: 8I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x16 | OMP_NUM_THREADS: 16I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x32 | OMP_NUM_THREADS: 32I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x64 | OMP_NUM_THREADS: 64I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Run 2x96 | OMP_NUM_THREADS: 96I_MPI_PIN_DOMAIN: auto:scatterOMP_PLACES: threadsOMP_PROC_BIND: spread |
Loop id | Source Location | Source Function | Level | Coverage 2x1 (%) | Coverage 2x2 (%) | Coverage 2x4 (%) | Coverage 2x8 (%) | Coverage 2x16 (%) | Coverage 2x32 (%) | Coverage 2x64 (%) | Coverage 2x96 (%) | Max Time Over Threads 2x1 (s) | Max Time Over Threads 2x2 (s) | Max Time Over Threads 2x4 (s) | Max Time Over Threads 2x8 (s) | Max Time Over Threads 2x16 (s) | Max Time Over Threads 2x32 (s) | Max Time Over Threads 2x64 (s) | Max Time Over Threads 2x96 (s) | Time w.r.t. Wall Time 2x1 (s) | Time w.r.t. Wall Time 2x2 (s) | Time w.r.t. Wall Time 2x4 (s) | Time w.r.t. Wall Time 2x8 (s) | Time w.r.t. Wall Time 2x16 (s) | Time w.r.t. Wall Time 2x32 (s) | Time w.r.t. Wall Time 2x64 (s) | Time w.r.t. Wall Time 2x96 (s) | Nb Threads 2x1 | Nb Threads 2x2 | Nb Threads 2x4 | Nb Threads 2x8 | Nb Threads 2x16 | Nb Threads 2x32 | Nb Threads 2x64 | Nb Threads 2x96 | GFLOPS 2x1 | GFLOPS 2x2 | GFLOPS 2x4 | GFLOPS 2x8 | GFLOPS 2x16 | GFLOPS 2x32 | GFLOPS 2x64 | GFLOPS 2x96 | Vectorization Ratio (%) | Vector Length Use (%) | Speedup If No Scalar Integer | Speedup If FP Vectorized | Speedup If Fully Vectorized | Speedup If Perfect Load Balancing 2x1 | Speedup If Perfect Load Balancing 2x2 | Speedup If Perfect Load Balancing 2x4 | Speedup If Perfect Load Balancing 2x8 | Speedup If Perfect Load Balancing 2x16 | Speedup If Perfect Load Balancing 2x32 | Speedup If Perfect Load Balancing 2x64 | Speedup If Perfect Load Balancing 2x96 | Stride 0 | Stride 1 | Stride n | Stride Unknown | Stride Indirect | (2x1) Efficiency | (2x1) Potential Speed-Up (%) | (2x2) Efficiency | (2x2) Potential Speed-Up (%) | (2x4) Efficiency | (2x4) Potential Speed-Up (%) | (2x8) Efficiency | (2x8) Potential Speed-Up (%) | (2x16) Efficiency | (2x16) Potential Speed-Up (%) | (2x32) Efficiency | (2x32) Potential Speed-Up (%) | (2x64) Efficiency | (2x64) Potential Speed-Up (%) | (2x96) Efficiency | (2x96) Potential Speed-Up (%) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1084 | libkripke.so - forall.hpp:59-59 [...] | _ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS... | InBetween | 90.79 | 90.27 | 89.93 | 89.46 | 87.87 | 82.11 | 61.44 | 57.46 | 922.41 | 455.5 | 231.63 | 114.74 | 57.58 | 28.74 | 14.75 | 11.05 | 916.65 | 456.6 | 229.69 | 114.31 | 57.92 | 29.51 | 16.57 | 12.81 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 7.17 | 14.39 | 28.61 | 57.49 | 113.47 | 222.71 | 396.63 | 513.06 | 6.67 | 13.33 | 2.19 | 1.98 | 6.2 | 1.01 | 1.01 | 1.02 | 1.02 | 1.03 | 1.03 | 1.03 | 1.03 | NA | NA | NA | NA | NA | 1 | 0 | 1 | 0 | 1 | 0.21 | 1 | 0 | 0.99 | 0.95 | 0.97 | 2.41 | 0.86 | 8.33 | 0.75 | 14.63 |
902 | libkripke.so - forall.hpp:59-59 [...] | _ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl2EEEEJNS2_3ForILl1ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS... | Innermost | 2.58 | 3.24 | 3.37 | 3.34 | 3.49 | 5.49 | 14.82 | 14.93 | 26.28 | 18.19 | 8.91 | 4.54 | 2.47 | 2.53 | 6.17 | 4.93 | 26.06 | 16.38 | 8.61 | 4.27 | 2.3 | 1.97 | 4 | 3.33 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 15.43 | 24.55 | 46.72 | 94.18 | 174.85 | 204.13 | 100.48 | 120.86 | 0 | 12.5 | 1 | 1 | 4.57 | 1.01 | 1.12 | 1.05 | 1.08 | 1.11 | 1.35 | 1.79 | 1.76 | 1 | 2 | 0 | 0 | 0 | 1 | 0 | 0.8 | 0.66 | 0.76 | 0.82 | 0.76 | 0.79 | 0.71 | 1.02 | 0.41 | 3.22 | 0.1 | 13.31 | 0.08 | 13.71 |
1022 | libkripke.so - forall.hpp:59-59 [...] | _ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl2EEEEJNS2_3ForILl1ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS... | Innermost | 2.51 | 2.94 | 3.06 | 3.22 | 4.01 | 5.85 | 12.85 | 15.09 | 25.32 | 16.17 | 8.58 | 4.47 | 2.97 | 2.65 | 4.86 | 4.8 | 25.31 | 14.87 | 7.82 | 4.11 | 2.64 | 2.1 | 3.47 | 3.36 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 15.89 | 27.04 | 51.41 | 97.81 | 152.27 | 191.45 | 115.86 | 119.36 | 0 | 12.5 | 1 | 1 | 4.57 | 1 | 1.1 | 1.11 | 1.1 | 1.16 | 1.32 | 1.63 | 1.7 | 1 | 2 | 0 | 0 | 0 | 1 | 0 | 0.85 | 0.44 | 0.81 | 0.58 | 0.77 | 0.74 | 0.6 | 1.61 | 0.38 | 3.65 | 0.11 | 11.39 | 0.08 | 13.91 |
1038 | libkripke.so - forall.hpp:59-59 [...] | _ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS2_6LambdaILl0EJEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSF_EEENS5_5tupleIJNS_4impl4SpanINS_9... | Innermost | 0.22 | 0.22 | 0.22 | 0.22 | 0.22 | 0.22 | 0.37 | 0.47 | 2.21 | 1.11 | 0.56 | 0.28 | 0.15 | 0.1 | 0.18 | 0.17 | 2.22 | 1.12 | 0.56 | 0.28 | 0.14 | 0.08 | 0.1 | 0.1 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 5.44 | 10.78 | 21.57 | 43.20 | 86.51 | 150.88 | 119.86 | 125.53 | 0 | 12.5 | 1 | 1.95 | 8 | 1 | 1 | 1.02 | 1 | 1.07 | 1.43 | 2 | 1.89 | 0 | 2 | 0 | 0 | 0 | 1 | 0 | 0.99 | 0 | 0.99 | 0 | 0.99 | 0 | 0.99 | 0 | 0.87 | 0.03 | 0.35 | 0.24 | 0.23 | 0.36 |
1121 | libkripke.so - forall.hpp:59-59 [...] | _ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS8_ILl4ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSH_E... | Innermost | 0.08 | 0.08 | 0.08 | 0.07 | 0.08 | 0.07 | 0.06 | 0.05 | 0.83 | 0.45 | 0.22 | 0.14 | 0.07 | 0.05 | 0.03 | 0.03 | 0.79 | 0.41 | 0.19 | 0.09 | 0.05 | 0.02 | 0.02 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 2.97 | 5.65 | 12.36 | 46.15 | 79.08 | 228.00 | 275.63 | 591.26 | 0 | 12.5 | 1.28 | 1 | 8 | 1.05 | 1.13 | 1.16 | 1.56 | 1.4 | 2.5 | 3 | 3 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0.96 | 0 | 1.04 | -0 | 1.1 | 0 | 0.99 | 0 | 1.23 | 0 | 0.62 | 0.02 | 0.82 | 0.01 |
1085 | libkripke.so - forall.hpp:59-59 [...] | _ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSG_EEENS5_5tupleIJNS... | InBetween | 0.07 | 0.07 | 0.07 | 0.07 | 0.07 | 0.07 | 0.05 | 0.05 | 1.14 | 0.64 | 0.32 | 0.17 | 0.12 | 0.07 | 0.05 | 0.03 | 0.69 | 0.35 | 0.17 | 0.09 | 0.05 | 0.02 | 0.01 | 0.01 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 192 | 9.58 | 15.67 | 34.29 | 63.00 | 109.80 | 270.88 | 553.38 | 518.13 | 0 | 12.5 | 1.92 | 1 | 6.57 | 1.65 | 1.83 | 1.88 | 1.89 | 2.4 | 3.5 | 5 | 3 | NA | NA | NA | NA | NA | 1 | 0 | 0.99 | 0 | 1.01 | -0 | 0.96 | 0 | 0.86 | 0.01 | 1.08 | 0 | 1.08 | -0 | 0.72 | 0.01 |
1122 | libkripke.so - forall.hpp:59-59 [...] | _ZN4RAJA8internal17StatementExecutorINS_9statement8CollapseINS_26omp_parallel_collapse_execEN4camp7int_seqIlJLl0ELl1EEEEJNS2_3ForILl2ENS_6policy4loop9loop_execEJNS8_ILl3ESB_JNS8_ILl4ESB_JNS2_6LambdaILl0EJEEEEEEEEEEEEEEEE4execIRNS0_8LoopDataINS5_4listIJSH_E... | InBetween | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.11 | 0.05 | 0.04 | 0.03 | 0.02 | 0.01 | 0.01 | 0.01 | 0.1 | 0.04 | 0.02 | 0.01 | 0.01 | 0 | 0 | 0 | 2 | 4 | 8 | 16 | 32 | 63 | 117 | 173 | 1.11 | 2.31 | 4.19 | 85.25 | 48.63 | 0.00 | 0.00 | 0.00 | 0 | 11.79 | 2.04 | 1 | 11 | 1.22 | 1.25 | 2 | 3 | 2 | 0 | 0 | 0 | NA | NA | NA | NA | NA | 1 | 0 | 1.25 | -0 | 1.25 | -0 | 1.25 | -0 | 0.63 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |