Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
---|---|---|---|---|---|---|---|---|---|
Total Time (s) | 44.62 | 45.07 | 40.35 | 37.64 | 46.77 | 48.18 | 44.23 | 41.62 | |
Profiled Time (s) | 42.73 | 43.16 | 38.88 | 36.29 | 45.50 | 46.81 | 42.95 | 40.50 | |
GFLOPS | 2.514 | 1.892 | 3.636 | 2.825 | 2.401 | 1.792 | 2.426 | 2.547 | |
Time in analyzed loops (%) | 99.6 | 99.4 | 99.5 | 99.5 | 99.6 | 99.5 | 99.5 | 99.6 | |
Time in analyzed innermost loops (%) | 99.1 | 98.8 | 98.3 | 98.3 | 99.1 | 99.0 | 99.4 | 98.5 | |
Time in user code (%) | 99.7 | 99.5 | 99.6 | 99.6 | 99.6 | 99.5 | 99.7 | 99.6 | |
Compilation Options Score (%) | 0 | 100 | 100 | 0 | 0 | 100 | 100 | 0 | |
Array Access Efficiency (%) | 80.5 | 81.2 | 80.1 | 82.9 | 81.5 | 82.8 | 81.7 | 83.3 | |
Potential Speedups | |||||||||
Iterations Count | Not Available | Not Available | 1.00 | 1.00 | Not Available | Not Available | 1.00 | 1.00 | |
Perfect Flow Complexity | 1.00 | 1.14 | 1.00 | 1.00 | 1.00 | 1.12 | 1.09 | 1.00 | |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
No Scalar Integer | Potential Speedup | 1.02 | 1.02 | 1.01 | 1.01 | 1.02 | 1.02 | 1.00 | 1.01 |
Nb Loops to get 80% | 3 | 3 | 7 | 5 | 3 | 2 | 2 | 5 | |
FP Vectorised | Potential Speedup | 1.13 | 1.28 | 1.00 | 1.00 | 1.14 | 1.27 | 1.14 | 1.00 |
Nb Loops to get 80% | 10 | 7 | 1 | 2 | 11 | 9 | 9 | 2 | |
Fully Vectorised | Potential Speedup | 1.32 | 1.54 | 1.01 | 1.01 | 1.36 | 1.58 | 1.40 | 1.01 |
Nb Loops to get 80% | 15 | 15 | 22 | 21 | 15 | 16 | 15 | 21 | |
Only FP Arithmetic | Potential Speedup | 1.10 | 1.13 | 1.02 | 1.07 | 1.11 | 1.14 | 1.11 | 1.08 |
Nb Loops to get 80% | 5 | 6 | 9 | 8 | 5 | 6 | 3 | 7 | |
Data In L1 Cache | Potential Speedup | Not Available | Not Available | 1.00 | Not Available | Not Available | Not Available | 1.00 | Not Available |
Nb Loops to get 80% | Not Available | Not Available | 1 | Not Available | Not Available | Not Available | 1 | Not Available |
Source Object | Issue |
---|---|
▼exec | |
▼update_halo_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼calc_dt_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼revert_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼advec_cell_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼flux_calc_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼viscosity_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼field_summary_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼advec_mom_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼accelerate_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼ideal_gas_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼PdV_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼reset_field_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼generate_chunk_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼initialise_chunk_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
---|---|---|---|---|---|---|---|---|
Application | /scratch_na/users/xoserete/qaas_runs/170-499-6783/intel/CloverLeafFC/run/oneview_runs/defaults/orig/exec | /scratch_na/users/xoserete/qaas_runs/170-499-6783/intel/CloverLeafFC/run/binaries/gcc_16/exec | /scratch_na/users/xoserete/qaas_runs/170-499-6783/intel/CloverLeafFC/run/binaries/icc_6/exec | /scratch_na/users/xoserete/qaas_runs/170-499-6783/intel/CloverLeafFC/run/binaries/icx_2/exec | /scratch_na/users/xoserete/qaas_runs/170-489-0037/intel/CloverLeafFC/run/oneview_runs/defaults/orig/exec | /scratch_na/users/xoserete/qaas_runs/170-489-0037/intel/CloverLeafFC/run/binaries/gcc_4/exec | /scratch_na/users/xoserete/qaas_runs/170-489-0037/intel/CloverLeafFC/run/binaries/icc_11/exec | /scratch_na/users/xoserete/qaas_runs/170-489-0037/intel/CloverLeafFC/run/binaries/icx_2/exec |
Timestamp | 2024-01-11 19:18:03 | 2024-01-11 21:29:56 | 2024-01-11 21:25:50 | 2024-01-11 21:22:55 | 2024-01-10 13:39:15 | 2024-01-10 15:59:39 | 2024-01-10 15:55:19 | 2024-01-10 15:52:23 |
Experiment Type | MPI; | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Machine | o406 | same as r0 | same as r0 | same as r0 | o401 | same as r4 | same as r4 | same as r4 |
Architecture | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture | SAPPHIRE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Model Name | Intel (R) Xeon (R) CPU Max 9480 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Cache Size | 115200 KB | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of Cores | 56 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Maximal Frequency | 1.901 GHz | same as r0 | same as r0 | same as r0 | 1.9 GHz | same as r4 | same as r4 | same as r4 |
OS Version | Linux 4.18.0-372.26.1.el8_6.x86_64 #1 SMP Sat Aug 27 02:44:20 EDT 2022 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | SAPPHIRE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Compilation Options | exec: Intel(R) Fortran 23.0-1769 | exec: GNU GIMPLE 12.1.0 -march=haswell -g -g -O2 -O2 -fno-openacc -fno-pie -fcf-protection=none -fno-tree-vectorize -ffast-math -fno-omit-frame-pointer -fopenmp -funroll-loops -fltrans | exec: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.10.0 Build 20230609_000000 -I/scratch_na/users/xoserete/qaas_runs/170-499-6783/intel/CloverLeafFC/build/CloverLeafFC/CloverLeaf_ref/kernels -I/opt/intel/oneapi/mpi/2021.8.0//include -I/opt/intel/oneapi/mpi/2021.8.0/include -O2 -xSAPPHIRERAPIDS -qopt-zmm-usage=high -g -fno-omit-frame-pointer -no-pie -qopenmp -c -o CMakeFiles/clover_leaf.dir/CloverLeaf_ref/kernels/advec_mom_kernel.f90.o | same as r0 | same as r0 | exec: GNU Fortran2008 12.1.0 -march=haswell -g -O3 -O3 -fallow-argument-mismatch -fno-tree-vectorize -fno-openmp-simd -ffast-math -fno-omit-frame-pointer -fopenmp -funroll-loops -fintrinsic-modules-path /software/compilers/gcc/gcc-12.1.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/12.1.0/finclude -fpre-include=/usr/include/finclude/math-vector-fortran.h | exec: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.10.0 Build 20230609_000000 -I/scratch_na/users/xoserete/qaas_runs/170-489-0037/intel/CloverLeafFC/build/CloverLeafFC/CloverLeaf_ref/kernels -I/opt/intel/oneapi/mpi/2021.8.0//include -I/opt/intel/oneapi/mpi/2021.8.0/include -O3 -O3 -xCORE-AVX2 -flto -g -fno-omit-frame-pointer -no-pie -qopenmp -c -o CMakeFiles/clover_leaf.dir/CloverLeaf_ref/kernels/advec_mom_kernel.f90.o | same as r0 |
Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of threads observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Driver | acpi-cpufreq | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Governor | userspace | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Huge Pages | never | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Hyperthreading | on | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of sockets | 2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 56 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.18.0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO build | Build information not available | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Comments | - | - | - | - | - | - | - | - |