Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
---|---|---|---|---|---|---|---|---|---|
Total Time (s) | 48.45 | 51.36 | 51.06 | 45.39 | 90.09 | 83.07 | 80.18 | 86.01 | |
Profiled Time (s) | 37.17 | 36.28 | 36.23 | 34.44 | 78.05 | 73.91 | 68.35 | 77.68 | |
GFLOPS | 155.585 | 116.368 | 185.951 | 151.705 | 88.754 | 77.462 | 119.928 | 80.015 | |
Time in analyzed loops (%) | 89.6 | 89.7 | 90.3 | 89.6 | 95.6 | 96.4 | 95.6 | 95.7 | |
Time in analyzed innermost loops (%) | 89.5 | 89.7 | 90.1 | 89.4 | 95.5 | 96.4 | 95.4 | 95.5 | |
Time in user code (%) | 89.8 | 89.9 | 90.5 | 89.8 | 95.7 | 96.5 | 95.8 | 95.8 | |
Compilation Options Score (%) | 0 | 100 | 100 | 0 | 0 | 100 | 100 | 0 | |
Array Access Efficiency (%) | 82.9 | 84.3 | 81.5 | 83.8 | 86.7 | 89.5 | 83.5 | 87.8 | |
Potential Speedups | |||||||||
Iterations Count | 1.00 | Not Available | 1.00 | 1.00 | 1.00 | Not Available | 1.00 | 1.00 | |
Perfect Flow Complexity | 1.00 | 1.10 | 1.00 | 1.00 | 1.00 | 1.09 | 1.00 | 1.03 | |
Perfect OpenMP + MPI + Pthread | 1.08 | 1.05 | 1.09 | 1.08 | 1.04 | 1.02 | 1.04 | 1.04 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.15 | 1.11 | 1.16 | 1.14 | 1.07 | 1.04 | 1.07 | 1.06 | |
No Scalar Integer | Potential Speedup | 1.01 | 1.02 | 1.00 | 1.01 | 1.01 | 1.02 | 1.00 | 1.11 |
Nb Loops to get 80% | 3 | 2 | 3 | 3 | 3 | 3 | 3 | 9 | |
FP Vectorised | Potential Speedup | 1.12 | 1.21 | 1.00 | 1.00 | 1.12 | 1.11 | 1.00 | 1.05 |
Nb Loops to get 80% | 10 | 8 | 1 | 2 | 11 | 4 | 1 | 2 | |
Fully Vectorised | Potential Speedup | 1.28 | 1.44 | 1.00 | 1.00 | 1.31 | 1.12 | 1.00 | 1.06 |
Nb Loops to get 80% | 14 | 15 | 5 | 5 | 15 | 5 | 6 | 2 | |
Only FP Arithmetic | Potential Speedup | 1.08 | 1.11 | 1.02 | 1.05 | 1.07 | 1.15 | 1.02 | 1.17 |
Nb Loops to get 80% | 5 | 6 | 4 | 6 | 6 | 8 | 4 | 10 | |
Data In L1 Cache | Potential Speedup | Not Available | Not Available | 1.04 | 1.05 | Not Available | Not Available | 1.05 | 1.05 |
Nb Loops to get 80% | Not Available | Not Available | 1 | 1 | Not Available | Not Available | 1 | 1 |
Source Object | Issue |
---|---|
▼exec | |
▼update_halo_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼calc_dt_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼revert_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼ideal_gas_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼flux_calc_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼viscosity_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼field_summary_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼advec_mom_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼accelerate_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼advec_cell_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼generate_chunk_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼reset_field_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼PdV_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
▼initialise_chunk_kernel.f90 | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3/-Ofast, -x(target) |
r0 | r1 | r2 | r3 | r4 | r5 | r6 | r7 | |
---|---|---|---|---|---|---|---|---|
Application | /scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/run/oneview_runs/defaults/orig/exec | /scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/run/binaries/gcc_4/exec | /scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/run/binaries/icc_14/exec | /scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/run/binaries/icx_2/exec | /scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/run/oneview_runs/defaults/orig/exec | /scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/run/binaries/gcc_10/exec | /scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/run/binaries/icc_10/exec | /scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/run/binaries/icx_14/exec |
Timestamp | 2024-01-10 10:05:34 | 2024-01-10 12:39:28 | 2024-01-10 12:35:12 | 2024-01-10 12:31:22 | 2024-01-11 18:24:39 | 2024-01-11 22:34:37 | 2024-01-11 22:29:03 | 2024-01-11 22:23:03 |
Experiment Type | MPI; OpenMP; | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Machine | o406 | same as r0 | same as r0 | same as r0 | o401 | same as r4 | same as r4 | same as r4 |
Architecture | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture | SAPPHIRE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Model Name | Intel (R) Xeon (R) CPU Max 9480 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Cache Size | 115200 KB | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of Cores | 56 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Maximal Frequency | 1.901 GHz | same as r0 | same as r0 | same as r0 | 1.9 GHz | same as r4 | same as r4 | same as r4 |
OS Version | Linux 4.18.0-372.26.1.el8_6.x86_64 #1 SMP Sat Aug 27 02:44:20 EDT 2022 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Architecture used during static analysis | x86_64 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Micro Architecture used during static analysis | SAPPHIRE_RAPIDS | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Compilation Options | exec: Intel(R) Fortran 23.0-1769 | exec: GNU Fortran2008 12.1.0 -march=haswell -g -O3 -O3 -fallow-argument-mismatch -fno-tree-vectorize -fno-openmp-simd -ffast-math -fno-omit-frame-pointer -fopenmp -funroll-loops -fintrinsic-modules-path /software/compilers/gcc/gcc-12.1.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/12.1.0/finclude -fpre-include=/usr/include/finclude/math-vector-fortran.h | exec: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.10.0 Build 20230609_000000 -I/scratch_na/users/xoserete/qaas_runs/170-487-7187/intel/CloverLeafFC/build/CloverLeafFC/CloverLeaf_ref/kernels -I/opt/intel/oneapi/mpi/2021.8.0//include -I/opt/intel/oneapi/mpi/2021.8.0/include -O2 -xSAPPHIRERAPIDS -qopt-zmm-usage=high -flto -g -fno-omit-frame-pointer -no-pie -qopenmp -c -o CMakeFiles/clover_leaf.dir/CloverLeaf_ref/kernels/advec_mom_kernel.f90.o | same as r0 | same as r0 | exec: GNU GIMPLE 12.1.0 -mprefer-vector-width=512 -march=sapphirerapids -mprefer-vector-width=512 -g -g -O3 -O3 -O3 -O3 -fno-openacc -fno-pie -fcf-protection=none -ffast-math -fno-omit-frame-pointer -fopenmp -funroll-loops -fltrans | exec: Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.10.0 Build 20230609_000000 -I/scratch_na/users/xoserete/qaas_runs/170-499-3310/intel/CloverLeafFC/build/CloverLeafFC/CloverLeaf_ref/kernels -I/opt/intel/oneapi/mpi/2021.8.0//include -I/opt/intel/oneapi/mpi/2021.8.0/include -O3 -O3 -xSAPPHIRERAPIDS -qopt-zmm-usage=high -flto -g -fno-omit-frame-pointer -no-pie -qopenmp -c -o CMakeFiles/clover_leaf.dir/CloverLeaf_ref/kernels/advec_mom_kernel.f90.o | same as r0 |
Number of processes observed | 1 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of threads observed | 113 | 112 | same as r1 | same as r1 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Driver | acpi-cpufreq | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Frequency Governor | userspace | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Huge Pages | never | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Hyperthreading | on | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of sockets | 2 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Number of cores per socket | 56 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO version | 2.18.0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
MAQAO build | Build information not available | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 | same as r0 |
Comments | - | - | - | - | - | - | - | - |