Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | |
---|---|---|---|---|
Total Time (s) | 19.88 | 19.98 | 19.11 | |
Profiled Time (s) | 18.99 | 19.09 | 18.30 | |
Time in analyzed loops (%) | 99.4 | 99.4 | 99.2 | |
Time in analyzed innermost loops (%) | 80.0 | 80.4 | 63.0 | |
Time in user code (%) | 99.5 | 99.5 | 99.3 | |
Compilation Options Score (%) | 0 | 0 | 100 | |
Array Access Efficiency (%) | 67.5 | 67.3 | 71.9 | |
Potential Speedups | ||||
Iterations Count | Not Available | Not Available | 1.19 | |
Perfect Flow Complexity | 1.09 | 1.09 | 1.09 | |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.00 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | |
No Scalar Integer | Potential Speedup | 1.14 | 1.14 | 1.32 |
Nb Loops to get 80% | 10 | 10 | 17 | |
FP Vectorised | Potential Speedup | 1.50 | 1.50 | 1.31 |
Nb Loops to get 80% | 4 | 5 | 5 | |
Fully Vectorised | Potential Speedup | 4.94 | 4.94 | 4.34 |
Nb Loops to get 80% | 41 | 41 | 33 | |
Only FP Arithmetic | Potential Speedup | 1.34 | 1.35 | 1.52 |
Nb Loops to get 80% | 29 | 28 | 25 |
Source Object | Issue |
---|---|
▼exec | |
▼IJVector_parcsr.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼par_coarsen.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼par_strength.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼random.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼par_lr_interp.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼vector.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼ams.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼csr_matvec.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼IJMatrix_parcsr.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼amg.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼csr_matop.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼par_coarse_parms.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼par_csr_matop.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼hypre_qsort.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼par_multi_interp.c | |
○ | -g and -grecord-gcc-switches are missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
r0 | r1 | r2 | |
---|---|---|---|
Application | /home/kcamus/qaas_runs/169-771-5789/intel/AMG/run/oneview_runs/orig/exec | /home/kcamus/qaas_runs/169-771-5789/intel/AMG/run/binaries/clang_14/exec | /home/kcamus/qaas_runs/169-771-5789/intel/AMG/run/binaries/gcc_12/exec |
Timestamp | 2023-10-19 11:48:15 | 2023-10-19 12:44:48 | 2023-10-19 12:45:31 |
Experiment Type | MPI; | same as r0 | same as r0 |
Machine | ip-172-31-68-94 | same as r0 | same as r0 |
Architecture | x86_64 | same as r0 | same as r0 |
Micro Architecture | ZEN_V4 | same as r0 | same as r0 |
Model Name | AMD EPYC 9R14 96-Core Processor | same as r0 | same as r0 |
Cache Size | 1024 KB | same as r0 | same as r0 |
Number of Cores | 96 | same as r0 | same as r0 |
Maximal Frequency | 3.701953 GHz | same as r0 | same as r0 |
OS Version | Linux 6.2.0-1013-aws #13~22.04.1-Ubuntu SMP Fri Sep 8 17:29:56 UTC 2023 | same as r0 | same as r0 |
Architecture used during static analysis | x86_64 | same as r0 | same as r0 |
Micro Architecture used during static analysis | ZEN_V4 | same as r0 | same as r0 |
Compilation Options | exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) | libparcsr_mv.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) exec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libHYPRE_utilities.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libseq_mv.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libIJ_mv.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) libparcsr_ls.so: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) | libparcsr_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans exec: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fno-pie -fcf-protection=none -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libHYPRE_utilities.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libseq_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libIJ_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libparcsr_ls.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans |
Number of processes observed | 1 | same as r0 | same as r0 |
Number of threads observed | 1 | same as r0 | same as r0 |
Frequency Driver | acpi-cpufreq | same as r0 | same as r0 |
Frequency Governor | performance | same as r0 | same as r0 |
Huge Pages | madvise | same as r0 | same as r0 |
Hyperthreading | off | same as r0 | same as r0 |
Number of sockets | 2 | same as r0 | same as r0 |
Number of cores per socket | 96 | same as r0 | same as r0 |
MAQAO version | 2.18.0 | same as r0 | same as r0 |
MAQAO build | 44fc1f08bd133baf72fdfe51b209105f7e5da0e1::20231013-163433 | same as r0 | same as r0 |
Comments | - | - | - |