Help is available by moving the cursor above any symbol or by checking MAQAO website.
Metric | r0 | r1 | r2 | |
---|---|---|---|---|
Total Time (s) | 16.77 | 38.94 | 19.11 | |
Profiled Time (s) | 15.42 | 36.88 | 18.30 | |
Time in analyzed loops (%) | 99.2 | 99.5 | 99.2 | |
Time in analyzed innermost loops (%) | 74.8 | 25.7 | 63.0 | |
Time in user code (%) | 99.9 | 99.7 | 99.3 | |
Compilation Options Score (%) | 100 | 100 | 100 | |
Array Access Efficiency (%) | Not Available | 76.7 | 71.9 | |
Potential Speedups | ||||
Iterations Count | Not Available | 1.02 | 1.19 | |
Perfect Flow Complexity | 1.10 | 1.07 | 1.09 | |
Perfect OpenMP + MPI + Pthread | 1.00 | 1.00 | 1.00 | |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.00 | 1.00 | 1.00 | |
No Scalar Integer | Potential Speedup | 1.12 | 1.13 | 1.32 |
Nb Loops to get 80% | 12 | 10 | 17 | |
FP Vectorised | Potential Speedup | 1.01 | 1.05 | 1.31 |
Nb Loops to get 80% | 2 | 3 | 5 | |
Fully Vectorised | Potential Speedup | 1.56 | 1.60 | 4.34 |
Nb Loops to get 80% | 24 | 22 | 33 | |
Only FP Arithmetic | Potential Speedup | 1.41 | 3.19 | 1.52 |
Nb Loops to get 80% | 27 | 22 | 25 | |
Data In L1 Cache | Potential Speedup | Not Available | 1.00 | Not Available |
Nb Loops to get 80% | Not Available | 1 | Not Available |
Source Object | Issue |
---|---|
▼exec | |
▼IJVector_parcsr.c | |
○ | |
▼par_strength.c | |
○ | |
▼amg.c | |
○ | |
▼par_lr_interp.c | |
○ | |
▼csr_matrix.c | |
○ | |
▼random.c | |
○ | |
▼csr_matvec.c | |
○ | |
▼IJMatrix_parcsr.c | |
○ | |
▼par_coarsen.c | |
○ | |
▼csr_matop.c | |
○ | |
▼par_csr_matop.c | |
○ | |
▼vector.c | |
○ | |
▼ams.c | |
○ | |
▼par_multi_interp.c | |
○ |
r0 | r1 | r2 | |
---|---|---|---|
Application | /home/hbollore/qaas/qaas-runs/169-817-3176/intel/AMG/run/binaries/gcc_2/exec | /home/kcamus/qaas_runs/169-443-9681/intel/AMG/run/binaries/gcc_2/exec | /home/kcamus/qaas_runs/169-771-5789/intel/AMG/run/binaries/gcc_12/exec |
Timestamp | 2023-10-24 19:07:03 | 2023-09-11 18:50:08 | 2023-10-19 12:45:31 |
Experiment Type | MPI; | same as r0 | same as r0 |
Machine | ip-172-31-47-199 | skylake | ip-172-31-68-94 |
Architecture | aarch64 | x86_64 | same as r1 |
Micro Architecture | ARM_NEOVERSE_V1 | SKYLAKE | ZEN_V4 |
Model Name | Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz | AMD EPYC 9R14 96-Core Processor | |
Cache Size | 36608 KB | 1024 KB | |
Number of Cores | 26 | 96 | |
Maximal Frequency | 0 GHz | 2.1 GHz | 3.701953 GHz |
OS Version | Linux 5.15.0-1048-aws #53~20.04.1-Ubuntu SMP Wed Oct 4 16:51:38 UTC 2023 | Linux 6.4.1-arch2-1 #1 SMP PREEMPT_DYNAMIC Tue, 04 Jul 2023 08:39:40 +0000 | Linux 6.2.0-1013-aws #13~22.04.1-Ubuntu SMP Fri Sep 8 17:29:56 UTC 2023 |
Architecture used during static analysis | aarch64 | x86_64 | same as r1 |
Micro Architecture used during static analysis | ARM_NEOVERSE_V1 | SKYLAKE | ZEN_V4 |
Compilation Options | exec: GNU C17 11.1.0 -mlittle-endian -mabi=lp64 -mcpu=zeus+crypto+sha3+sm4+nodotprod+noprofile -g -g -Ofast -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fasynchronous-unwind-tables -fstack-protector-strong -fstack-clash-protection | exec: GNU C89 13.1.1 20230429 -march=skylake-avx512 -mprefer-vector-width=512 -g -O3 -std=gnu90 -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops | libparcsr_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans exec: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fno-pie -fcf-protection=none -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libHYPRE_utilities.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libseq_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libIJ_mv.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans libparcsr_ls.so: GNU GIMPLE 14.0.0 20231018 (experimental) -march=znver4 -g -g -O3 -O3 -fno-openacc -fcf-protection=none -fPIC -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fopenmp -funroll-loops -fltrans |
Number of processes observed | 1 | same as r0 | same as r0 |
Number of threads observed | 1 | same as r0 | same as r0 |
Frequency Driver | NA | intel_cpufreq | acpi-cpufreq |
Frequency Governor | NA | schedutil | performance |
Huge Pages | madvise | always | same as r0 |
Hyperthreading | off | same as r0 | same as r0 |
Number of sockets | 1 | 2 | same as r1 |
Number of cores per socket | 64 | 26 | 96 |
MAQAO version | 2.17.9 | 2.17.8 | 2.18.0 |
MAQAO build | 690431094d99a32cb85b834b2d457fa7bff1d94a::20230918-111356 | 70175eac56e139877d863e6478260132bd85e954::20230901-143618 | 44fc1f08bd133baf72fdfe51b209105f7e5da0e1::20231013-163433 |
Comments | - | - | - |