Help is available by moving the cursor above any symbol or by checking MAQAO website.
- r0: arm/maqao_2022-10-27_15-38-51_v2.15.9_specfem/
- r1: intel/maqao_2022-10-25_17-51-17/
Metric | r0 | r1 |
---|
Total Time (s) | 206.53 | 103.41 |
Profiled Time (s) | 204.69 | 98.44 |
Time in analyzed loops (%) | 93.2 | 84.8 |
Time in analyzed innermost loops (%) | 89.1 | 83.0 |
Time in user code (%) | 96.5 | 86.6 |
Compilation Options | 99.99 | OK |
Perfect Flow Complexity | Not Available | 1.00 |
Array Access Efficiency (%) | Not Available | 72.8 |
Perfect OpenMP + MPI + Pthread | 1.01 | 1.05 |
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.04 | 2.21 |
No Scalar Integer | Potential Speedup | Not Available | 1.07 |
Nb Loops to get 80% | Not Available | 6 |
FP Vectorised | Potential Speedup | Not Available | 1.13 |
Nb Loops to get 80% | Not Available | 7 |
Fully Vectorised | Potential Speedup | Not Available | 2.48 |
Nb Loops to get 80% | Not Available | 23 |
Only FP Arithmetic | Potential Speedup | Not Available | 1.49 |
Nb Loops to get 80% | Not Available | 12 |
| r0 | r1 |
Application | /home_nfs/bhamitono/projects/emopass/benchmarks/specfem3d_globe/build_armclang_20221027/bin/xspecfem3D | ../build_intel_20221024/bin/xspecfem3D |
Timestamp | 2022-10-27 15:38:51 | 2022-10-25 17:51:17 |
Experiment Type | MPI; | MPI; OpenMP; |
Machine | o118 | o201 |
Architecture | arm64 | x86_64 |
Micro Architecture | ARM_NEOVERSE_N1 | ICELAKE_SP |
Model Name | | Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz |
Cache Size | | 49152 KB |
Number of Cores | | 32 |
Maximal Frequency | | 2.601 GHz |
OS Version | Linux 4.18.0-240.el8.aarch64 #1 SMP Wed Sep 23 05:09:38 EDT 2020 | Linux 4.18.0-305.7.1.el8_4.x86_64 #1 SMP Mon Jun 14 17:25:42 EDT 2021 |
Architecture used during static analysis | arm64 | x86_64 |
Micro Architecture used during static analysis | ARM_NEOVERSE_N1 | ICELAKE_SP |
Compilation Options | xspecfem3D:
| xspecfem3D: Intel 2021.7.0 -I./obj -I. -I. -I./setup -I/opt/intel/oneapi/mpi/2021.7.0//include -I/opt/intel/oneapi/mpi/2021.7.0/include -fno-omit-frame-pointer -g -O3 -qopenmp -xCORE-AVX512 -mtune=icelake-client -DUSE_FP32 -DOPT_STREAMS -fp-model fast=2 -traceback -mcmodel=large -DUSE_OPENMP -module ./obj -c -o obj/assemble_MPI_scalar.solver.o
|
Number of processes observed | 54 | same as r0 |
Number of threads observed | 54 | 108 |
MAQAO version | 2.15.9 | same as r0 |
MAQAO build | f97bf913f6f6bdc7e18da5e6dbbfe115364f721a::20220721-153132 | 3dbe53d8f250d42161e0eeef60fd2f16f9a81c40::20220719-154243 |