options

exec - 2024-08-10 19:32:00 - MAQAO 2.20.7

Help is available by moving the cursor above any symbol or by checking MAQAO website.

Global Metrics

Total Time (s)83.99
Profiled Time (s)83.72
GFLOPS994.456
Time in analyzed loops (%)28.4
Time in analyzed innermost loops (%)28.4
Time in user code (%)28.6
Compilation Options Score (%)100
Array Access Efficiency (%)91.6
Potential Speedups
Perfect Flow Complexity1.00
Perfect OpenMP + MPI + Pthread1.00
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution1.12
No Scalar IntegerPotential Speedup1.01
Nb Loops to get 80%1
FP VectorisedPotential Speedup1.01
Nb Loops to get 80%1
Fully VectorisedPotential Speedup1.34
Nb Loops to get 80%1
FP Arithmetic OnlyPotential Speedup1.15
Nb Loops to get 80%1

CQA Potential Speedups Summary

1
2
1.00
1.05
1.10
1.15
1.20
1.25
1.30
1.35
1.40
If No Scalar Integer
If FP vectorized
If fully vectorized
If FP only

Experiment Summary

Experiment Name
Application/home/eoseret/qaas_runs_CPU_9468/172-289-8348/intel/HACCmk/run/binaries/aocc_libalm/exec
Timestamp2024-08-10 19:32:00 Universal Timestamp1723311120
Number of processes observed1 Number of threads observed192
Experiment TypeMPI; OpenMP;
Machinegmz11.benchmarkcenter.megware.com
Model NameAMD EPYC 9654 96-Core Processor
Architecturex86_64 Micro ArchitectureZEN_V4
Cache Size1024 KB Number of Cores96
OS VersionLinux 5.14.0-427.18.1.el9_4.x86_64 #1 SMP PREEMPT_DYNAMIC Tue May 28 06:27:02 EDT 2024
Architecture used during static analysisx86_64 Micro Architecture used during static analysisZEN_V4
Frequency Driveracpi-cpufreq Frequency Governorperformance
Huge Pagesalways Hyperthreadingon
Number of sockets2 Number of cores per socket96
Compilation Optionsexec: AMD clang version 16.0.3 (CLANG: AOCC_4.1.0-Build#270 2023_07_10) /cluster/comp/aocc/4.1.0/bin/clang-16 -I /home/eoseret/qaas_runs_CPU_9468/172-289-8348/intel/HACCmk/build/HACCmk/CoMD/src-openmp -I /home/eoseret/qaas_runs_CPU_9468/172-289-8348/intel/HACCmk/build/aocc_libalm -O3 -O3 -march=znver4 -fno-vectorize -fno-slp-vectorize -fno-openmp-simd -flto=full -g -fno-omit-frame-pointer -fcf-protection=none -nopie -grecord-command-line -fopenmp=libomp -fveclib=AMDLIBM -MD -MT CMakeFiles/HACCmk.dir/src/main.c.o -MF CMakeFiles/HACCmk.dir/src/main.c.o.d -o CMakeFiles/HACCmk.dir/src/main.c.o -c /home/eoseret/qaas_runs_CPU_9468/172-289-8348/intel/HACCmk/build/HACCmk/src/main.c
Comments

Configuration Summary

Dataset
Run Command<executable>
MPI Commandmpirun -n <number_processes>
Number Processes1
Number Nodes1
Number Processes per Nodes1
FilterNot Used
Profile StartNot Used
Maximal Path Number4
×