Help is available by moving the cursor above any symbol or by checking MAQAO website.
Total Time (s) | 624.20 | ||
Profiled Time (s) | 615.25 | ||
Time in analyzed loops (%) | 38.1 | ||
Time in analyzed innermost loops (%) | 36.1 | ||
Time in user code (%) | 39.3 | ||
Compilation Options Score (%) | 75.7 | ||
Perfect Flow Complexity | 1.07 | ||
Array Access Efficiency (%) | 74.1 | ||
GFLOPS | 0.0 | ||
Perfect OpenMP + MPI + Pthread | 1.03 | ||
Perfect OpenMP + MPI + Pthread + Perfect Load Distribution | 1.51 | ||
No Scalar Integer | Potential Speedup | 1.10 | |
Nb Loops to get 80% | 4 | ||
FP Vectorised | Potential Speedup | 1.17 | |
Nb Loops to get 80% | 3 | ||
Fully Vectorised | Potential Speedup | 1.42 | |
Nb Loops to get 80% | 4 | ||
FP Arithmetic Only | Potential Speedup | 1.15 | |
Nb Loops to get 80% | 5 | ||
OpenMP perfectly balanced | Potential Speedup | 1.02 | |
Nb Loops to get 80% | 2 |
Source Object | Issue |
---|---|
▼libexanbDefBoxPlugin.so | |
○compute_cell_particles.h | |
▼libexanbMPIPlugin.so | |
○reduce_cell_particles.h | |
▼exaStamp | |
○ | -g is missing for some functions (possibly ones added by the compiler), it is needed to have more accurate reports. Other recommended flags are: -O2/-O3, -march=(target) |
▼libexaStampAnalyticsPlugin.so | |
○grid_connected_components.cpp | |
○grid_cell_particle_splatting.cpp | |
▼libraveloPlugin.so | |
○compute_pair_singlemat_cell.h | |
○log.h | |
○basic_types_operators.h | |
○basic_types_def.h | |
▼libexanbParticleNeighborsPlugin.so | |
○chunk_neighbors_execute.h |
Experiment Name | exaStamp analysis scalability test | ||||
Application | /ccc/scratch/cont001/ocre/oserete/local/exaStamp/Rhel_8__x86_64/milan-bxi-intel/bin/exaStamp | ||||
Timestamp | 2023-07-03 17:23:41 | Universal Timestamp | 1688397821 | ||
Number of processes observed | 64 | Number of threads observed | 512 | ||
Experiment Type | MPI; OpenMP; | ||||
Machine | inti6231,inti6229,inti6230,inti6232 | ||||
Model Name | AMD EPYC 7763 64-Core Processor | ||||
Architecture | x86_64 | Micro Architecture | ZEN_V3 | ||
Cache Size | 512 KB | Number of Cores | 64 | ||
OS Version | Linux 4.18.0-305.88.1.el8_4.x86_64 #1 SMP Thu Apr 6 10:22:46 EDT 2023 | ||||
Architecture used during static analysis | x86_64 | Micro Architecture used during static analysis | ZEN_V3 | ||
Frequency Driver | acpi-cpufreq | Frequency Governor | performance | ||
Huge Pages | always | Hyperthreading | on | ||
Number of sockets | 2 | Number of cores per socket | 64 | ||
Compilation Options | libexanbDefBoxPlugin.so: Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 : /tmp/tmpxft_00149f79_00000000-8_push_vec3_2nd_order_xform1.cudafe1.cpp : -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/defbox -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/apps/exaStamp/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/include -I/ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/lib/cmake/yaml-cpp/../../../include -I/ccc/products2/openmpi-4.1.4.5/Rhel_8__x86_64/intel--20.0.0/test_bxics-2.6/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tkspline -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tinyexpr -I/ccc/products/cuda-12.0/system/nvhpc-231/bin/../include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/onika/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/src/onika/gen/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/core/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/compute/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/defbox/include -I/ccc/products/cuda-12.0/system/nvhpc-231/bin/../targets/x86_64-linux/include -sox -std=c++17 -D__CUDA_ARCH__=800 -D__CUDA_ARCH_LIST__=800 -c -x c++ -DCUDA_DOUBLE_MATH_FUNCTIONS -fPIC -g -Wall -qopenmp -fno-omit-frame-pointer -O3 -isystem /ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/include -m64 -g -o CMakeFiles/exanbDefBoxPlugin.dir/gen/exanbDefBox_SCRS/push_vec3_2nd_order_xform1.cu.o libexanbMPIPlugin.so: Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 : /tmp/tmpxft_0014a2c6_00000000-8_particle_displ_over1.cudafe1.cpp : -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/mpi -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/apps/exaStamp/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/include -I/ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/lib/cmake/yaml-cpp/../../../include -I/ccc/products2/openmpi-4.1.4.5/Rhel_8__x86_64/intel--20.0.0/test_bxics-2.6/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tkspline -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tinyexpr -I/ccc/products/cuda-12.0/system/nvhpc-231/bin/../include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/onika/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/src/onika/gen/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/core/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/compute/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/grid_cell_particles/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/mpi/include -I/ccc/products/cuda-12.0/system/nvhpc-231/bin/../targets/x86_64-linux/include -sox -std=c++17 -D__CUDA_ARCH__=800 -D__CUDA_ARCH_LIST__=800 -c -x c++ -DCUDA_DOUBLE_MATH_FUNCTIONS -fPIC -g -Wall -qopenmp -fno-omit-frame-pointer -O3 -isystem /ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/include -m64 -g -o CMakeFiles/exanbMPIPlugin.dir/gen/exanbMPI_SCRS/particle_displ_over1.cu.o libexanbParticleNeighborsPlugin.so: Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 : /ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/src/exanb/particle_neighbors/chunk_neighbors1.cpp : -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/particle_neighbors -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/apps/exaStamp/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/include -I/ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/lib/cmake/yaml-cpp/../../../include -I/ccc/products2/openmpi-4.1.4.5/Rhel_8__x86_64/intel--20.0.0/test_bxics-2.6/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tkspline -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tinyexpr -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/onika/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/src/onika/gen/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/core/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/amr/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/particle_neighbors/include -sox -DONIKA_CUDA_VERSION=12.0.76 -DONIKA_CU_ENABLE_KERNEL_BOUNDS=0 -DONIKA_CU_MAX_THREADS_PER_BLOCK=128 -DONIKA_CU_MIN_BLOCKS_PER_SM=6 -DONIKA_OMP_MAX_DEPENDS=10 -DSOATL_SIZE_TYPE_32BITS=1 -DUSTAMP_PLUGIN_DIR=\"/ccc/scratch/cont001/ocre/oserete/local/exaStamp/Rhel_8__x86_64/milan-bxi-intel/lib\" -DUSTAMP_VERSION=\"master-2023-07-03\" -DXNB_CONFIG_DIR=\"/ccc/scratch/cont001/ocre/oserete/local/exaStamp/Rhel_8__x86_64/milan-bxi-intel/share/config\" -DXNB_DEFAULT_CONFIG_FILE=\"config_exaStamp.msp\" -DXNB_LOCAL_CONFIG_FILE=\"exaStamp_build.msp\" -DXSTAMP_ADVISED_HW_THREADS=256 -DXSTAMP_CUDA_ARCH=80 -DXSTAMP_CUDA_VERSION=12.0.76 -DXSTAMP_DEFAULT_DATA_DIRS=\".:/ccc/home/cont001/xstampdev/xstampdev/data:/ccc/scratch/cont001/ocre/oserete/local/exaStamp/Rhel_8__x86_64/milan-bxi-intel/share/data\" -DXSTAMP_FIELD_ARRAYS_STORE_COUNT=1 -DXSTAMP_MAX_PARTICLE_NEIGHBORS_DEFAULT=512 -DXSTAMP_OMP_NUM_THREADS_WORKAROUND=1 -DXSTAMP_OMP_VERSION=5.0 -DexanbParticleNeighborsPlugin_EXPORTS -DXSTAMP_SOURCE_ID=72 -isystem /ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/include -g -O3 -g -fno-omit-frame-pointer -march=core-avx2 -DNDEBUG -fPIC -Wall -qopenmp -std=c++17 -MD -MT src/exanb/particle_neighbors/CMakeFiles/exanbParticleNeighborsPlugin.dir/chunk_neighbors1.cpp.o -MF CMakeFiles/exanbParticleNeighborsPlugin.dir/chunk_neighbors1.cpp.o.d -o CMakeFiles/exanbParticleNeighborsPlugin.dir/chunk_neighbors1.cpp.o -c libexaStampAnalyticsPlugin.so: Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 : /ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exaStamp/analytics/grid_cell_particle_splatting.cpp : -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exaStamp/analytics -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/apps/exaStamp/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/include -I/ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/lib/cmake/yaml-cpp/../../../include -I/ccc/products2/openmpi-4.1.4.5/Rhel_8__x86_64/intel--20.0.0/test_bxics-2.6/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tkspline -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tinyexpr -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/onika/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/src/onika/gen/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/core/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exaStamp/mechanical/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exaStamp/compute/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/particle_neighbors/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/amr/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/mpi/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/compute/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/grid_cell_particles/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exaStamp/particle_species/include -sox -DONIKA_CUDA_VERSION=12.0.76 -DONIKA_CU_ENABLE_KERNEL_BOUNDS=0 -DONIKA_CU_MAX_THREADS_PER_BLOCK=128 -DONIKA_CU_MIN_BLOCKS_PER_SM=6 -DONIKA_OMP_MAX_DEPENDS=10 -DSOATL_SIZE_TYPE_32BITS=1 -DUSTAMP_PLUGIN_DIR=\"/ccc/scratch/cont001/ocre/oserete/local/exaStamp/Rhel_8__x86_64/milan-bxi-intel/lib\" -DUSTAMP_VERSION=\"master-2023-07-03\" -DXNB_CONFIG_DIR=\"/ccc/scratch/cont001/ocre/oserete/local/exaStamp/Rhel_8__x86_64/milan-bxi-intel/share/config\" -DXNB_DEFAULT_CONFIG_FILE=\"config_exaStamp.msp\" -DXNB_LOCAL_CONFIG_FILE=\"exaStamp_build.msp\" -DXSTAMP_ADVISED_HW_THREADS=256 -DXSTAMP_CUDA_ARCH=80 -DXSTAMP_CUDA_VERSION=12.0.76 -DXSTAMP_DEFAULT_DATA_DIRS=\".:/ccc/home/cont001/xstampdev/xstampdev/data:/ccc/scratch/cont001/ocre/oserete/local/exaStamp/Rhel_8__x86_64/milan-bxi-intel/share/data\" -DXSTAMP_FIELD_ARRAYS_STORE_COUNT=1 -DXSTAMP_MAX_PARTICLE_NEIGHBORS_DEFAULT=512 -DXSTAMP_OMP_NUM_THREADS_WORKAROUND=1 -DXSTAMP_OMP_VERSION=5.0 -DexaStampAnalyticsPlugin_EXPORTS -DXSTAMP_SOURCE_ID=123 -isystem /ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/include -g -O3 -g -fno-omit-frame-pointer -march=core-avx2 -DNDEBUG -fPIC -Wall -qopenmp -std=c++17 -MD -MT src/exaStamp/analytics/CMakeFiles/exaStampAnalyticsPlugin.dir/grid_cell_particle_splatting.cpp.o -MF CMakeFiles/exaStampAnalyticsPlugin.dir/grid_cell_particle_splatting.cpp.o.d -o CMakeFiles/exaStampAnalyticsPlugin.dir/grid_cell_particle_splatting.cpp.o -c libraveloPlugin.so: Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 Intel(R) C++ Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.5.0 Build 20211109_000000 : /tmp/tmpxft_0014a6c5_00000000-8_eam_potential1.cudafe1.cpp : -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/src/exaStamp/potential/gen/eam_potentials/ravelo -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/apps/exaStamp/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/include -I/ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/lib/cmake/yaml-cpp/../../../include -I/ccc/products2/openmpi-4.1.4.5/Rhel_8__x86_64/intel--20.0.0/test_bxics-2.6/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tkspline -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/thirdparty/tinyexpr -I/ccc/products/cuda-12.0/system/nvhpc-231/bin/../include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/onika/include -I/ccc/scratch/cont001/ocre/oserete/build/exaStamp/Rhel_8__x86_64/milan-bxi-intel/src/onika/gen/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/core/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exaStamp/potential_factory/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/compute/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exaStamp/particle_species/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/particle_neighbors/include -I/ccc/dsku/nfs-server/user/cont001/ocre/oserete/exastamp/ExaStamp/src/exanb/amr/include -I/ccc/products/cuda-12.0/system/nvhpc-231/bin/../targets/x86_64-linux/include -sox -std=c++17 -D__CUDA_ARCH__=800 -D__CUDA_ARCH_LIST__=800 -c -x c++ -DCUDA_DOUBLE_MATH_FUNCTIONS -fPIC -g -Wall -qopenmp -fno-omit-frame-pointer -O3 -isystem /ccc/home/cont001/xstampdev/xstampdev/tools/yaml-cpp-intl21.4-gcc8.4/include -m64 -g -o CMakeFiles/raveloPlugin.dir/gen/ravelo_SCRS/eam_potential1.cu.o exaStamp: N/A |
Dataset | |
Run Command | <executable> /ccc/scratch/cont001/ocre/oserete/exastamp/msp_inputs/run_killian.msp |
MPI Command | ccc_mprun -p milan-bxi -n <number_processes> -c <OMP_NUM_THREADS> -N <number_nodes> -T 3600 -x -E --enable_perf |
Number Processes | 64 |
Number Nodes | 4 |
Number Processes per Nodes | 16 |
Filter | Not Used |
Profile Start | Not Used |
Maximal Path Number | 4 |