options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_10_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_10_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_10_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_10_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_10_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_10_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_10_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_10_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_10_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_10_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_10_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_10_threads) Efficiency (run_10_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_10_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_10_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_10_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_10_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_10_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_10_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_10_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_10_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_10_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_10_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_10_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_10_threads) Efficiency(run_10_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone ._omp_fn.0]+kmeans-gcc-O3-vectorize96.1096.1296.1395.9695.870.000.000.000.000.00177.0090.5947.7624.3519.490.000.000.000.000.00177.0093.9352.8230.2425.560.000.000.000.000.001248100.005.026.367.848.310.000.020.010.010.01Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.003.536.6511.8320.6724.45GNU C++14 15.1.1 20250425 -march=cascadelake -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi -mn...100.945.560.8415.590.7325.750.6929.48
Loop 2 - main.cpp:72-79 - kmeans-gcc-O3-vectorize+96.1096.1296.1395.9695.873.112.923.063.012.87177.0090.7847.9924.5019.675.732.861.610.860.69177.0093.9352.8230.2425.565.732.851.680.950.761248100.000.020.390.260.390.000.160.100.070.064.148.6915.3127.6434.01101.0100.850.450.760.730.750.71
Loop 1 - main.cpp:73-79 - kmeans-gcc-O3-vectorize92.9993.2093.0792.9593.0092.9993.2093.0792.9593.00171.2687.9246.3823.6418.98171.2687.9246.3823.6418.98171.2691.0851.1329.2924.79171.2691.0851.1329.2924.791248100.005.035.977.738.060.000.130.110.070.063.516.5911.7220.4424.16100.945.570.8415.140.7325.020.6928.76
k_means(int, point_t*, point_t*, int*, point_t*, int, int)+kmeans-gcc-O3-vectorize3.903.783.613.533.530.000.000.000.000.007.177.137.187.177.170.000.000.000.000.007.173.701.991.110.940.000.000.000.000.00111110.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.701.352.524.495.32GNU C++14 15.1.1 20250425 -march=cascadelake -mmmx -mpopcnt -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -mavx -mavx2 -mno-sse4a -mno-fma4 -mno-xop -mfma -mavx512f -mbmi -mbmi2 -maes -mpclmul -mavx512vl -mavx512bw -mavx512dq -mavx512cd -mno-avx512vbmi -mn...100.970.110.90.350.810.690.760.84
Loop 5 - main.cpp:68-107 - kmeans-gcc-O3-vectorize [...]+3.903.783.613.533.530.000.000.000.000.007.177.137.187.177.170.000.000.000.000.007.173.701.991.110.940.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 6 - main.cpp:98-104 - kmeans-gcc-O3-vectorize0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 4 - main.cpp:93-96 - kmeans-gcc-O3-vectorize3.903.783.613.533.533.903.783.613.533.537.177.137.187.177.177.177.137.187.177.177.173.701.991.110.947.173.701.991.110.94111110.000.000.000.000.000.000.000.000.000.000.701.352.524.495.32100.970.110.90.350.810.690.760.84
gomp_barrier_wait_endlibgomp.so.1.0.00.000.060.180.410.520.000.060.180.410.520.000.120.120.120.120.000.120.120.120.120.000.060.100.130.140.000.060.100.130.14013790.000.000.010.030.020.000.000.010.010.00NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.001010101010
gomp_team_barrier_wait_endlibgomp.so.1.0.00.000.030.070.100.080.000.030.070.100.080.000.060.050.040.040.000.060.050.040.040.000.030.040.030.020.000.030.040.030.020137100.000.000.030.020.050.000.000.010.010.01NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.001010101010
×