options

Functions and Loops

Columns Filter

Coverage run_1_thread (%) Coverage run_2_threads (%) Coverage run_4_threads (%) Coverage run_8_threads (%) Coverage run_10_threads (%) Coverage Excluding Loops run_1_thread (%) Coverage Excluding Loops run_2_threads (%) Coverage Excluding Loops run_4_threads (%) Coverage Excluding Loops run_8_threads (%) Coverage Excluding Loops run_10_threads (%) Max Inclusive Time Over Threads run_1_thread (s) Max Inclusive Time Over Threads run_2_threads (s) Max Inclusive Time Over Threads run_4_threads (s) Max Inclusive Time Over Threads run_8_threads (s) Max Inclusive Time Over Threads run_10_threads (s) Max Exclusive Time Over Threads run_1_thread (s) Max Exclusive Time Over Threads run_2_threads (s) Max Exclusive Time Over Threads run_4_threads (s) Max Exclusive Time Over Threads run_8_threads (s) Max Exclusive Time Over Threads run_10_threads (s) Inclusive Time w.r.t. Wall Time run_1_thread (s) Inclusive Time w.r.t. Wall Time run_2_threads (s) Inclusive Time w.r.t. Wall Time run_4_threads (s) Inclusive Time w.r.t. Wall Time run_8_threads (s) Inclusive Time w.r.t. Wall Time run_10_threads (s) Exclusive Time w.r.t. Wall Time run_1_thread (s) Exclusive Time w.r.t. Wall Time run_2_threads (s) Exclusive Time w.r.t. Wall Time run_4_threads (s) Exclusive Time w.r.t. Wall Time run_8_threads (s) Exclusive Time w.r.t. Wall Time run_10_threads (s) Nb Threads run_1_thread Nb Threads run_2_threads Nb Threads run_4_threads Nb Threads run_8_threads Nb Threads run_10_threads Deviation (coverage) run_1_thread Deviation (coverage) run_2_threads Deviation (coverage) run_4_threads Deviation (coverage) run_8_threads Deviation (coverage) run_10_threads Deviation (walltime) run_1_thread Deviation (walltime) run_2_threads Deviation (walltime) run_4_threads Deviation (walltime) run_8_threads Deviation (walltime) run_10_threads Categories run_1_thread Categories run_2_threads Categories run_4_threads Categories run_8_threads Categories run_10_threads GFLOPS run_1_thread GFLOPS run_2_threads GFLOPS run_4_threads GFLOPS run_8_threads GFLOPS run_10_threads Compilation Options (run_1_thread) Efficiency (run_1_thread) Potential Speed-Up (%) (run_2_threads) Efficiency (run_2_threads) Potential Speed-Up (%) (run_4_threads) Efficiency (run_4_threads) Potential Speed-Up (%) (run_8_threads) Efficiency (run_8_threads) Potential Speed-Up (%) (run_10_threads) Efficiency (run_10_threads) Potential Speed-Up (%)
NameModuleCoverage run_1_thread (%)Coverage run_2_threads (%)Coverage run_4_threads (%)Coverage run_8_threads (%)Coverage run_10_threads (%)Coverage Excluding Loops run_1_thread (%)Coverage Excluding Loops run_2_threads (%)Coverage Excluding Loops run_4_threads (%)Coverage Excluding Loops run_8_threads (%)Coverage Excluding Loops run_10_threads (%)Max Inclusive Time Over Threads run_1_thread (s)Max Inclusive Time Over Threads run_2_threads (s)Max Inclusive Time Over Threads run_4_threads (s)Max Inclusive Time Over Threads run_8_threads (s)Max Inclusive Time Over Threads run_10_threads (s)Max Exclusive Time Over Threads run_1_thread (s)Max Exclusive Time Over Threads run_2_threads (s)Max Exclusive Time Over Threads run_4_threads (s)Max Exclusive Time Over Threads run_8_threads (s)Max Exclusive Time Over Threads run_10_threads (s)Inclusive Time w.r.t. Wall Time run_1_thread (s)Inclusive Time w.r.t. Wall Time run_2_threads (s)Inclusive Time w.r.t. Wall Time run_4_threads (s)Inclusive Time w.r.t. Wall Time run_8_threads (s)Inclusive Time w.r.t. Wall Time run_10_threads (s)Exclusive Time w.r.t. Wall Time run_1_thread (s)Exclusive Time w.r.t. Wall Time run_2_threads (s)Exclusive Time w.r.t. Wall Time run_4_threads (s)Exclusive Time w.r.t. Wall Time run_8_threads (s)Exclusive Time w.r.t. Wall Time run_10_threads (s)Nb Threads run_1_threadNb Threads run_2_threadsNb Threads run_4_threadsNb Threads run_8_threadsNb Threads run_10_threadsDeviation (coverage) run_1_threadDeviation (coverage) run_2_threadsDeviation (coverage) run_4_threadsDeviation (coverage) run_8_threadsDeviation (coverage) run_10_threadsDeviation (walltime) run_1_threadDeviation (walltime) run_2_threadsDeviation (walltime) run_4_threadsDeviation (walltime) run_8_threadsDeviation (walltime) run_10_threadsCategories run_1_threadCategories run_2_threadsCategories run_4_threadsCategories run_8_threadsCategories run_10_threadsGFLOPS run_1_threadGFLOPS run_2_threadsGFLOPS run_4_threadsGFLOPS run_8_threadsGFLOPS run_10_threadsCompilation Options(run_1_thread) Efficiency(run_1_thread) Potential Speed-Up (%)(run_2_threads) Efficiency(run_2_threads) Potential Speed-Up (%)(run_4_threads) Efficiency(run_4_threads) Potential Speed-Up (%)(run_8_threads) Efficiency(run_8_threads) Potential Speed-Up (%)(run_10_threads) Efficiency(run_10_threads) Potential Speed-Up (%)
k_means(int, point_t*, point_t*, int*, point_t*, int, int) [clone .extracted]+kmeans-icpx-O3-funroll92.7688.7482.1571.1966.910.000.000.000.000.0094.5248.4025.4813.2811.050.000.000.000.000.0094.5249.5827.2615.0912.760.000.000.000.000.001248100.003.063.674.094.480.000.010.010.100.20Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.006.6112.6022.9341.4248.97clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.2.1 (2024.2.1.20240711) --driver-mode=g++ --intel -I . -MMD -MP -march=native -std=c++14 -g -fiopenmp -O3 -funroll-loops -c -o main.o main.cpp -fveclib=SVML -faltmathlib=SVML -fheinous-gnu-extensions100.954.150.8710.930.7815.440.7417.33
Loop 13 - main.cpp:71-82 - kmeans-icpx-O3-funroll+92.7688.7482.1571.1966.913.433.253.382.762.7094.5248.5925.6413.4611.153.501.801.090.590.4894.5249.5827.2615.0912.763.501.821.120.580.511248100.000.180.190.350.370.000.040.030.050.057.3415.3023.4845.7249.62100.960.120.780.750.750.690.680.86
Loop 15 - main.cpp:73-79 - kmeans-icpx-O3-funroll85.3081.3675.1065.1761.3785.3081.3675.1065.1761.3786.9144.4323.3812.2010.1686.9144.4323.3812.2010.1686.9245.4524.9213.8111.7086.9245.4524.9213.8111.701248100.003.013.433.554.100.000.100.080.150.216.5512.5522.9241.2648.84100.963.570.879.610.7913.90.7415.78
Loop 14 - main.cpp:73-79 - kmeans-icpx-O3-funroll4.034.133.663.272.844.034.133.663.272.844.112.351.170.660.504.112.351.170.660.504.102.311.220.690.544.102.311.220.690.541248100.000.120.170.460.320.000.140.050.060.047.2311.5522.5240.8751.15100.890.460.840.570.740.850.760.69
k_means(int, point_t*, point_t*, int*, point_t*, int, int)+kmeans-icpx-O3-funroll7.226.846.215.304.900.000.000.000.000.007.367.457.707.787.770.000.000.000.000.007.363.822.061.120.930.000.000.000.000.00111110.000.000.000.000.000.000.000.000.000.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.00Exe (%): 100.000.681.312.434.455.35clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.2.1 (2024.2.1.20240711) --driver-mode=g++ --intel -I . -MMD -MP -march=native -std=c++14 -g -fiopenmp -O3 -funroll-loops -c -o main.o main.cpp -fveclib=SVML -faltmathlib=SVML -fheinous-gnu-extensions100.960.250.890.670.820.960.791.04
Loop 5 - main.cpp:68-105 - kmeans-icpx-O3-funroll [...]+7.226.846.215.304.900.000.000.000.000.007.367.457.707.787.770.000.000.000.000.007.363.822.061.120.930.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 9 - main.cpp:93-96 - kmeans-icpx-O3-funroll7.226.846.215.304.907.226.846.215.304.907.367.457.707.787.777.367.457.707.787.777.363.822.061.120.937.363.822.061.120.93111110.000.000.000.000.000.000.000.000.000.000.681.312.434.455.35100.960.250.890.670.820.960.791.04
Loop 6 - main.cpp:98-104 - kmeans-icpx-O3-funroll0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 8 - main.cpp:98-104 - kmeans-icpx-O3-funroll0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
Loop 7 - main.cpp:98-105 - kmeans-icpx-O3-funroll0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00000000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libiomp5.so0.000.100.240.550.600.000.100.240.550.600.000.110.120.140.150.000.110.120.140.150.000.060.080.120.110.000.060.080.120.110138100.000.000.060.250.200.000.000.020.040.03NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.001010101010
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libiomp5.so0.004.2911.3722.8527.490.004.2911.3722.8527.490.004.684.714.905.030.004.684.714.905.030.002.403.774.845.240.002.403.774.845.240138100.000.000.058.718.280.000.000.021.541.24NAOMP (%): 100.00OMP (%): 100.00OMP (%): 100.00OMP (%): 100.000.000.000.000.000.001010101010
×