| Name | Module | Max Thread Time / Walltime orig_0 (%) | Coverage orig_0 (%) | Coverage Excluding Loops orig_0 (%) | Max Inclusive Time Over Threads orig_0 (s) | Max Exclusive Time Over Threads orig_0 (s) | Inclusive Time w.r.t. Wall Time orig_0 (s) | Exclusive Time w.r.t. Wall Time orig_0 (s) | Nb Threads orig_0 | Deviation (coverage) orig_0 | Deviation (walltime) orig_0 | Categories orig_0 | GFLOPS orig_0 | Compilation Options |
| ►ggml_vec_dot_q8_0_q8_0+ | libggml-cpu.so | 82.43 | 61.76 | 0.04 | 56.93 | 0.07 | 37.44 | 0.02 | 192 | 30.88 | 18.76 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 54.18 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REP... |
| ○Loop 2865 - quants.c:1066-1073 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2866 - quants.c:108-1042 - libggml-cpu.so [...] | | 82.41 | 61.72 | 61.72 | 56.92 | 56.92 | 37.42 | 37.42 | 192 | 30.88 | 18.76 | | 54.14 | |
| ○kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libiomp5.so | 79.36 | 36.00 | 36.00 | 54.81 | 54.81 | 21.83 | 21.83 | 192 | 30.26 | 18.00 | OMP (%): 100.00 | 0.12 | |
| ○kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libiomp5.so | 2.37 | 1.00 | 1.00 | 1.64 | 1.64 | 0.61 | 0.61 | 192 | 0.79 | 0.47 | OMP (%): 100.00 | 0.17 | |
| ►ggml_compute_forward_mul_mat+ | libggml-cpu.so | 0.43 | 0.19 | 0.04 | 0.30 | 0.10 | 0.12 | 0.02 | 192 | 0.09 | 0.05 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 48.72 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REP... |
| ►Loop 101 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]+ | | 0.04 | 0.15 | 0.01 | 0.35 | 0.03 | 0.09 | 0.01 | 120 | 0.01 | 0.01 | | 58.55 | |
| ►Loop 103 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+ | | 0.01 | 0.14 | 0.00 | 0.32 | 0.01 | 0.08 | 0.00 | 15 | 0.01 | 0.01 | | 71.38 | |
| ►Loop 104 - ggml-cpu.c:1163-1198 - libggml-cpu.so [...]+ | | 0.07 | 0.13 | 0.02 | 0.31 | 0.05 | 0.08 | 0.01 | 176 | 0.02 | 0.01 | | 76.63 | |
| ►Loop 105 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+ | | 0.30 | 0.12 | 0.11 | 0.26 | 0.21 | 0.07 | 0.07 | 192 | 0.07 | 0.04 | | 50.82 | |
| ○Loop 106 - ggml-cpu.c:1197-1198 - libggml-cpu.so | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 47 | 0.01 | 0.00 | | 333.18 | |
| ○Loop 102 - ggml-cpu.c:1193-1194 - libggml-cpu.so | | 0.06 | 0.01 | 0.01 | 0.04 | 0.04 | 0.01 | 0.01 | 172 | 0.01 | 0.01 | | 157.79 | |
| ○Loop 107 - ggml-cpu.c:1197-1198 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 109 - ggml-cpu.c:1316-1328 - libggml-cpu.so+ | | 0.01 | 0.00 | 0.00 | 0.04 | 0.01 | 0.00 | 0.00 | 10 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 108 - ggml-cpu.c:1317-1328 - libggml-cpu.so | | 0.04 | 0.00 | 0.00 | 0.03 | 0.03 | 0.00 | 0.00 | 26 | 0.01 | 0.00 | | 1.70 | |
| ►Loop 115 - ggml-cpu.c:1248-1260 - libggml-cpu.so+ | | 0.01 | 0.00 | 0.00 | 0.03 | 0.01 | 0.00 | 0.00 | 3 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 114 - ggml-cpu.c:1249-1260 - libggml-cpu.so | | 0.03 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 46 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 110 - ggml-cpu.c:1289-1295 - libggml-cpu.so+ | | 0.01 | 0.00 | 0.00 | 0.05 | 0.01 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 111 - ggml-cpu.c:1290-1295 - libggml-cpu.so+ | | 0.03 | 0.00 | 0.00 | 0.04 | 0.02 | 0.00 | 0.00 | 10 | 0.01 | 0.00 | | 0.00 | |
| ○Loop 113 - ggml-cpu.c:1291-1295 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 112 - ggml-cpu.c:1291-1295 - libggml-cpu.so | | 0.03 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 19 | 0.00 | 0.00 | | 0.00 | |
| ○__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libiomp5.so | 0.68 | 0.15 | 0.15 | 0.47 | 0.47 | 0.09 | 0.09 | 190 | 0.22 | 0.13 | OMP (%): 100.00 | 19.46 | |
| ○__GI___sched_yield | libc.so.6 | 0.42 | 0.15 | 0.15 | 0.29 | 0.29 | 0.09 | 0.09 | 142 | 0.13 | 0.08 | System (%): 100.00 | 0.00 | |
| ►void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<4>(long, long, long, long)+ | libggml-cpu.so | 0.19 | 0.12 | 0.00 | 0.13 | 0.00 | 0.07 | 0.00 | 189 | 0.07 | 0.04 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 208.71 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --driver-mode=g++ --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ○Loop 2624 - sgemm.cpp:814-853 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 2626 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+ | | 0.00 | 0.12 | 0.00 | 0.13 | 0.00 | 0.07 | 0.00 | 8 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2625 - sgemm.cpp:138-1044 - libggml-cpu.so [...] | | 0.19 | 0.12 | 0.12 | 0.13 | 0.13 | 0.07 | 0.07 | 189 | 0.07 | 0.04 | | 208.39 | |
| ►void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<2>(long, long, long, long)+ | libggml-cpu.so | 0.20 | 0.12 | 0.00 | 0.14 | 0.00 | 0.07 | 0.00 | 189 | 0.07 | 0.04 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 98.92 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --driver-mode=g++ --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ○Loop 2638 - sgemm.cpp:814-853 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 2640 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+ | | 0.00 | 0.12 | 0.00 | 0.14 | 0.00 | 0.07 | 0.00 | 3 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2639 - sgemm.cpp:138-1044 - libggml-cpu.so [...] | | 0.20 | 0.12 | 0.12 | 0.14 | 0.14 | 0.07 | 0.07 | 189 | 0.07 | 0.04 | | 98.81 | |
| ○Loop 2637 - sgemm.cpp:814-853 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_compute_forward_flash_attn_ext+ | libggml-cpu.so | 0.43 | 0.05 | 0.00 | 0.30 | 0.02 | 0.03 | 0.00 | 69 | 0.15 | 0.09 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 1415.50 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --driver-mode=g++ --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 2044 - ops.cpp:8759-8927 - libggml-cpu.so [...]+ | | 0.01 | 0.05 | 0.00 | 0.40 | 0.01 | 0.03 | 0.00 | 6 | 0.01 | 0.01 | | 63.47 | |
| ○Loop 2049 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2051 - ops.cpp:8885-8886 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2047 - vec.h:677-682 - libggml-cpu.so | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 16 | 0.01 | 0.00 | | 237.93 | |
| ○Loop 2050 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2048 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2052 - ops.cpp:8885-8886 - libggml-cpu.so [...] | | 0.04 | 0.00 | 0.00 | 0.03 | 0.03 | 0.00 | 0.00 | 34 | 0.01 | 0.01 | | 239.97 | |
| ►Loop 2053 - ops.cpp:8759-8927 - libggml-cpu.so [...]+ | | 0.10 | 0.05 | 0.01 | 0.35 | 0.07 | 0.03 | 0.01 | 36 | 0.03 | 0.02 | | 1722.43 | |
| ○Loop 2058 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2061 - vec.h:502-503 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2059 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2062 - vec.h:491-497 - libggml-cpu.so | | 0.39 | 0.04 | 0.04 | 0.27 | 0.27 | 0.02 | 0.02 | 36 | 0.10 | 0.06 | | 1564.38 | |
| ○Loop 2054 - vec.h:386-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2060 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2057 - vec.h:375-381 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2055 - vec.h:375-381 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2064 - vec.h:740-745 - libggml-cpu.so | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 5 | 0.01 | 0.00 | | 190.43 | |
| ○Loop 2056 - vec.h:386-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2063 - vec.h:750-751 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2046 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2045 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_graph_compute_thread+ | libggml-cpu.so | 0.14 | 0.04 | 0.02 | 0.10 | 0.05 | 0.03 | 0.01 | 188 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 29.12 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REP... |
| ○Loop 124 - ggml-cpu.c:533-2891 - libggml-cpu.so [...] | | 0.09 | 0.03 | 0.03 | 0.06 | 0.06 | 0.02 | 0.02 | 172 | 0.02 | 0.01 | | 50.55 | |
| ○Loop 125 - ggml-cpu.c:2087-2088 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 126 - ggml-cpu.c:2087-2088 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_vec_dot_f16+ | libggml-cpu.so | 0.32 | 0.04 | 0.01 | 0.22 | 0.06 | 0.03 | 0.00 | 32 | 0.06 | 0.03 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 1198.63 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --driver-mode=g++ --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ○Loop 862 - vec.cpp:324-325 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 863 - vec.cpp:311-316 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 864 - vec.cpp:311-316 - libggml-cpu.so | | 0.26 | 0.04 | 0.04 | 0.18 | 0.18 | 0.02 | 0.02 | 32 | 0.05 | 0.03 | | 752.21 | |
| ○Loop 861 - vec.cpp:324-325 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►__intel_avx_rep_memcpy+ | exec | 0.13 | 0.04 | 0.04 | 0.09 | 0.09 | 0.02 | 0.02 | 191 | 0.03 | 0.02 | Memory (%): 100.00 | 62.56 | |
| ○Loop 2062 - - exec | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2063 - - exec | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 4 | 0.00 | 0.00 | | 0.00 | |
| ○__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libiomp5.so | 1.00 | 0.04 | 0.04 | 0.69 | 0.69 | 0.02 | 0.02 | 126 | 0.14 | 0.08 | OMP (%): 100.00 | 6.61 | |
| ○__kmp_yield | libiomp5.so | 0.14 | 0.04 | 0.04 | 0.10 | 0.10 | 0.02 | 0.02 | 121 | 0.04 | 0.02 | OMP (%): 100.00 | 0.00 | |
| ►ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+ | libggml-cpu.so | 0.10 | 0.04 | 0.00 | 0.07 | 0.03 | 0.02 | 0.00 | 192 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 264.60 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --driver-mode=g++ --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 1675 - ops.cpp:6210-6484 - libggml-cpu.so [...]+ | | 0.01 | 0.03 | 0.00 | 0.13 | 0.01 | 0.02 | 0.00 | 18 | 0.01 | 0.00 | | 201.68 | |
| ►Loop 1676 - ops.cpp:6210-6484 - libggml-cpu.so [...]+ | | 0.03 | 0.03 | 0.00 | 0.12 | 0.02 | 0.02 | 0.00 | 143 | 0.01 | 0.00 | | 245.17 | |
| ○Loop 1689 - ops.cpp:6220-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1694 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1707 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1677 - ops.cpp:6365-6484 - libggml-cpu.so [...]+ | | 0.04 | 0.01 | 0.01 | 0.05 | 0.03 | 0.01 | 0.00 | 189 | 0.01 | 0.01 | | 407.24 | |
| ○Loop 1681 - ops.cpp:6446-6457 - libggml-cpu.so | | 0.03 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 20 | 0.01 | 0.01 | | 67.94 | |
| ○Loop 1685 - ops.cpp:6429-6442 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1682 - ops.cpp:6446-6457 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1679 - ops.cpp:6479-6484 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1680 - ops.cpp:6479-6484 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1686 - ops.cpp:6429-6442 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1683 - ops.cpp:6413-6426 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1678 - ops.cpp:6462-6475 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1684 - ops.cpp:6413-6426 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1702 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1706 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1698 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1701 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1699 - ops.cpp:6220-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1692 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1693 - ops.cpp:6220-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1703 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1690 - ops.cpp:6220-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1697 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1696 - ops.cpp:6220-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1705 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1695 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1688 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1691 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1704 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1708 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 0.07 | 0.02 | 0.02 | 0.05 | 0.05 | 0.01 | 0.01 | 192 | 0.02 | 0.01 | | 277.71 | |
| ○Loop 1687 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1700 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►quantize_row_q8_0+ | libggml-cpu.so | 0.14 | 0.04 | 0.01 | 0.10 | 0.03 | 0.02 | 0.00 | 154 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 0.35 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REP... |
| ○Loop 2855 - quants.c:298-355 - libggml-cpu.so [...] | | 0.12 | 0.03 | 0.03 | 0.08 | 0.08 | 0.02 | 0.02 | 142 | 0.03 | 0.01 | | 0.43 | |
| ○kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check() | libiomp5.so | 0.45 | 0.02 | 0.02 | 0.31 | 0.31 | 0.01 | 0.01 | 128 | 0.09 | 0.05 | OMP (%): 100.00 | 38.29 | |
| ○__kmp_barrier | libiomp5.so | 0.48 | 0.01 | 0.01 | 0.33 | 0.33 | 0.01 | 0.01 | 168 | 0.04 | 0.03 | OMP (%): 100.00 | 99.14 | |
| ○__libm_expf_l9 | exec | 0.10 | 0.01 | 0.01 | 0.07 | 0.07 | 0.01 | 0.01 | 32 | 0.03 | 0.02 | Math (%): 100.00 | 1435.63 | |
| ►ggml_compute_forward_add_non_quantized+ | libggml-cpu.so | 0.16 | 0.01 | 0.00 | 0.11 | 0.02 | 0.00 | 0.00 | 45 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 49.78 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --driver-mode=g++ --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ►Loop 401 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 402 - ggml-impl.h:518-541 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 404 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 403 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 409 - binary-ops.cpp:10-97 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 411 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 410 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 441 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.11 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 443 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.11 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 442 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.16 | 0.00 | 0.00 | 0.11 | 0.11 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 289.66 | |
| ○Loop 444 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 445 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 447 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 446 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 423 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 424 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 425 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 405 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 406 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 407 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 408 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 433 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 435 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 434 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 436 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 437 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 426 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 430 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 429 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 427 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 428 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 438 - binary-ops.cpp:10-97 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 439 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 440 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 416 - binary-ops.cpp:10-97 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 417 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 418 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 419 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 420 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 422 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 421 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 412 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 413 - binary-ops.cpp:10-124 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 415 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 414 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 431 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 432 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_vec_swiglu_f32+ | libggml-cpu.so | 0.32 | 0.00 | 0.00 | 0.22 | 0.00 | 0.00 | 0.00 | 7 | 0.14 | 0.08 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../build/bin/libggml-blas.so (%): 100.00 | 4103.41 | clang based Intel(R) oneAPI DPC++/C++ Compiler 2025.1.0 (2025.1.0.20250317) /cluster/intel/oneapi/2025.1.0/compiler/2025.1/bin/compiler/clang --driver-mode=g++ --intel -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -... |
| ○Loop 870 - vec.cpp:402-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 871 - vec.h:1084-1116 - libggml-cpu.so [...] | | 0.32 | 0.00 | 0.00 | 0.22 | 0.22 | 0.00 | 0.00 | 7 | 0.14 | 0.08 | | 4099.26 | |
| ○Loop 869 - vec.cpp:402-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |