| Name | Module | Max Thread Time / Walltime aocc_0 (%) | Coverage aocc_0 (%) | Coverage Excluding Loops aocc_0 (%) | Max Inclusive Time Over Threads aocc_0 (s) | Max Exclusive Time Over Threads aocc_0 (s) | Inclusive Time w.r.t. Wall Time aocc_0 (s) | Exclusive Time w.r.t. Wall Time aocc_0 (s) | Nb Threads aocc_0 | Deviation (coverage) aocc_0 | Deviation (walltime) aocc_0 | Categories aocc_0 | GFLOPS aocc_0 | Compilation Options |
| ►ggml_vec_dot_q8_0_q8_0+ | libggml-cpu.so | 82.80 | 61.79 | 0.03 | 57.34 | 0.06 | 37.67 | 0.02 | 192 | 31.00 | 18.95 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 53.88 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US... |
| ○Loop 2536 - quants.c:1066-1073 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2537 - quants.c:108-1042 - libggml-cpu.so [...] | | 82.80 | 61.76 | 61.76 | 57.34 | 57.34 | 37.65 | 37.65 | 192 | 31.00 | 18.95 | | 53.84 | |
| ○__kmp_hardware_timestamp | libomp.so | 45.57 | 20.04 | 20.04 | 31.56 | 31.56 | 12.22 | 12.22 | 192 | 16.94 | 10.13 | OMP (%): 100.00 | 0.09 | |
| ○__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | 37.31 | 16.97 | 16.97 | 25.84 | 25.84 | 10.35 | 10.35 | 192 | 14.26 | 8.53 | OMP (%): 100.00 | 0.18 | |
| ○__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libomp.so | 5.91 | 0.28 | 0.28 | 4.09 | 4.09 | 0.17 | 0.17 | 191 | 0.70 | 0.43 | OMP (%): 100.00 | 10.22 | |
| ►ggml_compute_forward_mul_mat+ | libggml-cpu.so | 0.30 | 0.19 | 0.04 | 0.21 | 0.09 | 0.12 | 0.02 | 192 | 0.07 | 0.04 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 52.54 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US... |
| ►Loop 89 - ggml-cpu.c:1289-1297 - libggml-cpu.so+ | | 0.01 | 0.00 | 0.00 | 0.03 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 88 - ggml-cpu.c:1289-1297 - libggml-cpu.so+ | | 0.01 | 0.00 | 0.00 | 0.02 | 0.01 | 0.00 | 0.00 | 6 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 87 - ggml-cpu.c:1289-1297 - libggml-cpu.so | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 4 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 91 - ggml-cpu.c:1248-1260 - libggml-cpu.so+ | | 0.01 | 0.00 | 0.00 | 0.03 | 0.01 | 0.00 | 0.00 | 3 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 90 - ggml-cpu.c:1248-1260 - libggml-cpu.so | | 0.03 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 35 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 86 - ggml-cpu.c:1316-1328 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 85 - ggml-cpu.c:1317-1328 - libggml-cpu.so | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 13 | 0.00 | 0.00 | | 3.97 | |
| ►Loop 66 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]+ | | 0.06 | 0.15 | 0.01 | 0.32 | 0.04 | 0.09 | 0.00 | 107 | 0.01 | 0.01 | | 61.16 | |
| ►Loop 67 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 68 - ggml-cpu.c:1163-1198 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 70 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 69 - ggml-cpu.c:1197-1198 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 75 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 74 - ggml-cpu.c:1197-1198 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 73 - ggml-cpu.c:1193-1194 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 72 - ggml-cpu.c:1164-1194 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 71 - ggml-cpu.c:1193-1194 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 76 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+ | | 0.01 | 0.14 | 0.00 | 0.28 | 0.01 | 0.08 | 0.00 | 23 | 0.00 | 0.00 | | 595.29 | |
| ►Loop 77 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+ | | 0.04 | 0.14 | 0.01 | 0.27 | 0.03 | 0.08 | 0.00 | 114 | 0.01 | 0.01 | | 74.23 | |
| ►Loop 81 - ggml-cpu.c:1164-1194 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 80 - ggml-cpu.c:1193-1194 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 79 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 78 - ggml-cpu.c:1197-1198 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 84 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+ | | 0.19 | 0.13 | 0.08 | 0.24 | 0.13 | 0.08 | 0.05 | 192 | 0.05 | 0.03 | | 70.88 | |
| ○Loop 82 - ggml-cpu.c:1193-1194 - libggml-cpu.so | | 0.13 | 0.05 | 0.05 | 0.09 | 0.09 | 0.03 | 0.03 | 190 | 0.03 | 0.02 | | 53.36 | |
| ○Loop 83 - ggml-cpu.c:1197-1198 - libggml-cpu.so | | 0.03 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 41 | 0.01 | 0.00 | | 180.98 | |
| ○__GI___sched_yield | libc.so.6 | 0.46 | 0.15 | 0.15 | 0.32 | 0.32 | 0.09 | 0.09 | 146 | 0.14 | 0.09 | System (%): 100.00 | 0.00 | |
| ►void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<2>(long, long, long, long)+ | libggml-cpu.so | 0.22 | 0.13 | 0.00 | 0.15 | 0.00 | 0.08 | 0.00 | 189 | 0.07 | 0.04 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 91.54 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 2372 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+ | | 0.01 | 0.13 | 0.00 | 0.16 | 0.01 | 0.08 | 0.00 | 7 | 0.01 | 0.00 | | 119.07 | |
| ○Loop 2373 - sgemm.cpp:138-1044 - libggml-cpu.so [...] | | 0.22 | 0.13 | 0.13 | 0.15 | 0.15 | 0.08 | 0.08 | 189 | 0.07 | 0.04 | | 91.50 | |
| ►void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<4>(long, long, long, long)+ | libggml-cpu.so | 0.20 | 0.12 | 0.00 | 0.14 | 0.00 | 0.07 | 0.00 | 189 | 0.07 | 0.04 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 201.52 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 2363 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+ | | 0.00 | 0.12 | 0.00 | 0.14 | 0.00 | 0.07 | 0.00 | 19 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2364 - sgemm.cpp:138-1044 - libggml-cpu.so [...] | | 0.20 | 0.12 | 0.12 | 0.14 | 0.14 | 0.07 | 0.07 | 189 | 0.07 | 0.04 | | 200.87 | |
| ►ggml_graph_compute_thread+ | libggml-cpu.so | 0.13 | 0.06 | 0.01 | 0.09 | 0.04 | 0.03 | 0.01 | 187 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 20.83 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US... |
| ►Loop 114 - ggml-cpu.c:1572-1579 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 113 - ggml-cpu.c:1572-1579 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 100 - ggml-cpu.c:682-1654 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 101 - ggml-cpu.c:1436-1654 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 107 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 109 - ggml-cpu.c:1437-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 108 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 111 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 110 - ggml-cpu.c:1454-1462 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 102 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 104 - ggml-cpu.c:1437-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 106 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 105 - ggml-cpu.c:1454-1462 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 103 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 117 - ggml-cpu.c:1552-1560 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 116 - ggml-cpu.c:1552-1560 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 115 - ggml-cpu.c:1552-1560 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 112 - ggml-cpu.c:1585-1587 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 99 - ggml-cpu.c:533-2891 - libggml-cpu.so [...] | | 0.12 | 0.05 | 0.05 | 0.08 | 0.08 | 0.03 | 0.03 | 184 | 0.03 | 0.02 | | 25.47 | |
| ○Loop 118 - ggml-cpu.c:2087-2088 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_compute_forward_flash_attn_ext+ | libggml-cpu.so | 0.43 | 0.05 | 0.00 | 0.30 | 0.02 | 0.03 | 0.00 | 54 | 0.15 | 0.09 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 1782.92 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 1700 - vec.h:375-751 - libggml-cpu.so [...]+ | | 0.03 | 0.05 | 0.00 | 0.40 | 0.02 | 0.03 | 0.00 | 4 | 0.01 | 0.01 | | 11.91 | |
| ○Loop 1705 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1709 - ops.cpp:8885-8886 - libggml-cpu.so [...] | | 0.04 | 0.00 | 0.00 | 0.03 | 0.03 | 0.00 | 0.00 | 31 | 0.01 | 0.01 | | 105.59 | |
| ○Loop 1706 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1702 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1703 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1701 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1707 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1704 - vec.h:677-682 - libggml-cpu.so | | 0.07 | 0.00 | 0.00 | 0.05 | 0.05 | 0.00 | 0.00 | 10 | 0.02 | 0.01 | | 7.94 | |
| ►Loop 1710 - ops.cpp:8759-8881 - libggml-cpu.so [...]+ | | 0.10 | 0.04 | 0.01 | 0.30 | 0.07 | 0.03 | 0.00 | 32 | 0.03 | 0.02 | | 3130.72 | |
| ○Loop 1723 - vec.h:491-497 - libggml-cpu.so | | 0.32 | 0.04 | 0.04 | 0.22 | 0.22 | 0.02 | 0.02 | 33 | 0.08 | 0.05 | | 1781.82 | |
| ○Loop 1714 - vec.h:375-381 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1722 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1711 - vec.h:386-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1725 - vec.h:502-503 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1724 - vec.h:502-503 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1719 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1720 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1729 - vec.h:750-751 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1715 - vec.h:386-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1718 - vec.h:375-381 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1716 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1726 - vec.h:502-503 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1730 - vec.h:740-745 - libggml-cpu.so | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 12 | 0.01 | 0.00 | | 158.56 | |
| ○Loop 1717 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1712 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1713 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1727 - vec.h:750-751 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1728 - vec.h:750-751 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1721 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1708 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+ | libggml-cpu.so | 0.16 | 0.04 | 0.00 | 0.11 | 0.02 | 0.02 | 0.00 | 192 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 311.75 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 1451 - ops.cpp:6210-6484 - libggml-cpu.so [...]+ | | 0.01 | 0.03 | 0.00 | 0.17 | 0.01 | 0.02 | 0.00 | 20 | 0.01 | 0.00 | | 140.87 | |
| ►Loop 1452 - ops.cpp:6210-6484 - libggml-cpu.so [...]+ | | 0.04 | 0.03 | 0.00 | 0.16 | 0.03 | 0.02 | 0.00 | 122 | 0.01 | 0.01 | | 128.64 | |
| ○Loop 1460 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1462 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 0.10 | 0.02 | 0.02 | 0.07 | 0.07 | 0.01 | 0.01 | 192 | 0.02 | 0.01 | | 369.05 | |
| ►Loop 1453 - ops.cpp:6365-6484 - libggml-cpu.so [...]+ | | 0.04 | 0.01 | 0.00 | 0.06 | 0.03 | 0.00 | 0.00 | 168 | 0.01 | 0.01 | | 413.81 | |
| ○Loop 1456 - ops.cpp:6446-6456 - libggml-cpu.so [...] | | 0.04 | 0.00 | 0.00 | 0.03 | 0.03 | 0.00 | 0.00 | 25 | 0.01 | 0.01 | | 124.12 | |
| ○Loop 1454 - ops.cpp:6479-6484 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1457 - ops.cpp:6429-6442 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1458 - ops.cpp:6413-6426 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1455 - ops.cpp:6462-6475 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1461 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1459 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_vec_dot_f16+ | libggml-cpu.so | 0.29 | 0.03 | 0.00 | 0.20 | 0.06 | 0.02 | 0.00 | 33 | 0.06 | 0.04 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 1079.36 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ○Loop 760 - vec.cpp:324-325 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 761 - vec.cpp:311-316 - libggml-cpu.so | | 0.25 | 0.03 | 0.03 | 0.17 | 0.17 | 0.02 | 0.02 | 33 | 0.06 | 0.03 | | 1005.58 | |
| ►quantize_row_q8_0+ | libggml-cpu.so | 0.13 | 0.03 | 0.00 | 0.09 | 0.02 | 0.02 | 0.00 | 153 | 0.02 | 0.01 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 0.40 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US... |
| ○Loop 2527 - quants.c:298-355 - libggml-cpu.so [...] | | 0.12 | 0.03 | 0.03 | 0.08 | 0.08 | 0.02 | 0.02 | 139 | 0.02 | 0.01 | | 0.46 | |
| ○f64xsubf128 | libm.so.6 | 0.10 | 0.03 | 0.03 | 0.07 | 0.07 | 0.02 | 0.02 | 192 | 0.03 | 0.02 | Math (%): 100.00 | 630.54 | |
| ○int __kmp_barrier_template<false>(barrier_type, int, int, unsigned long, void*, void (*)(void*, void*)) [clone .isra.33] | libomp.so | 0.51 | 0.01 | 0.01 | 0.35 | 0.35 | 0.01 | 0.01 | 187 | 0.04 | 0.03 | OMP (%): 100.00 | 193.78 | |
| ►ggml_compute_forward_add_non_quantized+ | libggml-cpu.so | 0.17 | 0.01 | 0.00 | 0.12 | 0.03 | 0.00 | 0.00 | 42 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 46.83 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 383 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 382 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 381 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 373 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 375 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 378 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 377 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 376 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 374 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 399 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 400 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 401 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 402 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 398 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.17 | 0.00 | 0.00 | 0.12 | 0.12 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 226.14 | |
| ►Loop 380 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 379 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 390 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 391 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 393 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 394 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 395 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 392 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 397 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 396 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 361 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 360 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 367 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 368 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 369 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 370 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 372 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 371 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 363 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 364 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 362 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 366 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 365 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 384 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 387 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 386 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 388 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 389 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 385 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_compute_forward_rms_norm+ | libggml-cpu.so | 0.13 | 0.00 | 0.00 | 0.09 | 0.01 | 0.00 | 0.00 | 30 | 0.02 | 0.01 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 172.93 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 1166 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1165 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1164 - ops.cpp:4319-4333 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1179 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1180 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1181 - vec.h:673-688 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1182 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1178 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1177 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1183 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1199 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1198 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1197 - ops.cpp:4321-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1195 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.13 | 0.00 | 0.00 | 0.09 | 0.09 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 528.99 | |
| ○Loop 1196 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1185 - ops.cpp:4319-4338 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1186 - ops.cpp:4319-4338 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1187 - ops.cpp:4321-4338 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1184 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1190 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1191 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1192 - vec.h:687-688 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1193 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1194 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1188 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1189 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1168 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1169 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1170 - vec.h:687-688 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1172 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1167 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1171 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1161 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1162 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1163 - ops.cpp:4321-4333 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1176 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1175 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1174 - ops.cpp:4321-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1173 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1203 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1204 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1205 - vec.h:673-688 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1200 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1202 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1206 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1201 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1207 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_compute_forward_mul+ | libggml-cpu.so | 0.14 | 0.00 | 0.00 | 0.10 | 0.02 | 0.00 | 0.00 | 24 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 57.43 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 449 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 450 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 448 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 451 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 452 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 485 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 486 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.10 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 488 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 484 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.14 | 0.00 | 0.00 | 0.10 | 0.10 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 190.44 | |
| ○Loop 487 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 447 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 446 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 483 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 482 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 466 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 465 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 453 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 455 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 457 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 458 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 456 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 454 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 470 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 473 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 475 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 474 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 472 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 471 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 459 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 460 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 461 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 463 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 464 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 462 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 469 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 468 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 467 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 476 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 479 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 478 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 481 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 480 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 477 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_vec_swiglu_f32+ | libggml-cpu.so | 0.29 | 0.00 | 0.00 | 0.20 | 0.00 | 0.00 | 0.00 | 7 | 0.12 | 0.08 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00 | 4611.39 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ○Loop 764 - vec.cpp:402-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 765 - vec.h:1084-1116 - libggml-cpu.so [...] | | 0.29 | 0.00 | 0.00 | 0.20 | 0.20 | 0.00 | 0.00 | 7 | 0.12 | 0.08 | | 4609.13 | |