| Name | Module | Max Thread Time / Walltime aocc_7 (%) | Coverage aocc_7 (%) | Coverage Excluding Loops aocc_7 (%) | Max Inclusive Time Over Threads aocc_7 (s) | Max Exclusive Time Over Threads aocc_7 (s) | Inclusive Time w.r.t. Wall Time aocc_7 (s) | Exclusive Time w.r.t. Wall Time aocc_7 (s) | Nb Threads aocc_7 | Deviation (coverage) aocc_7 | Deviation (walltime) aocc_7 | Categories aocc_7 | GFLOPS aocc_7 | Compilation Options |
| ►ggml_vec_dot_q8_0_q8_0+ | libggml-cpu.so | 82.66 | 61.88 | 0.03 | 57.27 | 0.06 | 37.71 | 0.02 | 192 | 31.09 | 19.00 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 53.86 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US... |
| ○Loop 2158 - quants.c:1066-1073 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2159 - quants.c:108-1042 - libggml-cpu.so [...] | | 82.64 | 61.85 | 61.85 | 57.26 | 57.26 | 37.70 | 37.70 | 192 | 31.09 | 19.00 | | 53.82 | |
| ○__kmp_hardware_timestamp | libomp.so | 45.00 | 20.01 | 20.01 | 31.18 | 31.18 | 12.20 | 12.20 | 192 | 17.00 | 10.18 | OMP (%): 100.00 | 0.08 | |
| ○__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | 37.05 | 16.96 | 16.96 | 25.67 | 25.67 | 10.34 | 10.34 | 192 | 14.27 | 8.54 | OMP (%): 100.00 | 0.16 | |
| ○__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libomp.so | 6.06 | 0.27 | 0.27 | 4.20 | 4.20 | 0.17 | 0.17 | 188 | 0.71 | 0.43 | OMP (%): 100.00 | 11.18 | |
| ►ggml_compute_forward_mul_mat+ | libggml-cpu.so | 0.35 | 0.16 | 0.04 | 0.24 | 0.07 | 0.10 | 0.02 | 192 | 0.07 | 0.04 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 48.17 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US... |
| ►Loop 65 - ggml-cpu.c:1316-1328 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 64 - ggml-cpu.c:1317-1328 - libggml-cpu.so | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 19 | 0.00 | 0.00 | | 2.65 | |
| ►Loop 68 - ggml-cpu.c:1289-1297 - libggml-cpu.so+ | | 0.01 | 0.00 | 0.00 | 0.03 | 0.01 | 0.00 | 0.00 | 12 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 67 - ggml-cpu.c:1289-1297 - libggml-cpu.so+ | | 0.01 | 0.00 | 0.00 | 0.02 | 0.01 | 0.00 | 0.00 | 10 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 66 - ggml-cpu.c:1289-1297 - libggml-cpu.so | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 15 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 70 - ggml-cpu.c:1248-1260 - libggml-cpu.so+ | | 0.01 | 0.00 | 0.00 | 0.03 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 69 - ggml-cpu.c:1248-1260 - libggml-cpu.so | | 0.03 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 35 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 58 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]+ | | 0.04 | 0.11 | 0.01 | 0.33 | 0.03 | 0.07 | 0.00 | 100 | 0.01 | 0.01 | | 62.17 | |
| ►Loop 59 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+ | | 0.03 | 0.11 | 0.00 | 0.30 | 0.02 | 0.07 | 0.00 | 42 | 0.01 | 0.01 | | 78.60 | |
| ►Loop 60 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+ | | 0.04 | 0.11 | 0.00 | 0.28 | 0.03 | 0.06 | 0.00 | 112 | 0.01 | 0.01 | | 133.61 | |
| ►Loop 61 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+ | | 0.22 | 0.10 | 0.09 | 0.25 | 0.15 | 0.06 | 0.06 | 192 | 0.05 | 0.03 | | 55.91 | |
| ○Loop 63 - ggml-cpu.c:1193-1194 - libggml-cpu.so | | 0.13 | 0.01 | 0.01 | 0.09 | 0.09 | 0.01 | 0.01 | 140 | 0.02 | 0.01 | | 131.51 | |
| ○Loop 62 - ggml-cpu.c:1197-1198 - libggml-cpu.so | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 35 | 0.01 | 0.00 | | 224.58 | |
| ○__GI___sched_yield | libc.so.6 | 0.52 | 0.15 | 0.15 | 0.36 | 0.36 | 0.09 | 0.09 | 150 | 0.15 | 0.09 | System (%): 100.00 | 0.00 | |
| ►void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<4>(long, long, long, long)+ | libggml-cpu.so | 0.19 | 0.11 | 0.00 | 0.13 | 0.00 | 0.07 | 0.00 | 189 | 0.07 | 0.04 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 210.64 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 2011 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+ | | 0.00 | 0.11 | 0.00 | 0.20 | 0.00 | 0.07 | 0.00 | 8 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2010 - sgemm.cpp:205-853 - libggml-cpu.so [...] | | 0.01 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 18 | 0.00 | 0.00 | | 857.56 | |
| ►Loop 2013 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+ | | 0.17 | 0.11 | 0.08 | 0.19 | 0.12 | 0.07 | 0.05 | 189 | 0.05 | 0.03 | | 171.59 | |
| ○Loop 2012 - sgemm.cpp:138-1044 - libggml-cpu.so [...] | | 0.10 | 0.04 | 0.04 | 0.07 | 0.07 | 0.02 | 0.02 | 189 | 0.03 | 0.02 | | 290.25 | |
| ►void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<2>(long, long, long, long)+ | libggml-cpu.so | 0.19 | 0.11 | 0.00 | 0.13 | 0.01 | 0.07 | 0.00 | 189 | 0.06 | 0.04 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 108.17 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 2020 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+ | | 0.00 | 0.11 | 0.00 | 0.13 | 0.00 | 0.07 | 0.00 | 4 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 2021 - sgemm.cpp:138-1044 - libggml-cpu.so [...] | | 0.19 | 0.11 | 0.11 | 0.13 | 0.13 | 0.07 | 0.07 | 189 | 0.06 | 0.04 | | 108.06 | |
| ►ggml_graph_compute_thread+ | libggml-cpu.so | 0.19 | 0.06 | 0.01 | 0.13 | 0.03 | 0.04 | 0.01 | 189 | 0.04 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 17.57 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US... |
| ►Loop 90 - ggml-cpu.c:1552-1560 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 89 - ggml-cpu.c:1552-1560 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 88 - ggml-cpu.c:1552-1560 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 79 - ggml-cpu.c:682-1654 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 80 - ggml-cpu.c:1436-1654 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 81 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 82 - ggml-cpu.c:1437-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 83 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 84 - ggml-cpu.c:1461-1462 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 86 - ggml-cpu.c:1572-1579 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 87 - ggml-cpu.c:1573-1579 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 85 - ggml-cpu.c:1585-1587 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 78 - ggml-cpu.c:533-2891 - libggml-cpu.so [...] | | 0.16 | 0.05 | 0.05 | 0.11 | 0.11 | 0.03 | 0.03 | 181 | 0.03 | 0.02 | | 17.69 | |
| ○Loop 91 - ggml-cpu.c:2087-2088 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_compute_forward_flash_attn_ext+ | libggml-cpu.so | 0.42 | 0.06 | 0.00 | 0.29 | 0.02 | 0.03 | 0.00 | 53 | 0.16 | 0.10 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 1581.14 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 1436 - vec.h:375-751 - libggml-cpu.so [...]+ | | 0.00 | 0.05 | 0.00 | 0.38 | 0.00 | 0.03 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1437 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1440 - vec.h:677-682 - libggml-cpu.so | | 0.04 | 0.00 | 0.00 | 0.03 | 0.03 | 0.00 | 0.00 | 10 | 0.01 | 0.01 | | 7.32 | |
| ○Loop 1445 - ops.cpp:8885-8886 - libggml-cpu.so [...] | | 0.03 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 29 | 0.01 | 0.01 | | 140.58 | |
| ○Loop 1438 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1442 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1443 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1441 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1439 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1444 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1446 - ops.cpp:8759-8881 - libggml-cpu.so [...]+ | | 0.12 | 0.05 | 0.01 | 0.33 | 0.08 | 0.03 | 0.01 | 34 | 0.03 | 0.02 | | 2299.56 | |
| ○Loop 1464 - vec.h:750-751 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1460 - vec.h:502-503 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1466 - vec.h:740-745 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 3 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1457 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1465 - vec.h:750-751 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1459 - vec.h:491-497 - libggml-cpu.so | | 0.36 | 0.04 | 0.04 | 0.25 | 0.25 | 0.03 | 0.03 | 35 | 0.10 | 0.06 | | 1606.01 | |
| ○Loop 1458 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1450 - vec.h:375-381 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1454 - vec.h:375-381 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1463 - vec.h:750-751 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1456 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1462 - vec.h:502-503 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1452 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1449 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1455 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1451 - vec.h:386-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1448 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1453 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1461 - vec.h:502-503 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1447 - vec.h:386-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_vec_dot_f16+ | libggml-cpu.so | 0.30 | 0.04 | 0.00 | 0.21 | 0.06 | 0.02 | 0.00 | 32 | 0.06 | 0.03 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 971.24 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ○Loop 758 - vec.cpp:311-316 - libggml-cpu.so | | 0.27 | 0.03 | 0.03 | 0.19 | 0.19 | 0.02 | 0.02 | 32 | 0.06 | 0.03 | | 878.66 | |
| ○Loop 757 - vec.cpp:324-325 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+ | libggml-cpu.so | 0.14 | 0.03 | 0.00 | 0.10 | 0.02 | 0.02 | 0.00 | 192 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 324.42 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 1266 - ops.cpp:6210-6484 - libggml-cpu.so [...]+ | | 0.00 | 0.03 | 0.00 | 0.12 | 0.00 | 0.02 | 0.00 | 5 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 1267 - ops.cpp:6210-6484 - libggml-cpu.so [...]+ | | 0.01 | 0.03 | 0.00 | 0.12 | 0.01 | 0.02 | 0.00 | 73 | 0.01 | 0.00 | | 328.90 | |
| ○Loop 1274 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.09 | 0.02 | 0.02 | 0.06 | 0.06 | 0.01 | 0.01 | 192 | 0.02 | 0.01 | | 370.49 | |
| ►Loop 1268 - ops.cpp:6365-6484 - libggml-cpu.so [...]+ | | 0.04 | 0.01 | 0.00 | 0.05 | 0.03 | 0.00 | 0.00 | 178 | 0.01 | 0.01 | | 332.93 | |
| ○Loop 1272 - ops.cpp:6429-6442 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1269 - ops.cpp:6479-6484 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1273 - ops.cpp:6413-6426 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1270 - ops.cpp:6462-6475 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 1271 - ops.cpp:6446-6457 - libggml-cpu.so | | 0.03 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 | 0.00 | 17 | 0.01 | 0.01 | | 117.87 | |
| ►quantize_row_q8_0+ | libggml-cpu.so | 0.10 | 0.03 | 0.00 | 0.07 | 0.02 | 0.02 | 0.00 | 155 | 0.02 | 0.01 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 0.95 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US... |
| ○Loop 2149 - quants.c:298-355 - libggml-cpu.so [...] | | 0.09 | 0.03 | 0.03 | 0.06 | 0.06 | 0.02 | 0.02 | 148 | 0.02 | 0.01 | | 1.04 | |
| ○f64xsubf128 | libm.so.6 | 0.10 | 0.02 | 0.02 | 0.07 | 0.07 | 0.01 | 0.01 | 192 | 0.02 | 0.01 | Math (%): 100.00 | 907.15 | |
| ○int __kmp_barrier_template<false>(barrier_type, int, int, unsigned long, void*, void (*)(void*, void*)) [clone .isra.33] | libomp.so | 0.43 | 0.01 | 0.01 | 0.30 | 0.30 | 0.01 | 0.01 | 191 | 0.04 | 0.02 | OMP (%): 100.00 | 196.95 | |
| ►ggml_compute_forward_add_non_quantized+ | libggml-cpu.so | 0.32 | 0.01 | 0.00 | 0.22 | 0.02 | 0.00 | 0.00 | 46 | 0.05 | 0.03 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 36.02 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 323 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 324 - ggml-impl.h:518-541 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 325 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 326 - ggml-impl.h:518-541 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 328 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 327 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 345 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 346 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 348 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 350 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 347 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 349 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 335 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 338 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 337 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 336 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 339 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 342 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 344 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 343 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 341 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 340 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 317 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 318 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 320 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 321 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 322 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 319 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 329 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 331 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 332 - ggml-impl.h:518-541 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 333 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 334 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 330 - ggml-impl.h:518-541 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 351 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.21 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 354 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.21 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 356 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 355 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 353 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.30 | 0.00 | 0.00 | 0.21 | 0.21 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 120.08 | |
| ○Loop 352 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_compute_forward_mul+ | libggml-cpu.so | 0.17 | 0.01 | 0.00 | 0.12 | 0.03 | 0.00 | 0.00 | 44 | 0.03 | 0.02 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 37.60 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ►Loop 415 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 416 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 418 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 417 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 409 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 411 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 412 - ggml-impl.h:518-541 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 414 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 413 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 410 - ggml-impl.h:518-541 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 419 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 422 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 421 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 423 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 424 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 420 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 431 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 432 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 434 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 436 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 433 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.17 | 0.00 | 0.00 | 0.12 | 0.12 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | 194.29 | |
| ○Loop 435 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 397 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 400 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 401 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 402 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 399 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 398 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 403 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 404 - ggml-impl.h:518-541 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 405 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 407 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 406 - ggml-impl.h:518-541 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 408 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 425 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 426 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►Loop 428 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 430 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 429 - binary-ops.cpp:31-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 427 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ►ggml_vec_swiglu_f32+ | libggml-cpu.so | 0.25 | 0.00 | 0.00 | 0.17 | 0.00 | 0.00 | 0.00 | 7 | 0.10 | 0.06 | /beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00 | 5309.66 | AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL... |
| ○Loop 766 - vec.h:1084-1115 - libggml-cpu.so [...] | | 0.25 | 0.00 | 0.00 | 0.17 | 0.17 | 0.00 | 0.00 | 7 | 0.10 | 0.06 | | 5283.19 | |
| ○Loop 763 - vec.cpp:402-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 764 - vec.cpp:402-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |
| ○Loop 765 - vec.cpp:402-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | 0.00 | |