| Name | Module | Max Thread Time / Walltime armclang_3 (%) | Coverage armclang_3 (%) | Coverage Excluding Loops armclang_3 (%) | Max Inclusive Time Over Threads armclang_3 (s) | Max Exclusive Time Over Threads armclang_3 (s) | Inclusive Time w.r.t. Wall Time armclang_3 (s) | Exclusive Time w.r.t. Wall Time armclang_3 (s) | Nb Threads armclang_3 | Deviation (coverage) armclang_3 | Deviation (walltime) armclang_3 | Categories armclang_3 | Compilation Options |
| ►kai_run_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm+ | libggml-cpu.so | 30.60 | 47.49 | 0.00 | 2.26 | 0.00 | 2.39 | 0.00 | 95 | 5.02 | 0.21 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ►Loop 2527 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2526 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2525 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2530 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+ | | 0.07 | 47.48 | 0.00 | 2.28 | 0.00 | 2.39 | 0.00 | 1 | 0.00 | 0.00 | | |
| ►Loop 2529 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+ | | 0.27 | 47.48 | 0.13 | 2.27 | 0.02 | 2.39 | 0.01 | 62 | 0.10 | 0.00 | | |
| ○Loop 2528 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so | | 30.54 | 47.35 | 47.35 | 2.25 | 2.25 | 2.38 | 2.38 | 95 | 5.01 | 0.21 | | |
| ○kmp_flag_64<false, true>::wait(kmp_info*, int, void*) | libomp.so | 35.95 | 27.23 | 27.23 | 2.65 | 2.65 | 1.37 | 1.37 | 96 | 6.72 | 0.27 | OMP (%): 100.00 | |
| ►ggml_vec_dot_q6_K_q8_K+ | libggml-cpu.so | 8.19 | 13.71 | 0.04 | 0.61 | 0.01 | 0.69 | 0.00 | 96 | 0.60 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ○Loop 2424 - quants.c:2835-2913 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2426 - quants.c:2492-2660 - libggml-cpu.so [...]+ | | 1.42 | 13.67 | 1.58 | 0.66 | 0.11 | 0.69 | 0.08 | 96 | 0.35 | 0.01 | | |
| ○Loop 2425 - quants.c:2506-2590 - libggml-cpu.so [...] | | 7.52 | 12.09 | 12.09 | 0.55 | 0.55 | 0.61 | 0.61 | 96 | 0.63 | 0.02 | | |
| ○__GI___pthread_mutex_lock | libc.so.6 | 2.84 | 2.63 | 2.63 | 0.21 | 0.21 | 0.13 | 0.13 | 95 | 1.11 | 0.05 | Pthread (%): 100.00 | |
| ○kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check() | libomp.so | 2.44 | 1.81 | 1.81 | 0.18 | 0.18 | 0.09 | 0.09 | 96 | 0.65 | 0.03 | OMP (%): 100.00 | |
| ►ggml_compute_forward_flash_attn_ext+ | libggml-cpu.so | 1.15 | 1.02 | 0.03 | 0.09 | 0.01 | 0.05 | 0.00 | 88 | 0.41 | 0.02 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 1894 - ops.cpp:8778-8920 - libggml-cpu.so [...]+ | | 0.07 | 0.98 | 0.00 | 0.13 | 0.00 | 0.05 | 0.00 | 1 | 0.00 | 0.00 | | |
| ○Loop 1892 - ops.cpp:8885-8886 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1893 - vec.h:375-751 - libggml-cpu.so [...]+ | | 0.14 | 0.98 | 0.02 | 0.12 | 0.01 | 0.05 | 0.00 | 15 | 0.03 | 0.00 | | |
| ○Loop 1915 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1895 - ops.cpp:8885-8886 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1913 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1914 - vec.h:677-682 - libggml-cpu.so | | 0.14 | 0.04 | 0.04 | 0.01 | 0.01 | 0.00 | 0.00 | 29 | 0.05 | 0.00 | | |
| ○Loop 1912 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1916 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1917 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1896 - ops.cpp:8793-8881 - libggml-cpu.so [...]+ | | 0.74 | 0.92 | 0.60 | 0.10 | 0.06 | 0.05 | 0.03 | 86 | 0.28 | 0.01 | | |
| ○Loop 1904 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1906 - ggml-impl.h:355-404 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1901 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1907 - vec.h:503-503 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1897 - vec.h:386-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1900 - vec.h:386-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1905 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1899 - vec.h:375-381 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1902 - vec.h:375-381 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1910 - vec.h:751-751 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1898 - vec.h:387-387 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1911 - vec.h:740-745 - libggml-cpu.so | | 0.07 | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | | |
| ○Loop 1908 - vec.h:491-497 - libggml-cpu.so | | 0.54 | 0.32 | 0.32 | 0.04 | 0.04 | 0.02 | 0.02 | 79 | 0.19 | 0.01 | | |
| ○Loop 1903 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1909 - ggml-impl.h:355-404 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+ | libggml-cpu.so | 1.02 | 0.81 | 0.00 | 0.08 | 0.00 | 0.04 | 0.00 | 96 | 0.31 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 1588 - ops.cpp:6210-6484 - libggml-cpu.so [...]+ | | 0.07 | 0.81 | 0.02 | 0.10 | 0.01 | 0.04 | 0.00 | 19 | 0.01 | 0.00 | | |
| ○Loop 1592 - ops.cpp:6413-6426 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1590 - ops.cpp:6479-6484 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1589 - ops.cpp:6462-6475 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1587 - ops.cpp:6210-6462 - libggml-cpu.so [...]+ | | 0.20 | 0.71 | 0.03 | 0.08 | 0.01 | 0.04 | 0.00 | 24 | 0.06 | 0.00 | | |
| ○Loop 1596 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 0.88 | 0.68 | 0.68 | 0.06 | 0.06 | 0.03 | 0.03 | 95 | 0.28 | 0.01 | | |
| ○Loop 1594 - ops.cpp:6210-6303 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1597 - ops.cpp:6210-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1595 - ops.cpp:6220-6245 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1593 - ops.cpp:6429-6442 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1591 - ops.cpp:6446-6456 - libggml-cpu.so [...] | | 0.27 | 0.07 | 0.07 | 0.02 | 0.02 | 0.00 | 0.00 | 39 | 0.08 | 0.00 | | |
| ►ggml_vec_dot_f16+ | libggml-cpu.so | 0.81 | 0.60 | 0.03 | 0.06 | 0.01 | 0.03 | 0.00 | 86 | 0.30 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ○Loop 907 - vec.cpp:324-325 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 908 - vec.cpp:311-316 - libggml-cpu.so | | 0.81 | 0.57 | 0.57 | 0.06 | 0.06 | 0.03 | 0.03 | 86 | 0.29 | 0.01 | | |
| ○Loop 906 - vec.cpp:325-325 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○__GI___sched_yield | libc.so.6 | 0.61 | 0.60 | 0.60 | 0.05 | 0.05 | 0.03 | 0.03 | 96 | 0.25 | 0.01 | OMP (%): 100.00 | |
| ○unknown_function | [vdso] | 0.88 | 0.42 | 0.00 | 0.07 | 0.00 | 0.02 | 0.00 | 93 | 0.27 | 0.01 | OMP (%): 100.00 | |
| ○__aarch64_ldadd8_acq_rel | libomp.so | 1.42 | 0.36 | 0.36 | 0.10 | 0.10 | 0.02 | 0.02 | 82 | 0.38 | 0.02 | OMP (%): 100.00 | |
| ○__sincosf_finite | libamath.so | 0.47 | 0.32 | 0.32 | 0.04 | 0.04 | 0.02 | 0.02 | 92 | 0.18 | 0.01 | Math (%): 100.00 | |
| ○__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*) | libomp.so | 0.54 | 0.30 | 0.30 | 0.04 | 0.04 | 0.01 | 0.01 | 86 | 0.17 | 0.01 | OMP (%): 100.00 | |
| ►ggml_vec_swiglu_f32+ | libggml-cpu.so | 1.35 | 0.25 | 0.00 | 0.10 | 0.00 | 0.01 | 0.00 | 16 | 0.46 | 0.02 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 913 - vec.cpp:402-405 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 912 - vec.cpp:402-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 914 - vec.cpp:403-403 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 915 - vec.h:1045-1072 - libggml-cpu.so [...]+ | | 0.00 | 0.25 | 0.00 | 0.10 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 916 - vec.h:1045-1072 - libggml-cpu.so [...] | | 1.35 | 0.25 | 0.25 | 0.10 | 0.10 | 0.01 | 0.01 | 16 | 0.46 | 0.02 | | |
| ○ggml::cpu::kleidiai::extra_buffer_type::get_tensor_traits(ggml_tensor const*) | libggml-cpu.so | 0.54 | 0.22 | 0.22 | 0.04 | 0.04 | 0.01 | 0.01 | 68 | 0.21 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►ggml_graph_compute_thread+ | libggml-cpu.so | 0.54 | 0.22 | 0.00 | 0.04 | 0.00 | 0.01 | 0.00 | 59 | 0.24 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ○Loop 90 - ggml-cpu.c:2087-2088 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 84 - ggml-cpu.c:1585-1587 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 73 - ggml-cpu.c:533-2897 - libggml-cpu.so [...]+ | | 0.54 | 0.22 | 0.22 | 0.04 | 0.04 | 0.01 | 0.01 | 59 | 0.24 | 0.01 | | |
| ○Loop 72 - ggml-cpu.c:533-2897 - libggml-cpu.so [...] | | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | |
| ►Loop 75 - ggml-cpu.c:1436-1642 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 79 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 78 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 77 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 76 - ggml-cpu.c:1454-1462 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 83 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 82 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 81 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 80 - ggml-cpu.c:1461-1462 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 74 - ggml-cpu.c:1592-1601 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 89 - ggml-cpu.c:1552-1560 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 88 - ggml-cpu.c:1552-1560 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 87 - ggml-cpu.c:1552-1560 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 86 - ggml-cpu.c:1572-1579 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 85 - ggml-cpu.c:1573-1579 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►kai_run_lhs_quant_pack_qsi8d32p4x8sb_f32_neon+ | libggml-cpu.so | 2.84 | 0.18 | 0.00 | 0.21 | 0.00 | 0.01 | 0.00 | 4 | 0.57 | 0.02 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ►Loop 2494 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:268-335 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2495 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:271-332 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2497 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:93-264 - libggml-cpu.so [...]+ | | 0.00 | 0.18 | 0.00 | 0.21 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2496 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:96-258 - libggml-cpu.so [...] | | 2.84 | 0.18 | 0.18 | 0.21 | 0.21 | 0.01 | 0.01 | 4 | 0.57 | 0.02 | | |
| ►ggml_compute_forward_mul+ | libggml-cpu.so | 0.81 | 0.18 | 0.09 | 0.06 | 0.01 | 0.01 | 0.00 | 59 | 0.23 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 516 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 517 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 515 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 537 - ggml-impl.h:355-404 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 538 - ggml-impl.h:355-404 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 536 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 535 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 534 - binary-ops.cpp:84-101 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 511 - ggml-impl.h:355-404 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 512 - ggml-impl.h:355-404 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 514 - binary-ops.cpp:42-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 513 - binary-ops.cpp:42-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 548 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 550 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 549 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 551 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 555 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 554 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 556 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 558 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 557 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 559 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 561 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 560 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 545 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 544 - binary-ops.cpp:84-101 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 547 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 546 - ggml-impl.h:355-404 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 540 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.09 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 539 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 543 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 542 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 541 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.61 | 0.09 | 0.09 | 0.04 | 0.04 | 0.00 | 0.00 | 16 | 0.24 | 0.01 | | |
| ►Loop 553 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 552 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 518 - ggml-impl.h:355-404 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 519 - ggml-impl.h:355-404 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 520 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 521 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 524 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 522 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 523 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 532 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 533 - binary-ops.cpp:18-95 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 531 - binary-ops.cpp:18-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 530 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 525 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 527 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 526 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 528 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 529 - binary-ops.cpp:18-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○__expf_finite | libamath.so | 0.41 | 0.18 | 0.18 | 0.03 | 0.03 | 0.01 | 0.01 | 70 | 0.13 | 0.01 | Math (%): 100.00 | |
| ○__GI___lll_lock_wait | libc.so.6 | 0.34 | 0.16 | 0.16 | 0.03 | 0.03 | 0.01 | 0.01 | 70 | 0.12 | 0.00 | System (%): 100.00 | |
| ►kai_run_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0+ | libggml-cpu.so | 6.16 | 0.11 | 0.00 | 0.45 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ►Loop 2507 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2509 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2508 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-134 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2502 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2501 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2503 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-148 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2504 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2505 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-142 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 2506 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 2510 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+ | | 0.74 | 0.11 | 0.01 | 0.46 | 0.05 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | | |
| ○Loop 2512 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so | | 1.76 | 0.03 | 0.03 | 0.13 | 0.13 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | |
| ○Loop 2511 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-139 - libggml-cpu.so [...] | | 3.66 | 0.07 | 0.07 | 0.27 | 0.27 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | |
| ○__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*) | libomp.so | 0.27 | 0.11 | 0.11 | 0.02 | 0.02 | 0.01 | 0.01 | 51 | 0.11 | 0.01 | OMP (%): 100.00 | |
| ►ggml_cpu_fp32_to_fp16+ | libggml-cpu.so | 0.41 | 0.10 | 0.00 | 0.03 | 0.01 | 0.01 | 0.00 | 46 | 0.14 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ○Loop 0 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...] | | 0.41 | 0.10 | 0.10 | 0.03 | 0.03 | 0.00 | 0.00 | 46 | 0.14 | 0.01 | | |
| ○Loop 1 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►ggml_compute_forward_add_non_quantized+ | libggml-cpu.so | 0.68 | 0.10 | 0.01 | 0.05 | 0.01 | 0.00 | 0.00 | 22 | 0.33 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 433 - ggml-impl.h:355-404 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 434 - ggml-impl.h:355-404 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 432 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 430 - binary-ops.cpp:84-101 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 431 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 414 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 415 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 413 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 428 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 427 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 429 - binary-ops.cpp:10-95 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 444 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 446 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 447 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 445 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 448 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 449 - ggml-impl.h:355-404 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 451 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 450 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 454 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 456 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 457 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 455 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 459 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 458 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 426 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 423 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 425 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 422 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 424 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 421 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 436 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.08 | 0.00 | 0.05 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 439 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 435 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 438 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 437 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.68 | 0.08 | 0.08 | 0.05 | 0.05 | 0.00 | 0.00 | 15 | 0.32 | 0.01 | | |
| ►Loop 441 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 440 - binary-ops.cpp:84-84 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 442 - ggml-impl.h:355-404 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 443 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 409 - ggml-impl.h:355-404 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 410 - ggml-impl.h:355-404 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 412 - binary-ops.cpp:42-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 411 - binary-ops.cpp:42-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 416 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 417 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 419 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 420 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 418 - binary-ops.cpp:10-32 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 453 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 452 - binary-ops.cpp:10-45 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○__memcpy | libastring.so | 0.74 | 0.09 | 0.09 | 0.05 | 0.05 | 0.00 | 0.00 | 47 | 0.16 | 0.01 | String (%): 100.00 | |
| ○__GI___lll_lock_wake | libc.so.6 | 0.20 | 0.07 | 0.07 | 0.01 | 0.01 | 0.00 | 0.00 | 46 | 0.06 | 0.00 | System (%): 100.00 | |
| ○__kmp_barrier | libomp.so | 0.20 | 0.07 | 0.07 | 0.02 | 0.02 | 0.00 | 0.00 | 42 | 0.07 | 0.00 | OMP (%): 100.00 | |
| ○kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check() | libomp.so | 0.47 | 0.06 | 0.06 | 0.04 | 0.04 | 0.00 | 0.00 | 19 | 0.24 | 0.01 | OMP (%): 100.00 | |
| ○__GI___pthread_mutex_unlock_usercnt | libc.so.6 | 0.27 | 0.06 | 0.06 | 0.02 | 0.02 | 0.00 | 0.00 | 23 | 0.14 | 0.01 | Pthread (%): 100.00 | |
| ○@plt_start@ | libomp.so | 0.27 | 0.05 | 0.05 | 0.02 | 0.02 | 0.00 | 0.00 | 33 | 0.08 | 0.00 | OMP (%): 100.00 | |
| ►ggml_compute_forward_mul_mat+ | libggml-cpu.so | 0.20 | 0.05 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 31 | 0.07 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU... |
| ►Loop 60 - ggml-cpu.c:1289-1297 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 59 - ggml-cpu.c:1289-1297 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 58 - ggml-cpu.c:1289-1297 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 55 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]+ | | 0.07 | 0.05 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 2 | 0.00 | 0.00 | | |
| ►Loop 56 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]+ | | 0.07 | 0.04 | 0.01 | 0.03 | 0.01 | 0.00 | 0.00 | 5 | 0.00 | 0.00 | | |
| ○Loop 54 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...] | | 0.14 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 12 | 0.03 | 0.00 | | |
| ○Loop 53 - ggml-cpu.c:1183-1194 - libggml-cpu.so [...] | | 0.14 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 17 | 0.03 | 0.00 | | |
| ○Loop 57 - ggml-cpu.c:1197-1198 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►ggml_compute_forward_rms_norm+ | libggml-cpu.so | 0.34 | 0.05 | 0.00 | 0.03 | 0.00 | 0.00 | 0.00 | 18 | 0.14 | 0.01 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 1313 - ops.cpp:4319-4365 - libggml-cpu.so [...]+ | | 0.00 | 0.04 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1317 - ops.cpp:4319-4365 - libggml-cpu.so [...]+ | | 0.07 | 0.04 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | |
| ►Loop 1330 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1331 - ops.cpp:4321-4333 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1334 - ops.cpp:4320-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1332 - ops.cpp:4321-4333 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1333 - ops.cpp:4321-4333 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1325 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1326 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1327 - vec.h:687-688 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1329 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1323 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1324 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1328 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1318 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1319 - ops.cpp:4321-4333 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1322 - ops.cpp:4320-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1321 - ops.cpp:4321-4333 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1320 - ops.cpp:4321-4333 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1344 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1343 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1342 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1341 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1357 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.04 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1356 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.04 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1355 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.07 | 0.04 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | | |
| ○Loop 1353 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.27 | 0.02 | 0.02 | 0.02 | 0.02 | 0.00 | 0.00 | 11 | 0.12 | 0.01 | | |
| ○Loop 1358 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1354 - vec.h:677-682 - libggml-cpu.so | | 0.14 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 11 | 0.05 | 0.00 | | |
| ►Loop 1337 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1338 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1339 - vec.h:677-688 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1335 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1336 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1340 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1348 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1349 - ops.cpp:4319-4338 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1350 - vec.h:677-688 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1352 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1345 - ops.cpp:4325-4326 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1351 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1347 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1346 - vec.h:677-682 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1314 - ops.cpp:4319-4333 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1315 - vec.h:687-688 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1316 - vec.h:688-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1312 - vec.h:687-688 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○ggml_is_empty | libggml-base.so | 0.27 | 0.04 | 0.04 | 0.02 | 0.02 | 0.00 | 0.00 | 26 | 0.09 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BUILD -D GGML_COMMIT="unknown" -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLEID... |
| ○unknown_function | libggml-cpu.so | 0.14 | 0.04 | 0.00 | 0.01 | 0.00 | 0.00 | 0.00 | 30 | 0.04 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | |
| ►ggml_compute_forward_set_rows+ | libggml-cpu.so | 0.14 | 0.04 | 0.04 | 0.01 | 0.01 | 0.00 | 0.00 | 26 | 0.04 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 1510 - ops.cpp:5550-5563 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1509 - ops.cpp:5551-5563 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1508 - ops.cpp:5552-5563 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○ggml::cpu::kleidiai::tensor_traits::compute_forward_q4_0(ggml_compute_params*, ggml_tensor*) | libggml-cpu.so | 0.14 | 0.04 | 0.04 | 0.01 | 0.01 | 0.00 | 0.00 | 23 | 0.05 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ○__kmp_yield | libomp.so | 0.14 | 0.03 | 0.03 | 0.01 | 0.01 | 0.00 | 0.00 | 23 | 0.04 | 0.00 | OMP (%): 100.00 | |
| ○__kmp_now_nsec | libomp.so | 0.14 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 18 | 0.04 | 0.00 | OMP (%): 100.00 | |
| ○__kmpc_barrier | libomp.so | 0.14 | 0.02 | 0.02 | 0.01 | 0.01 | 0.00 | 0.00 | 13 | 0.03 | 0.00 | OMP (%): 100.00 | |
| ○__memset | libastring.so | 0.68 | 0.01 | 0.01 | 0.05 | 0.05 | 0.00 | 0.00 | 2 | 0.65 | 0.03 | String (%): 100.00 | |
| ►ggml_cpu_extra_compute_forward+ | libggml-cpu.so | 0.14 | 0.01 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 9 | 0.04 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ○Loop 407 - traits.cpp:13-17 - libggml-cpu.so [...] | | 0.07 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 6 | 0.01 | 0.00 | | |
| ►ggml_compute_forward_add+ | libggml-cpu.so | 0.07 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | 0.00 | 9 | 0.00 | 0.00 | /home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00 | Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR... |
| ►Loop 1091 - vec.h:80-80 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1095 - vec.h:80-80 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1094 - vec.h:80-80 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1096 - vec.h:80-80 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1093 - vec.h:80-80 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1092 - vec.h:80-80 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1097 - vec.h:80-80 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1102 - vec.h:80-80 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1101 - vec.h:80-80 - libggml-cpu.so+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1100 - vec.h:80-80 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ►Loop 1099 - vec.h:80-80 - libggml-cpu.so [...]+ | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1098 - vec.h:80-80 - libggml-cpu.so | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1104 - ops.cpp:1395-1422 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |
| ○Loop 1103 - ops.cpp:1395-1424 - libggml-cpu.so [...] | | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | | |