options

Functions and Loops

Columns Filter

Max Thread Time / Walltime armclang_3 (%) Coverage armclang_3 (%) Coverage Excluding Loops armclang_3 (%) Max Inclusive Time Over Threads armclang_3 (s) Max Exclusive Time Over Threads armclang_3 (s) Inclusive Time w.r.t. Wall Time armclang_3 (s) Exclusive Time w.r.t. Wall Time armclang_3 (s) Nb Threads armclang_3 Deviation (coverage) armclang_3 Deviation (walltime) armclang_3 Categories armclang_3 Compilation Options Max Thread Time / Walltime Coverage Coverage Excluding Loops Max Inclusive Time Over Threads Max Exclusive Time Over Threads Inclusive Time w.r.t. Wall Time Exclusive Time w.r.t. Wall Time Nb Threads Deviation (coverage) Deviation (walltime) Categories Compilation Options
NameModuleMax Thread Time / Walltime armclang_3 (%)Coverage armclang_3 (%)Coverage Excluding Loops armclang_3 (%)Max Inclusive Time Over Threads armclang_3 (s)Max Exclusive Time Over Threads armclang_3 (s)Inclusive Time w.r.t. Wall Time armclang_3 (s)Exclusive Time w.r.t. Wall Time armclang_3 (s)Nb Threads armclang_3Deviation (coverage) armclang_3Deviation (walltime) armclang_3Categories armclang_3Compilation Options
kai_run_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm+libggml-cpu.so30.6047.490.002.260.002.390.00955.020.21/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU...
Loop 2527 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 2526 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 2525 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2530 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.0747.480.002.280.002.390.0010.000.00
Loop 2529 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.2747.480.132.270.022.390.01620.100.00
Loop 2528 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so30.5447.3547.352.252.252.382.38955.010.21
kmp_flag_64<false, true>::wait(kmp_info*, int, void*)libomp.so35.9527.2327.232.652.651.371.37966.720.27OMP (%): 100.00
ggml_vec_dot_q6_K_q8_K+libggml-cpu.so8.1913.710.040.610.010.690.00960.600.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU...
Loop 2424 - quants.c:2835-2913 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 2426 - quants.c:2492-2660 - libggml-cpu.so [...]+1.4213.671.580.660.110.690.08960.350.01
Loop 2425 - quants.c:2506-2590 - libggml-cpu.so [...]7.5212.0912.090.550.550.610.61960.630.02
__GI___pthread_mutex_locklibc.so.62.842.632.630.210.210.130.13951.110.05Pthread (%): 100.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::notdone_check()libomp.so2.441.811.810.180.180.090.09960.650.03OMP (%): 100.00
ggml_compute_forward_flash_attn_ext+libggml-cpu.so1.151.020.030.090.010.050.00880.410.02/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 1894 - ops.cpp:8778-8920 - libggml-cpu.so [...]+0.070.980.000.130.000.050.0010.000.00
Loop 1892 - ops.cpp:8885-8886 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1893 - vec.h:375-751 - libggml-cpu.so [...]+0.140.980.020.120.010.050.00150.030.00
Loop 1915 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1895 - ops.cpp:8885-8886 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1913 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1914 - vec.h:677-682 - libggml-cpu.so0.140.040.040.010.010.000.00290.050.00
Loop 1912 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1916 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1917 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1896 - ops.cpp:8793-8881 - libggml-cpu.so [...]+0.740.920.600.100.060.050.03860.280.01
Loop 1904 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1906 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1901 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1907 - vec.h:503-503 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1897 - vec.h:386-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1900 - vec.h:386-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1905 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1899 - vec.h:375-381 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1902 - vec.h:375-381 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1910 - vec.h:751-751 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1898 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1911 - vec.h:740-745 - libggml-cpu.so0.070.000.000.010.010.000.0020.000.00
Loop 1908 - vec.h:491-497 - libggml-cpu.so0.540.320.320.040.040.020.02790.190.01
Loop 1903 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1909 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+libggml-cpu.so1.020.810.000.080.000.040.00960.310.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 1588 - ops.cpp:6210-6484 - libggml-cpu.so [...]+0.070.810.020.100.010.040.00190.010.00
Loop 1592 - ops.cpp:6413-6426 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1590 - ops.cpp:6479-6484 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1589 - ops.cpp:6462-6475 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1587 - ops.cpp:6210-6462 - libggml-cpu.so [...]+0.200.710.030.080.010.040.00240.060.00
Loop 1596 - ops.cpp:6220-6245 - libggml-cpu.so [...]0.880.680.680.060.060.030.03950.280.01
Loop 1594 - ops.cpp:6210-6303 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1597 - ops.cpp:6210-6245 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1595 - ops.cpp:6220-6245 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1593 - ops.cpp:6429-6442 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1591 - ops.cpp:6446-6456 - libggml-cpu.so [...]0.270.070.070.020.020.000.00390.080.00
ggml_vec_dot_f16+libggml-cpu.so0.810.600.030.060.010.030.00860.300.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 907 - vec.cpp:324-325 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 908 - vec.cpp:311-316 - libggml-cpu.so0.810.570.570.060.060.030.03860.290.01
Loop 906 - vec.cpp:325-325 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
__GI___sched_yieldlibc.so.60.610.600.600.050.050.030.03960.250.01OMP (%): 100.00
unknown_function[vdso]0.880.420.000.070.000.020.00930.270.01OMP (%): 100.00
__aarch64_ldadd8_acq_rellibomp.so1.420.360.360.100.100.020.02820.380.02OMP (%): 100.00
__sincosf_finitelibamath.so0.470.320.320.040.040.020.02920.180.01Math (%): 100.00
__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*)libomp.so0.540.300.300.040.040.010.01860.170.01OMP (%): 100.00
ggml_vec_swiglu_f32+libggml-cpu.so1.350.250.000.100.000.010.00160.460.02/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 913 - vec.cpp:402-405 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 912 - vec.cpp:402-403 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 914 - vec.cpp:403-403 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 915 - vec.h:1045-1072 - libggml-cpu.so [...]+0.000.250.000.100.000.010.0000.000.00
Loop 916 - vec.h:1045-1072 - libggml-cpu.so [...]1.350.250.250.100.100.010.01160.460.02
ggml::cpu::kleidiai::extra_buffer_type::get_tensor_traits(ggml_tensor const*)libggml-cpu.so0.540.220.220.040.040.010.01680.210.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
ggml_graph_compute_thread+libggml-cpu.so0.540.220.000.040.000.010.00590.240.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU...
Loop 90 - ggml-cpu.c:2087-2088 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 84 - ggml-cpu.c:1585-1587 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 73 - ggml-cpu.c:533-2897 - libggml-cpu.so [...]+0.540.220.220.040.040.010.01590.240.01
Loop 72 - ggml-cpu.c:533-2897 - libggml-cpu.so [...]0.070.000.000.000.000.000.0010.000.00
Loop 75 - ggml-cpu.c:1436-1642 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 79 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 78 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 77 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 76 - ggml-cpu.c:1454-1462 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 83 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 82 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 81 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 80 - ggml-cpu.c:1461-1462 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 74 - ggml-cpu.c:1592-1601 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 89 - ggml-cpu.c:1552-1560 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 88 - ggml-cpu.c:1552-1560 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 87 - ggml-cpu.c:1552-1560 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 86 - ggml-cpu.c:1572-1579 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 85 - ggml-cpu.c:1573-1579 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
kai_run_lhs_quant_pack_qsi8d32p4x8sb_f32_neon+libggml-cpu.so2.840.180.000.210.000.010.0040.570.02/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU...
Loop 2494 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:268-335 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2495 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:271-332 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 2497 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:93-264 - libggml-cpu.so [...]+0.000.180.000.210.000.010.0000.000.00
Loop 2496 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:96-258 - libggml-cpu.so [...]2.840.180.180.210.210.010.0140.570.02
ggml_compute_forward_mul+libggml-cpu.so0.810.180.090.060.010.010.00590.230.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 516 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 517 - binary-ops.cpp:18-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 515 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 537 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 538 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 536 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 535 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 534 - binary-ops.cpp:84-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 511 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 512 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 514 - binary-ops.cpp:42-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 513 - binary-ops.cpp:42-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 548 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 550 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 549 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 551 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 555 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 554 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 556 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 558 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 557 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 559 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 561 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 560 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 545 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 544 - binary-ops.cpp:84-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 547 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 546 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 540 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.090.000.040.000.000.0000.000.00
Loop 539 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 543 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 542 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 541 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.610.090.090.040.040.000.00160.240.01
Loop 553 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 552 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 518 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 519 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 520 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 521 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 524 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 522 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 523 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 532 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 533 - binary-ops.cpp:18-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 531 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 530 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 525 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 527 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 526 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 528 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 529 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
__expf_finitelibamath.so0.410.180.180.030.030.010.01700.130.01Math (%): 100.00
__GI___lll_lock_waitlibc.so.60.340.160.160.030.030.010.01700.120.00System (%): 100.00
kai_run_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0+libggml-cpu.so6.160.110.000.450.000.010.0010.000.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU...
Loop 2507 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2509 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2508 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-134 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2502 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2501 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2503 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-148 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 2504 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2505 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-142 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 2506 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2510 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-154 - libggml-cpu.so [...]+0.740.110.010.460.050.010.0010.000.00
Loop 2512 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so1.760.030.030.130.130.000.0010.000.00
Loop 2511 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-139 - libggml-cpu.so [...]3.660.070.070.270.270.000.0010.000.00
__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*)libomp.so0.270.110.110.020.020.010.01510.110.01OMP (%): 100.00
ggml_cpu_fp32_to_fp16+libggml-cpu.so0.410.100.000.030.010.010.00460.140.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU...
Loop 0 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...]0.410.100.100.030.030.000.00460.140.01
Loop 1 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
ggml_compute_forward_add_non_quantized+libggml-cpu.so0.680.100.010.050.010.000.00220.330.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 433 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 434 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 432 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 430 - binary-ops.cpp:84-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 431 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 414 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 415 - binary-ops.cpp:10-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 413 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 428 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 427 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 429 - binary-ops.cpp:10-95 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 444 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 446 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 447 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 445 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 448 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 449 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 451 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 450 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 454 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 456 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 457 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 455 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 459 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 458 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 426 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 423 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 425 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 422 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 424 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 421 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 436 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.080.000.050.000.000.0000.000.00
Loop 439 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 435 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 438 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 437 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.680.080.080.050.050.000.00150.320.01
Loop 441 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 440 - binary-ops.cpp:84-84 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 442 - ggml-impl.h:355-404 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 443 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 409 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 410 - ggml-impl.h:355-404 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 412 - binary-ops.cpp:42-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 411 - binary-ops.cpp:42-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 416 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 417 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 419 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 420 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 418 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 453 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 452 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
__memcpylibastring.so0.740.090.090.050.050.000.00470.160.01String (%): 100.00
__GI___lll_lock_wakelibc.so.60.200.070.070.010.010.000.00460.060.00System (%): 100.00
__kmp_barrierlibomp.so0.200.070.070.020.020.000.00420.070.00OMP (%): 100.00
kmp_flag_native<unsigned long long, (flag_type)1, true>::done_check()libomp.so0.470.060.060.040.040.000.00190.240.01OMP (%): 100.00
__GI___pthread_mutex_unlock_usercntlibc.so.60.270.060.060.020.020.000.00230.140.01Pthread (%): 100.00
@plt_start@libomp.so0.270.050.050.020.020.000.00330.080.00OMP (%): 100.00
ggml_compute_forward_mul_mat+libggml-cpu.so0.200.050.000.010.010.000.00310.070.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU...
Loop 60 - ggml-cpu.c:1289-1297 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 59 - ggml-cpu.c:1289-1297 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 58 - ggml-cpu.c:1289-1297 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 55 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]+0.070.050.000.030.000.000.0020.000.00
Loop 56 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]+0.070.040.010.030.010.000.0050.000.00
Loop 54 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]0.140.020.020.010.010.000.00120.030.00
Loop 53 - ggml-cpu.c:1183-1194 - libggml-cpu.so [...]0.140.020.020.010.010.000.00170.030.00
Loop 57 - ggml-cpu.c:1197-1198 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
ggml_compute_forward_rms_norm+libggml-cpu.so0.340.050.000.030.000.000.00180.140.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 1313 - ops.cpp:4319-4365 - libggml-cpu.so [...]+0.000.040.000.040.000.000.0000.000.00
Loop 1317 - ops.cpp:4319-4365 - libggml-cpu.so [...]+0.070.040.000.040.000.000.0010.000.00
Loop 1330 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1331 - ops.cpp:4321-4333 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1334 - ops.cpp:4320-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1332 - ops.cpp:4321-4333 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1333 - ops.cpp:4321-4333 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1325 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1326 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1327 - vec.h:687-688 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1329 - ops.cpp:4325-4326 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1323 - ops.cpp:4325-4326 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1324 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1328 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1318 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1319 - ops.cpp:4321-4333 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1322 - ops.cpp:4320-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1321 - ops.cpp:4321-4333 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1320 - ops.cpp:4321-4333 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1344 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1343 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1342 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1341 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1357 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.040.000.040.000.000.0000.000.00
Loop 1356 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.040.000.040.000.000.0000.000.00
Loop 1355 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.070.040.000.040.000.000.0010.000.00
Loop 1353 - ops.cpp:4325-4326 - libggml-cpu.so0.270.020.020.020.020.000.00110.120.01
Loop 1358 - ops.cpp:4325-4326 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1354 - vec.h:677-682 - libggml-cpu.so0.140.020.020.010.010.000.00110.050.00
Loop 1337 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1338 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1339 - vec.h:677-688 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1335 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1336 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1340 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1348 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1349 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1350 - vec.h:677-688 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1352 - ops.cpp:4325-4326 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1345 - ops.cpp:4325-4326 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1351 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1347 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1346 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1314 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1315 - vec.h:687-688 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1316 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1312 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
ggml_is_emptylibggml-base.so0.270.040.040.020.020.000.00260.090.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 -D GGML_BUILD -D GGML_COMMIT="unknown" -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_KLEID...
unknown_functionlibggml-cpu.so0.140.040.000.010.000.000.00300.040.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00
ggml_compute_forward_set_rows+libggml-cpu.so0.140.040.040.010.010.000.00260.040.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 1510 - ops.cpp:5550-5563 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 1509 - ops.cpp:5551-5563 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 1508 - ops.cpp:5552-5563 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
ggml::cpu::kleidiai::tensor_traits::compute_forward_q4_0(ggml_compute_params*, ggml_tensor*)libggml-cpu.so0.140.040.040.010.010.000.00230.050.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
__kmp_yieldlibomp.so0.140.030.030.010.010.000.00230.040.00OMP (%): 100.00
__kmp_now_nseclibomp.so0.140.020.020.010.010.000.00180.040.00OMP (%): 100.00
__kmpc_barrierlibomp.so0.140.020.020.010.010.000.00130.030.00OMP (%): 100.00
__memsetlibastring.so0.680.010.010.050.050.000.0020.650.03String (%): 100.00
ggml_cpu_extra_compute_forward+libggml-cpu.so0.140.010.000.010.010.000.0090.040.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 407 - traits.cpp:13-17 - libggml-cpu.so [...]0.070.010.010.010.010.000.0060.010.00
ggml_compute_forward_add+libggml-cpu.so0.070.010.010.010.010.000.0090.000.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../armclang_3/bin/libggml-blas.so (%): 100.00Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_AmazonLinux-2023/llvm-bin/clang-19 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHAR...
Loop 1091 - vec.h:80-80 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1095 - vec.h:80-80 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 1094 - vec.h:80-80 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1096 - vec.h:80-80 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1093 - vec.h:80-80 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1092 - vec.h:80-80 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1097 - vec.h:80-80 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1102 - vec.h:80-80 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1101 - vec.h:80-80 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 1100 - vec.h:80-80 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1099 - vec.h:80-80 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 1098 - vec.h:80-80 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1104 - ops.cpp:1395-1422 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 1103 - ops.cpp:1395-1424 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
×