options

Functions and Loops

Columns Filter

Max Thread Time / Walltime gcc_5 (%) Coverage gcc_5 (%) Coverage Excluding Loops gcc_5 (%) Max Inclusive Time Over Threads gcc_5 (s) Max Exclusive Time Over Threads gcc_5 (s) Inclusive Time w.r.t. Wall Time gcc_5 (s) Exclusive Time w.r.t. Wall Time gcc_5 (s) Nb Threads gcc_5 Deviation (coverage) gcc_5 Deviation (walltime) gcc_5 Categories gcc_5 Compilation Options Max Thread Time / Walltime Coverage Coverage Excluding Loops Max Inclusive Time Over Threads Max Exclusive Time Over Threads Inclusive Time w.r.t. Wall Time Exclusive Time w.r.t. Wall Time Nb Threads Deviation (coverage) Deviation (walltime) Categories Compilation Options
NameModuleMax Thread Time / Walltime gcc_5 (%)Coverage gcc_5 (%)Coverage Excluding Loops gcc_5 (%)Max Inclusive Time Over Threads gcc_5 (s)Max Exclusive Time Over Threads gcc_5 (s)Inclusive Time w.r.t. Wall Time gcc_5 (s)Exclusive Time w.r.t. Wall Time gcc_5 (s)Nb Threads gcc_5Deviation (coverage) gcc_5Deviation (walltime) gcc_5Categories gcc_5Compilation Options
kai_run_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm+libggml-cpu.so29.1247.790.012.270.002.430.00954.550.20/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C11 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu11 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 2099 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.0047.780.002.290.002.430.0000.000.00
Loop 2098 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.3247.780.162.290.032.430.01660.120.01
Loop 2097 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so29.1247.6347.632.272.272.422.42954.540.20
Loop 2096 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 2095 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 2094 - kai_matmul_clamp_f32_qsi8d32p4x8_qsi4c32p4x8_16x4_neon_i8mm.c:131-131 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
gomp_team_barrier_wait_endlibgomp.so.1.0.039.0630.0630.063.053.051.531.53967.030.26OMP (%): 100.00
ggml_vec_dot_q6_K_q8_K+libggml-cpu.so7.5012.860.010.590.010.650.00960.690.02/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C11 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu11 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 1959 - quants.c:2486-2923 - libggml-cpu.so [...]+0.1312.850.070.640.010.650.00450.060.00
Loop 1961 - quants.c:2492-2654 - libggml-cpu.so [...]+1.3512.781.640.630.110.650.08960.380.02
Loop 1960 - quants.c:2506-2575 - libggml-cpu.so [...]6.7311.1411.140.530.530.570.57960.750.03
Loop 1962 - quants.c:2683-2814 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
__GI___pthread_mutex_locklibc.so.62.312.432.430.180.180.120.12960.730.03Pthread (%): 100.00
__aarch64_ldadd4_acq_rellibgomp.so.1.0.01.541.511.510.120.120.080.08960.550.02OMP (%): 100.00
ggml_compute_forward_flash_attn_ext+libggml-cpu.so1.031.010.050.080.010.050.00860.340.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 1491 - simd-mappings.h:51-51 - libggml-cpu.so [...]0.060.010.010.010.010.000.00120.000.00
Loop 1498 - ops.cpp:8778-8939 - libggml-cpu.so [...]+0.060.950.000.130.010.050.0040.000.00
Loop 1490 - ops.cpp:8778-8939 - libggml-cpu.so [...]+0.060.060.010.030.010.000.00110.000.00
Loop 1501 - ops.cpp:8885-8886 - libggml-cpu.so [...]0.060.010.010.010.010.000.0050.000.00
Loop 1500 - vec.h:646-653 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1499 - vec.h:646-653 - libggml-cpu.so0.190.040.040.010.010.000.00250.060.00
Loop 1492 - ops.cpp:8817-8817 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1493 - ops.cpp:8817-8881 - libggml-cpu.so [...]+0.710.890.560.100.060.050.03860.280.01
Loop 1494 - vec.h:461-466 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1496 - vec.h:710-717 - libggml-cpu.so0.060.000.000.000.000.000.0020.000.00
Loop 1495 - vec.h:411-458 - libggml-cpu.so0.510.330.330.040.040.020.02830.200.01
Loop 1497 - vec.h:646-653 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
ggml_rope_cache_init(float, float, float const*, float*, long, float, float, float*, float, float)+libggml-cpu.so0.640.600.000.050.000.030.00940.220.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 727 - ops.cpp:6238-6247 - libggml-cpu.so+0.060.600.000.060.010.030.0020.010.00
Loop 726 - ops.cpp:6238-6245 - libggml-cpu.so0.640.600.600.050.050.030.03940.220.01
rope_yarn(float, float, float*, long, float, float, float*, float*)+libggml-cpu.so0.580.540.130.050.020.030.01940.260.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 725 - ops.cpp:6211-6231 - libggml-cpu.so [...]0.510.410.410.040.040.020.02920.200.01
ggml_vec_dot_f16+libggml-cpu.so0.640.500.040.050.020.030.00860.230.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 710 - vec.cpp:224-337 - libggml-cpu.so [...]+0.130.020.020.010.010.000.00160.040.00
Loop 711 - vec.cpp:266-269 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 712 - vec.cpp:231-262 - libggml-cpu.so0.580.440.440.050.050.020.02860.220.01
gomp_barrier_wait_endlibgomp.so.1.0.00.380.320.320.030.030.020.02900.160.01OMP (%): 100.00
ggml_vec_swiglu_f32+libggml-cpu.so1.730.290.000.130.000.010.00160.650.03/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 714 - vec.cpp:385-387 - libggml-cpu.so [...]1.730.290.290.130.130.010.01160.650.03
sincosflibm.so.60.450.270.270.040.040.010.01840.170.01Math (%): 100.00
kai_run_lhs_quant_pack_qsi8d32p4x8sb_f32_neon+libggml-cpu.so2.630.180.000.200.000.010.0040.630.02/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C11 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu11 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 2063 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:90-340 - libggml-cpu.so [...]+0.000.180.000.200.000.010.0000.000.00
Loop 2061 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:267-340 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 2065 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:93-264 - libggml-cpu.so [...]+0.000.180.000.200.000.010.0000.000.00
Loop 2066 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:96-262 - libggml-cpu.so [...]2.630.180.180.200.200.010.0140.630.02
Loop 2062 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:268-336 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 2064 - kai_lhs_quant_pack_qsi8d32p4x8sb_f32_neon.c:271-332 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
__expf_finitelibm.so.60.320.170.170.030.030.010.01680.130.01Math (%): 100.00
kai_run_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0+libggml-cpu.so8.140.160.000.630.000.010.0010.000.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C11 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu11 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 2081 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-107 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2078 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-158 - libggml-cpu.so+0.260.160.000.630.020.010.0010.000.00
Loop 2074 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:126-154 - libggml-cpu.so [...]+1.280.100.020.390.100.000.0010.000.00
Loop 2079 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-139 - libggml-cpu.so [...]3.780.070.070.290.290.000.0010.000.00
Loop 2073 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:127-134 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2075 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:107-153 - libggml-cpu.so [...]+2.820.050.050.220.220.000.0010.000.00
Loop 2076 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:126-126 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2077 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:115-118 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 2080 - kai_rhs_pack_nxk_qsi4c32pscalef16_qsu4c32s16s0.c:135-141 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+libggml-cpu.so0.320.140.000.020.000.010.00640.120.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 735 - ops.cpp:6389-6490 - libggml-cpu.so [...]+0.060.130.010.060.010.010.00120.000.00
Loop 742 - ops.cpp:6446-6457 - libggml-cpu.so0.190.050.050.020.020.000.00280.080.00
Loop 736 - ops.cpp:6389-6484 - libggml-cpu.so [...]+0.000.070.000.030.000.000.0000.000.00
Loop 741 - ops.cpp:6479-6484 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 732 - ops.cpp:6389-6475 - libggml-cpu.so [...]+0.260.070.060.030.020.000.00340.090.00
Loop 737 - ops.cpp:6462-6475 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 738 - ops.cpp:6413-6426 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 734 - ops.cpp:6389-6411 - libggml-cpu.so [...]+0.060.010.010.010.010.000.0050.000.00
Loop 733 - ops.cpp:6389-6407 - libggml-cpu.so [...]0.060.010.010.010.010.000.0050.000.00
Loop 739 - ops.cpp:6429-6479 - libggml-cpu.so [...]+0.060.000.000.000.000.000.0010.000.00
Loop 740 - ops.cpp:6429-6442 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 743 - ops.cpp:6368-6372 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
__GI___lll_lock_waitlibc.so.60.320.140.140.030.030.010.01680.110.00System (%): 100.00
ggml_compute_forward_add_non_quantized+libggml-cpu.so0.580.120.010.040.010.010.00230.300.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 355 - binary-ops.cpp:10-146 - libggml-cpu.so [...]+0.000.110.000.050.000.010.0000.000.00
Loop 374 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 378 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 377 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 376 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 375 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 369 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 371 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 370 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 372 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 373 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 356 - binary-ops.cpp:10-146 - libggml-cpu.so [...]+0.060.110.000.050.000.010.0020.000.00
Loop 357 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 360 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 359 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 358 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 361 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 365 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.110.000.040.000.010.0000.000.00
Loop 366 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 364 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 368 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 363 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.110.000.040.000.010.0000.000.00
Loop 362 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.580.110.110.040.040.010.01160.220.01
Loop 367 - binary-ops.cpp:31-31 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 379 - binary-ops.cpp:31-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 381 - binary-ops.cpp:31-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 380 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 383 - binary-ops.cpp:42-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 382 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 389 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 392 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 391 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 390 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 393 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 384 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 386 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 385 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 388 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 387 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
ggml_compute_forward_mul+libggml-cpu.so0.510.090.010.040.010.000.00220.240.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 462 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 465 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 466 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 464 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 463 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 467 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 471 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 469 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 468 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 470 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 457 - binary-ops.cpp:31-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 459 - binary-ops.cpp:31-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 458 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 460 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 461 - binary-ops.cpp:42-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 433 - binary-ops.cpp:18-154 - libggml-cpu.so [...]+0.000.080.000.040.000.000.0000.000.00
Loop 434 - binary-ops.cpp:18-154 - libggml-cpu.so [...]+0.060.080.000.040.000.000.0010.000.00
Loop 435 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 439 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 438 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 437 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 436 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 445 - binary-ops.cpp:31-31 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 443 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.080.000.040.000.000.0000.000.00
Loop 441 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.080.000.040.000.000.0000.000.00
Loop 440 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.510.080.080.040.040.000.00160.210.01
Loop 444 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 442 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 446 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 447 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 450 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 449 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 448 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 451 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 452 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 455 - binary-ops.cpp:31-101 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 454 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 453 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 456 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
__GI___memcpy_svelibc.so.60.770.080.080.060.060.000.00420.170.01Memory (%): 100.00
ggml_graph_compute_thread+libggml-cpu.so0.190.070.010.020.010.000.00430.070.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C11 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu11 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 63 - ggml-cpu.c:1500-1560 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 62 - ggml-cpu.c:1553-1560 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 61 - ggml-cpu.c:1554-1560 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 73 - ggml-cpu.c:1552-1554 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 65 - ggml-cpu.c:1424-1642 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 66 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 67 - ggml-cpu.c:1437-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 68 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 69 - ggml-cpu.c:1461-1462 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 64 - ggml-cpu.c:1585-1587 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 60 - ggml-cpu.c:1664-2898 - libggml-cpu.so [...]+0.190.070.060.020.020.000.00350.060.00
Loop 59 - ggml-cpu.c:2879-2898 - libggml-cpu.so [...]0.060.010.010.010.010.000.0080.000.00
Loop 70 - ggml-cpu.c:1572-1600 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.00
Loop 71 - ggml-cpu.c:1572-1579 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 75 - ggml-cpu.c:1552-1553 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 76 - ggml-cpu.c:2087-2088 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 74 - ggml-cpu.c:1474-1539 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
Loop 72 - ggml-cpu.c:1572-1573 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
__GI___lll_lock_wakelibc.so.60.190.070.070.020.020.000.00420.070.00System (%): 100.00
ggml_cpu_fp32_to_fp16+libggml-cpu.so0.190.070.000.020.000.000.00420.080.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C11 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu11 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 4 - ggml-cpu.c:3228-3229 - libggml-cpu.so [...]0.190.070.070.020.020.000.00420.080.00
ggml_compute_forward_rms_norm+libggml-cpu.so0.320.060.000.030.000.000.00170.130.01/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 1127 - ops.cpp:4319-4365 - libggml-cpu.so [...]+0.060.060.000.050.000.000.0010.000.00
Loop 1123 - ops.cpp:4319-4343 - libggml-cpu.so [...]+0.000.060.000.040.000.000.0000.000.00
Loop 1125 - ops.cpp:4319-4343 - libggml-cpu.so [...]+0.000.060.000.040.000.000.0000.000.00
Loop 1126 - ops.cpp:4319-4321 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 1124 - ops.cpp:4325-4326 - libggml-cpu.so0.320.050.050.030.030.000.00160.130.01
Loop 1128 - vec.h:646-653 - libggml-cpu.so0.190.010.010.020.020.000.00100.070.00
__GI___pthread_mutex_unlock_usercntlibc.so.60.190.060.060.020.020.000.00350.080.00Pthread (%): 100.00
unknown_functionlibggml-cpu.so0.190.060.000.020.000.000.00380.060.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00
ggml::cpu::kleidiai::extra_buffer_type::get_tensor_traits(ggml_tensor const*)+libggml-cpu.so0.190.060.040.010.010.000.00370.060.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 2037 - kleidiai.cpp:535-547 - libggml-cpu.so [...]0.130.010.010.010.010.000.0090.040.00
ggml_compute_forward_mul_mat+libggml-cpu.so0.190.060.000.020.000.000.00380.050.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C11 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu11 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 56 - ggml-cpu.c:1289-1297 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 55 - ggml-cpu.c:1290-1297 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.00
Loop 54 - ggml-cpu.c:1291-1297 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 57 - ggml-cpu.c:1289-1291 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 58 - ggml-cpu.c:1289-1290 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 50 - ggml-cpu.c:1125-1397 - libggml-cpu.so [...]+0.000.060.000.030.000.000.0000.000.00
Loop 48 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+0.000.040.000.020.000.000.0000.000.00
Loop 47 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+0.060.040.010.020.010.000.00110.000.00
Loop 53 - ggml-cpu.c:1193-1194 - libggml-cpu.so0.130.030.030.010.010.000.00200.030.00
Loop 52 - ggml-cpu.c:1197-1198 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
Loop 49 - ggml-cpu.c:1132-1198 - libggml-cpu.so [...]0.060.000.000.010.010.000.0010.000.00
Loop 51 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]0.060.010.010.010.010.000.00110.000.00
ggml_cpu_extra_compute_forward+libggml-cpu.so0.130.020.020.010.010.000.00160.050.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 353 - traits.cpp:13-17 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.00
ggml::cpu::kleidiai::tensor_traits::compute_forward_q4_0(ggml_compute_params*, ggml_tensor*) [clone .isra.0]+libggml-cpu.so0.060.020.020.010.010.000.00160.010.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 2021 - std_function.h:247-591 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.00
__GI___memset_genericlibc.so.60.770.020.020.060.060.000.0030.620.03Memory (%): 100.00
ggml_is_emptylibggml-base.so0.060.020.020.010.010.000.00130.000.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml.so (%): 100.00GNU C11 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu11 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC
ggml_backend_cpu_kleidiai_buffer_type+libggml-cpu.so0.130.010.010.010.010.000.00100.050.00/home/eoseret/Tools/QaaS/qaas_runs/ip-172-31-47-249.ec2.internal/176-138-2040/llama.cpp/build/llama.cpp/../gcc_5/bin/libggml-blas.so (%): 100.00GNU C++17 14.2.0 -mcpu=neoverse-v2 -mlittle-endian -mabi=lp64 -g -O2 -std=gnu++17 -funroll-loops -ffast-math -fno-omit-frame-pointer -fcf-protection=none -fno-finite-math-only -fPIC -fopenmp
Loop 2020 - kleidiai.cpp:56-569 - libggml-cpu.so [...]+0.060.000.000.010.000.000.0020.000.00
Loop 2019 - kleidiai.cpp:56-569 - libggml-cpu.so [...]0.060.000.000.010.010.000.0020.000.00
×