| Name | Module | Coverage gcc_0 (%) | Coverage Excluding Loops gcc_0 (%) | Max Inclusive Time Over Threads gcc_0 (s) | Max Exclusive Time Over Threads gcc_0 (s) | Inclusive Time w.r.t. Wall Time gcc_0 (s) | Exclusive Time w.r.t. Wall Time gcc_0 (s) | Nb Threads gcc_0 | Deviation (coverage) gcc_0 | Deviation (walltime) gcc_0 | Categories gcc_0 | Compilation Options |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ○gomp_team_barrier_wait_end | libgomp.so.1.0.0 | 81.80 | 81.80 | 21.50 | 21.50 | 20.96 | 20.96 | 72 | 6.54 | 4.79 | OMP (%): 100.00 | |
| ►ggml_vec_dot_q8_0_q8_0 | libggml-cpu.so | 10.65 | 0.02 | 3.49 | 0.02 | 2.73 | 0.01 | 72 | 2.52 | 0.60 | /scratch/users/amazouz/QAAS/service/Llama.cpp/ortce-gh/175-931-3387/llama.cpp/build/llama.cpp/../gcc/bin/libggml-blas.so (%): 100.00 | GNU C11 14.2.0 -mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+sve2-sm4+norng+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu11 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ○Loop 2043 - quants.c:1010-1026 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 2039 - quants.c:908-1128 - libggml-cpu.so [...] | 10.63 | 0.03 | 3.52 | 0.01 | 2.72 | 0.01 | 33 | 0.03 | 0.00 | |||
| ○Loop 2041 - quants.c:979-1000 - libggml-cpu.so [...] | 10.60 | 10.60 | 3.49 | 3.49 | 2.72 | 2.72 | 72 | 2.52 | 0.60 | |||
| ○Loop 2042 - quants.c:910-929 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.01 | 0.01 | 0.00 | 0.00 | 1 | 0.00 | 0.00 | |||
| ○Loop 2040 - quants.c:1117-1124 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 2044 - quants.c:1046-1075 - libggml-cpu.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○syscall | libc.so.6 | 6.46 | 6.46 | 4.28 | 4.28 | 1.66 | 1.66 | 72 | 4.18 | 0.69 | System (%): 100.00 | |
| ○__aarch64_ldadd4_acq_rel | libgomp.so.1.0.0 | 0.18 | 0.18 | 0.31 | 0.31 | 0.05 | 0.05 | 56 | 0.29 | 0.04 | OMP (%): 100.00 | |
| ►ggml_vec_swiglu_f32 | libggml-cpu.so | 0.05 | 0.00 | 0.39 | 0.00 | 0.01 | 0.00 | 4 | 0.76 | 0.19 | /scratch/users/amazouz/QAAS/service/Llama.cpp/ortce-gh/175-931-3387/llama.cpp/build/llama.cpp/../gcc/bin/libggml-blas.so (%): 100.00 | GNU C++17 14.2.0 -mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+sve2-sm4+norng+dotprod+i8mm+sve+nosme -mlittle-endian -mabi=lp64 -g -O3 -O3 -std=gnu++17 -fno-omit-frame-pointer -fcf-protection=none -fPIC -fopenmp |
| ○Loop 756 - vec.cpp:385-387 - libggml-cpu.so [...] | 0.05 | 0.05 | 0.39 | 0.39 | 0.01 | 0.01 | 4 | 0.76 | 0.19 | |||
| ►llama_token_data_array_partial_sort_inplace(llama_token_data_array*, int) | libllama.so | 0.05 | 0.00 | 0.40 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | /scratch/users/amazouz/QAAS/service/Llama.cpp/ortce-gh/175-931-3387/llama.cpp/build/llama.cpp/../gcc/bin/libllama.so (%): 100.00 | GNU C++17 14.2.0 -mlittle-endian -mabi=lp64 -g -O3 -O3 -fno-omit-frame-pointer -fcf-protection=none -fPIC GNU C17 14.2.0 -mlittle-endian -mabi=lp64 -g -g -g -O2 -O2 -O2 -fbuilding-libgcc -fno-stack-protector -fPIC |
| ►Loop 3482 - stl_heap.h:262-422 - libllama.so [...] | 0.05 | 0.00 | 0.40 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 3483 - stl_algo.h:1594-1595 - libllama.so [...] | 0.05 | 0.05 | 0.40 | 0.40 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | |||
| ○Loop 3484 - stl_heap.h:262-422 - libllama.so [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 3485 - stl_heap.h:355-360 - libllama.so | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►common_sampler_sample(common_sampler*, llama_context*, int, bool) | exec | 0.03 | 0.00 | 0.26 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | Exe (%): 100.00 | GNU C++17 14.2.0 -mlittle-endian -mabi=lp64 -g -O3 -O3 -fno-omit-frame-pointer -fcf-protection=none -fPIC GNU C17 14.2.0 -mlittle-endian -mabi=lp64 -g -g -g -O2 -O2 -O2 -fbuilding-libgcc -fno-stack-protector -fPIC |
| ►Loop 3889 - sampling.cpp:116-382 - exec [...] | 0.03 | 0.00 | 0.26 | 0.00 | 0.01 | 0.00 | 0 | 0.00 | 0.00 | |||
| ►Loop 3890 - stl_vector.h:993-1949 - exec [...] | 0.03 | 0.00 | 0.26 | 0.00 | 0.01 | 0.00 | 1 | 0.00 | 0.00 | |||
| ○Loop 3894 - sampling.cpp:125-126 - exec [...] | 0.03 | 0.03 | 0.25 | 0.25 | 0.01 | 0.01 | 1 | 0.00 | 0.00 | |||
| ○Loop 3893 - sampling.cpp:125-126 - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 3891 - sampling.cpp:125-126 - exec | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 | |||
| ○Loop 3892 - sampling.cpp:125-126 - exec [...] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0 | 0.00 | 0.00 |