options

Functions and Loops

36 loops and 74 functions have been discarded from the report because their ratio ((Max Inclusive Time Over Threads * 100) / Max Thread Active Time) is lower than the threshold set by object_coverage_threshold (0.1%). It represents about 0.11% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis.
Inclusive metrics are only related to the given object code and do not include other external objects / libraries.

Columns Filter

Max Thread Time / Walltime aocc_0 (%) Coverage aocc_0 (%) Coverage Excluding Loops aocc_0 (%) Max Inclusive Time Over Threads aocc_0 (s) Max Exclusive Time Over Threads aocc_0 (s) Inclusive Time w.r.t. Wall Time aocc_0 (s) Exclusive Time w.r.t. Wall Time aocc_0 (s) Nb Threads aocc_0 Deviation (coverage) aocc_0 Deviation (walltime) aocc_0 Categories aocc_0 GFLOPS aocc_0 Compilation Options Max Thread Time / Walltime Coverage Coverage Excluding Loops Max Inclusive Time Over Threads Max Exclusive Time Over Threads Inclusive Time w.r.t. Wall Time Exclusive Time w.r.t. Wall Time Nb Threads Deviation (coverage) Deviation (walltime) Categories GFLOPS Compilation Options
NameModuleMax Thread Time / Walltime aocc_0 (%)Coverage aocc_0 (%)Coverage Excluding Loops aocc_0 (%)Max Inclusive Time Over Threads aocc_0 (s)Max Exclusive Time Over Threads aocc_0 (s)Inclusive Time w.r.t. Wall Time aocc_0 (s)Exclusive Time w.r.t. Wall Time aocc_0 (s)Nb Threads aocc_0Deviation (coverage) aocc_0Deviation (walltime) aocc_0Categories aocc_0GFLOPS aocc_0Compilation Options
ggml_vec_dot_q8_0_q8_0+libggml-cpu.so82.8061.790.0357.340.0637.670.0219231.0018.95/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.0053.88AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US...
Loop 2536 - quants.c:1066-1073 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 2537 - quants.c:108-1042 - libggml-cpu.so [...]82.8061.7661.7657.3457.3437.6537.6519231.0018.9553.84
__kmp_hardware_timestamplibomp.so45.5720.0420.0431.5631.5612.2212.2219216.9410.13OMP (%): 100.000.09
__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*)libomp.so37.3116.9716.9725.8425.8410.3510.3519214.268.53OMP (%): 100.000.18
__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*)libomp.so5.910.280.284.094.090.170.171910.700.43OMP (%): 100.0010.22
ggml_compute_forward_mul_mat+libggml-cpu.so0.300.190.040.210.090.120.021920.070.04/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.0052.54AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US...
Loop 89 - ggml-cpu.c:1289-1297 - libggml-cpu.so+0.010.000.000.030.010.000.0020.000.000.00
Loop 88 - ggml-cpu.c:1289-1297 - libggml-cpu.so+0.010.000.000.020.010.000.0060.000.000.00
Loop 87 - ggml-cpu.c:1289-1297 - libggml-cpu.so0.010.000.000.010.010.000.0040.000.000.00
Loop 91 - ggml-cpu.c:1248-1260 - libggml-cpu.so+0.010.000.000.030.010.000.0030.000.000.00
Loop 90 - ggml-cpu.c:1248-1260 - libggml-cpu.so0.030.000.000.020.020.000.00350.000.000.00
Loop 86 - ggml-cpu.c:1316-1328 - libggml-cpu.so+0.000.000.000.010.000.000.0000.000.000.00
Loop 85 - ggml-cpu.c:1317-1328 - libggml-cpu.so0.010.000.000.010.010.000.00130.000.003.97
Loop 66 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]+0.060.150.010.320.040.090.001070.010.0161.16
Loop 67 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 68 - ggml-cpu.c:1163-1198 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 70 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 69 - ggml-cpu.c:1197-1198 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 75 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 74 - ggml-cpu.c:1197-1198 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 73 - ggml-cpu.c:1193-1194 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 72 - ggml-cpu.c:1164-1194 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 71 - ggml-cpu.c:1193-1194 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 76 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+0.010.140.000.280.010.080.00230.000.00595.29
Loop 77 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+0.040.140.010.270.030.080.001140.010.0174.23
Loop 81 - ggml-cpu.c:1164-1194 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 80 - ggml-cpu.c:1193-1194 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 79 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 78 - ggml-cpu.c:1197-1198 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 84 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+0.190.130.080.240.130.080.051920.050.0370.88
Loop 82 - ggml-cpu.c:1193-1194 - libggml-cpu.so0.130.050.050.090.090.030.031900.030.0253.36
Loop 83 - ggml-cpu.c:1197-1198 - libggml-cpu.so0.030.000.000.020.020.000.00410.010.00180.98
__GI___sched_yieldlibc.so.60.460.150.150.320.320.090.091460.140.09System (%): 100.000.00
void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<2>(long, long, long, long)+libggml-cpu.so0.220.130.000.150.000.080.001890.070.04/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.0091.54AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 2372 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+0.010.130.000.160.010.080.0070.010.00119.07
Loop 2373 - sgemm.cpp:138-1044 - libggml-cpu.so [...]0.220.130.130.150.150.080.081890.070.0491.50
void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<4>(long, long, long, long)+libggml-cpu.so0.200.120.000.140.000.070.001890.070.04/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00201.52AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 2363 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+0.000.120.000.140.000.070.00190.000.000.00
Loop 2364 - sgemm.cpp:138-1044 - libggml-cpu.so [...]0.200.120.120.140.140.070.071890.070.04200.87
ggml_graph_compute_thread+libggml-cpu.so0.130.060.010.090.040.030.011870.030.02/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.0020.83AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US...
Loop 114 - ggml-cpu.c:1572-1579 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.000.00
Loop 113 - ggml-cpu.c:1572-1579 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 100 - ggml-cpu.c:682-1654 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 101 - ggml-cpu.c:1436-1654 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 107 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 109 - ggml-cpu.c:1437-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 108 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 111 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 110 - ggml-cpu.c:1454-1462 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 102 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 104 - ggml-cpu.c:1437-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 106 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 105 - ggml-cpu.c:1454-1462 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 103 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 117 - ggml-cpu.c:1552-1560 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.000.00
Loop 116 - ggml-cpu.c:1552-1560 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.000.00
Loop 115 - ggml-cpu.c:1552-1560 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 112 - ggml-cpu.c:1585-1587 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 99 - ggml-cpu.c:533-2891 - libggml-cpu.so [...]0.120.050.050.080.080.030.031840.030.0225.47
Loop 118 - ggml-cpu.c:2087-2088 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
ggml_compute_forward_flash_attn_ext+libggml-cpu.so0.430.050.000.300.020.030.00540.150.09/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.001782.92AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 1700 - vec.h:375-751 - libggml-cpu.so [...]+0.030.050.000.400.020.030.0040.010.0111.91
Loop 1705 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1709 - ops.cpp:8885-8886 - libggml-cpu.so [...]0.040.000.000.030.030.000.00310.010.01105.59
Loop 1706 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1702 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1703 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1701 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1707 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1704 - vec.h:677-682 - libggml-cpu.so0.070.000.000.050.050.000.00100.020.017.94
Loop 1710 - ops.cpp:8759-8881 - libggml-cpu.so [...]+0.100.040.010.300.070.030.00320.030.023130.72
Loop 1723 - vec.h:491-497 - libggml-cpu.so0.320.040.040.220.220.020.02330.080.051781.82
Loop 1714 - vec.h:375-381 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1722 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1711 - vec.h:386-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1725 - vec.h:502-503 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1724 - vec.h:502-503 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1719 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1720 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1729 - vec.h:750-751 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1715 - vec.h:386-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1718 - vec.h:375-381 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1716 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1726 - vec.h:502-503 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1730 - vec.h:740-745 - libggml-cpu.so0.010.000.000.010.010.000.00120.010.00158.56
Loop 1717 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1712 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1713 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1727 - vec.h:750-751 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1728 - vec.h:750-751 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1721 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1708 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+libggml-cpu.so0.160.040.000.110.020.020.001920.030.02/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00311.75AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 1451 - ops.cpp:6210-6484 - libggml-cpu.so [...]+0.010.030.000.170.010.020.00200.010.00140.87
Loop 1452 - ops.cpp:6210-6484 - libggml-cpu.so [...]+0.040.030.000.160.030.020.001220.010.01128.64
Loop 1460 - ops.cpp:6220-6245 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1462 - ops.cpp:6220-6245 - libggml-cpu.so [...]0.100.020.020.070.070.010.011920.020.01369.05
Loop 1453 - ops.cpp:6365-6484 - libggml-cpu.so [...]+0.040.010.000.060.030.000.001680.010.01413.81
Loop 1456 - ops.cpp:6446-6456 - libggml-cpu.so [...]0.040.000.000.030.030.000.00250.010.01124.12
Loop 1454 - ops.cpp:6479-6484 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1457 - ops.cpp:6429-6442 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1458 - ops.cpp:6413-6426 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1455 - ops.cpp:6462-6475 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1461 - ops.cpp:6210-6245 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1459 - ops.cpp:6210-6245 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
ggml_vec_dot_f16+libggml-cpu.so0.290.030.000.200.060.020.00330.060.04/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.001079.36AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 760 - vec.cpp:324-325 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 761 - vec.cpp:311-316 - libggml-cpu.so0.250.030.030.170.170.020.02330.060.031005.58
quantize_row_q8_0+libggml-cpu.so0.130.030.000.090.020.020.001530.020.01/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.000.40AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US...
Loop 2527 - quants.c:298-355 - libggml-cpu.so [...]0.120.030.030.080.080.020.021390.020.010.46
f64xsubf128libm.so.60.100.030.030.070.070.020.021920.030.02Math (%): 100.00630.54
int __kmp_barrier_template<false>(barrier_type, int, int, unsigned long, void*, void (*)(void*, void*)) [clone .isra.33]libomp.so0.510.010.010.350.350.010.011870.040.03OMP (%): 100.00193.78
ggml_compute_forward_add_non_quantized+libggml-cpu.so0.170.010.000.120.030.000.00420.030.02/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.0046.83AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 383 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 382 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 381 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 373 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 375 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 378 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 377 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 376 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 374 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 399 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.120.000.000.0010.000.000.00
Loop 400 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.120.000.000.0000.000.000.00
Loop 401 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 402 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 398 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.170.000.000.120.120.000.0010.000.00226.14
Loop 380 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 379 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 390 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 391 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 393 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 394 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 395 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 392 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 397 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 396 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 361 - binary-ops.cpp:10-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 360 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 367 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 368 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 369 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 370 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 372 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 371 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 363 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 364 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 362 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 366 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 365 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 384 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 387 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 386 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 388 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 389 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 385 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
ggml_compute_forward_rms_norm+libggml-cpu.so0.130.000.000.090.010.000.00300.020.01/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.00172.93AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 1166 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1165 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1164 - ops.cpp:4319-4333 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1179 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1180 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1181 - vec.h:673-688 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1182 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1178 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1177 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1183 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1199 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.090.000.000.0000.000.000.00
Loop 1198 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.090.000.000.0000.000.000.00
Loop 1197 - ops.cpp:4321-4338 - libggml-cpu.so [...]+0.000.000.000.090.000.000.0000.000.000.00
Loop 1195 - ops.cpp:4325-4326 - libggml-cpu.so0.130.000.000.090.090.000.0010.000.00528.99
Loop 1196 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0010.000.000.00
Loop 1185 - ops.cpp:4319-4338 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.000.00
Loop 1186 - ops.cpp:4319-4338 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.000.00
Loop 1187 - ops.cpp:4321-4338 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.000.00
Loop 1184 - ops.cpp:4325-4326 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1190 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1191 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1192 - vec.h:687-688 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1193 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1194 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1188 - ops.cpp:4325-4326 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1189 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1168 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1169 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1170 - vec.h:687-688 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1172 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1167 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1171 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1161 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1162 - ops.cpp:4319-4333 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1163 - ops.cpp:4321-4333 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1176 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1175 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1174 - ops.cpp:4321-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1173 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1203 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1204 - ops.cpp:4319-4338 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1205 - vec.h:673-688 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 1200 - ops.cpp:4325-4326 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1202 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1206 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1201 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1207 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
ggml_compute_forward_mul+libggml-cpu.so0.140.000.000.100.020.000.00240.030.02/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.0057.43AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 449 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 450 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 448 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 451 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 452 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 485 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.100.000.000.0000.000.000.00
Loop 486 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.100.000.000.0010.000.000.00
Loop 488 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 484 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.140.000.000.100.100.000.0010.000.00190.44
Loop 487 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 447 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 446 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 483 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 482 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 466 - binary-ops.cpp:18-95 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 465 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 453 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 455 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 457 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 458 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 456 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 454 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 470 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 473 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 475 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 474 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 472 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 471 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 459 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 460 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 461 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 463 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 464 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 462 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 469 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 468 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 467 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 476 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 479 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 478 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 481 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 480 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 477 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
ggml_vec_swiglu_f32+libggml-cpu.so0.290.000.000.200.000.000.0070.120.08/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc/bin/libggml-blas.so (%): 100.004611.39AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 764 - vec.cpp:402-403 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 765 - vec.h:1084-1116 - libggml-cpu.so [...]0.290.000.000.200.200.000.0070.120.084609.13
×