options

Functions and Loops

38 loops and 72 functions have been discarded from the report because their ratio ((Max Inclusive Time Over Threads * 100) / Max Thread Active Time) is lower than the threshold set by object_coverage_threshold (0.1%). It represents about 0.17% of the application. To include them, change the value of object_coverage_threshold in the experiment directory configuration file, then rerun the command with the additionnal parameter --force-static-analysis.
Inclusive metrics are only related to the given object code and do not include other external objects / libraries.

Columns Filter

Max Thread Time / Walltime aocc_7 (%) Coverage aocc_7 (%) Coverage Excluding Loops aocc_7 (%) Max Inclusive Time Over Threads aocc_7 (s) Max Exclusive Time Over Threads aocc_7 (s) Inclusive Time w.r.t. Wall Time aocc_7 (s) Exclusive Time w.r.t. Wall Time aocc_7 (s) Nb Threads aocc_7 Deviation (coverage) aocc_7 Deviation (walltime) aocc_7 Categories aocc_7 GFLOPS aocc_7 Compilation Options Max Thread Time / Walltime Coverage Coverage Excluding Loops Max Inclusive Time Over Threads Max Exclusive Time Over Threads Inclusive Time w.r.t. Wall Time Exclusive Time w.r.t. Wall Time Nb Threads Deviation (coverage) Deviation (walltime) Categories GFLOPS Compilation Options
NameModuleMax Thread Time / Walltime aocc_7 (%)Coverage aocc_7 (%)Coverage Excluding Loops aocc_7 (%)Max Inclusive Time Over Threads aocc_7 (s)Max Exclusive Time Over Threads aocc_7 (s)Inclusive Time w.r.t. Wall Time aocc_7 (s)Exclusive Time w.r.t. Wall Time aocc_7 (s)Nb Threads aocc_7Deviation (coverage) aocc_7Deviation (walltime) aocc_7Categories aocc_7GFLOPS aocc_7Compilation Options
ggml_vec_dot_q8_0_q8_0+libggml-cpu.so82.6661.880.0357.270.0637.710.0219231.0919.00/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.0053.86AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US...
Loop 2158 - quants.c:1066-1073 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 2159 - quants.c:108-1042 - libggml-cpu.so [...]82.6461.8561.8557.2657.2637.7037.7019231.0919.0053.82
__kmp_hardware_timestamplibomp.so45.0020.0120.0131.1831.1812.2012.2019217.0010.18OMP (%): 100.000.08
__kmp_hyper_barrier_release(barrier_type, kmp_info*, int, int, int, void*)libomp.so37.0516.9616.9625.6725.6710.3410.3419214.278.54OMP (%): 100.000.16
__kmp_hyper_barrier_gather(barrier_type, kmp_info*, int, int, void (*)(void*, void*), void*)libomp.so6.060.270.274.204.200.170.171880.710.43OMP (%): 100.0011.18
ggml_compute_forward_mul_mat+libggml-cpu.so0.350.160.040.240.070.100.021920.070.04/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.0048.17AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US...
Loop 65 - ggml-cpu.c:1316-1328 - libggml-cpu.so+0.000.000.000.010.000.000.0010.000.000.00
Loop 64 - ggml-cpu.c:1317-1328 - libggml-cpu.so0.010.000.000.010.010.000.00190.000.002.65
Loop 68 - ggml-cpu.c:1289-1297 - libggml-cpu.so+0.010.000.000.030.010.000.00120.000.000.00
Loop 67 - ggml-cpu.c:1289-1297 - libggml-cpu.so+0.010.000.000.020.010.000.00100.000.000.00
Loop 66 - ggml-cpu.c:1289-1297 - libggml-cpu.so0.010.000.000.010.010.000.00150.000.000.00
Loop 70 - ggml-cpu.c:1248-1260 - libggml-cpu.so+0.010.000.000.030.010.000.0020.000.000.00
Loop 69 - ggml-cpu.c:1248-1260 - libggml-cpu.so0.030.000.000.020.020.000.00350.000.000.00
Loop 58 - ggml-cpu.c:1125-1395 - libggml-cpu.so [...]+0.040.110.010.330.030.070.001000.010.0162.17
Loop 59 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+0.030.110.000.300.020.070.00420.010.0178.60
Loop 60 - ggml-cpu.c:1162-1198 - libggml-cpu.so [...]+0.040.110.000.280.030.060.001120.010.01133.61
Loop 61 - ggml-cpu.c:1164-1198 - libggml-cpu.so [...]+0.220.100.090.250.150.060.061920.050.0355.91
Loop 63 - ggml-cpu.c:1193-1194 - libggml-cpu.so0.130.010.010.090.090.010.011400.020.01131.51
Loop 62 - ggml-cpu.c:1197-1198 - libggml-cpu.so0.010.000.000.010.010.000.00350.010.00224.58
__GI___sched_yieldlibc.so.60.520.150.150.360.360.090.091500.150.09System (%): 100.000.00
void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<4>(long, long, long, long)+libggml-cpu.so0.190.110.000.130.000.070.001890.070.04/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00210.64AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 2011 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+0.000.110.000.200.000.070.0080.000.000.00
Loop 2010 - sgemm.cpp:205-853 - libggml-cpu.so [...]0.010.000.000.010.010.000.00180.000.00857.56
Loop 2013 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+0.170.110.080.190.120.070.051890.050.03171.59
Loop 2012 - sgemm.cpp:138-1044 - libggml-cpu.so [...]0.100.040.040.070.070.020.021890.030.02290.25
void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<2>(long, long, long, long)+libggml-cpu.so0.190.110.000.130.010.070.001890.060.04/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00108.17AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 2020 - sgemm.cpp:138-1044 - libggml-cpu.so [...]+0.000.110.000.130.000.070.0040.000.000.00
Loop 2021 - sgemm.cpp:138-1044 - libggml-cpu.so [...]0.190.110.110.130.130.070.071890.060.04108.06
ggml_graph_compute_thread+libggml-cpu.so0.190.060.010.130.030.040.011890.040.02/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.0017.57AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US...
Loop 90 - ggml-cpu.c:1552-1560 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.000.00
Loop 89 - ggml-cpu.c:1552-1560 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.000.00
Loop 88 - ggml-cpu.c:1552-1560 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 79 - ggml-cpu.c:682-1654 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 80 - ggml-cpu.c:1436-1654 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 81 - ggml-cpu.c:1436-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 82 - ggml-cpu.c:1437-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 83 - ggml-cpu.c:1438-1465 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 84 - ggml-cpu.c:1461-1462 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 86 - ggml-cpu.c:1572-1579 - libggml-cpu.so+0.000.000.000.000.000.000.0000.000.000.00
Loop 87 - ggml-cpu.c:1573-1579 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 85 - ggml-cpu.c:1585-1587 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 78 - ggml-cpu.c:533-2891 - libggml-cpu.so [...]0.160.050.050.110.110.030.031810.030.0217.69
Loop 91 - ggml-cpu.c:2087-2088 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
ggml_compute_forward_flash_attn_ext+libggml-cpu.so0.420.060.000.290.020.030.00530.160.10/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.001581.14AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 1436 - vec.h:375-751 - libggml-cpu.so [...]+0.000.050.000.380.000.030.0000.000.000.00
Loop 1437 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1440 - vec.h:677-682 - libggml-cpu.so0.040.000.000.030.030.000.00100.010.017.32
Loop 1445 - ops.cpp:8885-8886 - libggml-cpu.so [...]0.030.000.000.020.020.000.00290.010.01140.58
Loop 1438 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1442 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1443 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1441 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1439 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1444 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1446 - ops.cpp:8759-8881 - libggml-cpu.so [...]+0.120.050.010.330.080.030.01340.030.022299.56
Loop 1464 - vec.h:750-751 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1460 - vec.h:502-503 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1466 - vec.h:740-745 - libggml-cpu.so0.000.000.000.000.000.000.0030.000.000.00
Loop 1457 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1465 - vec.h:750-751 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1459 - vec.h:491-497 - libggml-cpu.so0.360.040.040.250.250.030.03350.100.061606.01
Loop 1458 - vec.h:677-682 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1450 - vec.h:375-381 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1454 - vec.h:375-381 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1463 - vec.h:750-751 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1456 - vec.h:688-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1462 - vec.h:502-503 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1452 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1449 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1455 - vec.h:687-688 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1451 - vec.h:386-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1448 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1453 - vec.h:387-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1461 - vec.h:502-503 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 1447 - vec.h:386-387 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
ggml_vec_dot_f16+libggml-cpu.so0.300.040.000.210.060.020.00320.060.03/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00971.24AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 758 - vec.cpp:311-316 - libggml-cpu.so0.270.030.030.190.190.020.02320.060.03878.66
Loop 757 - vec.cpp:324-325 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)+libggml-cpu.so0.140.030.000.100.020.020.001920.030.02/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.00324.42AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 1266 - ops.cpp:6210-6484 - libggml-cpu.so [...]+0.000.030.000.120.000.020.0050.000.000.00
Loop 1267 - ops.cpp:6210-6484 - libggml-cpu.so [...]+0.010.030.000.120.010.020.00730.010.00328.90
Loop 1274 - ops.cpp:6210-6245 - libggml-cpu.so [...]0.090.020.020.060.060.010.011920.020.01370.49
Loop 1268 - ops.cpp:6365-6484 - libggml-cpu.so [...]+0.040.010.000.050.030.000.001780.010.01332.93
Loop 1272 - ops.cpp:6429-6442 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1269 - ops.cpp:6479-6484 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1273 - ops.cpp:6413-6426 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1270 - ops.cpp:6462-6475 - libggml-cpu.so0.000.000.000.000.000.000.0000.000.000.00
Loop 1271 - ops.cpp:6446-6457 - libggml-cpu.so0.030.000.000.020.020.000.00170.010.01117.87
quantize_row_q8_0+libggml-cpu.so0.100.030.000.070.020.020.001550.020.01/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.000.95AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LLAMAFILE -D GGML_US...
Loop 2149 - quants.c:298-355 - libggml-cpu.so [...]0.090.030.030.060.060.020.021480.020.011.04
f64xsubf128libm.so.60.100.020.020.070.070.010.011920.020.01Math (%): 100.00907.15
int __kmp_barrier_template<false>(barrier_type, int, int, unsigned long, void*, void (*)(void*, void*)) [clone .isra.33]libomp.so0.430.010.010.300.300.010.011910.040.02OMP (%): 100.00196.95
ggml_compute_forward_add_non_quantized+libggml-cpu.so0.320.010.000.220.020.000.00460.050.03/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.0036.02AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 323 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 324 - ggml-impl.h:518-541 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 325 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 326 - ggml-impl.h:518-541 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 328 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 327 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 345 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 346 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 348 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 350 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 347 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 349 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 335 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 338 - binary-ops.cpp:10-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 337 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 336 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 339 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 342 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 344 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 343 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 341 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 340 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 317 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 318 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 320 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 321 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 322 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 319 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 329 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 331 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 332 - ggml-impl.h:518-541 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 333 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 334 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 330 - ggml-impl.h:518-541 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 351 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.210.000.000.0000.000.000.00
Loop 354 - binary-ops.cpp:10-110 - libggml-cpu.so [...]+0.000.000.000.210.000.000.0000.000.000.00
Loop 356 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 355 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 353 - binary-ops.cpp:10-32 - libggml-cpu.so [...]0.300.000.000.210.210.000.0010.000.00120.08
Loop 352 - binary-ops.cpp:10-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
ggml_compute_forward_mul+libggml-cpu.so0.170.010.000.120.030.000.00440.030.02/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.0037.60AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 415 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 416 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 418 - binary-ops.cpp:18-101 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 417 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 409 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 411 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 412 - ggml-impl.h:518-541 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 414 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 413 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 410 - ggml-impl.h:518-541 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 419 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 422 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 421 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 423 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 424 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 420 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 431 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.120.000.000.0000.000.000.00
Loop 432 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 434 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.120.000.000.0000.000.000.00
Loop 436 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 433 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.170.000.000.120.120.000.0010.000.00194.29
Loop 435 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 397 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 400 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 401 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 402 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 399 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 398 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 403 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 404 - ggml-impl.h:518-541 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 405 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 407 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 406 - ggml-impl.h:518-541 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 408 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 425 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 426 - binary-ops.cpp:18-45 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 428 - binary-ops.cpp:18-110 - libggml-cpu.so [...]+0.000.000.000.000.000.000.0000.000.000.00
Loop 430 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 429 - binary-ops.cpp:31-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 427 - binary-ops.cpp:18-32 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
ggml_vec_swiglu_f32+libggml-cpu.so0.250.000.000.170.000.000.0070.100.06/beegfs/hackathon/users/eoseret/qaas_runs_test/175-950-2189/intel/llama.cpp/build/llama.cpp/../aocc_7/bin/libggml-blas.so (%): 100.005309.66AMD clang version 17.0.6 (CLANG: AOCC_5.0.0-Build#1377 2024_09_24) /home/eoseret/aocc-compiler-5.0.0/bin/clang-17 --driver-mode=g++ -D GGML_BACKEND_BUILD -D GGML_BACKEND_SHARED -D GGML_SCHED_MAX_COPIES=4 -D GGML_SHARED -D GGML_USE_CPU_REPACK -D GGML_USE_LL...
Loop 766 - vec.h:1084-1115 - libggml-cpu.so [...]0.250.000.000.170.170.000.0070.100.065283.19
Loop 763 - vec.cpp:402-403 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 764 - vec.cpp:402-403 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
Loop 765 - vec.cpp:402-403 - libggml-cpu.so [...]0.000.000.000.000.000.000.0000.000.000.00
×