options

Loops Index

Columns Filter

Level Max Thread Time / Walltime run_0 (%) Exclusive Coverage run_0 (%) Inclusive Coverage run_0 (%) Max Exclusive Time Over Threads run_0 (s) Max Inclusive Time Over Threads run_0 (s) Exclusive Time w.r.t. Wall Time run_0 (s) Inclusive Time w.r.t. Wall Time run_0 (s) Nb Threads run_0 Vectorization Ratio (%) Vector Length Use (%) Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing run_0 Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency Speedup If Data in L1 run_0 Level Max Thread Time / Walltime Exclusive Coverage Inclusive Coverage Max Exclusive Time Over Threads Max Inclusive Time Over Threads Exclusive Time w.r.t. Wall Time Inclusive Time w.r.t. Wall Time Nb Threads Vectorization Ratio Vector Length Use Speedup If No Scalar Integer Speedup If FP Vectorized Speedup If Fully Vectorized Speedup If Perfect Load Balancing Stride 0 Stride 1 Stride n Stride Unknown Stride Indirect Array Access Efficiency Speedup If Data in L1
Loop idSource LocationSource FunctionLevelMax Thread Time / Walltime run_0 (%)Exclusive Coverage run_0 (%)Inclusive Coverage run_0 (%)Max Exclusive Time Over Threads run_0 (s)Max Inclusive Time Over Threads run_0 (s)Exclusive Time w.r.t. Wall Time run_0 (s)Inclusive Time w.r.t. Wall Time run_0 (s)Nb Threads run_0Vectorization Ratio (%)Vector Length Use (%)Speedup If No Scalar IntegerSpeedup If FP VectorizedSpeedup If Fully VectorizedSpeedup If Perfect Load Balancing run_0Stride 0Stride 1Stride nStride UnknownStride IndirectArray Access EfficiencySpeedup If Data in L1 run_0
2499libggml-cpu.so - quants.c:101-1042 [...]ggml_vec_dot_q8_0_q8_0Single87.1784.7884.7894.7894.7890.8890.885272.7338.0711.412.611.050200166.672.8
1768libggml-cpu.so - vec.h:491-497ggml_compute_forward_flash_attn_extInnermost0.850.410.410.920.920.440.443710075111.331.502000100.00NA
767libggml-cpu.so - vec.cpp:311-316ggml_vec_dot_f16Single0.730.310.310.790.790.330.333410066.67111.51.5502000100.00NA
2294libggml-cpu.so - sgemm.cpp:138-1044 [...]void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<4>(long, long, long, long)Innermost0.160.100.100.170.170.110.115293.4346.051.061.192.171.622401181.25NA
61libggml-cpu.so - ggml-cpu.c:1125-1395 [...]ggml_compute_forward_mul_matOutermost0.160.080.190.170.360.090.2152011.49112.061.96NANANANANA0.00NA
2489libggml-cpu.so - quants.c:298-355 [...]quantize_row_q8_0Single0.150.080.080.160.160.080.085259.8329.331.231.673.441.9702000100.00NA
1761libggml-cpu.so - vec.h:375-751 [...]ggml_compute_forward_flash_attn_extInBetween0.180.080.490.191.080.080.523349.6445.961.591.021.461.48NANANANANA0.00NA
2306libggml-cpu.so - sgemm.cpp:138-1044 [...]void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<2>(long, long, long, long)Innermost0.110.070.070.120.120.070.075290.7744.471.041.172.151.640401175.00NA
67libggml-cpu.so - ggml-cpu.c:1193-1194ggml_compute_forward_mul_matInnermost0.090.050.050.100.100.060.0652012.51181.8210000100.00NA
64libggml-cpu.so - ggml-cpu.c:1164-1198 [...]ggml_compute_forward_mul_matInBetween0.090.050.110.100.210.050.1152011.37112.061.94NANANANANA0.00NA
82libggml-cpu.so - ggml-cpu.c:533-2891 [...]ggml_graph_compute_threadSingle0.060.030.030.060.060.030.035209.821114.642.27NANANANANA0.00NA
2297libggml-cpu.so - sgemm.cpp:138-1044 [...]void (anonymous namespace)::tinyBLAS_Q0_AVX<block_q8_0, block_q8_0, float>::gemm4xN<3>(long, long, long, long)Innermost0.080.020.020.090.090.020.025292.3145.31.041.172.153.70501178.57NA
1437libggml-cpu.so - ops.cpp:6220-6245 [...]ggml_compute_forward_rope_f32(ggml_compute_params const*, ggml_tensor*, bool)Innermost0.040.010.010.040.040.020.024815.569.171.091.095.832.412000100.00NA
1760libggml-cpu.so - ops.cpp:8885-8886 [...]ggml_compute_forward_flash_attn_extInnermost0.050.010.010.050.050.010.013106.2511162.540200166.67NA
×