Evaluation of the running time of each component of the network

zhao_yj · July 9, 2025, 2:46am

Hello, I deployed my convolutional neural network on the Raspberry Pi 5 AI Kit (26TOPS), with the Hailort version being 4.20.

However, I now want to optimize the running speed of my network. One feasible method is to examine the compatibility between my network and Hailo8,

That is, I want to check the actual running time of each component during a single inference, and obtain the data as shown in the following figure. Are there any recommended methods?

KlausK · July 9, 2025, 3:01am

You can create a profiler report. It will give you the FPS and other data of each layer. Check the Hailo Dataflow Compiler User Guide or run the tutorials in the Hailo AI Software Suite Docker with the following command to find out how to create a profiler report:

hailo tutorial

You can also run:

hailo profiler --help

Note: The Hailo Dataflow Compiler can implement the same layer running faster or slower by using different amount of resources. So the same layer will run faster in a network that is smaller because the there are more resources available for each layer.

To get the maximum performance for a specific network you can use the performance mode of the compiler. This method of compilation will require significantly longer time to complete, because the compiler tries to use very high utilization levels, that might not allocate successfully. If it fails to allocate, it automatically tries lower utilization, until it finds the highest possible utilization. See Hailo Dataflow Compiler User Guide.

Topic		Replies	Views
Hailo chip utilization rate General raspberry-pi , hailo8	2	91	May 31, 2025
Poor performance of Hailo8L and Rpi5 General raspberry-pi , performance	6	889	March 20, 2025
How to improve processing speed General raspberry-pi , hailo8	1	45	June 25, 2025
Inference Performance Issue of Hailo-8L on RPi5 General	5	274	December 16, 2024
Benchmark Performance General raspberry-pi , hailo8	7	942	October 9, 2024

Evaluation of the running time of each component of the network

Related topics