When running pose_estimation.py example on my raspberry pi 5 for yolo pose estimation, the resulting pose output is very shaky, compared to running yolo pose on my pc. This in turn seems to negatively affect the accuraccy for an st-gcn classification model I am running at the same time (on cpu). Is this shakyness normal?
Also if I compile yolo pose using the compiler, will there be any improvement in the shakyness?
This is likely due to the quantization. The model outputs are quantized to 8-bit. You can see fluctuations also in the floating-point model on the CPU. However because you have more bits available the changes are visually less noticeable.
You can try the following two options.
Filter the outputs before you send them to your second model. This will smooth the transition and can be done in the application.
Quantize the last layers of the model to 16-bit. This will give your model a finer resolution. This will require you to convert the model using the Hailo Dataflow Compiler.
looking at the tutorials, there are options for choosing the quantization level for some percentage of all weights. However, I couldn’t find any way for specifically disabling quantization for specific layers. Could you help with that?
You can have a look at the model script files in the Model Zoo. I just found that the yolov8m_pose and yolov8s_pose model in the Model Zoo already use the quantization parameter command.
It seems that the model in the example was specifically built for the Raspberry Pi without using this command. Using this will affect performance especially with the single PCIe lane.
great that helps a lot! So is there any difference building the model using the modelzoo cli or using the jupyter notebooks? Which approach should I choose? Finally, is the step for data calibration necessary? If so, should i use coco dataset for data calibration or my own data?
The Jupyter notebooks are the way to start. They are interactive and make it easy to experiment and understand the whole workflow.
The Model Zoo flow is very easy especially when you use our retraining docker and popular models like Yolo.
Using the CLI allows you to do some quick tests.
Using a python script provides the best flexibility and full automation of the conversion process. You can add your own validation and integrate everything as you see fit.
Yes, calibration is necessary. You can do a quick calibration if you are less interested in accuracy and want to evaluate the FPS of a model.
To get the best accuracy you want to use data that is representative of the data the model will see when it is running inference.
We use the public dataset like COCO because they are available to everyone and are a good starting point for evaluation.
Due to the different output format (16-bit vs 8-bit) the post processing on gstreamer doesnt work. As a result I can’t get the keypoints, just the bounding box. Can you help with that?