Poor Inference Results of My MobileNetV2-UNet on Hailo8L

user134 · February 26, 2025, 9:42am

Hello,

I have successfully converted my model from ONNX to HEF and deployed it on the Hailo8L. While the model runs without issues, the inference results are significantly worse compared to the ONNX outputs. The performance gap is much larger than expected.

I have already verified that the input format remains RGB throughout the process, so I’m unsure what could be causing such a drastic difference.

Could you help identify potential reasons for this discrepancy? Also, would it be possible for me to share both the ONNX and HEF models with you so you can check if there are any issues?

Thank you!

shashi · February 26, 2025, 3:15pm

Hi @user134
Did you evaluate the mask mAP on a validation dataset for both the float onnx and the hef? This will give a hint on how much is the actual degradation. While there can be some loss due to quantization, if the mAP loss is much higher, then either pre-processing or calibration or some compiler settings could be the issue.

user134 · February 27, 2025, 12:47am

Hi @shashi
I haven’t evaluated the mask mAP. The HEF evaluation method only provides FPS and latency. Does this mean that we need to implement our own evaluation method?

shashi · February 27, 2025, 12:54am

Hi @user134
Yes, you should implement a box/mask map evaluation method. Since you trained the model by yourself, you should already have the eval script and validation dataset.

user134 · February 27, 2025, 1:06am

Hi @shashi
Why is it that my input and output are both float32 in ONNX, but after converting to HEF, the input and output become uint8?

My last layer is sigmoid, but after running through Hailo, it is automatically converted to uint8 in the range of 0-255. How is this process handled?

shashi · February 27, 2025, 1:10am

Hi @user134
Hailo devices run quantized models. Hence their inputs and outputs are quantized. Each output tensor contains the quantization information so that you can convert it back to floating point values. The formula to convert quantized output to float is float_value = scale*(quant_value-zero_point).

user134 · February 27, 2025, 1:15am

hi @shashi
But my output is uint8 in the range of 0-255, and I can’t output any floating-point values. Is there any way to make Hailo output the original model output instead of uint8 pixel values?

shashi · February 27, 2025, 1:17am

Hi @user134
In your original model, what do the output values mean? You mentioned last layer is sigmoid. Can you share some more details on output tensor size and its interpretation.

user134 · February 27, 2025, 1:19am

Hi @shashi
Or should I send you my ONNX and HEF files for you to take a look?

shashi · February 27, 2025, 1:19am

Hi @user134
Sure. We will see if we can help.

user134 · February 27, 2025, 1:22am

Hi @shashi
May I know which email address I should send them to?

Topic		Replies	Views
ONNX model conversion problem General dfc , network	2	529	November 11, 2024
FLOAT32 Custom HEF model problem General dfc , hailo8	3	45	May 15, 2025
Strong Performance Degradation after conversion from HAR to HEF General hailort , raspberry-pi , hailo8	6	294	December 8, 2024
Custom ONNX models on H8L Raspberry General dfc , raspberry-pi , hailo8	3	2131	July 30, 2024
Trouble Converting ONNX to HEF General hailo8 , error	6	227	May 5, 2025

Poor Inference Results of My MobileNetV2-UNet on Hailo8L

Related topics