Where to quantize inputs

omria · September 25, 2024, 12:30pm

Welcome to the Hailo Community!

The mismatch you’re seeing is likely due to the input format defined in your HEF file. If it’s set to HAILO_FORMAT_TYPE_UINT8, the API expects quantized uint8 data, not float32. Here are a few ways to address this:

Automatic Quantization: If you prefer HailoRT to handle quantization, you can configure your input stream to use FLOAT32 format when setting up virtual streams:

hailort::InputVStreamParams input_params;
input_params.format.type = HAILO_FORMAT_TYPE_FLOAT32;

This allows you to pass float32 data directly, and HailoRT will quantize it using the qp_scale and qp_zp values from your HEF file.
2. Manual Quantization: If you want to quantize inputs yourself, use the qp_scale and qp_zp parameters from your HEF file:

for (size_t i = 0; i < buffer_size; i++) {
    quantized_buffer[i] = (uint8_t)((input_buffer[i] / qp_scale) + qp_zp);
}

Output Dequantization: If you need float32 results from quantized output:

for (size_t i = 0; i < output_size; i++) {
    dequantized_output[i] = (float32_t)(quantized_output[i] - qp_zp) * qp_scale;
}

Remember, when using async inference, ensure your buffer format matches the expected type (uint8 or float32).

The key is to either configure your model to handle float32 inputs through HailoRT’s virtual streams, or manually handle quantization if uint8 input is expected.

If you need more details on C++ API usage or setting up virtual streams, feel free to ask. Good luck with your implementation!

Topic		Replies	Views
Hailo model input layer is UINT8 but expects FLOAT32 during runtime General hailo8	2	123	November 20, 2024
The given output format type UINT8 is not supported, should be HAILO_FORMAT_TYPE_FLOAT32 General hailo8 , yolov5	5	179	March 5, 2025
Quantized model is giving wrong output while running on the Hailo-8L chip General dfc , hailort	5	448	July 6, 2024
Data understanding on hailo quantization General	2	609	December 3, 2024
Inference output dtype leads to different results General hailo8	6	176	December 4, 2024

Where to quantize inputs

Related topics