How to Replicate TFLite’s int8 Quantization in Hailo Optimization?

user128 · March 26, 2025, 4:38pm

I achieved satisfactory performance when quantizing a TensorFlow model to an int8 TFLite model using 600,000 representative samples using

converter.optimizations = [tf.lite.Optimize.DEFAULT]
converter.target_spec.supported_ops = [tf.lite.OpsSet.TFLITE_BUILTINS_INT8]

How can I apply the same quantization approach using Hailo’s optimization? Would the following configuration work?

model_script_lines = [
    "model_optimization_flavor(optimization_level=0, compression_level=0)\n", 
    "model_optimization_config(calibration, batch_size=1, calibset_size=600000)\n"
]
runner.load_model_script("".join(model_script_lines))
runner.optimize(calib_dataset)

Any feedback or suggestions would be appreciated!

KlausK · March 26, 2025, 11:29pm

The Hailo model conversion starts from models in TFLite or ONNX format in floating point format.

To understand the workflow I would recommend you start with working trough the tutorials in the Hailo AI Software Suite Docker. Just call:

hailo tutorial

This will start a Jupyter Notebook server with notebooks for each step.

The model optimization will require a number images depending on which optimization level you are using. You can find more details in the Hailo Dataflow Compiler User Guide. More is not always better.

user128 · March 27, 2025, 1:14am

Thanks for your reply.

I followed the Hailo DFC tutorial documentation and noticed that with the default settings, only a subset of my entire dataset is being used for calibration. As a result, I observed a performance degradation in the quantized model. In contrast, when quantizing a TensorFlow model to an int8 TFLite model, the process uses every sample from the representative dataset with a TFLite default quantization method, and I didn’t see any performance loss.

I would like to apply the same basic quantization approach used by TFLite to Hailo. How can I configure Hailo’s optimization to replicate this behavior?

Any suggestions or insights would be greatly appreciated!

Topic		Replies	Views
Model quantization issue from tensorflow to tensorflow Lite General hailo8 , error	6	254	December 9, 2024
Data understanding on hailo quantization General	2	551	December 3, 2024
Convert my own model using HailoMZ General dfc	2	93	March 26, 2025
Average Pool error cannot quantize General optimization , hailort , hailo8 , error	1	358	August 22, 2024
Error while compiling the optimized file for converting tflite model to HEF file General dfc	21	728	August 27, 2024

How to Replicate TFLite’s int8 Quantization in Hailo Optimization?

Related topics