Are there recommendations to what is the best practice when running a model through the Dataflow Compiler pipeline? Parsing, optimization, compilation etc.
Parsing -
- if using an ONNX model, the ONNX should be simplified using onnx-simplifier Python package.
- Mostly for detection models (but not only), end nodes of the model should be the last neural operations which after it the postprocess operation start.
Optimization\quantization - - The model should have the real pre-trained weights.
- The calibration set should be with real images that are a subset of the training dataset.
- Calibration set images should either be normalized before running optimization, or by adding a normalization layer using an alls command.
- The model should be trained with Batch Normalization in order to limit the range of the output values from the activations layers.
Compilation -
Not always necessary, but to get the best FPS from a compiled model, you can use the following alls command:
performance_param(compiler_optimization_level=max)
How to download it ? I need it