Hi @shashi,
Congratulations for the great job!
The DeGirum PySDK tool looks very nice.
I personally tested the Hailo integration and it works well.
In order to set it up and perform inference on the Hailo-8 locally, I also had to enable the HailoRT Multi-Process service, as per HailoRT documentation:
sudo systemctl enable --now hailort.service  # for Ubuntu
This was not mentioned in your GitHub, so please double check if it is necessary (if DeGirum use the multi-process service, the user should check whether the service is enabled)
I have a question about the implementation (feel free not to answer, if you are not allowed to). Thanks to its dataflow architecture, Hailo-8 can run inference in synchronous (blocking) or asynchronous (non-blocking) mode. Especially for single-context models, asynchronous or multi-threaded inference allows to use the Hailo-8 NN core at its best, achieving higher throughput.
Does DeGirum make use of this functionality?
Again, congratulations for the great tool you developed!
