Question about running inference on multiple Hailo models

Kac_Zal · October 1, 2025, 6:21am

Hello,

I’m currently working with YOLO-based models, like yolov11n for detection and yolov8s-pose for keypoints, on the Hailo-8L accelerator.

I have a couple of questions:

Is it feasible to load and run inference on two different .hef models (on the same input)—say, one for detection and another for pose estimation—sequentially on the same device?
Also, could you share any code examples or documentation on how to run compiled .hef models directly on Hailo hardware using HailoRT?

I’m really trying to get a better grasp on executing multiple models and managing the performance trade-offs involved.

Thanks so much for your assistance!

Best regards.

KlausK · October 1, 2025, 6:32am

Yes, the HailoRT scheduler allows you to do that.

See HailoRT User Guide.

Our Application Examples contains a GStreamer based LPR pipeline using multiple networks.

You can use the hailortcli run2 command to test running multiple networks without an application.

hailortcli run2 set-net model1.hef set-net model2.hef

Use hailortcli run2 --help to see all options to tune the scenario e.g. fix the framerate for a model …

You can use the HailoRT monitor to see the networks running similar to htop. In a second terminal use the following command.

hailortcli monitor

You will need to set the Hailo monitor environment variable before your run your app or the CLI command.

export HAILO_MONITOR=1
hailortcli run2 set-net model1.hef ...

Topic		Replies	Views
Using multiple models on the same hailo 8 module General gstreamer , hailort , raspberry-pi , hailo8	4	485	April 29, 2025
Multiple Models on Hailo8 with hailort General hailo8	3	89	August 22, 2025
Do I need 2 hailo devices if I use 2 hef models? General	2	284	January 7, 2025
How to run Multiple local model on Hailo-8L in parallel? General hailo8	3	62	September 19, 2025
Multiple Models Inference on Hailo 8 General hailo8	6	1103	August 12, 2024