Multiple Models on Hailo8 with hailort

Wolfram_Strothmann · August 21, 2025, 4:22pm

Hi all,

is there example code for running multiple models on one hailo8 device using c++ API of hailort. In my case object detection + image classification of two different streams is required.

This comment in the example code

  // Constructor for when using multiple models on the same device
    AsyncModelInfer(const std::string &hef_path, const std::string &group_id);

github.com/hailo-ai/Hailo-Application-Code-Examples

runtime/hailo-8/cpp/object_detection/utils/async_inference.hpp

689ff79c3


      
                  const std::vector<std::shared_ptr<uint8_t>> &output_guards,
                  const std::vector<std::shared_ptr<cv::Mat>> &input_guards,
                  std::function<void(const hailort::AsyncInferCompletionInfo&,
                      const std::vector<std::pair<uint8_t*, hailo_vstream_info_t>> &,
                      const std::vector<std::shared_ptr<uint8_t>> &)> callback);
          public:
              // Constructors
          
              // Constructor for when using only one model one device
              AsyncModelInfer(const std::string &hef_path);
              // Constructor for when using multiple models on the same device
              AsyncModelInfer(const std::string &hef_path, const std::string &group_id);
              // Destructor
              ~AsyncModelInfer();
          
              // Getters
              const std::vector<hailort::InferModel::InferStream>& get_inputs();
              const std::vector<hailort::InferModel::InferStream>& get_outputs();
              const std::shared_ptr<hailort::InferModel> get_infer_model();
              
              // Functions

hints that it should be possible using VDevice and passing a group_id. But the doc on these classes / methods is relatively thin. Any further hints on doc / reference examples for such application.

Any replies appreciated.

Best regards,

Wolfram

omria · August 21, 2025, 5:07pm

Hey @Wolfram_Strothmann,

Great to see you back in the Hailo Community!

For what you’re looking to do, there’s actually a really good C++ example that shows how to work with multiple HEFs. Check this one out:

github.com/hailo-ai/Hailo-Application-Code-Examples

runtime/hailo-8/cpp/zero_shot_classification/clip_example.cpp

main

/**
 * Copyright (c) 2020-2023 Hailo Technologies Ltd. All rights reserved.
 * Distributed under the MIT license (https://opensource.org/licenses/MIT)
 **/

#include "hailo/hailort.hpp"
#include "common.h"
#include "tokenizer/nn_embeddings.hpp"

#include <iostream>
#include <future>
#include <mutex>
#include <fstream>
#include <functional>
#include <numeric>

#include <opencv2/opencv.hpp>
#include <opencv2/highgui.hpp>
#include <opencv2/core/matx.hpp>
#include <opencv2/imgcodecs.hpp>

This file has been truncated. show original

Here’s how you’d approach it:

To run multiple models on a single Hailo-8 device using the HailoRT C++ API, you’ll want to use the VDevice (Virtual Device) abstraction along with a group_id when setting up your models. This lets the runtime handle coordination between multiple models through the Concurrent Pipeline Processing (CCPP) scheduler.

VDevice: This abstracts your physical device(s) into a single logical device - works even if you’re just using one Hailo-8
group_id: This associates multiple models (like object detection + classification) to the same scheduler context so they can run concurrently
AsyncModelInfer: The class from our application examples that handles inference with async scheduling

Here’s the basic structure:

std::string group_id = "multi_model_app";

// Create VDevice (you can also specify device IDs if needed)
auto vdevice = hailort::VDevice::create().value();

// Load your HEFs
auto model1 = std::make_shared<AsyncModelInfer>("yolov8s.hef", group_id);
auto model2 = std::make_shared<AsyncModelInfer>("yolov5-seg.hef", group_id);

// Run inference on each model
model1->infer(...);
model2->infer(...);

Hope this helps!

Topic		Replies	Views
Multiple Models Inference on Hailo 8 General hailo8	6	1007	August 12, 2024
Multithreading inference of two models on a single Hailo-8 General hailort , hailo8	6	250	January 2, 2025
Do I need 2 hailo devices if I use 2 hef models? General	2	249	January 7, 2025
Issue with running multiple output model General hailort , hailo8	4	305	August 11, 2024
Multi model setup on RPI5 Hailo AI kit for python api General hailort , raspberry-pi , hailo8	2	587	September 5, 2024

Multiple Models on Hailo8 with hailort

Here’s the basic structure:

Related topics