User Guide 4: Simplifying Instance Segmentation on a Hailo Device Using DeGirum PySDK

shashi · February 4, 2025, 5:14am

Simplifying Instance Segmentation on a Hailo Device Using DeGirum PySDK

This guide demonstrates how to leverage PySDK’s built-in segmentation postprocessing—integrated in C++—to run YOLOv8/YOLO11 segmentation models on Hailo devices. With minimal configuration, you can run inference and visualize segmentation outputs, including class labels. Although this example uses a model trained on the COCO dataset, the method works for any YOLOv8/YOLO11 segmentation model with appropriate modifications.

Tip: If you are new to PySDK, consider reviewing User Guide 3: Simplifying Object Detection on a Hailo Device Using DeGirum PySDK before diving into segmentation.

Overview of the Inference Pipeline

For segmentation models, the inference pipeline consists of:

Pre-Processing:
Resize and format the input image (e.g., letterbox padding, bilinear interpolation, quantization) to match model requirements.
Inference:
Run the YOLOv8/YOLO11 segmentation model (compiled into a .hef file) on the Hailo device.
Post-Processing:
The integrated C++ postprocessing converts the model’s raw outputs into a segmentation mask overlay. To enable this processing, specify "OutputPostprocessType": "SegmentationYoloV8" in the JSON configuration. In this guide, a COCO labels file is provided for human-readable output.
Visualization:
The processed segmentation overlay is provided as an image, which can be displayed using tools such as OpenCV.

A simple diagram of the pipeline:

Input Image
    │
    ▼
Pre-Processing (resize, letterbox, quantize)
    │
    ▼
Model Inference (.hef file on Hailo device)
    │
    ▼
Built-in Post-Processing (SegmentationYoloV8 in C++)
    │
    ▼
Segmentation Overlay (with COCO labels)
    │
    ▼
Visualization (e.g., via OpenCV)

What You’ll Need

Ensure you have the following prerequisites:

Hailo AI Accelerator:
A Hailo8 or Hailo8L device. The host system can be x86 or an Arm-based system (e.g., Raspberry Pi).
Drivers and Software Tools:
Install the necessary drivers and follow the Hailo + PySDK setup instructions.
Segmentation Model File (.hef):
A YOLOv8/YOLO11 segmentation model trained on COCO, compiled into a .hef file. For example, you can use yolov8n_seg.hef available at Hailo Model Zoo.
Input Image:
An image on which to run segmentation. For instance, download this Cat Image. Feel free to experiment with your own images.
COCO Labels File (labels_coco.json):
A file mapping the 80 COCO classes to human-readable labels. You can download this from Hugging Face or another trusted source.

Summary

We’ll walk you through the key steps to run segmentation inference on a Hailo device using DeGirum PySDK:

Configuring the Model JSON File:
Set up your JSON file to define pre-processing parameters, specify the segmentation model file, and enable the built-in C++ postprocessing (using "OutputPostprocessType": "SegmentationYoloV8"). This section also covers setting the number of classes, a confidence threshold, and the "SigmoidOnCLS" flag if required by your model.
Preparing the Model Zoo:
Organize your model assets—including the JSON configuration file, the .hef model file, and the COCO labels file—into a structured directory for easy access and management by PySDK.
Running Inference:
Load the segmentation model from the model zoo, execute inference on an input image, and obtain a segmentation overlay that visually represents the segmentation masks along with human-readable class labels.
Visualizing the Output:
Use tools such as OpenCV to display the segmentation overlay, enabling you to review and analyze the segmented regions.

By following these steps, you can seamlessly deploy and visualize YOLOv8/YOLO11 segmentation models on Hailo devices using PySDK.

Configuring the Model JSON File

Since the segmentation postprocessing is integrated in C++ with PySDK, the JSON configuration is straightforward. In addition to specifying pre-processing and the model file, you will also provide the COCO labels file, the number of classes, and a confidence threshold to filter low-probability detections.

Example Model JSON (`yolov8n_seg.json`)

{
    "ConfigVersion": 10,
    "DEVICE": [
        {
            "DeviceType": "HAILO8",
            "RuntimeAgent": "HAILORT",
            "SupportedDeviceTypes": "HAILORT/HAILO8"
        }
    ],
    "PRE_PROCESS": [
        {
            "InputType": "Image",
            "InputN": 1,
            "InputH": 640,
            "InputW": 640,
            "InputC": 3,
            "InputPadMethod": "letterbox",
            "InputResizeMethod": "bilinear",
            "InputQuantEn": true
        }
    ],
    "MODEL_PARAMETERS": [
        {
            "ModelPath": "yolov8n_seg.hef"
        }
    ],
    "POST_PROCESS": [
        {
            "OutputPostprocessType": "SegmentationYoloV8",
            "LabelsPath": "labels_coco.json",
            "OutputNumClasses": 80,
            "OutputConfThreshold": 0.3,
            "SigmoidOnCLS": true
        }
    ]
}

Key Points

Pre-Processing Section:
The input image is resized to 1 x 640 x 640 x 3 using letterbox padding and bilinear interpolation, with quantization enabled.
Model Parameters Section:
Specifies the segmentation model file (yolov8n_seg.hef).
Post-Processing Section:
- "OutputPostprocessType": "SegmentationYoloV8" activates the built-in C++ segmentation postprocessing. This setting works for both YOLOv8 and YOLO11 models.
- "LabelsPath": "labels_coco.json" provides the COCO labels for human-readable output.
- "OutputNumClasses": 80 specifies the number of classes in the model.
- "OutputConfThreshold": 0.3 filters out detections below the confidence threshold.
- Understanding SigmoidOnCLS:
  The "SigmoidOnCLS": true flag indicates that a sigmoid activation is applied on certain output layers. This flag is necessary when models are compiled with vendor-specific settings that apply sigmoid activations; adjust this flag as needed for your model.

Preparing the Model Zoo

A model zoo is a structured repository of model assets (configuration JSON files, model files, post-processor code, and labels) that simplifies model management. To organize your assets:

Save the JSON configuration as yolov8n_seg.json.
Place the segmentation model file (yolov8n_seg.hef) in the same directory.
Include the COCO labels file as labels_coco.json.

Your directory structure might look like:

/path/to/model_zoo/
├── yolov8n_seg.json
├── yolov8n_seg.hef
└── labels_coco.json

Tip: For easier maintenance, you can organize models into separate subdirectories. PySDK will automatically search for model JSON files in all subdirectories inside the directory specified by the zoo_url.

Running Inference

Once your model zoo is set up, running inference is nearly identical to the process for detection models. The inference output includes an overlay image with segmentation masks and COCO labels.

Python Code Example

import degirum as dg
import cv2

# Load the segmentation model from the model zoo.
# Replace '<path_to_model_zoo>' with the directory path to your model assets.
model = dg.load_model(
    model_name='yolov8n_seg',
    inference_host_address='@local',
    zoo_url='<path_to_model_zoo>'
)

# Run inference on an input image.
# Replace '<path_to_input_image>' with the actual path to your image.
inference_result = model('<path_to_input_image>')

# The segmentation overlay (with masks and labels) is available via the image_overlay attribute.
cv2.imshow("Segmentation Output", inference_result.image_overlay)

# Wait until the user presses 'x' or 'q' to close the window.
while True:
    key = cv2.waitKey(0) & 0xFF
    if key == ord('x') or key == ord('q'):
        break

cv2.destroyAllWindows()

Expected Output

The displayed window should show the input image with segmentation masks overlaid. Each segment will be highlighted with distinct colors, and the COCO class labels will be visible on the overlay.

Example output showing a cat image with segmented regions and labeled classes.

Troubleshooting and Debug Tips

Verify File Paths:
Ensure that the JSON configuration file, model file, and labels file are in the correct locations, and that the paths specified in the JSON (e.g., "ModelPath" and "LabelsPath") match the actual file names.
Input Dimensions:
Confirm that the dimensions specified in the PRE_PROCESS section (e.g., 640×640) match your model’s input requirements.
Number of Classes:
Double-check that "OutputNumClasses" is correctly set to the number of classes your model detects.
SigmoidOnCLS Flag:
If you experience unexpected behavior in the postprocessed output, verify that the "SigmoidOnCLS" flag is correctly configured for your model’s compiled settings.

Conclusion

This guide has shown you how to run YOLOv8/YOLO11 segmentation models on Hailo devices using DeGirum PySDK. By simply specifying "OutputPostprocessType": "SegmentationYoloV8", providing a COCO labels file (labels_coco.json), and correctly setting parameters such as "OutputNumClasses", "OutputConfThreshold", and "SigmoidOnCLS", you enable the built-in C++ segmentation postprocessing. This converts raw model outputs into visually interpretable segmentation masks with human-readable labels.

The method outlined here also applies to any custom YOLOv8/YOLO11 segmentation model compiled for Hailo devices—simply adjust the JSON configuration (especially the labels file and number of classes) to match your model’s specifications.

kyurrii · April 5, 2025, 8:30am

Hi @shashi , i was trying yolov8n_relu6_coco_pose–640x640_quant_hailort_hailo8l_1 model for pose detections. I downloaded folder with .hef and .json files from Degirum AI Hub. As result it detects persons, it shows some dots on the display, but i’m not sure these are roi key points. Can you suggest code to print in console pose estimation metadata (coordinates of key points) and overlay them connected on display ?

shashi · April 10, 2025, 2:54am

@kyurrii
Sorry for the delayed response. We have a feature in PySDK where you can specify the keypoints that you want to connect. In the model json, under postprocessing section, you can add something like this:

"Connections": {
        "5": [
          7,
          11,
          6
        ],
        "6": [
          8,
          12,
          5
        ],
        "7": [
          9,
          5
        ],
        "8": [
          10,
          6
        ],
        "9": [
          7
        ],
        "10": [
          8
        ],
        "11": [
          13,
          5,
          12
        ],
        "12": [
          14,
          6,
          11
        ],
        "13": [
          15,
          11
        ],
        "14": [
          16,
          12
        ],
        "15": [
          13
        ],
        "16": [
          14
        ]

This type of connection specification is quite common in pose models. Also, every inference result object has all keypoint locations specified which you can access by print(inference_result.results). Please let me know if this helps.

kyurrii · April 11, 2025, 4:46pm

Hi @shashi,

thanks for answer. I pasted this into model json, but now , when start the program, in terminal raises error: degirum.exceptions.DegirumException: Model ‘yolov8n_relu6_coco_pose–640x640_quant_hailort_hailo8l_1’ is not found in model zoo ‘/home/pi/DEGIRUM/model_zoo’.

But this snippet hasn’t it be pasted into postprocessor itself ? Where is postprocessor located phisically ?

shashi · April 11, 2025, 6:00pm

@kyurrii
Can you share the json? Most likely there is a typo somewhere if the same model was working before you added this to model json.

Topic		Replies	Views
User Guide 3: Simplifying Object Detection on a Hailo Device Using DeGirum PySDK Guides	49	753	April 9, 2025
Yolo Segmentation Post nms processing Guides raspberry-pi , hailo8	4	145	February 4, 2025
User Guide 2: Running Your First Object Detection Model on a Hailo Device Using DeGirum PySDK Guides	24	816	April 1, 2025
YOLOv11 Segmentation Support on Hailo-8 General	4	166	February 8, 2025
Simplifying Edge AI Development with DeGirum PySDK and Hailo General raspberry-pi	14	813	February 4, 2025