Error converting model when `num_classes=1`

hgaiser · April 28, 2025, 8:51am

I’m trying to convert a semantic segmentation model to HEF, which works if the num_classes > 1 but not if num_classes = 1. Here’s a simple script to test:



import numpy as np
import onnx
import segmentation_models_pytorch as smp
import torch
import torchvision
import timm
from hailo_sdk_client import ClientRunner
from onnxsim import simplify

# WORKS
# model = smp.FPN("resnet18", classes=2)
# model = torchvision.models.resnet18(num_classes=1)
# model = timm.create_model("resnet18", pretrained=True, num_classes=1)

# DOESN'T WORK
model = smp.FPN("resnet18", classes=1)

torch.onnx.export(model, torch.randn(1, 3, 512, 512), "fpn.onnx")
onnx_model_simplified, check = simplify("fpn.onnx")
onnx.save_model(onnx_model_simplified, "fpn_simplified.onnx")

runner = ClientRunner(hw_arch="hailo8l")
runner.translate_onnx_model("fpn_simplified.onnx", "fpn")
runner.load_model_script("input_normalization1 = normalization([123.675, 116.28, 103.53], [58.395, 57.12, 57.375])\n")
runner.optimize(np.random.randint(0, 255, (128, 512, 512, 3), dtype=np.uint8))
runner.compile()

The error I get:

[error] Mapping Failed (allocation time: 1m 20s)
Compiler could not find a valid partition to contexts. Most common error is: Automri finished with too many resources on context_2 with 24/88 failures.

[error] Failed to produce compiled graph
[error] BackendAllocatorException: Compilation failed: Compiler could not find a valid partition to contexts. Most common error is: Automri finished with too many resources on context_2 with 24/88 failures.

Like I mentioned, setting the num_classes = 2 does work. I tried this both on my laptop (Arch Linux) as well as the DFC docker container, both show the same error.

omria · April 29, 2025, 4:13pm

Hey @hgaiser ,

Welcome to the Hailo Community!

The issue you’re seeing — where segmentation models with num_classes=1 fail to compile to HEF, but work fine when num_classes > 1 — is a known edge case in how the Hailo Dataflow Compiler handles output tensor shapes.

Why This Happens

When num_classes=1, the model outputs a tensor shaped like (1, 1, H, W).
But during simplification/export, this can get flattened to (1, H, W) — effectively dropping the channel dimension.

This breaks things in the compiler’s backend:

It affects how the compiler fuses layers and tiles the model.

Can cause the error:

Automri finished with too many resources on context_2

It’s not that your model is broken — it’s a limitation in how the compiler handles singleton channels.

Quick Fix (Most Reliable)

Use num_classes=2, but just treat one of them as foreground during postprocessing.
You’ll still get a binary output, and the compiler won’t choke.

Example:

model = smp.FPN("resnet18", classes=2)

Then in postprocess:

binary_mask = (output.argmax(dim=1) == 1).float()

hgaiser · May 12, 2025, 9:58am

Hi @omria ,

Thank you for the detailed reply. That was indeed the workaround I used thus far. I hope a better solution becomes available at some time in the future, but for now this works fine.

Topic		Replies	Views
Conversion error from yolov8n.onnx to HEF file General	18	1740	August 21, 2024
Error: While Parsing onnx Based models General dfc , error	4	404	November 19, 2024
Error encountered during onnx to hef converstion General dfc	21	337	August 20, 2024
Error converting ONNX to .hef General dfc , raspberry-pi	2	840	September 27, 2024
Need help on ONNX to Hef General hailo8	3	86	May 27, 2025

Error converting model when `num_classes=1`

Why This Happens

Quick Fix (Most Reliable)

Related topics