Tiled inference of object detection model on high resolution image

Hi everyone!

I am running an object detection model, which sometimes struggles to detect smallish objects. I’ve thought abouting doing several inferences for each image of my camera similar to the operation done for SAHI.

I would like to split the camera image (1920x1080) into three overlapping image tiles, apply downsampling to each of them, stack them as a batch, and do the batched forward pass.

Could I run these additional operations (slicing, resizing, and batching) on the Hailo as well? Has anyone tried something like this before?

Thanks!

Welcome to the Hailo Community!

Have a look at our Tappas Tilling examples.

GitHub - Hailo Tappas Tiling

The Hailo-8 is an AI accelerator for inference and not a GPU, therefore the slicing and resizing has to be done on the host (CPU or GPU if you have one).

Thanks for the response!

As far as I know, the Hailo supports resizing (bilinear interpolation, …) as well as slicing operations. Why isn’t it possible to run a tiled inference completely on the Hailo? I am interested in offloading these operations to the Hailo, as it is quite expensive for me to do on the CPU.