How does each work runner.infer_context(param)

clear.han · September 27, 2024, 2:40am

InferenceContext.SDK_HAILO_HW
InferenceContext.SDK_NATIVE
InferenceContext.SDK_BIT_EXACT
InferenceContext.SDK_FP_OPTIMIZED

And I’d like to know the difference between the two inference methods, with runner.infer_context and withInferVStreams.

omria · September 29, 2024, 10:24am

Hey @clear.han

Let me explain the key differences between runner.infer_context and withInferVStreams:

runner.infer_context(param):
This method allows you to specify the inference context for batch processing. You can choose from:
- SDK_HAILO_HW: Uses Hailo hardware for optimal performance
- SDK_NATIVE: Runs on the host CPU (useful for testing without Hailo hardware)
- SDK_BIT_EXACT: Simulates Hailo hardware behavior on the host for exact precision
- SDK_FP_OPTIMIZED: Optimized for floating-point operations, focusing on speed
withInferVStreams:
This method is optimized for real-time streaming. It’s designed to manage continuous input/output with lower latency, making it ideal for applications like video or sensor data processing.

The main difference lies in their use cases:

runner.infer_context is best for batch inference, where you process a set of data all at once.
withInferVStreams is suited for scenarios requiring continuous, real-time data processing.

Choose the method that aligns best with your specific application needs. If you’re working with batch data, go with runner.infer_context. For real-time streaming applications, withInferVStreams would be more appropriate.

Regards

clear.han · October 4, 2024, 12:00am

In the sample code, inferVStream uses a hef file,
infer.context uses haf files.
Can I use the hef file in runner.infer_context?
If possible, How do I load hef on runner???

omria · October 6, 2024, 9:01am

Yes, you can use an .hef file with runner.infer_context in HailoRT. Here’s how:

Load the .hef file and create a runner:

from hailo_platform import VDevice, HEF, ConfigureParams

hef = HEF('path_to_your_model.hef')
vdevice = VDevice()
configure_params = ConfigureParams.create_from_hef(hef)
network_group = vdevice.configure(hef, configure_params)
runner = network_group.create_runner()

Run inference:

input_tensor = runner.get_input_vstream().create_host_buffers()
output_tensor = runner.get_output_vstream().create_host_buffers()
# Fill input_tensor with data
runner.infer(input_tensor, output_tensor)
# Access results from output_tensor

This method allows you to use .hef files with runner.infer_context for inference on Hailo hardware.

Topic		Replies	Views
How can I infer several Hefs over a single device General infer , hef	0	118	January 7, 2024
How to Quickly Switch and Execute Two Single Context HEF Models General dfc , hailort	12	1047	December 11, 2024
Help regarding: Python Inference Tutorial - Multi Process Service and Model Scheduler General hailort , raspberry-pi , python , hailo8 , error	3	46	May 26, 2025
Best approach to run inference with 2 hefs General	1	26	March 13, 2025
How to perform inference on Hailo-8L using the pre-compiled hef from Model Explorer General hailo8	1	605	October 24, 2024

How does each work runner.infer_context(param)

Related topics