BF16 Not supported?

chinmay.kapoor · March 6, 2025, 7:01am

Does Hailo 8 support BF16 format? Has anyone tried converting Llama 3.2 to .HAR and HEF formats? Need support @system

KlausK · May 6, 2025, 11:34pm

The Hailo-8 can perform model inference much more efficiently than a typical GPU. One key reason is that it uses integer computation rather than floating-point, which significantly reduces power consumption.

The Hailo-8 supports 4-bit, 8-bit (default) and 16-bit integer.

We have announced a new generation AI accelerator called Hailo-10H that will allow running LLM.

Hailo-10H

Topic		Replies	Views
Can I run inference on a model that was quantized to have fully 16-bit weights? General python , cpp	0	286	April 9, 2024
Running local LLM using Hailo-8L General hailo8	5	8193	September 13, 2024
Argmax 16bit output General	17	129	May 15, 2025
The given output format type UINT8 is not supported, should be HAILO_FORMAT_TYPE_FLOAT32 General hailo8 , yolov5	5	131	March 5, 2025
What are differences between Hailo-8 and Hailo-8L? General hailo8	4	2212	July 10, 2024

BF16 Not supported?

Related topics