Hi all,
I’m currently working on converting a YOLOv8 ONNX model to HEF format using the Hailo toolchain.
I have the hailo_ai_sw
Docker running. I trained the model on Google Colab, exported it to ONNX, and then tried to load it using hailomz_compile
. However, it gave an error saying the ONNX model version is 11, which is higher than the supported version 10.
To work around this, I modified the model’s ir_version
to 10 and saved it again. After that, it did load, but I ran into the following error:
===========================================================================
• inputs=[‘tf.Tensor(shape=(None, 80, 80, 64), dtype=float32)’, ‘tf.Tensor(shape=(None, 80, 80, 3), dtype=float32)’, ‘tf.Tensor(shape=(None, 40, 40, 64), dtype=float32)’, ‘tf.Tensor(shape=(None, 40, 40, 3), dtype=float32)’, ‘tf.Tensor(shape=(None, 20, 20, 64), dtype=float32)’, ‘tf.Tensor(shape=(None, 20, 20, 3), dtype=float32)’]
• training=False
• kwargs=<class ‘inspect._empty’>
(hailo_virtualenv) hailo@kunxianhuan
Then, I tried training directly using the Hailo Model Zoo YOLOv8 training Docker with the same dataset. But for some reason, the training loss stays as nan
across all epochs — the model isn’t learning.
Any guidance would be really appreciated!