Hello.
I have Hailo AI Software Suite. Version 2025-10. I use virtual environment from hailo_ai_sw_suite. I need optimize my model as better, as i could. For this work i need gpu. nvidia-smi work and watch my 5060. But tensorflow don`t watch it. How could i use gpu for optimization?
(hailo_venv) mihail@mihail-MS-7D98:~/hailo_light/light_onnx$ nvidia-smi
Fri Nov 21 13:33:13 2025
±----------------------------------------------------------------------------------------+
| NVIDIA-SMI 580.95.05 Driver Version: 580.95.05 CUDA Version: 13.0 |
±----------------------------------------±-----------------------±---------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 5060 Off | 00000000:01:00.0 On | N/A |
| 0% 42C P8 12W / 145W | 734MiB / 8151MiB | 2% Default |
| | | N/A |
±----------------------------------------±-----------------------±---------------------+
±----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 2519 G /usr/lib/xorg/Xorg 292MiB |
| 0 N/A N/A 2731 G /usr/bin/gnome-shell 53MiB |
| 0 N/A N/A 3332 G …exec/xdg-desktop-portal-gnome 3MiB |
| 0 N/A N/A 3781 G …rack-uuid=3190708988185955192 141MiB |
| 0 N/A N/A 4803 G /usr/bin/hiddify 33MiB |
| 0 N/A N/A 106728 G /proc/self/exe 88MiB |
| 0 N/A N/A 717910 G /usr/bin/nautilus 15MiB |
±----------------------------------------------------------------------------------------+
(hailo_venv) mihail@mihail-MS-7D98:~/hailo_light/light_onnx$ python -c “import tensorflow as tf; print(tf.config.list_physical_devices(‘GPU’))”
2025-11-21 13:33:18.521258: I tensorflow/core/util/port.cc:153] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable TF_ENABLE_ONEDNN_OPTS=0.
2025-11-21 13:33:18.530488: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:477] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
WARNING: All log messages before absl::InitializeLog() is called are written to STDERR
E0000 00:00:1763721198.540700 748114 cuda_dnn.cc:8310] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
E0000 00:00:1763721198.543995 748114 cuda_blas.cc:1418] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
2025-11-21 13:33:18.555496: I tensorflow/core/platform/cpu_feature_guard.cc:210] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 AVX_VNNI FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
W0000 00:00:1763721200.170204 748114 gpu_device.cc:2433] TensorFlow was not built with CUDA kernel binaries compatible with compute capability 12.0. CUDA kernels will be jit-compiled from PTX, which could take 30 minutes or longer.
[PhysicalDevice(name=‘/physical_device:GPU:0’, device_type=‘GPU’)]
(hailo_venv) mihail@mihail-MS-7D98:~/hailo_light/light_onnx$ hailo --version
[info] No GPU chosen and no suitable GPU found, falling back to CPU.
[info] Current Time: 13:33:28, 11/21/25
[info] CPU: Architecture: x86_64, Model: , Number Of Cores: 12, Utilization: 3.9%
[info] Memory: Total: 31GB, Available: 14GB
[info] System info: OS: Linux, Kernel: 6.14.0-35-generic
[info] Hailo DFC Version: 3.33.0
[info] HailoRT Version: 4.23.0
[info] PCIe: 0000:04:00.0: Number Of Lanes: 1, Speed: 8.0 GT/s PCIe
[info] Running hailo --version
HailoRT v4.23.0
Hailo Dataflow Compiler v3.33.0
(hailo_venv) mihail@mihail-MS-7D98:~/hailo_light/light_onnx$ hailo optimize lg10_kpts.har --calib-set-path ../onnx_models/calib_inputs_correct.npz --hw-arch hailo8
[info] No GPU chosen and no suitable GPU found, falling back to CPU.
[info] Current Time: 13:39:47, 11/21/25
[info] CPU: Architecture: x86_64, Model: , Number Of Cores: 12, Utilization: 8.5%
[info] Memory: Total: 31GB, Available: 14GB
[info] System info: OS: Linux, Kernel: 6.14.0-35-generic
[info] Hailo DFC Version: 3.33.0
[info] HailoRT Version: 4.23.0
[info] PCIe: 0000:04:00.0: Number Of Lanes: 1, Speed: 8.0 GT/s PCIe
[info] Running hailo optimize lg10_kpts.har --calib-set-path ../onnx_models/calib_inputs_correct.npz --hw-arch hailo8
[info] Starting Model Optimization
[warning] Reducing optimization level to 0 (the accuracy won’t be optimized and compression won’t be used) because there’s no available GPU
[warning] Running model optimization with zero level of optimization is not recommended for production use and might lead to suboptimal accuracy results
[info] Model received quantization params from the hn
[info] MatmulDecompose skipped
[info] Starting Mixed Precision
[info] Model Optimization Algorithm Mixed Precision is done (completion time is 00:00:00.54)
[info] Starting LayerNorm Decomposition