Run Clip application on Raspberrypi5 and Hailo8

jiahao.li · January 15, 2025, 5:45am

CLIP(Contrastive Language-Image Pre-Training) is a neural network trained on a variety of (image, text) pairs. It can be instructed in natural language to predict the most relevant text snippet, given an image, without directly optimizing for the task, similarly to the zero-shot capabilities of GPT-2 and 3. We found CLIP matches the performance of the original ResNet50 on ImageNet “zero-shot” without using any of the original 1.28M labeled examples, overcoming several major challenges in computer vision.

Here is the link
If you like it please give a star.

jiahao.li · January 15, 2025, 5:46am

In the video shown below, you can see that when I input “banana,” the CLIP model recognizes a banana, and when I input “apple,” the model recognizes an apple. You only need to input different words, and the CLIP model will recognize different objects.

jiahao.li · January 15, 2025, 5:50am

Topic		Replies	Views
GUI App on RPi5 for Video via GStreamer Guides raspberry-pi	2	432	September 26, 2024
CLIP ResNet-50 / ResNet-50x4 Training Pipeline and Deployment on Hailo-8L General raspberry-pi , hailo8	1	64	June 24, 2025
Fine-tuning CLIP Model with ResNet50x4 Backbone for Hailo-8L Deployment on Raspberry Pi 5 General raspberry-pi , hailo8	1	126	May 21, 2025
CLIP on live feed from rpi camera General	3	123	December 4, 2024
CLIP Zero Shot Inference Application Crashing on Raspberry Pi 5 with Hailo-8L and AI HAT+ General raspberry-pi , hailo8	4	63	June 13, 2025

Run Clip application on Raspberrypi5 and Hailo8

Related topics