Compile CLIP with the DFC

lukasOST · November 11, 2024, 3:32pm

I have to withdraw my statement. I didn’t notice I was working with the changes I mentioned earlier in this forum.
[my fix]

I still get the error:

spatial_reshape_sizes = [output_shape[1], output_shape[2]]
IndexError: list index out of range

@nina-vilela, do you have another idea where my error is?

nina-vilela · November 12, 2024, 8:43am

Please try again from a fresh clean venv, and make sure that you install clip the recommended way:

pip install git+https://github.com/openai/CLIP.git

Also. please try exactly this snippet:

nina-vilela:

import torch
import clip
from torch.autograd import Variable

vision_arch = "RN50x4"
device = "cpu"
model, preprocess = clip.load("RN50x4", device=device)

vit = model.visual

dummy_input = Variable(torch.randn(1, 3, 288, 288, device="cpu"))

# ==================== #
# Export Model as ONNX #
# ==================== #
vit.eval()
torch.onnx.export(vit,          # model being run 
         dummy_input,                 # model input (or a tuple for multiple inputs) 
         "clip.onnx",       # where to save the model  
         opset_version=12,      # the ONNX version to export the model to
         do_constant_folding=True,          # whether to execute constant folding for optimization 
         )

lukasOST · November 13, 2024, 7:28am

Hey @nina-vilela,
I made a fresh venv and installed the following packages:

pip install torch
pip install torch.onnx
pip install git+https://github.com/openai/CLIP.git

Then i executed your Code but i again had to adjust the opset_version = 14`
The result is this:

lihis · November 17, 2024, 3:48pm

Hi @lukasOST

You can compile the model using one of the following options:

Install DFC 3.28 – Downgrade your DFC version by downloading the previous version(3.28) from the Developer Zone and installing it in a clean virtual environment.
Downgrade your Python version to 3.8 – Alternatively, install Python 3.8 and python3.8-dev, export the model as described above, create a clean virtual environment, and install DFC 3.29.

jinsookim · November 17, 2024, 11:54pm

Hello, @lukasOST
Sorry for the late reply. I got a message from you asking “How did you get the ONNX file?”, I’ll answer it below.

I found an ONNX file that openai pretrained on FP32 at here.
I only succeed the ResNet50 Image Encoder of CLIP.

And on each step I set on this way.

[ Parsing (ONNX->HAR1) ]

        hn, npz = runner.translate_onnx_model(
            onnx_path,
            onnx_model_name,
            start_node_names=['input'],
            end_node_names=['output'],
            net_input_shapes={'input': [1, 3, 224, 224]})

[ (Optimization) HAR1 → HAR2 ]
I set it up as follows by referring to the config file and CLIP_CIFAR100 in hailo_model_zoo github.

I resized CIFAR100 test dataset to 224x224 and converted to .npy format for optimization.

And the model script was as below.

set_seed(seed=23)
normalization1 = normalization([122.7709383, 116.7460125, 104.09373615000001], [68.5005327, 66.6321579, 70.32316304999999])
model_optimization_flavor(optimization_level=0, compression_level=0)
model_optimization_config(checker_cfg, batch_norm_checker=False)

When I used optimization_level=3, I was running out of RAM memory, so I specified it like above.

lukasOST · November 18, 2024, 8:33am

Hey @lihis,

i tried to compile the visual part with DFC 3.28.0.
I got this error:

hailo_sdk_client.model_translator.exceptions.ParsingWithRecommendationException: Parsing failed. The errors found in the graph are:

 UnsupportedShuffleLayerError in op /attnpool/Transpose_5: Failed to determine type of layer to create in node /attnpool/Transpose_5
Please try to parse the model again, using these end node names: /attnpool/MatMul_4

so i tried this code to compile it:

hn, npz = runner.translate_onnx_model(
    str(onnx_path),
    onnx_model_name,
    end_node_names="/attnpool/MatMul_4"
)

This compiled. My question now is did i clip the rest of the graph which would come after /attnpool/MatMul_4 ?

lihis · November 18, 2024, 1:23pm

Hi @lukasOST,

I sent you my onnx in a private message. Can you check if it’s parsing and compiling on version 3.28?
If it works, here is my pip list. Please ensure all the packages match mine exactly.

Package                  Version
------------------------ ----------
clip                     1.0
filelock                 3.16.1
fsspec                   2024.10.0
ftfy                     6.3.1
Jinja2                   3.1.4
MarkupSafe               3.0.2
mpmath                   1.3.0
networkx                 3.4.2
numpy                    2.1.3
nvidia-cublas-cu12       12.4.5.8
nvidia-cuda-cupti-cu12   12.4.127
nvidia-cuda-nvrtc-cu12   12.4.127
nvidia-cuda-runtime-cu12 12.4.127
nvidia-cudnn-cu12        9.1.0.70
nvidia-cufft-cu12        11.2.1.3
nvidia-curand-cu12       10.3.5.147
nvidia-cusolver-cu12     11.6.1.9
nvidia-cusparse-cu12     12.3.1.170
nvidia-nccl-cu12         2.21.5
nvidia-nvjitlink-cu12    12.4.127
nvidia-nvtx-cu12         12.4.127
onnx                     1.17.0
packaging                24.2
pillow                   11.0.0
pip                      22.0.2
protobuf                 5.28.3
regex                    2024.11.6
setuptools               59.6.0
sympy                    1.13.1
torch                    2.5.1
torchvision              0.20.1
tqdm                     4.67.0
triton                   3.1.0
typing_extensions        4.12.2
wcwidth                  0.2.13

Also, please double-check that you’re using this code for exporting and that everything is being done in a clean environment.

import torch
import clip
from torch.autograd import Variable

vision_arch = "RN50x4"
device = "cpu"
model, preprocess = clip.load("RN50x4", device=device)

vit = model.visual

dummy_input = Variable(torch.randn(1, 3, 288, 288, device="cpu"))

# ==================== #
# Export Model as ONNX #
# ==================== #
vit.eval()
torch.onnx.export(vit,          # model being run 
         dummy_input,                 # model input (or a tuple for multiple inputs) 
         "clip.onnx",       # where to save the model  
         opset_version=14,      # the ONNX version to export the model to
         do_constant_folding=True,          # whether to execute constant folding for optimization 
         )

lukasOST · November 20, 2024, 7:37am

Hey @lihis,

First i tried to compile the onnx you have send me. It failed at the same point as my onnx file.

Parsing failed. The errors found in the graph are:
 UnsupportedShuffleLayerError in op /attnpool/Transpose_5: Failed to determine type of layer to create in node /attnpool/Transpose_5
Please try to parse the model again, using these end node names: /attnpool/MatMul_4

Then i checkd the installed packages.
My list is way longer than yours:

Package                      Version
---------------------------- --------------
absl-py                      2.1.0
anyio                        4.6.2.post1
argcomplete                  3.5.1
argon2-cffi                  23.1.0
argon2-cffi-bindings         21.2.0
arrow                        1.3.0
asttokens                    2.4.1
astunparse                   1.6.3
async-lru                    2.0.4
attrs                        24.2.0
babel                        2.16.0
beautifulsoup4               4.12.3
bleach                       6.2.0
cachetools                   5.5.0
certifi                      2024.8.30
cffi                         1.17.1
charset-normalizer           3.4.0
cloudpickle                  3.1.0
coloredlogs                  15.0.1
comm                         0.2.2
contextlib2                  21.6.0
contourpy                    1.3.1
cycler                       0.12.1
debugpy                      1.8.8
decorator                    5.1.1
defusedxml                   0.7.1
disjoint_set                 0.8.0
dm-tree                      0.1.8
exceptiongroup               1.2.2
executing                    2.1.0
fastjsonschema               2.20.0
flatbuffers                  23.5.26
fonttools                    4.55.0
fqdn                         1.5.1
future                       1.0.0
gast                         0.4.0
google-auth                  2.36.0
google-auth-oauthlib         1.0.0
google-pasta                 0.2.0
grpcio                       1.68.0
h11                          0.14.0
h5py                         3.12.1
hailo-dataflow-compiler      3.28.0
httpcore                     1.0.7
httpx                        0.27.2
humanfriendly                10.0
idna                         3.10
importlib_metadata           8.5.0
ipykernel                    6.29.5
ipython                      8.29.0
ipywidgets                   8.1.5
isoduration                  20.11.0
jax                          0.4.30
jaxlib                       0.4.30
jedi                         0.19.2
Jinja2                       3.1.4
json5                        0.9.28
jsonpointer                  3.0.0
jsonref                      1.1.0
jsonschema                   4.23.0
jsonschema-specifications    2024.10.1
jupyter                      1.1.1
jupyter_client               8.6.3
jupyter-console              6.6.3
jupyter_core                 5.7.2
jupyter-events               0.10.0
jupyter-lsp                  2.2.5
jupyter_server               2.14.2
jupyter_server_terminals     0.5.3
jupyterlab                   4.2.6
jupyterlab_pygments          0.3.0
jupyterlab_server            2.27.3
jupyterlab_widgets           3.0.13
keras                        2.12.0
kiwisolver                   1.4.7
libclang                     18.1.1
Markdown                     3.7
markdown-it-py               3.0.0
MarkupSafe                   3.0.2
matplotlib                   3.9.2
matplotlib-inline            0.1.7
mdurl                        0.1.2
mistune                      3.0.2
ml_dtypes                    0.5.0
mpmath                       1.3.0
msgpack                      1.1.0
nbclient                     0.10.0
nbconvert                    7.16.4
nbformat                     5.10.4
nest-asyncio                 1.6.0
networkx                     3.4.2
notebook                     7.2.2
notebook_shim                0.2.4
numpy                        1.23.3
oauthlib                     3.2.2
onnx                         1.14.0
onnx-tf                      1.10.0
onnxruntime                  1.12.0
onnxsim                      0.4.17
opt_einsum                   3.4.0
overrides                    7.7.0
packaging                    24.2
pandas                       2.2.3
pandocfilters                1.5.1
parso                        0.8.4
pbr                          6.1.0
pexpect                      4.9.0
pillow                       11.0.0
pip                          22.0.2
platformdirs                 4.3.6
prettytable                  3.5.0
prometheus_client            0.21.0
prompt_toolkit               3.0.48
protobuf                     3.20.3
psutil                       6.1.0
ptyprocess                   0.7.0
pure_eval                    0.2.3
pwlf                         2.3.0
py                           1.11.0
py-cpuinfo                   9.0.0
pyasn1                       0.6.1
pyasn1_modules               0.4.1
pycparser                    2.22
pydantic                     1.10.8
pyDOE                        0.3.8
Pygments                     2.18.0
pygraphviz                   1.14
pyparsing                    2.4.7
python-dateutil              2.9.0.post0
python-json-logger           2.0.7
pytz                         2024.2
PyYAML                       6.0.2
pyzmq                        26.2.0
referencing                  0.35.1
requests                     2.32.3
requests-oauthlib            2.0.0
rfc3339-validator            0.1.4
rfc3986-validator            0.1.1
rich                         13.9.4
rpds-py                      0.21.0
rsa                          4.9
scipy                        1.13.1
Send2Trash                   1.8.3
setuptools                   68.0.0
six                          1.16.0
sniffio                      1.3.1
soupsieve                    2.6
stack-data                   0.6.3
sympy                        1.13.3
tabulate                     0.9.0
tensorboard                  2.12.3
tensorboard-data-server      0.7.2
tensorflow                   2.12.0
tensorflow-addons            0.23.0
tensorflow-estimator         2.12.0
tensorflow-io-gcs-filesystem 0.37.1
tensorflow-probability       0.20.1
termcolor                    2.5.0
terminado                    0.18.1
testresources                2.0.1
tflite                       2.10.0
tinycss2                     1.4.0
tomli                        2.1.0
tornado                      6.4.1
tqdm                         4.67.0
traitlets                    5.14.3
typeguard                    2.13.3
types-python-dateutil        2.9.0.20241003
typing_extensions            4.12.2
tzdata                       2024.2
uri-template                 1.3.0
urllib3                      2.2.3
verboselogs                  1.7
wcwidth                      0.2.13
webcolors                    24.11.1
webencodings                 0.5.1
websocket-client             1.8.0
Werkzeug                     3.1.3
wheel                        0.37.1
widgetsnbextension           4.0.13
wrapt                        1.14.1
zipp                         3.21.0

I am using a fresh new enviroment and deleted all global packages with pip freeze | xargs pip uninstall -y.
I am using hailo_dataflow_compiler-3.28.0-py3-none-linux_x86_64.whl with python 3.10.12 to install the DFC.

Of the packages on my list and your list, the following are different:

My version | Your version
numpy==1.23.3  | 2.1.3
onnx==1.14.0 | 1.17.0
protobuf==3.20.3 | 5.28.3
regex not installed | 2024.11.6

I also checked again with the DFC 3.29.0 and tried to compile your onnx but there i get the old error:

 raise UnexpectedNodeError(f"Failed to find reshape node in format conversion layer near {self.name}.")
output_shape = consumed_vertices[-1].get_output_shapes()[0]
spatial_reshape_sizes = [output_shape[1], output_shape[2]]
return consumed_vertices, [output_shape], spatial_reshape_sizes

lihis · November 27, 2024, 5:46pm

Hi @lukasOST

The list I sent you is for the clip_venv.
I checked my DFC virtual environment, and it contains the same packages as yours, so I’m not sure why the parsing isn’t working.

Please download the onnx from the link I sent you. I’ve uploaded a new version that I checked today.

Here are the steps I followed:

python3 -m venv 3_28_new  
. ./3_28_new/bin/activate  
pip install ../hailo_dataflow_compiler-3.28.0-py3-none-linux_x86_64.whl  
hailo parser onnx clip.onnx

If this still doesn’t work, you can send me your onnx, and I’ll try to parse it for you.

Just so you know, we passed this issue to our R&D and they will work on it, but it might take some time.

lukasOST · November 29, 2024, 7:37am

Hey @lihis
The new model you sent me compiles fine.
My question now is how did you get the onnx file?
Because for the clip I need torch 2.5, but in the file you sent me I can see you used torch 1.7.

By comparison, this is the first file you sent me that doesn’t compile.

My other question would be if the first file does compile on your computer.

lukasOST · December 7, 2024, 8:21am

Hey @lihis,
I have a suspicion where the error lies.
After the compilation it looks like a network can only have 3 dimensions.
I saw this in the follow har file:

It is a compiled version of the RN50x4 network i got from my code. To compile it i cut of the last part.
This is the onnx file you sent me:

There we see the maximal number of dimensions is 3.
Now this is my onnx file:

There we see that after the MatMul block i got 4 dimensions.
The compilation fails at the Transpose layer after the MatMul block.

I suspect that the compiler combines the dimensions and then doesn’t know what to do at the transpose layer.

Topic		Replies	Views
Compile Zero-Shot Classification General dfc , hailo8 , error	4	162	November 20, 2024
Compiling a .onnx to .hef with DFC v3.28.0 following official documentation General dfc , raspberry-pi	2	435	August 11, 2024
Hailo-Application-Code-Example Hailo-8 CLIP ViT-L/14 error General dfc , hailort , hailo8	3	94	February 17, 2025
Trouble compiling yolov8m compilation guide General dfc , hailo8 , yolov8	2	54	March 31, 2025
About Compiler Upcoming Features/Enhancement General dfc	11	225	January 25, 2025

Compile CLIP with the DFC

Related topics