Hey @_Mab, @Kristijan_Vrban,
Regarding the large models with more than 3-4 contexts: this issue might be related to the Raspberry Pi 5’s single-lane PCIe configuration, which differs from the x86 systems used in the ModelZoo tests.
Thanks for providing that information! I’ve opened a ticket to address the model issue. This appears to be caused by an incorrect model URL (not pointing to the performance-optimized version).
I’ll test both configurations and provide updates on the results.