Whisper integration to PySDK

Recently, the Hailo team compiled the Whisper models to Hailo8 and Hailo8L devices, opening new avenues for voice-based application development. Building on top of this, we are excited to share DeGirum/degirum_speech, the PySDK version of Whisper integration.

DeGirum Speech has been designed to make development easier with these features:

  • High-performance Whisper transcription on Hailo-8/Hailo-8L accelerators.

  • Transcribe from files, NumPy arrays, or a live microphone stream.

  • Optional audio denoising for noisy environments.

  • Optional Voice Activity Detection (VAD) to ignore silence.

  • Simple CLI to download models for offline use.

Looking forward to community’s feedback on this exciting package.

3 Likes