Skip to content
#

speech-transcription

Here are 19 public repositories matching this topic...

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Ollama). It ensures privacy and offline use with a user-friendly GUI.

  • Updated Feb 23, 2025
  • Python

A production-ready local transcription workflow leveraging OpenAI's Whisper models that addresses the limitations of cloud-based solutions through complete data sovereignty, unlimited scale, reproducible processing and advanced quality control, while maintaining GDPR compliance.

  • Updated Nov 16, 2025
  • Python

Improve this page

Add a description, image, and links to the speech-transcription topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-transcription topic, visit your repo's landing page and select "manage topics."

Learn more