speech-transcription

Here are 19 public repositories matching this topic...

Dadangdut33 / Speech-Translate

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

python translate whisper tkinter-python speech-translation speech-transcription

Updated Jan 18, 2024
Python

Appen / UHV-OTS-Speech

Star

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

speech-recognition speech-processing audio-segmentation gender-classification speaker-diarization synthetic-speech-detection topic-detection speech-seperation speaker-identification accent-detection speech-transcription speech-annotation

Updated Mar 25, 2023
Forth

jhauret / vibravox

Star

Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.

pytorch hydra datasets speaker-verification speech-enhancement pytorch-lightning speech-transcription bandwidth-extension

Updated Jun 16, 2025
Python

KevKibe / African-Whisper

Sponsor

Star

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated Feb 27, 2025
Python

arashsajjadi / ai-powered-video-analyzer

Star

An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Ollama). It ensures privacy and offline use with a user-friendly GUI.

gui privacy yolo image-captioning object-detection whisper offline-processing speech-transcription llm whisper-ai blip2 ollama panns image-captioning-ai ollama-api yolo11 ai-video-analysis audio-event-detection llm-summarization

Updated Feb 23, 2025
Python

srinivr / kaldi-long-audio-alignment

Star

Long audio alignment using Kaldi

speech-recognition automatic-speech-recognition speech-to-text kaldi transcription asr speechrecognition split-audio longaudio-alignment audio-segments speech-transcription

Updated Apr 22, 2021
Shell

PranavPutsa1006 / Speaker-Diarization

Star

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

deep-learning neural-networks speech-to-text mfcc speaker-diarization spectral-clustering voice-activity-detection speech-segmentation speech-detection speech-transcription embeddings-extraction

Updated Jun 18, 2023
Jupyter Notebook

laviprog / speech-transcription

Star

Speech Transcription API is a RESTful service that processes audio input and converts speech into text using state-of-the-art speech recognition models. Ideal for building transcription tools, smart assistants, and voice-controlled applications.

python docker sqlalchemy docker-compose postgresql alembic speech-to-text transcription fastapi speech-transcription whisperx

Updated Nov 7, 2025
Python

capjamesg / awsnap.js

Sponsor

Star

Navigate websites by clicking your fingers and saying the link you want to visit.

webaudio-api audio-classification tensorflow-js speech-transcription

Updated Oct 1, 2023
HTML

otonomee / mic2transcript

Star

CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.

audio cli speech openai transcription whisper cli-tool speech-transcription

Updated Jun 22, 2024
Python

umitkacar / transformer-asr-transcription

Star

Real-time transformer-based ASR supporting 100+ languages - Google Cloud integration with noise cancellation & low-latency optimization

Updated Nov 10, 2025
Python

Think-A-Move / SPEAR-SDK-Java-Android

Star

SPEAR-ASR and SPEAR-WakeUp Software Development Kit for Android

Updated Sep 16, 2021
Java

JadenChun / real-time-caption-generator

Star

Real time caption generator using Microsoft Azure speech services

cpp gui-application qt-widgets windows-application speech-translation azure-speech-service speech-transcription real-time-caption

Updated Jun 15, 2023
C++

adam-aalah / Speech-transcription

Star

Speech transcription and speech diarization

python speech-to-text transcription diarization speech-diarization speech-transcription speechbrain whisper-ai

Updated Mar 31, 2024
Python

Infinitode / Scriptify

Star

An open-source AI writing tool for realtime speech transcription.

css python html open-source app ui ai js offline tool writing openai free whisper writing-tool pywebview speech-transcription openai-whisper

Updated Jul 1, 2025
JavaScript

Think-A-Move / SPEAR-SDK-Python-Linux

Star

SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Python for Linux

Updated Nov 22, 2021
Python

ksquarekumar / whisper-stream

Star

Whisper Transcription Service

deep-learning inference transformer openai automatic-speech-recognition flax speech-to-text whisper jax speech-translation speech-transcription

Updated Sep 14, 2023
Jupyter Notebook

robotology / yarp-device-speechTranscription-whisper

Star

A yarp plugin to perform speech transcription using openai whisper

openai speech-to-text yarp whisper speech-transcription

Updated Jun 11, 2025
C++

pablobernabeu / secure_local_HPC_speech_transcription

Star

A production-ready local transcription workflow leveraging OpenAI's Whisper models that addresses the limitations of cloud-based solutions through complete data sovereignty, unlimited scale, reproducible processing and advanced quality control, while maintaining GDPR compliance.

machine-learning torch pytorch artificial-intelligence speech-to-text whisper huggingface transformer-models huggingface-transformers speech-transcription whisper-ai

Updated Nov 16, 2025
Python

Improve this page

Add a description, image, and links to the speech-transcription topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-transcription topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-transcription

Here are 19 public repositories matching this topic...

Dadangdut33 / Speech-Translate

Appen / UHV-OTS-Speech

jhauret / vibravox

KevKibe / African-Whisper

arashsajjadi / ai-powered-video-analyzer

srinivr / kaldi-long-audio-alignment

PranavPutsa1006 / Speaker-Diarization

laviprog / speech-transcription

capjamesg / awsnap.js

otonomee / mic2transcript

umitkacar / transformer-asr-transcription

Think-A-Move / SPEAR-SDK-Java-Android

JadenChun / real-time-caption-generator

adam-aalah / Speech-transcription

Infinitode / Scriptify

Think-A-Move / SPEAR-SDK-Python-Linux

ksquarekumar / whisper-stream

robotology / yarp-device-speechTranscription-whisper

pablobernabeu / secure_local_HPC_speech_transcription

Improve this page

Add this topic to your repo