Open source repositories tagged with #transcription, ranked by health score.
Industrial-grade speech recognition toolkit. 170x realtime, 50+ languages, speaker diarization, emotion detection — all in 3 lines of Python. Production-ready.
C++ ggml runtime hub for multilingual ASR models: Cohere Transcribe, Parakeet TDT, Voxtral, Canary 1B v2, etc, plus universal forced alignment via NeMo Forced Aligner-style CTC, and others. Fork of whisper.cpp.
AI that sees your screen, listens to your conversations and tells you what to do
Local speech-to-text for macOS on-device AI, fully private, optional cloud
macOS meeting transcription app with speaker diarization
Turn meetings, dictation, and audio files into local Markdown memory for Claude, Codex, and any agent.