Open source repositories tagged with #speech-to-text, ranked by health score.
Industrial-grade speech recognition toolkit. 170x realtime, 50+ languages, speaker diarization, emotion detection — all in 3 lines of Python. Production-ready.
C++ ggml runtime hub for multilingual ASR models: Cohere Transcribe, Parakeet TDT, Voxtral, Canary 1B v2, etc, plus universal forced alignment via NeMo Forced Aligner-style CTC, and others. Fork of whisper.cpp.
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
Local speech-to-text for macOS on-device AI, fully private, optional cloud
macOS meeting transcription app with speaker diarization
🎙️ Offline voice productivity for Windows - dictate text into any app and control a local AI assistant by voice. Manages notes, to-do lists, appointments & reminders. 100% local: faster-whisper + Ollama + SQLite. No cloud, no telemetry. Free and open source alternative to Wispr Flow
The original Piper(https://github.com/OHF-Voice/piper1-gpl), now on iOS and macOS
On-device Speech AI for Apple Silicon
Turn meetings, dictation, and audio files into local Markdown memory for Claude, Codex, and any agent.