TOPIC

#speech-to-text

Open source repositories tagged with #speech-to-text, ranked by health score.

Open source voice AI platform. Self-hosted alternative to Vapi and Retell. On Prem, BYOK across Speech to Speech or LLM/STT/TTS, with a visual workflow builder, MCP native and telephony support.

★ 4.8k

huggingface/speech-to-speech

Python

health

Build local voice agents with open-source models

A free, open source, and extensible speech-to-text application that works completely offline.

★ 26.1k

FluidInference/FluidAudio

Swift

health

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

★ 13.5k

TypeWhisper/typewhisper-mac

Swift

health

Local speech-to-text for macOS on-device AI, fully private, optional cloud

A native macOS menu bar dictation app using local speech-to-text with WhisperKit

C++ ggml runtime hub for multilingual ASR and TTS models: Cohere Transcribe, Parakeet TDT, Voxtral, Canary 1B v2, etc, plus universal forced alignment, and more

Muesli - local meeting transcription + dictation for macOS (Granola + WisprFlow alternative)

★ 731

mgsgde/whisper-shortcut

Swift

health

Speech-to-text and voice-to-prompt macOS app with Gemini and Whisper support

★ 64

pasrom/meeting-transcriber

Swift

health

On-device meeting transcriber for macOS — auto-records Teams/Zoom/Webex, transcribes & separates speakers locally. No cloud. Open-source alternative to Otter/Granola/Fireflies.

★ 76

argmaxinc/argmax-oss-swift

Swift

health

On-device Speech AI for Apple Silicon

QVAC - Local AI SDK and libraries for building private, cross-platform, peer-to-peer AI applications. Run LLMs, speech-to-text, translation, and more locally on Linux, macOS, Windows, Android, and iOS.

🎙️ Offline voice productivity for Windows - dictate text into any app and control a local AI assistant by voice. Manages notes, to-do lists, appointments & reminders. 100% local: faster-whisper + Ollama + SQLite. No cloud, no telemetry. Free and open source alternative to Wispr Flow

★ 71