Showing 11 of 11 projects
Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.
A comprehensive open-source toolkit for speech recognition research and development.
Offline speech recognition toolkit supporting 20+ languages with small models and streaming API.
A next-generation Kaldi-based toolkit for offline speech-to-text, text-to-speech, and audio processing across 12 languages and diverse hardware.
A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.
Standalone executables of OpenAI's Whisper and Faster-Whisper for speech-to-text transcription without Python dependencies.
A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.
A Linux desktop app for offline note-taking, reading, and translation using speech-to-text, text-to-speech, and machine translation.
Enables natural two-way voice conversations with Claude Code and other MCP agents, perfect for hands-free coding assistance.
A command-line interface for blazingly fast audio transcription using optimized Whisper ASR models.
A fork of OpenAI's Whisper speech recognition models optimized with OpenVINO backend for faster CPU inference.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.