Showing 7 of 7 projects
Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.
A comprehensive open-source toolkit for speech recognition research and development.
Offline speech recognition toolkit supporting 20+ languages with small models and streaming API.
A next-generation Kaldi-based toolkit for offline speech-to-text, text-to-speech, and audio processing across 12 languages and diverse hardware.
A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.
Standalone executables of OpenAI's Whisper and Faster-Whisper for speech-to-text transcription without Python dependencies.
A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.