Asr

12 projects

Showing 12 of 12 projects

Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.

#forced-alignment#vad#automatic-speech-recognition

Stars23.2k

Forks2.4k

Last commit11 days ago

KaldiShell

A comprehensive open-source toolkit for speech recognition research and development.

#cuda#research-toolkit#speaker-id

Stars15.4k

Forks5.4k

Last commit10 months ago

VoskJupyter Notebook

Offline speech recognition toolkit supporting 20+ languages with small models and streaming API.

#ios#embedded-systems#speech-to-text-android

Stars15.0k

Forks1.7k

Last commit21 days ago

sherpa-onnxC++

A next-generation Kaldi-based toolkit for offline speech-to-text, text-to-speech, and audio processing across 12 languages and diverse hardware.

#ios#embedded-systems#aarch64

Stars13.7k

Forks1.6k

Last commit8 hours ago

whisper-diarizationJupyter Notebook

A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.

#nvidia-nemo#automatic-speech-recognition#asr

Stars5.6k

Forks503

Last commit

whisper-standalone-win

Standalone executables of OpenAI's Whisper and Faster-Whisper for speech-to-text transcription without Python dependencies.

#media-processing#faster-whisper#asr

Stars3.1k

Forks164

Last commit8 months ago

whisper-timestampedPython

A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.

#subtitle-generation#python-library#deep-learning

A Linux desktop app for offline note-taking, reading, and translation using speech-to-text, text-to-speech, and machine translation.

#tts#stt#offline-app

Stars1.5k

Forks69

Last commit5 days ago

VoiceMode MCPPython

Enables natural two-way voice conversations with Claude Code and other MCP agents, perfect for hands-free coding assistance.

#tts#developer-tools#claudecode

Stars1.3k

Forks182

Last commit2 days ago

insanely-fast-whisper-cliPython

A command-line interface for blazingly fast audio transcription using optimized Whisper ASR models.

#asr#command-line-tool#speech-recognition

Stars407

Forks37

Last commit2 years ago

whisper-openvinoJupyter Notebook

A fork of OpenAI's Whisper speech recognition models optimized with OpenVINO backend for faster CPU inference.

#intel#cpu-optimization#asr

Stars184

Forks16

Last commit2 years ago

dicteePython

Push-to-talk voice dictation for Linux with 100% local transcription, multilingual support, and speaker diarization.

#kde#stt#nvidia

Stars48

Forks2

Last commit2 days ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub