Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Speech Recognition

Speech Recognition

31 projects

Showing 31 of 31 projects

JAX, Flax & Transformers
JAX, Flax & TransformersPython

A model-definition framework for state-of-the-art machine learning models across text, vision, audio, and multimodal tasks.

#transformer#hacktoberfest#model-training
Stars159.8k
Forks33.0k
Last commit1 day ago
HuggingFace Transformers
HuggingFace TransformersPython

A model-definition framework for state-of-the-art machine learning models across text, vision, audio, and multimodal tasks.

#transformer#hacktoberfest#model-training
Stars159.8k
Forks33.0k
Last commit1 day ago
Bindings for many languages
Bindings for many languagesC++

A high-performance C/C++ port of OpenAI's Whisper model for efficient, cross-platform speech recognition.

#transformer#automatic-speech-recognition#openai
Stars48.9k
Forks5.4k
Last commit4 days ago
whisper.cpp
whisper.cppC++

High-performance C/C++ port of OpenAI's Whisper for efficient, cross-platform speech recognition.

#transformer#ggml#offline
Stars48.9k
Forks5.4k
Last commit4 days ago
faster-whisper
faster-whisperPython

A fast, memory-efficient reimplementation of OpenAI's Whisper speech-to-text model using CTranslate2.

#transformer#ai#python-library
Stars22.4k
Forks1.8k
Last commit5 months ago
WhisperX
WhisperXPython

Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.

#forced-alignment#vad#automatic-speech-recognition
Stars21.4k
Forks2.2k
Last commit20 days ago
Kaldi
KaldiShell

A comprehensive open-source toolkit for speech recognition research and development.

#cuda#research-toolkit#speaker-id
Stars15.4k
Forks5.4k
Last commit7 months ago
Vosk
VoskJupyter Notebook

Offline speech recognition toolkit supporting 20+ languages with small models and streaming API.

#ios#embedded-systems#speech-to-text-android
Stars14.6k
Forks1.7k
Last commit2 months ago
EspNet
EspNetPython

An end-to-end speech processing toolkit for speech recognition, text-to-speech, translation, enhancement, and more.

#chainer#end-to-end#deep-learning
Stars9.8k
Forks2.4k
Last commit2 days ago
RTranslator
RTranslatorC++

An open-source Android app for real-time, offline voice translation between multiple languages using on-device AI models.

#bluetooth-le#open-source#machine-translation
Stars9.8k
Forks885
Last commit3 days ago
buster
busterJavaScript

A browser extension that solves difficult CAPTCHAs by completing reCAPTCHA audio challenges using speech recognition.

#privacy-tools#browser-extension#recaptcha
Stars9.0k
Forks675
Last commit4 days ago
annyang
annyangTypeScript

A tiny JavaScript library for adding speech recognition and voice commands to websites.

#web-accessibility#voice-commands#hands-free
Stars6.7k
Forks1.0k
Last commit17 days ago
wav2letter
wav2letterC++

A high-performance automatic speech recognition toolkit from Facebook AI Research, built with fully convolutional neural networks.

#end-to-end#deep-learning#automatic-speech-recognition
Stars6.4k
Forks994
Last commit3 months ago
Wav2Letter++
Wav2Letter++C++

Facebook AI Research's automatic speech recognition toolkit for end-to-end ASR with modern neural architectures.

#asr-toolkit#end-to-end#neural-architectures
Stars6.4k
Forks994
Last commit3 months ago
whisper-diarization
whisper-diarizationJupyter Notebook

A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.

#nvidia-nemo#automatic-speech-recognition#asr
Stars5.5k
Forks498
Last commit2 months ago
Porcupine
PorcupinePython

A highly-accurate, lightweight, on-device wake word detection engine powered by deep learning.

#iot#embedded-systems#on-device
Stars4.8k
Forks574
Last commit6 days ago
Whisper JAX
Whisper JAXJupyter Notebook

A JAX implementation of OpenAI's Whisper model offering up to 70x faster transcription on TPUs.

#parallel-computing#jax#deep-learning
Stars4.7k
Forks414
Last commit2 years ago
VoiceInk
VoiceInkSwift

A native macOS voice-to-text app that transcribes speech to text instantly with 100% offline processing.

#productivity-tool#ai-assistant#offline-transcription
Stars4.7k
Forks638
Last commit2 days ago
PocketSphinx
PocketSphinxC

A lightweight, open-source continuous speech recognition engine for embedded and offline applications.

#c-library#embedded-systems#python-library
Stars4.3k
Forks729
Last commit10 days ago
Warp-CTC
Warp-CTCCuda

A fast parallel implementation of the Connectionist Temporal Classification (CTC) loss function for CPU and GPU.

#cuda#parallel-computing#torch-binding
Stars4.1k
Forks1.0k
Last commit2 years ago
deep-chat
deep-chatTypeScript

A fully customizable AI chat component for websites, connecting to any API or hosting models directly in the browser.

#chat#ai#openai
Stars3.6k
Forks436
Last commit2 days ago
whisper-standalone-win
whisper-standalone-win

Standalone executables of OpenAI's Whisper and Faster-Whisper for speech-to-text transcription without Python dependencies.

#media-processing#faster-whisper#asr
Stars3.0k
Forks161
Last commit5 months ago
uncaptcha
uncaptchaPython

A proof-of-concept system that defeats Google's audio reCaptcha with 85% accuracy using speech recognition and browser automation.

#web-security#selenium#captcha-bypass
Stars2.8k
Forks328
Last commit8 years ago
whisper-timestamped
whisper-timestampedPython

A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.

#subtitle-generation#python-library#deep-learning
Stars2.8k
Forks210
Last commit7 months ago
rhasspy
rhasspyShell

An open-source, fully offline voice assistant for many languages, designed for private home automation.

#multi-language#voice-commands#open-source
Stars2.7k
Forks206
Last commit1 year ago
Chat with GPT
Chat with GPTTypeScript

An open-source ChatGPT app with realistic voice capabilities using ElevenLabs text-to-speech.

#chat#gpt-3#chatgpt-api
Stars2.4k
Forks485
Last commit1 year ago
awesome-whisper
awesome-whisper

A curated list of resources, tools, and applications for OpenAI's Whisper speech recognition system.

#ai#openai#awesome-list
Stars2.3k
Forks126
Last commit1 month ago
Whisper
Whisper

A curated list of resources, tools, and applications for OpenAI's Whisper speech recognition model.

#ai#openai#awesome-list
Stars2.3k
Forks126
Last commit1 month ago
Talk-to-ChatGPT
Talk-to-ChatGPTJavaScript

A Chrome/Edge extension that enables voice conversations with ChatGPT using speech recognition and text-to-speech.

#browser-extension#speech-recognition#text-to-speech
Stars1.9k
Forks323
Last commit1 year ago
FluidAudio
FluidAudioSwift

A Swift SDK for fully local, low-latency audio AI on Apple devices, including transcription, text-to-speech, voice activity detection, and speaker diarization.

#ios#apple-neural-engine#speaker-embedding
Stars1.9k
Forks256
Last commit1 day ago
gentle
gentlePython

A robust yet lenient forced aligner built on Kaldi for aligning speech audio with text transcripts.

#forced-alignment#speech-analysis#rest-api
Stars1.7k
Forks312
Last commit11 months ago

Related Tags

#Speech To Text16#Machine Learning12#Deep Learning11#Audio Processing11#Whisper9#Automatic Speech Recognition8#Transcription8#Openai7#Python6#Asr6#Transformer5#Speech5
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub