Categories Alternatives Stacks Self-Hosted Explore

© 2026 Open-Awesome. Curated for the developer elite.

Terms Privacy About GitHub RSS

Speech To Text Open Source Projects | Open Awesome

Home
Tags
Speech To Text

Speech To Text

79 projects

Showing 36 of 79 projects

Bindings for many languagesC++

A high-performance C/C++ port of OpenAI's Whisper model for efficient, cross-platform speech recognition.

#transformer#automatic-speech-recognition#openai

Last commit13 days ago

High-performance C/C++ port of OpenAI's Whisper for efficient, cross-platform speech recognition.

#transformer#ggml#offline

Last commit13 days ago

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

#on-device#embedded#deep-learning

Last commit1 year ago

faster-whisperPython

A fast, memory-efficient reimplementation of OpenAI's Whisper speech-to-text model using CTranslate2.

#transformer#ai#python-library

Last commit8 months ago

Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.

#forced-alignment#vad#automatic-speech-recognition

Last commit11 days ago

An open-source AI memory tool that captures your screen and audio locally, enabling search and automation agents based on your computer activity.

#ai#open-source#productivity

Last commit20 hours ago

An offline desktop application for transcribing and translating audio/video files, live recordings, and YouTube links using OpenAI's Whisper.

#vulkan#cuda#desktop-application

Last commit7 days ago

Open-source AI platform for building private agents, assistants, and enterprise search with document analysis and multi-model support.

#ai#information-retrieval#multi-model-support

Last commit23 hours ago

🧠 Leon is your open-source personal assistant.

#ai#speech-recognition#text-to-speech

Last commit2 days ago

A comprehensive open-source toolkit for speech recognition research and development.

#cuda#research-toolkit#speaker-id

Last commit10 months ago

VoskJupyter Notebook

Offline speech recognition toolkit supporting 20+ languages with small models and streaming API.

#ios#embedded-systems#speech-to-text-android

Last commit22 days ago

A next-generation Kaldi-based toolkit for offline speech-to-text, text-to-speech, and audio processing across 12 languages and diverse hardware.

#ios#embedded-systems#aarch64

Last commit20 hours ago

SDKs for adding private, on-device AI features like LLM chat, speech-to-text, and text-to-speech to mobile and web apps.

#ios#on-device-ai#android

Last commit23 hours ago

SpeechRecognitionPython

Speech recognition module for Python, supporting several engines and APIs, online and offline.

#speech-recognition#python#speech-to-text

Last commit1 month ago

Offline audio/video transcription desktop app using OpenAI Whisper with privacy-focused local processing.

#subtitle-generation#desktop-application#ai

Last commit7 days ago

annyangTypeScript

A tiny JavaScript library for adding speech recognition and voice commands to websites.

#web-accessibility#voice-commands#hands-free

Last commit10 days ago

mycroft-corePython

A hackable open-source voice assistant platform for building and running custom voice-controlled applications.

#ai#open-source#fedora

Last commit1 year ago

Wav2Letter++C++

Facebook AI Research's automatic speech recognition toolkit for end-to-end ASR with modern neural architectures.

#asr-toolkit#end-to-end#neural-architectures

Last commit10 days ago

A high-performance automatic speech recognition toolkit from Facebook AI Research, built with fully convolutional neural networks.

#end-to-end#deep-learning#automatic-speech-recognition

Last commit10 days ago

whisper-diarizationJupyter Notebook

A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.

#nvidia-nemo#automatic-speech-recognition#asr

Whisper JAXJupyter Notebook

A JAX implementation of OpenAI's Whisper model offering up to 70x faster transcription on TPUs.

#parallel-computing#jax#deep-learning

Last commit2 years ago

A Ruby client library for the OpenAI API, supporting GPT-5, Realtime WebRTC, and all major endpoints.

#ai#gpt-4o#assistants-api

Last commit2 months ago

whisper-standalone-win

Standalone executables of OpenAI's Whisper and Faster-Whisper for speech-to-text transcription without Python dependencies.

#media-processing#faster-whisper#asr

Last commit8 months ago

whisper-timestampedPython

A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.

#subtitle-generation#python-library#deep-learning

awesome-whisper

A curated list of resources, tools, and applications for OpenAI's Whisper speech recognition system.

#ai#openai#awesome-list

Last commit4 months ago

A curated list of resources, tools, and applications for OpenAI's Whisper speech recognition model.

#ai#openai#awesome-list

Last commit4 months ago

A web service providing a GUI and API with queuing for OpenAI Whisper transcription and translation.

#redis#flask#api-service

Last commit22 days ago

A robust yet lenient forced aligner built on Kaldi for aligning speech audio with text transcripts.

#forced-alignment#speech-analysis#rest-api

Last commit2 months ago

A versatile tool for generating, translating, and syncing subtitles from audio/video using Whisper and other AI models via Web UI, CLI, or Python.

#subtitles-generator#webui#cli-tool

Last commit3 months ago

TypeWhisperSwift

A macOS speech-to-text app offering on-device AI transcription, system-wide dictation, and AI text processing with full privacy.

#on-device#llm-integration#on-device-ai

Last commit1 day ago

A Linux desktop app for offline note-taking, reading, and translation using speech-to-text, text-to-speech, and machine translation.

#tts#stt#offline-app

Last commit6 days ago

A free web application that transcribes and translates audio files using OpenAI's Whisper and Chat APIs.

#laravel#openai-whisper#speech-to-text

Last commit24 days ago

node-sdkTypeScript

Node.js client library for accessing IBM Watson AI services like Assistant, Speech-to-Text, and Natural Language Understanding.

#hacktoberfest#ai-api#language-translation

Last commit6 days ago

python-sdkPython

A Python client library for interacting with IBM Watson AI services, available via pip as ibm-watson.

#hacktoberfest#cloud-services#authentication

Last commit5 months ago

yt-whisperPython

A Python tool that uses OpenAI's Whisper to automatically generate subtitle files for YouTube videos.

#ffmpeg#video-processing#subtitles-generated

Last commit2 years ago

whisper-ctranslate2Python

A faster, memory-efficient command-line client for OpenAI's Whisper speech recognition, powered by CTranslate2.

#openai#cli-tool#speech-recognition

Last commit5 months ago

Page 1 of 3

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub

5 months ago

10 months ago

#Speech Recognition29

#Machine Learning18

#Audio Processing17

#Transcription17

#Text To Speech16

#Accessibility14

#Openai Whisper12

#Natural Language Processing11