Whisper

37 projects

Showing 36 of 37 projects

Bindings for many languagesC++

A high-performance C/C++ port of OpenAI's Whisper model for efficient, cross-platform speech recognition.

#transformer#automatic-speech-recognition#openai

Stars52.2k

Forks5.9k

Last commit13 days ago

whisper.cppC++

High-performance C/C++ port of OpenAI's Whisper for efficient, cross-platform speech recognition.

#transformer#ggml#offline

Stars52.2k

Forks5.9k

Last commit13 days ago

faster-whisperPython

A fast, memory-efficient reimplementation of OpenAI's Whisper speech-to-text model using CTranslate2.

#transformer#ai#python-library

Stars24.5k

Forks2.0k

Last commit8 months ago

WhisperXPython

Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.

#forced-alignment#vad#automatic-speech-recognition

Stars23.2k

Forks2.4k

Last commit11 days ago

BuzzPython

An offline desktop application for transcribing and translating audio/video files, live recordings, and YouTube links using OpenAI's Whisper.

#vulkan#cuda#desktop-application

Stars20.3k

Forks1.5k

Last commit7 days ago

VibeTypeScript

Offline audio/video transcription desktop app using OpenAI Whisper with privacy-focused local processing.

#subtitle-generation#desktop-application#ai

Stars6.9k

Forks456

Last commit7 days ago

whisper-diarizationJupyter Notebook

A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.

#nvidia-nemo#automatic-speech-recognition#asr

Stars5.6k

Forks503

Last commit

Whisper JAXJupyter Notebook

A JAX implementation of OpenAI's Whisper model offering up to 70x faster transcription on TPUs.

#parallel-computing#jax#deep-learning

Stars4.7k

Forks412

Last commit2 years ago

Embark FrameworkJavaScript

A framework for building and deploying serverless decentralized applications on Ethereum, IPFS, and other blockchain platforms.

#ipfs#solidity#serverless

Stars3.8k

Forks480

Last commit2 years ago

RubyRuby

A Ruby client library for the OpenAI API, supporting GPT-5, Realtime WebRTC, and all major endpoints.

#ai#gpt-4o#assistants-api

Stars3.2k

Forks385

Last commit2 months ago

whisper-standalone-win

Standalone executables of OpenAI's Whisper and Faster-Whisper for speech-to-text transcription without Python dependencies.

#media-processing#faster-whisper#asr

Stars3.1k

Forks164

Last commit8 months ago

whisper-timestampedPython

A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.

#subtitle-generation#python-library#deep-learning

Stars2.8k

Forks212

Last commit

Chat with GPTTypeScript

An open-source ChatGPT app with realistic voice capabilities using ElevenLabs text-to-speech.

#chat#gpt-3#chatgpt-api

Stars2.4k

Forks475

Last commit2 years ago

OpenSuperWhisperSwift

macOS dictation app

#dictation#whisper#parakeet

Stars2.3k

Forks186

Last commit15 days ago

async-openaiRust

A Rust library for interacting with OpenAI's APIs with full async/await support and type-safe request/response handling.

#ai-integration#async-await#openai

Stars2.0k

Forks391

Last commit1 month ago

Subs AIPython

A versatile tool for generating, translating, and syncing subtitles from audio/video using Whisper and other AI models via Web UI, CLI, or Python.

#subtitles-generator#webui#cli-tool

Stars1.7k

Forks139

Last commit3 months ago

TypeWhisperSwift

A macOS speech-to-text app offering on-device AI transcription, system-wide dictation, and AI text processing with full privacy.

#on-device#llm-integration#on-device-ai

A Python tool that uses OpenAI's Whisper to automatically generate subtitle files for YouTube videos.

#ffmpeg#video-processing#subtitles-generated

Stars1.4k

Forks145

Last commit2 years ago

whisper-ctranslate2Python

A faster, memory-efficient command-line client for OpenAI's Whisper speech recognition, powered by CTranslate2.

#openai#cli-tool#speech-recognition

Stars1.3k

Forks127

Last commit5 months ago

gp.nvimLua

A Neovim plugin for AI-powered chat sessions, text/code operations, speech-to-text, and image generation using multiple LLM providers.

#ai-assistant#codeium#vim

Stars1.3k

Forks128

Last commit11 months ago

VoiceMode MCPPython

Enables natural two-way voice conversations with Claude Code and other MCP agents, perfect for hands-free coding assistance.

#tts#developer-tools#claudecode

Stars1.3k

Forks182

Last commit3 days ago

hyprwhsprPython

A fast, accurate, and private native speech-to-text tool for Linux, offering system-wide dictation with local or cloud backends.

#ai#wayland#archlinux

Stars1.1k

Forks85

Last commit2 days ago

use-whisperTypeScript

A React hook for OpenAI Whisper API with built-in speech recording, real-time transcription, and silence removal.

#api#hook#openai

Stars785

Forks139

Last commit2 years ago

WhisperJava

An Android Input Method Editor (IME) providing offline voice recognition and translation using the Whisper engine.

#open-source#privacy-protection#input-method-editor

Stars608

Forks36

Last commit5 months ago

VLogPython

A video-language understanding framework that treats video narration as vocabulary and videos as long documents for efficient analysis.

#cvpr-2025#video-understanding#vocabulary

Stars587

Forks31

Last commit1 year ago

YouTube Video TranscriptionJupyter Notebook

A Google Colab notebook that transcribes YouTube videos using OpenAI's Whisper speech recognition model.

#youtube-transcription#transformer#google-colab

Stars421

Forks115

Last commit2 years ago

insanely-fast-whisper-cliPython

A command-line interface for blazingly fast audio transcription using optimized Whisper ASR models.

#asr#command-line-tool#speech-recognition

Stars407

Forks37

Last commit2 years ago

tambourine-voiceRust

An open-source voice dictation tool that types your speech at the cursor in any application, powered by customizable AI transcription and formatting.

#productivity#pipecat#desktop-app

Stars375

Forks71

Last commit7 days ago

DelphiOpenAIPascal

A Delphi wrapper for OpenAI, DeepSeek, Azure OpenAI, YandexGPT, Ollama, GigaChat, and Qwen APIs, enabling AI features in Delphi applications.

#gpt-3#ai-integration#object-pascal

Stars309

Forks86

Last commit10 days ago

OpenScribeTypeScript

An open-source AI medical scribe that records patient encounters and generates structured clinical notes automatically.

#ai#medical-ai#clinical-informatics

Stars193

Forks44

Last commit2 months ago

whisper-openvinoJupyter Notebook

A fork of OpenAI's Whisper speech recognition models optimized with OpenVINO backend for faster CPU inference.

#intel#cpu-optimization#asr

Stars184

Forks16

Last commit2 years ago

hnsPython

A local, offline speech-to-text CLI tool that transcribes microphone input directly to your clipboard.

#developer-tools#faster-whisper#cli-tool

Stars113

Forks14

Last commit3 days ago

MeeperTypeScript

A browser extension that transcribes and summarizes in-browser conferences using ChatGPT and Whisper AI.

#ai#browser-extension#meeting-assistant

Stars84

Forks11

Last commit1 year ago

OpenDictationSwift

A lightweight, notch-integrated dictation utility for macOS that runs locally or with your own API keys.

#open-source#productivity#dictation

Stars55

Forks4

Last commit1 month ago

phonixPython

A Python tool that generates video captions using OpenAI's Whisper API, with options for enhanced transcription and stylish formatting.

#stable-ts#video-to-text#gui-application

Stars47

Forks5

Last commit1 year ago

FnKeyRust

A macOS menu bar app that transcribes speech to text when you hold the Fn key and pastes the result.

#deepgram#productivity#menu-bar-app

Stars46

Forks10

Last commit4 months ago

Page 1 of 2Next

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub