Showing 15 of 15 projects
A high-performance C/C++ port of OpenAI's Whisper model for efficient, cross-platform speech recognition.
High-performance C/C++ port of OpenAI's Whisper for efficient, cross-platform speech recognition.
A fast, memory-efficient reimplementation of OpenAI's Whisper speech-to-text model using CTranslate2.
Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.
An offline desktop application for transcribing and translating audio/video files, live recordings, and YouTube links using OpenAI's Whisper.
Offline audio/video transcription desktop app using OpenAI Whisper with privacy-focused local processing.
A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.
A JAX implementation of OpenAI's Whisper model offering up to 70x faster transcription on TPUs.
A framework for building and deploying serverless decentralized applications on Ethereum, IPFS, and other blockchain platforms.
A Ruby client library for the OpenAI API, supporting GPT-5, Realtime WebRTC, and all major endpoints.
Standalone executables of OpenAI's Whisper and Faster-Whisper for speech-to-text transcription without Python dependencies.
A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.
An open-source ChatGPT app with realistic voice capabilities using ElevenLabs text-to-speech.
A Rust library for interacting with OpenAI's APIs with full async/await support and type-safe request/response handling.
A versatile tool for generating, translating, and syncing subtitles from audio/video using Whisper and other AI models via Web UI, CLI, or Python.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.