Showing 22 of 58 projects
An Android Input Method Editor (IME) providing offline voice recognition and translation using the Whisper engine.
A Unity SDK for integrating IBM Watson AI services like speech, language, and vision into games and applications.
A customizable iOS overlay that handles voice permission and converts speech to text using native speech recognition.
A Google Colab notebook that transcribes YouTube videos using OpenAI's Whisper speech recognition model.
A joint audio tagging and speech recognition model that adds audio event detection to OpenAI Whisper with minimal computational overhead.
Control Vim and Vim-like editors with voice commands using speech recognition.
A command-line interface for blazingly fast audio transcription using optimized Whisper ASR models.
A curated collection of linguistic resources, datasets, and tools for Natural Language Processing and Computational Linguistics on Spanish.
A curated collection of linguistic resources, tools, and datasets for Natural Language Processing and Computational Linguistics on Spanish.
A Flutter plugin for speech recognition on iOS and Android using native APIs.
An on-device AI teleprompter that listens to your conversations and suggests charismatic quotes in real-time.
A curated collection of datasets, corpora, and resources for Indonesian natural language processing tasks.
An Android overlay that handles voice permission and converts user speech to text with a customizable UI.
Ruby FFI bindings for Pocketsphinx, a lightweight speech recognition engine.
A Chrome extension that adds hands-free voice control to ChatGPT with custom trigger phrases and 60+ language support.
An open-source desktop app that transcribes voice to polished text using AI and types it into any application.
An Android chatbot with voice interaction capabilities powered by IBM Watson's AI services on IBM Cloud.
A fork of OpenAI's Whisper speech recognition models optimized with OpenVINO backend for faster CPU inference.
A collection of iOS sample apps demonstrating Generative AI capabilities including OpenAI, local LLMs, Stable Diffusion, and speech recognition.
Android client library for integrating IBM Watson cognitive services like speech recognition, text-to-speech, and visual recognition.
A Python library for easy access, management, and processing of audio datasets, particularly for machine learning tasks.
A Capacitor plugin for native speech recognition on iOS and Android, enabling voice-to-text in hybrid mobile apps.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.