Speech Recognition

#hacktoberfest#language-translation#ai-services

swift-sdkSwift

Swift SDK for integrating IBM Watson AI services like speech, language, and assistant into iOS and Linux applications.

Stars869

Forks213

#ai#emacs#text-generation

org-aiEmacs Lisp

An Emacs org-mode minor mode that integrates generative AI models like ChatGPT, DALL-E, and Stable Diffusion for text and image generation.

Stars820

Forks64

Last commit6 months ago

SonusJavaScript

A Node.js library for adding voice interfaces with offline hotword detection and cloud speech recognition.

#voice-commands#stt#voice-control

Stars638

Forks76

#open-source#privacy-protection#input-method-editor

WhisperJava

An Android Input Method Editor (IME) providing offline voice recognition and translation using the Whisper engine.

Stars608

Forks36

Last commit5 months ago

unity-sdkC#

A Unity SDK for integrating IBM Watson AI services like speech, language, and vision into games and applications.

#unity3d#hacktoberfest#csharp

Stars565

Forks205

#transformer#audioset#python-library

Voice OverlaySwift

A customizable iOS overlay that handles voice permission and converts speech to text using native speech recognition.

#search#ios#input

Stars556

Forks58

Last commit1 month ago

Whisper-ATPython

A joint audio tagging and speech recognition model that adds audio event detection to OpenAI Whisper with minimal computational overhead.

Stars421

Forks36

YouTube Video TranscriptionJupyter Notebook

A Google Colab notebook that transcribes YouTube videos using OpenAI's Whisper speech recognition model.

#youtube-transcription#transformer#google-colab

Stars421

Forks115

#productivity#voice-control#vim

VimSpeakF#

Control Vim and Vim-like editors with voice commands using speech recognition.

Stars416

Forks30

Last commit5 years ago

insanely-fast-whisper-cliPython

A command-line interface for blazingly fast audio transcription using optimized Whisper ASR models.

#asr#command-line-tool#speech-recognition

Stars407

Forks37

#desktop-application#ai#deepgram

OpenTypelessRust

An open-source desktop app that transcribes voice to polished text using AI and types it into any application.

A curated collection of linguistic resources, tools, and datasets for Natural Language Processing and Computational Linguistics on Spanish.

#computational-linguistics#pos-tagging#machine-translation

Stars351

Forks42

#computational-linguistics#text-analysis#nlp-datasets

awesome-spanish-nlp

A curated collection of linguistic resources, datasets, and tools for Natural Language Processing and Computational Linguistics on Spanish.

Stars351

Forks42

#ai-assistant#charisma-training#on-device-ai

TeleprompterPython

An on-device AI teleprompter that listens to your conversations and suggests charismatic quotes in real-time.

Stars337

Forks38

Speech RecognitionDart

A Flutter plugin for speech recognition on iOS and Android using native APIs.

#dart#ios#android

Stars336

Forks193

#nlp-resources#indonesian-language#text-classification

Indonesian NLP

A curated collection of datasets, corpora, and resources for Indonesian natural language processing tasks.

Stars287

Forks46

#conversational-ui#input#ui-overlay

VoiceOverlayKotlin

An Android overlay that handles voice permission and converts user speech to text with a customizable UI.

Stars264

Forks36

#pocketsphinx#keyword-spotting#command-control

pocketsphinx-rubyRuby

Ruby FFI bindings for Pocketsphinx, a lightweight speech recognition engine.

Stars257

Forks39

Last commit9 years ago

ChassistantGPTJavaScript

A Chrome extension that adds hands-free voice control to ChatGPT with custom trigger phrases and 60+ language support.

#ai-assistant#hands-free#chatgpt-integration

Stars240

Forks33

chatbot-watson-androidJava

An Android chatbot with voice interaction capabilities powered by IBM Watson's AI services on IBM Cloud.

#ibm-cloud#ai-assistant#android

Stars197

Forks180

whisper-openvinoJupyter Notebook

A fork of OpenAI's Whisper speech recognition models optimized with OpenVINO backend for faster CPU inference.

#intel#cpu-optimization#asr

Stars184

Forks16

#dataset-creation#music#audio-analysis

iOS-GenAI-SamplerSwift

A collection of iOS sample apps demonstrating Generative AI capabilities including OpenAI, local LLMs, Stable Diffusion, and speech recognition.

#ios#metal#openai

Stars178

Forks15

Last commit8 months ago

android-sdkJava

Android client library for integrating IBM Watson cognitive services like speech recognition, text-to-speech, and visual recognition.

#ibm-cloud#aar#android

Stars148

Forks90

Last commit5 years ago

audiomatePython

A Python library for easy access, management, and processing of audio datasets, particularly for machine learning tasks.

Stars139

Forks25

#ios#capacitor-plugin#android

Speech recognitionJava

A Capacitor plugin for native speech recognition on iOS and Android, enabling voice-to-text in hybrid mobile apps.

Stars129

Forks69

Last commit1 month ago

TensorFlow Lite Examples - AndroidKotlin

A collection of refactored, high-quality Android examples demonstrating TensorFlow Lite for on-device machine learning tasks.

#android#model-deployment#minst

A deep learning system for automatic spoken language identification from audio files using TensorFlow and Caffe.

#spark#deep-learning#neural-networks

Stars90

Forks24

Last commit7 years ago

hsmmlearnJupyter Notebook

A Python library for unsupervised learning of hidden semi-Markov models with explicit durations.

#python-library#speech-recognition#hidden-semi-markov-models

Stars89

Forks25

max-speech-to-text-converterPython

A Docker-based speech recognition model that converts short English WAV audio files into text using Mozilla's DeepSpeech.

#codait#ibm#ai

Stars78

Forks32

Last commit10 months ago

Speech to TextC

A speech-to-text module for Godot 3 that captures microphone input and converts it to text for game development.

#module#voice-input#speech-recognition

Stars34

Forks2

#concurrent-programming#speech-recognition#text-to-speech

att_speechRuby

A Ruby library for consuming the AT&T Speech API to convert speech to text and text to speech.

Stars20

Forks4

Last commit12 years ago

Speech RecognitionJava

A Capacitor plugin providing natural, low-latency speech recognition for iOS and Android apps with streaming results and permission helpers.

#ios#capacitor-plugin#on-device-ai

Stars18

Forks5

Last commit13 days ago

ex_azure_speechElixir

An unofficial Elixir SDK for Microsoft Azure Speech Service, providing speech-to-text and text-to-speech capabilities.

#elixir#azure#pronunciation-assessment

Stars4

Forks0