Speech

14 projects

Showing 14 of 14 projects

A deep learning toolkit for Text-to-Speech generation with pretrained models in over 1100 languages and tools for training.

#tts#vocoder#glow-tts

Stars45.8k

Forks6.2k

Last commit1 year ago

WhisperXPython

Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.

#forced-alignment#vad#automatic-speech-recognition

Stars23.2k

Forks2.3k

Last commit9 days ago

KaldiShell

A comprehensive open-source toolkit for speech recognition research and development.

#cuda#research-toolkit#speaker-id

Stars15.4k

Forks5.4k

Last commit10 months ago

annyangTypeScript

A tiny JavaScript library for adding speech recognition and voice commands to websites.

#web-accessibility#voice-commands#hands-free

Stars6.8k

Forks1.0k

Last commit8 days ago

whisper-diarizationJupyter Notebook

A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.

#nvidia-nemo#automatic-speech-recognition#asr

An audio library for PyTorch providing data manipulation, transformations, and dataset loaders for machine learning applications.

#deep-learning#signal-processing#gpu-acceleration

Stars2.9k

Forks785

Last commit1 day ago

whisper-timestampedPython

A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.

#subtitle-generation#python-library#deep-learning

Python library and CLI tool to interface with Google Translate's text-to-speech API for generating MP3 audio from text.

#pypi#tts#python-library

Stars2.6k

Forks386

Last commit3 months ago

SonusJavaScript

A Node.js library for adding voice interfaces with offline hotword detection and cloud speech recognition.

#voice-commands#stt#voice-control

Stars638

Forks76

Last commit2 years ago

TTSC

An Arduino library for text-to-speech synthesis using PWM or DAC outputs with external amplifier.

#tts#embedded-systems#esp32

Stars216

Forks49

Last commit3 years ago

react-native-dialogflowJavaScript

A React Native bridge for integrating Google Dialogflow (API.AI) SDK to build conversational interfaces in mobile apps.

#dialogflow#speech-to-function#speak

Stars205

Forks60

Last commit3 years ago

audiomatePython

A Python library for easy access, management, and processing of audio datasets, particularly for machine learning tasks.

#dataset-creation#music#audio-analysis

Stars139

Forks25

Last commit3 years ago

snowboy2mqttJavaScript

Publishes MQTT messages when specific voice hotwords are detected using the Snowboy engine.

#iot#voice-commands#hotword

Stars6

Forks0

Last commit7 years ago

fluent-ssmlTypeScript

A TypeScript library for building Alexa SSML templates using a fluent, immutable, and composable API.

#voice-apps#alexa-skills#fluent-api

Stars2

Forks1

Last commit7 years ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub