Speech Synthesis

#research-tool#python-library#text-to-audio

BarkJupyter Notebook

A transformer-based text-to-audio model that generates realistic multilingual speech, music, and sound effects.

Stars39.2k

Forks4.7k

#ai#speech-recognition#text-to-speech

LeonTypeScript

🧠 Leon is your open-source personal assistant.

Stars17.4k

Forks1.5k

Last commit2 days ago

TorToiSeJupyter Notebook

A multi-voice text-to-speech system that produces highly realistic prosody and intonation using autoregressive and diffusion decoders.

#autoregressive-models#ai-voice#deep-learning

Stars14.9k

Forks2.0k

#chainer#end-to-end#deep-learning

EspNetPython

An end-to-end speech processing toolkit for speech recognition, text-to-speech, translation, enhancement, and more.

Stars9.9k

Forks2.4k

Last commit3 days ago

OpenUtauC#

A free, open-source singing synthesis editor designed as a modern successor to UTAU for creating vocal tracks.

#music#phonemizer#vogen

Stars4.1k

Forks530

Last commit17 days ago

gttsPython

Python library and CLI tool to interface with Google Translate's text-to-speech API for generating MP3 audio from text.

#pypi#tts#python-library

Stars2.6k

Forks386

Last commit3 months ago

Chat with GPTTypeScript

An open-source ChatGPT app with realistic voice capabilities using ElevenLabs text-to-speech.

#chat#gpt-3#chatgpt-api

Stars2.4k

Forks475

Last commit2 years ago

waveglowPython

A flow-based generative network for fast, high-quality speech synthesis from mel-spectrograms.

#flow-based-models#deep-learning#nvidia-gpu

Stars2.3k

Forks534

Last commit2 years ago

PhonemizerPython

A Python library and CLI tool for converting text to phonetic transcriptions (phones) across multiple languages using various backends.

#computational-linguistics#python-library#ipa

Stars1.6k

Forks200

#vocoder#f0-extraction#audio-analysis

PyWorldVocoderCython

A Python wrapper for the high-quality WORLD vocoder, enabling speech parameterization and synthesis.

Stars790

Forks126

#terminal-utility#translations#language-learning

termitRuby

A Ruby gem for translating text and synthesizing speech directly in the terminal using Bing Translator.

Stars506

Forks20

react-native-speechObjective-C

A React Native library for text-to-speech functionality with voice and rate control.

#ios#voice-control#android

Stars302

Forks51

speech-javascript-sdkJavaScript

A JavaScript library for adding IBM Watson Speech to Text and Text to Speech capabilities to web applications.

#browser-sdk#text-to-speech#watson-api

Stars263

Forks131

Last commit5 months ago

normitJavaScript

A Node.js CLI tool for translating text and speech synthesis directly in your terminal.

#developer-tools#npm#translations

Stars242

Forks22

#tts#embedded-systems#esp32

TTSC

An Arduino library for text-to-speech synthesis using PWM or DAC outputs with external amplifier.

Stars216

Forks49

Last commit3 years ago

espeak-rubyRuby

Ruby wrapper for espeak and lame to generate Text-To-Speech MP3 files with customizable voice parameters.

#tts#ruby-wrapper#ruby-gem

Stars197

Forks21

Last commit3 months ago

alexiaJavaScript

A Node.js framework for building Amazon Alexa skills with simplified intent handling and speech asset generation.

#chatbots#serverless#amazon-echo

Stars166

Forks40

Last commit5 years ago

ssml-builderJavaScript

A JavaScript library for building Speech Synthesis Markup Language (SSML) using a clean builder pattern API.

#voice-apps#alexa-skills#text-to-speech

Stars165

Forks37

Last commit5 years ago

Text to speechJava

A Capacitor plugin for synthesizing speech from text in cross-platform mobile apps.

#tts#ios#capacitor-plugin

Stars129

Forks39

Last commit1 month ago

Blazor.SpeechSynthesisC#

A Blazor class library providing Speech Synthesis API access for client-side and server-side Blazor applications.

#webassembly#server-side#nuget-package

Stars98

Forks19

Last commit5 months ago

react-native-android-speechJava

A React Native module for Android Text-to-Speech functionality, providing speech synthesis and language support.

#android#android-wrapper#text-to-speech

Stars48

Forks14

QHSpeechSynthesizerQueueObjective-C

A queue management system for AVSpeechSynthesizer on iOS/macOS to handle sequential and prioritized speech synthesis.

#audio-queue#ios#objective-c

Stars43

Forks7

Last commit8 years ago

max-audio-sample-generatorPython

Generates short audio clips of speech commands and lo-fi instrumental samples using a WaveGAN-based generative adversarial neural network.

#wavegan#rest-api#machine-learning-models

A DIY robot bartender built with Raspberry Pi and Go that mixes drinks using pumps and speech synthesis.

#iot#home-automation#peristaltic-pump

Stars21

micropython-SYN6988Python

Last commit8 years ago

A MicroPython library for interfacing with the YuTone VoiceTX SYN6988 text-to-speech module via UART.

#iot#tts#embedded-systems

Stars19

#amazon-echo#alexa-skills#text-to-speech

Last commit3 years ago

alexa-uncensorJavaScript

Generate SSML fragments to bypass Alexa's text-to-speech censorship for profane words.

Stars18

Forks8

Speech SynthesisJavaScript

A Capacitor plugin for cross-platform text-to-speech synthesis with full control over voice, pitch, rate, and volume.

#capacitor-plugin#text-to-speech#accessibility

Stars7

Forks0

Last commit13 days ago

micropython-samttsPython

A MicroPython port of the classic Software Automatic Mouth (SAM) text-to-speech synthesizer for embedded systems.

#embedded-systems#retro-computing#micropython

Stars7

Forks0

mjyc/cycle-posenet-driverTypeScript

A collection of Cycle.js drivers and components for programming robots, enabling reactive robot applications.

#robotics#cyclejs#functional-programming

Stars6

#elixir#azure#pronunciation-assessment

Last commit5 years ago

ex_azure_speechElixir

An unofficial Elixir SDK for Microsoft Azure Speech Service, providing speech-to-text and text-to-speech capabilities.

Stars4

Forks0

#voice-apps#alexa-skills#fluent-api

fluent-ssmlTypeScript

A TypeScript library for building Alexa SSML templates using a fluent, immutable, and composable API.

Stars2