Text To Speech

#ai#speech-recognition#text-to-speech

readestTypeScript

An open-source, cross-platform ebook reader with multi-format support, annotations, sync, and accessibility features.

#pdf-reader#tts#nextjs

🧠 Leon is your open-source personal assistant.

Stars17.4k

Forks1.5k

Last commit1 day ago

TorToiSeJupyter Notebook

A multi-voice text-to-speech system that produces highly realistic prosody and intonation using autoregressive and diffusion decoders.

#autoregressive-models#ai-voice#deep-learning

Stars14.9k

Forks2.0k

#youdao#productivity#baidu

EasydictSwift

A concise and elegant macOS dictionary and translation app with OCR, supporting 20+ services including Apple Dictionary, OpenAI, and DeepL.

Stars13.9k

Forks699

Last commit5 days ago

sherpa-onnxC++

A next-generation Kaldi-based toolkit for offline speech-to-text, text-to-speech, and audio processing across 12 languages and diverse hardware.

#ios#embedded-systems#aarch64

Stars13.7k

Forks1.6k

#ios#on-device-ai#android

RunAnywhereC++

SDKs for adding private, on-device AI features like LLM chat, speech-to-text, and text-to-speech to mobile and web apps.

An end-to-end speech processing toolkit for speech recognition, text-to-speech, translation, enhancement, and more.

#chainer#end-to-end#deep-learning

Stars9.9k

Forks2.4k

Last commit3 days ago

WavenetPython

A TensorFlow implementation of DeepMind's WaveNet neural network for generating raw audio waveforms.

#neural-network#deep-learning#generative-model

Stars5.4k

Forks1.3k

Last commit3 years ago

TTS WebUITypeScript

A unified web interface for text-to-speech, voice cloning, and audio generation with support for dozens of AI models.

#music#generator#gradio

Stars3.2k

Forks324

Last commit17 days ago

FolioReaderKitSwift

A Swift ePub reader and parser framework for iOS with rich customization and accessibility features.

#highlight#ios#custom-fonts

Stars2.7k

Forks778

Last commit5 years ago

gttsPython

Python library and CLI tool to interface with Google Translate's text-to-speech API for generating MP3 audio from text.

#pypi#tts#python-library

Stars2.6k

Forks386

Last commit3 months ago

FluidAudioSwift

A Swift SDK for fully local, low-latency audio AI on Apple devices, including transcription, text-to-speech, voice activity detection, and speaker diarization.

#ios#apple-neural-engine#speaker-embedding

Stars2.5k

Forks352

Chat with GPTTypeScript

An open-source ChatGPT app with realistic voice capabilities using ElevenLabs text-to-speech.

#chat#gpt-3#chatgpt-api

Stars2.4k

Forks475

#flow-based-models#deep-learning#nvidia-gpu

waveglowPython

A flow-based generative network for fast, high-quality speech synthesis from mel-spectrograms.

Stars2.3k

Forks534

#desktop-application#qt#open-source

Crow TranslateC++

A lightweight desktop translator that translates and speaks text using multiple online translation APIs.

Stars2.0k

Forks176

Talk-to-ChatGPTJavaScript

A Chrome/Edge extension that enables voice conversations with ChatGPT using speech recognition and text-to-speech.

#browser-extension#speech-recognition#text-to-speech

Stars1.9k

Forks317

#pcm5102a#embedded-audio#esp32

ESP32-audioI2SC

An Arduino library for ESP32 multi-core chips to play audio files and streams from SD card or network via I2S to external DACs/amplifiers.

Stars1.7k

Forks415

#computational-linguistics#python-library#ipa

PhonemizerPython

A Python library and CLI tool for converting text to phonetic transcriptions (phones) across multiple languages using various backends.

Stars1.6k

Forks200

#hacktoberfest#ai-api#language-translation

Speech NoteC++

A Linux desktop app for offline note-taking, reading, and translation using speech-to-text, text-to-speech, and machine translation.

#tts#stt#offline-app

Stars1.5k

Forks69

Last commit5 days ago

node-sdkTypeScript

Node.js client library for accessing IBM Watson AI services like Assistant, Speech-to-Text, and Natural Language Understanding.

Stars1.5k

Forks650

Last commit5 days ago

python-sdkPython

A Python client library for interacting with IBM Watson AI services, available via pip as ibm-watson.

#hacktoberfest#cloud-services#authentication

Stars1.4k

Forks818

Last commit5 months ago

VoiceMode MCPPython

Enables natural two-way voice conversations with Claude Code and other MCP agents, perfect for hands-free coding assistance.

#tts#developer-tools#claudecode

Stars1.3k

Forks182

#hacktoberfest#language-translation#ai-services

swift-sdkSwift

Swift SDK for integrating IBM Watson AI services like speech, language, and assistant into iOS and Linux applications.

Stars870

Forks213

chatgpt-conversationPython

A voice-based conversation interface for ChatGPT that allows users to speak and receive spoken responses.

#conversational-ai#real-time-communication#text-to-speech

A fast and stable translation plugin for PowerToys Run, enabling quick text and clipboard translation with multi-platform support.

#cmdpal#productivity#clipboard-translation

Stars581

Forks36

Last commit10 days ago

ashbuilds / payload-aiTypeScript

An AI plugin for Payload CMS that adds content generation, translation, proofreading, and image/voice creation to your content workflow.

#ai#ai-writing#ai-translate

Stars526

Forks63