Showing 3 of 3 projects
A deep learning toolkit for Text-to-Speech generation with pretrained models in over 1100 languages and tools for training.
A multi-voice text-to-speech system that produces highly realistic prosody and intonation using autoregressive and diffusion decoders.
A unified web interface for text-to-speech, voice cloning, and audio generation with support for dozens of AI models.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.