Showing 8 of 8 projects
An open-source AI engine that runs LLMs, vision, voice, and image/video models on any hardware with drop-in OpenAI API compatibility.
A transformer-based text-to-audio model that generates realistic multilingual speech, music, and sound effects.
A modular PyTorch library for state-of-the-art diffusion models to generate images, audio, and 3D molecular structures.
A TensorFlow implementation of DeepMind's WaveNet neural network for generating raw audio waveforms.
A free course teaching diffusion models theory and hands-on implementation using Hugging Face's Diffusers library.
A unified web interface for text-to-speech, voice cloning, and audio generation with support for dozens of AI models.
Python library and CLI tool to interface with Google Translate's text-to-speech API for generating MP3 audio from text.
A flow-based generative network for fast, high-quality speech synthesis from mel-spectrograms.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.