Showing 8 of 8 projects
A comprehensive collection of Chinese NLP resources, datasets, tools, and pre-trained models for developers and researchers.
An open-source Python toolkit for speaker diarization with state-of-the-art pretrained models and pipelines.
An end-to-end speech processing toolkit for speech recognition, text-to-speech, translation, enhancement, and more.
Facebook AI Research's automatic speech recognition toolkit for end-to-end ASR with modern neural architectures.
An audio library for PyTorch providing data manipulation, transformations, and dataset loaders for machine learning applications.
Python library and CLI tool to interface with Google Translate's text-to-speech API for generating MP3 audio from text.
A curated list of Python software and packages for scientific audio and music research.
A Python library and CLI tool for converting text to phonetic transcriptions (phones) across multiple languages using various backends.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.