Speech Processing

17 projects

Showing 17 of 17 projects

A comprehensive collection of Chinese NLP resources, datasets, tools, and pre-trained models for developers and researchers.

#nlp-tools#open-source-resources#text-generation

Stars82.0k

Forks15.2k

Last commit2 years ago

pyannote.audioJupyter Notebook

An open-source Python toolkit for speaker diarization with state-of-the-art pretrained models and pipelines.

#speaker-embedding#python-library#speech-activity-detection

An end-to-end speech processing toolkit for speech recognition, text-to-speech, translation, enhancement, and more.

#chainer#end-to-end#deep-learning

Facebook AI Research's automatic speech recognition toolkit for end-to-end ASR with modern neural architectures.

#asr-toolkit#end-to-end#neural-architectures

Stars6.4k

Forks990

Last commit8 days ago

TorchAudioPython

An audio library for PyTorch providing data manipulation, transformations, and dataset loaders for machine learning applications.

#deep-learning#signal-processing#gpu-acceleration

Python library and CLI tool to interface with Google Translate's text-to-speech API for generating MP3 audio from text.

#pypi#tts#python-library

Stars2.6k

Forks386

Last commit3 months ago

Awesome Python for Scientific Audio

A curated list of Python software and packages for scientific audio and music research.

#audio-analysis#music-information-retrieval#signal-processing

Stars1.7k

Forks185

Last commit1 month ago

PhonemizerPython

A Python library and CLI tool for converting text to phonetic transcriptions (phones) across multiple languages using various backends.

#computational-linguistics#python-library#ipa

Stars1.6k

Forks200

Last commit1 year ago

ParselmouthC++

A Python library that provides direct, Pythonic access to Praat's speech processing algorithms from within Python.

#scientific-computing#audio-analysis#python-library

A flexible Python framework for developing, training, and evaluating conversational AI agents in single or multi-agent environments.

#conversational-ui#chatbots#python-library

Stars981

Forks186

Last commit5 years ago

PyWorldVocoderCython

A Python wrapper for the high-quality WORLD vocoder, enabling speech parameterization and synthesis.

#vocoder#f0-extraction#audio-analysis

Stars790

Forks126

Last commit1 year ago

AutoAgentsRust

A production-grade multi-agent framework in Rust for building, deploying, and coordinating intelligent agents with LLMs.

#ai-agents-framework#ai#agents

Stars713

Forks83

Last commit13 days ago

pystoiMATLAB

Python implementation of the Short-Time Objective Intelligibility (STOI) measure for speech quality assessment.

#audio-analysis#python-library#signal-processing

Stars359

Forks57

Last commit2 years ago

react-native-dialogflowJavaScript

A React Native bridge for integrating Google Dialogflow (API.AI) SDK to build conversational interfaces in mobile apps.

#dialogflow#speech-to-function#speak

Stars205

Forks60

Last commit3 years ago

Awesome Community-Curated NLP List

A community-curated list of NLP tools, libraries, datasets, and resources across speech processing, text analysis, and machine translation.

#community-driven#text-analysis#nlp-tools

Stars202

Forks32

Last commit4 years ago

awesome-danish

A curated list of awesome resources for Danish language technology, including datasets, models, and tools.

#corpora#nlp-tools#natural-language-processing

Stars196

Forks20

Last commit1 year ago

E2E TFLite Tutorials

A community-driven collection of end-to-end tutorials for creating and deploying TensorFlow Lite models on mobile devices.

#community-projects#deep-learning#on-device-ai

Stars135

Forks26

Last commit4 years ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub