Audio Processing

222 projects

Showing 36 of 222 projects

A model-definition framework for state-of-the-art machine learning models across text, vision, audio, and multimodal tasks.

#transformer#hacktoberfest#model-training

Stars162.9k

Forks34.0k

Last commit2 hours ago

MediaPipeC++

Cross-platform framework for building customizable on-device machine learning pipelines for live and streaming media.

#media-processing#video-processing#on-device-ml

Stars36.3k

Forks6.1k

Last commit6 days ago

faster-whisperPython

A fast, memory-efficient reimplementation of OpenAI's Whisper speech-to-text model using CTranslate2.

#transformer#ai#python-library

Stars24.5k

Forks2.0k

Last commit8 months ago

WhisperXPython

Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.

#forced-alignment#vad#automatic-speech-recognition

Stars23.2k

Forks2.4k

Last commit10 days ago

BlackHoleC

A modern macOS virtual audio loopback driver for routing audio between applications with zero additional latency.

#virtual-audio#driver#macos-audio

Stars19.4k

Forks830

Last commit20 days ago

AudacityC++

An easy-to-use, multi-track audio editor and recorder for Windows, macOS, GNU/Linux, and other operating systems.

#gpl-licensed#open-source#gplv2

Stars17.4k

Forks2.6k

Last commit14 hours ago

KaldiShell

A comprehensive open-source toolkit for speech recognition research and development.

#cuda#research-toolkit#speaker-id

Stars15.4k

Forks5.4k

Last commit10 months ago

Tone.jsTypeScript

A Web Audio framework for creating interactive music and audio applications in the browser.

#music#audio-scheduling#music-synthesis

Stars14.7k

Forks1.1k

Last commit2 days ago

sherpa-onnxC++

A next-generation Kaldi-based toolkit for offline speech-to-text, text-to-speech, and audio processing across 12 languages and diverse hardware.

#ios#embedded-systems#aarch64

Stars13.7k

Forks1.6k

Last commit6 hours ago

AudioKitSwift

Audio synthesis, processing, and analysis platform for iOS, macOS, and tvOS applications.

#audio-synthesis#music#macos-development

Stars11.4k

Forks1.6k

Last commit6 days ago

Learn FFmpeg libav the Hard WayC

A hands-on tutorial teaching how to use FFmpeg's libav libraries for media processing, from basics to transcoding and transmuxing.

#transcoding#transcode-video#ffmpeg

Stars11.0k

Forks1.0k

Last commit1 month ago

pyannote.audioJupyter Notebook

An open-source Python toolkit for speaker diarization with state-of-the-art pretrained models and pipelines.

#speaker-embedding#python-library#speech-activity-detection

Stars10.3k

Forks1.1k

Last commit3 days ago

fluent-ffmpegJavaScript

A fluent Node.js API for FFmpeg that simplifies complex command-line video and audio processing.

#media-library#ffmpeg#video-processing

Stars8.2k

Forks939

Last commit1 year ago

MiniaudioC

A single-file C audio library for playback, capture, and processing with no external dependencies.

#c-library#embedded-audio#emscripten

Stars7.1k

Forks582

Last commit4 days ago

eqMac2Swift

A system-wide audio equalizer and volume mixer for macOS with free and pro features.

#audio-applications#equalizer#system-audio

Stars6.7k

Forks361

Last commit3 months ago

pyAudioAnalysisPython

A Python library for audio feature extraction, classification, segmentation, and machine learning applications.

#audio-analysis#python-library#signal-processing

Stars6.3k

Forks1.2k

Last commit11 months ago

NAudioC#

A comprehensive .NET audio library for playback, recording, format conversion, MIDI, and audio manipulation.

#midi#audio-library#wasapi

Stars6.2k

Forks1.1k

Last commit3 days ago

whisper-diarizationJupyter Notebook

A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.

#nvidia-nemo#automatic-speech-recognition#asr

Stars5.6k

Forks503

Last commit5 months ago

CinderC++

A peer-reviewed, free, open source C++ library for professional-quality creative coding.

#c-plus-plus-20#visual-programming#cmake

Stars5.5k

Forks966

Last commit4 months ago

PHP FFmpegPHP

An object-oriented PHP library for video and audio manipulation using FFmpeg binaries.

#transcoding#ffmpeg#video-processing

Stars5.0k

Forks896

Last commit6 months ago

EZAudioObjective-C

A simple, intuitive audio visualization and processing framework for iOS and macOS built on Core Audio.

#ios#waveform-plot#audio-framework

Stars5.0k

Forks817

Last commit10 months ago

projectMC++

A cross-platform open-source library for rendering Milkdrop-compatible music visualizations from audio input.

#music#gstreamer-plugin#media-player-integration

Stars4.3k

Forks467

Last commit8 days ago

PocketSphinxC

A lightweight, open-source continuous speech recognition engine for embedded and offline applications.

#c-library#embedded-systems#python-library

Stars4.3k

Forks729

Last commit3 days ago

RecorderjsJavaScript

A JavaScript plugin for recording and exporting audio from Web Audio API nodes as WAV files.

#wav-export#client-side-audio#javascript-plugin

Stars4.2k

Forks1.4k

Last commit2 years ago

ffmpeg.jsJavaScript

FFmpeg compiled to JavaScript via Emscripten for in-browser video/audio processing.

#transcoding#web-worker#webassembly

Stars3.5k

Forks341

Last commit2 years ago

SymphoniaRust

A pure Rust library for demuxing media formats, reading metadata tags, and decoding audio codecs.

#id3v1#flac#media-demuxing

Stars3.3k

Forks221

Last commit1 day ago

whisper-standalone-win

Standalone executables of OpenAI's Whisper and Faster-Whisper for speech-to-text transcription without Python dependencies.

#media-processing#faster-whisper#asr

Stars3.1k

Forks164

Last commit8 months ago

SwiftSwift

A Swift community-driven package for interacting with the OpenAI API and other compatible providers.

#ai#community-driven#ai-integration

Stars2.9k

Forks514

Last commit2 days ago

TorchAudioPython

An audio library for PyTorch providing data manipulation, transformations, and dataset loaders for machine learning applications.

#deep-learning#signal-processing#gpu-acceleration

Stars2.9k

Forks787

Last commit20 hours ago

whisper-timestampedPython

A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.

#subtitle-generation#python-library#deep-learning

Stars2.8k

Forks212

Last commit10 months ago

uncaptchaPython

A proof-of-concept system that defeats Google's audio reCaptcha with 85% accuracy using speech recognition and browser automation.

#web-security#selenium#captcha-bypass

Stars2.8k

Forks327

Last commit8 years ago

OpenAL SoftC++

A cross-platform, LGPL-licensed software implementation of the OpenAL 3D audio API.

#lgpl-licensed#software-rendering#game-audio

Stars2.7k

Forks638

Last commit10 hours ago

matcheringPython

An open-source tool for audio matching and mastering that makes your track sound like a reference song.

#sound#music#comfyui

Stars2.6k

Forks282

Last commit15 days ago

Music

A curated collection of awesome music libraries, tools, frameworks, and software across audio, notation, programming, and theory.

#music-technology#midi#music-programming

Stars2.5k

Forks166

Last commit1 month ago

StreamingKitObjective-C

A fast, extensible gapless audio player and streamer for iOS and macOS with low CPU usage.

#coreaudio#ios#objective-c

Stars2.4k

Forks525

Last commit4 years ago

dopplerJavaScript

A web implementation of the SoundWave paper that detects motion using the Doppler effect with microphone and speakers.

#getusermedia#touchless-interaction#motion-detection

Stars2.4k

Forks176

Last commit9 years ago

Page 1 of 7Next

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub

Audio Processing

Related Tags

Found a gem we're missing?

Audio Processing

Related Tags

Found a gem we're missing?