Showing 36 of 156 projects
A comprehensive .NET audio library for playing, recording, encoding, decoding, and real-time processing of audio in C#.
A curated list of resources, tools, and applications for OpenAI's Whisper speech recognition system.
A Python library for audio data augmentation to improve the robustness of audio machine learning models.
A self-hosted application that downloads music by fetching track info from Spotify and sourcing audio from YouTube.
A painless, high-performance audio library for iOS and macOS using Audio Units with simple APIs.
A Go library for audio playback and processing with a simple Streamer interface.
A C++ command-line tool that generates waveform data and renders PNG images from MP3, WAV, FLAC, Ogg Vorbis, and Opus audio files.
Go bindings for FFmpeg libraries enabling video/audio manipulation in Go applications.
A cross-platform game development library for C/C++ with multimedia, graphics, and input handling capabilities.
A cross-platform, open-source C library for real-time audio input and output with support for multiple host APIs.
A C++ template library for designing and implementing multichannel IIR filters with various response types and seamless parameter interpolation.
A high-speed, cross-platform game engine built with modern C++17 and Vulkan for graphics.
A high-speed, cross-platform game engine built with modern C++17 and Vulkan for graphics rendering.
A minimalistic, single-header MP3 decoder library focused on small size, speed, and ISO conformance.
A comprehensive audio effects library for the Web Audio API, offering overdrive, delay, reverb, and more.
A Laravel package providing a fluent API to integrate FFmpeg for video/audio processing with Laravel's filesystem.
A JavaScript library that simplifies creating and manipulating sounds with the Web Audio API.
A categorized collection of FFmpeg commands for video automation pipelines, from simple conversions to advanced editing.
A C library for reading and writing sound files containing sampled audio data.
A robust yet lenient forced aligner built on Kaldi for aligning speech audio with text transcripts.
A curated list of Python software and packages for scientific audio and music research.
A curated list of Python software and packages for scientific audio and music research.
A statically typed scripting language and backend for multimedia streaming, file generation, automation, and HTTP services.
A JavaScript library for audio feature extraction, supporting both offline and real-time analysis via the Web Audio API.
A free web application that transcribes and translates audio files using OpenAI's Whisper and Chat APIs.
A C++ time unit for exact representation of common media framerates and audio sample rates using std::chrono.
A modular C++20 toolkit for real-time media, WebRTC, and networking, built as a lightweight alternative to libwebrtc.
A curated collection of resources for audio digital signal processing and plugin development.
A faster, memory-efficient command-line client for OpenAI's Whisper speech recognition, powered by CTranslate2.
A C++20 framework for creative coding, enabling 2D/3D games, media art, visualizers, and simulators across Windows, macOS, Linux, and the Web.
A curated collection of FFmpeg libraries, tools, tutorials, and resources for developers working with audio and video.
A React Native library for recording audio on iOS and Android with configurable encoding and quality.
A simple header-only C++ library for reading and writing WAV and AIFF audio files.
An audio processing toolbox using PyTorch 1D convolutional neural networks for on-the-fly spectrogram generation with trainable kernels.
Node.js sample applications demonstrating IBM Watson Speech to Text service features for converting speech to text.
GPU-accelerated audio preprocessing layers for Keras/TensorFlow, enabling real-time audio feature extraction within neural networks.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.