Showing 13 of 13 projects
Open-source data integration platform for building ELT pipelines from APIs, databases, and files to data warehouses, lakes, and lakehouses.
An open-source log collector that unifies logging infrastructure by collecting events from various sources and routing them to multiple destinations.
Open-source customer data infrastructure that collects, validates, and enriches behavioral event data for AI and analytics.
Open-source data pipelines for cloud asset inventory, CSPM, FinOps, and vulnerability management across AWS, Azure, GCP, and 70+ sources.
Open-source data pipelines to sync cloud infrastructure metadata from AWS, Azure, GCP, and 70+ sources into your data warehouse.
An open source iOS framework for creating apps for medical and other research studies.
An open-source Java web crawler that provides a simple interface for multi-threaded web crawling.
A curated collection of publicly accessible JSON datasets across diverse topics like government, finance, climate, and entertainment.
A deprecated tool for collecting, processing, and delivering data from multiple sources with Go and Lua plugin support.
A distributed service for efficiently collecting, aggregating, and moving large amounts of log-like data.
An open-source Python library for low-code data preparation, offering fast EDA, data cleaning, and collection from APIs and databases.
A modular PowerShell framework for enterprise incident response and breach hunting using remote data collection.
A preconfigured web crawler for backing up websites, producing WARC files with a live dashboard and dynamic ignore patterns.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.