Showing 11 of 11 projects
Dolt is a version-controlled SQL database that supports Git-like operations such as fork, clone, branch, merge, push, and pull.
A unified open-source metadata platform for data discovery, observability, and governance with column-level lineage and team collaboration.
An open-source metadata platform for data discovery, governance, and observability across your entire data and AI stack.
An ultra-performant data transformation framework for AI, with incremental processing and data lineage built-in.
A metadata-driven data discovery and catalog platform that helps data teams find, understand, and trust their data resources.
A lightweight Python library for creating portable, expressive, and testable data transformation DAGs with built-in lineage and metadata.
A Python library for defining portable, modular, and testable data transformation DAGs with built-in lineage and metadata.
An open-source metadata service for collecting, aggregating, and visualizing data lineage and ecosystem metadata.
A Python-powered SQL lineage analysis tool that extracts source and target tables from SQL commands without deep parser knowledge.
An open-source data catalog tool that integrates into CI systems to test downstream impacts of data changes, preventing pipeline and dashboard breaks.
An Apache Spark framework for efficient data processing, extraction, and derivation from web archives and archival collections.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.