Showing 6 of 6 projects
A Python library for data quality testing and validation using expressive, extensible Expectations.
An open-source solution for continuous validation of machine learning models and data, from research to production.
A library built on Apache Spark for defining unit tests to measure data quality in large datasets.
A Python API for Deequ, enabling data quality testing and validation on large datasets using Apache Spark.
A Python data validation toolkit that finds data quality issues and generates beautiful, shareable reports for team collaboration.
An open-source data catalog tool that integrates into CI systems to test downstream impacts of data changes, preventing pipeline and dashboard breaks.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.