An open-source tool for PostgreSQL and MySQL database anonymization, synthetic data generation, and logical dumping.
Greenmask is an open-source utility for logical database dumping, anonymization, and synthetic data generation, primarily for PostgreSQL with MySQL support in development. It solves the problem of securing sensitive data in staging, testing, and development environments by providing stateless, compatible backups that can be transformed or subsetted.
Database administrators, DevOps engineers, and developers who need to sanitize production data for non-production use while maintaining referential integrity and compatibility with standard tools.
Developers choose Greenmask for its deterministic transformations, storage-agnostic backups, and extensibility, offering a robust, drop-in replacement for `pg_dump`/`pg_restore` with advanced data masking and subsetting capabilities.
Database anonymization, synthetic data generation and logical dump
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Creates smaller development databases that maintain cyclic and polymorphic references, ensuring data integrity for testing as highlighted in the database subsetting feature.
Supports local directories and S3-compatible services like AWS S3, MinIO, GCS, and Azure, offering flexible backup management without vendor lock-in.
Uses hash functions for reproducible data masking, guaranteeing consistent output for the same input, which is crucial for compliance and testing environments.
Allows custom transformations in any programming language via the CMD transformer or predefined templates, enabling domain-specific data processing.
MySQL functionality is labeled as work-in-progress and beta, making it unreliable for production use with MySQL databases, as admitted in the README.
Setting up advanced transformations and subsetting requires detailed YAML or JSON configurations, which can be steep and error-prone for quick adoption.
Lacks a graphical user interface, which may not suit teams preferring GUI-based tools for database management and anonymization tasks.