A Ruby gem for managing human-curated datasets stored as TOML in Git repositories.
Glean is a Ruby-based data management tool that helps developers and teams curate, share, and use human-editable datasets. It solves the problem of managing collaborative datasets by storing them as TOML files in Git repositories, enabling version control and easy integration into projects.
Developers, data curators, and teams working with structured datasets that require collaboration, versioning, and easy access via APIs or seeding into applications.
Developers choose Glean for its simplicity, human-readable data format (TOML), and seamless Git integration, which streamline dataset collaboration and eliminate the complexity of traditional data management systems.
A data management tool for humans
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Uses TOML, described as 'Markdown for data,' making datasets easy to edit and understand directly without complex tools.
Stores datasets in Git repositories, enabling version control, revision history, and collaborative changes via Pull Requests, as highlighted in the philosophy.
Provides commands like search, export, and get for easy dataset access and manipulation from the terminal, with examples for JSON and YAML export.
Includes helpers for Ruby on Rails to seed databases with Glean datasets, simplifying data import via the db/seeds.rb file.
The README explicitly states it is no longer maintained, meaning no updates, bug fixes, or support for future issues.
Marked as experimental with expected breaking changes until v1.0.0, making it risky for any production or long-term use.
User-defined datasets are listed as 'TODO' in the sources, severely limiting flexibility for custom data management needs.
Primarily integrates with Ruby on Rails, with vague support for other frameworks, potentially alienating developers in diverse tech stacks.