Showing 12 of 12 projects
Dolt is a version-controlled SQL database that supports Git-like operations such as fork, clone, branch, merge, push, and pull.
Deep Lake is a multimodal data lake and vector store optimized for AI, enabling scalable data management, retrieval, and training for LLM and deep learning applications.
An open-source storage framework that enables building a Lakehouse architecture with ACID transactions and scalable metadata handling.
An open-source MLOps/LLMOps suite for experiment management, data management, pipelines, orchestration, scheduling, and model serving.
An open-source tool that transforms object storage into a Git-like repository for versioned, atomic, and repeatable data lake operations.
An embeddable C++ storage engine for dense and sparse multi-dimensional arrays, dataframes, and key-value stores.
A transactional catalog for data lakes with Git-like semantics, enabling version control and branching for data assets.
A CLI tool that applies Git-like version control to cloud storage, enabling distributed, decentralized, and deduplicated data repositories.
A Python library that automates the tedious parts of exploratory data analysis with cleaning, feature engineering, visualization, and versioning.
Automatic and reliable PostgreSQL data change tracking using Write-Ahead Log and Change Data Capture.
A Go framework to simplify CRUD operations for arbitrarily deep structured data using graph concepts.
Automatic data change tracking for Prisma with PostgreSQL, enabling audit trails and time travel querying.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.