Showing 15 of 15 projects
A modern, cross-platform shell that treats data as structured tables instead of plain text.
An open-source, event-driven orchestration platform for building reliable scheduled and real-time workflows using declarative YAML.
Open-source data integration platform for building ELT pipelines from APIs, databases, and files to data warehouses, lakes, and lakehouses.
A reactive Python notebook that's reproducible, git-friendly, and deployable as scripts or apps.
A Python library for data quality testing and validation using expressive, extensible Expectations.
A Python framework for creating reproducible, maintainable, and modular data engineering and data science pipelines.
A Go library for writing shell-like scripts with a pipeline API for file reading, subprocess execution, string matching, and more.
A simple, powerful, and extensible CI/CD engine designed for self-hosting.
A fast front-end web application build tool with simple declarative config and seamless incremental compilation.
A Python tool for parameterizing, executing, and analyzing Jupyter Notebooks at scale.
A JavaScript application framework for machine learning and its engineering, designed for Web developers.
A Kubernetes-native, serverless platform for running massively parallel data and streaming jobs with exactly-once semantics.
A cross-platform, dependency-free C++ and Python DAG framework for building parallel computational graphs.
A lightweight and efficient stream processing library for Go, providing a declarative DSL to build data pipelines.
A Go toolkit for building concurrent programs using composable, channel-based pipelines with automatic error propagation.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.