Showing 36 of 45 projects
A modern, cross-platform shell that treats data as structured tables instead of plain text.
An open-source, event-driven orchestration platform for building reliable scheduled and real-time workflows using declarative YAML.
Open-source data integration platform for building ELT pipelines from APIs, databases, and files to data warehouses, lakes, and lakehouses.
A reactive Python notebook that's reproducible, git-friendly, and deployable as scripts or apps.
A Python library for data quality testing and validation using expressive, extensible Expectations.
A Python framework for creating reproducible, maintainable, and modular data engineering and data science pipelines.
A simple, powerful, and extensible CI/CD engine designed for self-hosting.
A Go library for writing shell-like scripts with a pipeline API for file reading, subprocess execution, string matching, and more.
A fast front-end web application build tool with simple declarative config and seamless incremental compilation.
A Python tool for parameterizing, executing, and analyzing Jupyter Notebooks at scale.
A JavaScript application framework for machine learning and its engineering, designed for Web developers.
A Kubernetes-native, serverless platform for running massively parallel data and streaming jobs with exactly-once semantics.
A cross-platform, dependency-free C++ and Python DAG framework for building parallel computational graphs.
A lightweight and efficient stream processing library for Go, providing a declarative DSL to build data pipelines.
A Go toolkit for building concurrent programs using composable, channel-based pipelines with automatic error propagation.
A full-stack DevOps framework for simplifying microservice deployment on AWS ECS and EKS.
A scalable n:m message multiplexer written in Go for routing messages from multiple sources to multiple destinations.
A Haskell library for streaming data processing with constant memory usage, deterministic resource handling, and easy composition.
A Ruby gem that transforms plain text into HTML using a pipeline of composable filters.
A library to define a continuous delivery pipeline as code in Clojure, enabling custom, self-hosted CI/CD.
A Go implementation of the ReactiveX spec providing a declarative and composable API for handling asynchronous data streams.
An Elixir library for elegant error handling using result monads and result tuples.
A CLI tool for processing JSON and text data with functional pipelines using Ramda, supporting both command-line and interactive browser modes.
A command-line tool for neural network inference using Unix pipeline philosophy.
A header-only C++ utility library that simplifies Vulkan graphics programming by reducing boilerplate and verbosity.
A powerful and flexible mediator implementation for .NET that enables clean architecture by decoupling request/response handling.
Write CI/CD pipelines in C# with local debugging, compile-time safety, and automatic parallelization.
A Rust crate providing generic extension methods for tapping, piping, and converting values in method chains.
An open-source machine learning solution for the Home Credit Default Risk Kaggle competition, providing reproducible code and experiments.
A Rust crate for executing and interacting with external processes and pipelines with deadlock-free communication and flexible I/O redirection.
A concurrent task runner and automation toolkit for developers, offering a modern alternative to GNU Make with human-readable YAML/JSON/TOML configurations.
A pipe-like function to write cleaner, more readable JavaScript code by transforming nested calls into vertical pipelines.
A flow-based application layer framework for Elixir that implements Flow-Based Programming (FBP) to structure business logic.
Run Jenkinsfiles inside GitHub Actions using a single-shot Jenkins master in a Docker container.
A Python library for building lazy data processing and machine learning workflows that handle datasets larger than memory.
A Go library for building data processing workflows and pipelines with functional operations, cycles, and fan-out capabilities.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.