A PostgreSQL extension for high-performance time-series aggregation that stores only aggregate results, not raw data.
PipelineDB is a PostgreSQL extension for high-performance time-series aggregation, designed to power realtime reporting and analytics applications. It allows users to define continuous SQL queries that perpetually aggregate streaming data, storing only the aggregate output in regular tables while raw data is never written to disk.
Developers and data engineers building real-time analytics applications, especially those needing efficient time-series data processing within a PostgreSQL environment.
It offers extremely high-throughput, incrementally updated materialized views that never need manual refreshing, making it highly efficient for aggregation workloads by avoiding raw data storage.
High-performance time-series aggregation for PostgreSQL
Continuous queries update materialized views in real-time without manual REFRESH commands, as shown in the example where INSERTs immediately aggregate data into test_view.
Raw data is never written to disk, minimizing storage overhead for high-volume time-series workloads and focusing only on aggregated results, as emphasized in the README.
It runs as a PostgreSQL extension, allowing use of standard SQL tools like psql and compatibility with existing ecosystems, demonstrated in the streaming and querying examples.
Output streams enable chaining continuous queries into networks for complex analytics pipelines, supported by features like continuous transforms as documented.
The project is in maintenance mode with no new releases beyond 1.0.0, only critical bug fixes, reducing its suitability for long-term or evolving projects.
Only supports PostgreSQL versions 10 and 11, which are outdated, limiting adoption with newer database versions and potentially causing upgrade challenges.
Requires building from source with dependencies like ZeroMQ and PostgreSQL dev packages, making installation more involved compared to standard extensions, as outlined in the README.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Incremental engine for long horizon agents 🌟 Star if you like it!
⚡ Fastest SQL ETL pipeline in a single C++ binary, built for stream processing, observability, analytics and AI/ML
Python Stream Processing. A Faust fork
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.