Showing 14 of 86 projects
A distributed, scalable database built for stream processing applications on Apache Kafka using SQL syntax.
An idiomatic Clojure dataframe library that runs on Apache Spark, providing a seamless interface for data processing and machine learning.
A Go library for declarative JSON-to-JSON transformations using JSON specifications.
A collection of import/export commands for the Neo4j shell to load and dump graph data in various formats.
R client for the Elasticsearch HTTP API, enabling data indexing, search, and analysis from R.
A collection of connectors enabling Apache HBase integration with Kafka, Spark, and other data processing systems.
A mapping language and engine for converting complex, nested data between schemas, with extensibility via plugins.
A Go-based toolkit for fast ETL and feature extraction on Hadoop, optimized for rapid development and execution.
A Go-based toolset for data extraction, transformation, and loading, providing powerful data synchronization capabilities.
A Spark library for reading from and writing to Google BigQuery using DataFrames and SQL.
Operator and codec library for building real-time streaming applications on Apache Apex.
A Java library for enriching, transforming, and filtering JSON documents using configurable pipelines.
An experimental Rust client for Apache Spark Connect, providing a DataFrame API to interact with Spark clusters.
A MongoDB to Neo4j document manager for live one-way synchronization, enabling polyglot persistence by converting documents into a graph structure.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.