Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Apache Arrow

Apache Arrow

13 projects

Showing 13 of 13 projects

polars
polarsRust

An extremely fast query engine for DataFrames, written in Rust, with multi-language frontends.

#out-of-core#apache-arrow#simd
Stars38.3k
Forks2.8k
Last commit1 day ago
Perspective
PerspectiveC++

An interactive analytics and data visualization component for large and streaming datasets, with a high-performance WebAssembly engine.

#columnar-database#custom-elements#webassembly
Stars10.4k
Forks1.3k
Last commit1 day ago
cudf
cudfC++

A GPU-accelerated DataFrame library for tabular data processing, part of the RAPIDS data science suite.

#cudf#cuda#apache-arrow
Stars9.6k
Forks1.0k
Last commit1 day ago
datafusion
datafusionRust

An extensible SQL query engine written in Rust, using Apache Arrow as its in-memory format for building fast database and analytic systems.

#columnar-database#apache-arrow#dataframe
Stars8.6k
Forks2.1k
Last commit1 day ago
vaex
vaexPython

A high-performance Python DataFrame library for lazy out-of-core processing and visualization of billion-row datasets at interactive speeds.

#out-of-core#python-dataframe#apache-arrow
Stars8.5k
Forks601
Last commit23 days ago
CloudQuery
CloudQueryGo

Open-source data pipelines for cloud asset inventory, CSPM, FinOps, and vulnerability management across AWS, Azure, GCP, and 70+ sources.

#sql-queryable#multi-cloud#apache-arrow
Stars6.4k
Forks546
Last commit1 day ago
CloudQuery
CloudQueryGo

Open-source data pipelines to sync cloud infrastructure metadata from AWS, Azure, GCP, and 70+ sources into your data warehouse.

#sql-queryable#multi-cloud#apache-arrow
Stars6.4k
Forks546
Last commit1 day ago
Fury
FuryJava

A high-performance multi-language serialization framework using JIT compilation and zero-copy techniques for fast data exchange.

#multi-language#fast#apache-arrow
Stars4.3k
Forks409
Last commit2 days ago
aws-sdk-pandas
aws-sdk-pandasPython

A Python library that simplifies data integration between pandas and AWS services like Athena, S3, Redshift, and more.

#apache-arrow#data-science#glue-catalog
Stars4.1k
Forks725
Last commit2 days ago
aws-data-wrangler
aws-data-wranglerPython

A Python library that simplifies data integration between pandas and AWS services like Athena, S3, Redshift, and more.

#apache-arrow#data-science#redshift
Stars4.1k
Forks725
Last commit2 days ago
dora
doraRust

A Rust-based middleware framework for building low-latency, composable, and distributed AI robotic applications using dataflow graphs.

#robotics#ai#apache-arrow
Stars3.7k
Forks386
Last commit2 days ago
feather <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">
feather <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">JavaScript

Feather is a binary columnar serialization format for data frames, enabling fast and interoperable data sharing between Python, R, and other languages.

#julia#data-serialization#apache-arrow
Stars2.8k
Forks166
Last commit4 months ago
RAPIDS cuGraph
RAPIDS cuGraphCuda

A collection of GPU-accelerated graph analytics libraries for creating, manipulating, and executing scalable graph algorithms.

#cuda#high-performance-computing#graph
Stars2.2k
Forks350
Last commit1 day ago

Related Tags

#Python10#Data Science5#Aws4#Etl4#Dataframe4#Pandas3#Rust3#Arrow3#Analytics3#Sql3
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub