Showing 36 of 37 projects
An async state management library that simplifies fetching, caching, synchronizing, and updating server state for web applications.
An async state management library for fetching, caching, synchronizing, and updating server state across web frameworks.
An async state management library for fetching, caching, synchronizing, and updating server state across web frameworks.
An open-source framework for building LLM-powered applications with data ingestion, indexing, and retrieval capabilities.
A JavaScript library for reading, writing, and processing spreadsheet data across Excel, CSV, and other formats.
A Python library that enables conversational data analysis on SQL, CSV, and parquet files using LLMs and RAG.
Open-source data integration platform for building ELT pipelines from APIs, databases, and files to data warehouses, lakes, and lakehouses.
An elegant and simple Python library for fetching financial data from various sources, designed for quantitative research.
Generate massive amounts of fake (but realistic) data for testing and development in Node.js and browsers.
A curated list of awesome big data frameworks, resources, and tools across various categories.
A curated list of awesome big data frameworks, resources, and tools across various categories.
A simple fake data generator for C#, F#, and VB.NET, ported from faker.js, to load databases and apps with realistic test data.
Open-source customer data infrastructure that collects, validates, and enriches behavioral event data for AI and analytics.
An ultra-performant data transformation framework for AI, with incremental processing and data lineage built-in.
Machine-readable browser compatibility data for Web APIs, CSS, JavaScript, HTML, and other web technologies.
A comprehensive Go library for generating realistic fake data across 300+ categories with zero dependencies.
An end-to-end framework for building custom AI applications and agents directly integrated with databases.
A distributed stream processing engine in Rust that performs stateful computations on real-time data with subsecond results.
A fast Python library for generating fake data in multiple languages with extensible providers and schema-based generation.
A curated collection of publicly accessible JSON datasets across diverse topics like government, finance, climate, and entertainment.
A curated list of awesome JSON datasets that don't require authentication.
Generate massive amounts of fake data in the browser and NodeJS with tree-shakable, fully-typed functions.
A comprehensive, dependency-free statistics library for Go with extensive mathematical functions and thorough testing.
A language and runtime that optimizes performance of data-intensive applications by lazily building and optimizing computations across libraries.
An open-source, AI-first data notebook that extends Jupyter with a sleek UI, reactive execution, and native data integrations.
A Scala API for Apache Beam and Google Cloud Dataflow, enabling unified batch and streaming data processing.
A declarative code-first data integration engine that unlocks 600+ APIs and databases, eliminating the need to write and maintain custom API integrations.
A distributed data integration framework for big data ecosystems, handling ingestion, replication, organization, and lifecycle management for both streaming and batch data.
A distributed data integration framework for big data ecosystems, handling ingestion, replication, organization, and lifecycle management for both streaming and batch data.
A masterless, cloud-scale, fault-tolerant distributed computation system for batch and stream processing written in Clojure.
A flexible and fast package for in-memory tabular data manipulation and analysis in the Julia programming language.
A Ruby framework for writing reliable, concise, and maintainable ETL (Extract-Transform-Load) data processing jobs.
A JavaScript library providing a keyword-to-emoji mapping for making emoji searchable.
A terminal-based tool to interactively scan raw disk partitions and recover deleted or overwritten files by searching for byte patterns.
A fast, modern, cookie-free analytics tool that can be self-hosted or used via cloud, providing AI-powered dashboards with setup in under 30 seconds.
A Rust procedural macro for creating newtypes with built-in sanitization and validation guarantees.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.