Showing 27 of 63 projects
A suite of extremely fast and reliable parsers for Java with a consistent interface for multiple file formats.
A simple, fast, and flexible ETL framework for .NET with built-in readers and writers for CSV, JSON, XML, Parquet, and more.
A data API framework that turns SQL into secure RESTful APIs for AI agents and data applications.
A biomedical knowledge graph integrating 20 resources to describe 17,080 diseases with over 4 million relationships across ten biological scales.
Fast, sensitive, and accurate integration of single-cell RNA-seq data across multiple datasets, batches, or experimental conditions.
A Delphi and Lazarus library for consuming REST services with support for multiple HTTP engines and adapters.
A library enabling Apache Spark to read from and write to Apache HBase tables as external data sources using DataFrames and SQL.
A curated list of awesome system integration software, patterns, and resources.
A deep learning framework for integrating single-cell multi-omics data using graph-linked unified embeddings.
A Python package for benchmarking and evaluating single-cell genomics data integration methods.
A factor analysis framework for unsupervised integration of multi-omics datasets.
An open-source hybrid integration platform for connecting applications, data, and systems with enterprise-grade features.
A web-based platform for data analysis and visualization with support for multiple data sources and interactive dashboards.
An integrative hetnet (heterogeneous network) encoding biomedical knowledge for drug repurposing and discovery.
A Spark library for reading and writing data between Spark SQL and MongoDB collections.
A GraphAware Framework module for bi-directional integration between Neo4j and Elasticsearch, enabling asynchronous data replication and search enrichment.
A collection of connectors enabling Apache HBase integration with Kafka, Spark, and other data processing systems.
A thin integration layer connecting Apache Spark with various NoSQL datastores and JDBC databases.
A Go-based toolset for data extraction, transformation, and loading, providing powerful data synchronization capabilities.
A curated list of awesome HBase projects, clients, frameworks, tools, and resources.
A distributed framework extending Apache Spark with unified SQL access to multiple datastores, optimized connectors, and streaming support.
An Elasticsearch plugin that integrates with Neo4j to personalize search results using graph data.
A Java utility for loading on-premises data into Salesforce Einstein Analytics datasets with autoloading, dataflow control, and dataset inspection.
A PostgreSQL extension that enables sending messages directly to Apache Kafka from within the database.
A lightweight Python parser for EDI 835 Health Care Claim Payment and Remittance Advice files.
An open-source biomedical knowledge graph for drug discovery, precision medicine, and drug repurposing research.
A simple Go package for interacting with the Airtable API, providing a wrapper for common operations.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.