A client layer that enables HBase to function as a TinkerPop 3-compliant graph database.
HGraphDB is a client layer that implements the Apache TinkerPop 3 interfaces, enabling HBase to function as a fully-featured graph database. It solves the problem of storing and querying graph-structured data at scale by leveraging HBase's distributed storage capabilities while providing a standard graph API.
Developers and data engineers working with large-scale graph data who need a scalable, distributed graph database solution integrated with the Hadoop ecosystem.
Developers choose HGraphDB because it combines the scalability of HBase with the rich graph traversal and analytics capabilities of the TinkerPop stack, all while supporting integration with popular big data processing frameworks like Spark and Flink.
HBase as a TinkerPop Graph Database
Provides full Apache TinkerPop 3 API compatibility, allowing seamless use of tools like the Gremlin console and integration with the broader graph ecosystem without vendor lock-in.
Leverages HBase's distributed architecture to handle petabyte-scale graph data, making it ideal for big data applications requiring horizontal scaling and fault tolerance.
Supports range queries, unique constraints, and keyset pagination on indexed results, enabling efficient data retrieval and traversal for large datasets, as detailed in the indexing and pagination sections.
Offers built-in integration with Apache Giraph, Spark GraphFrames, and Flink Gelly for distributed graph processing, as evidenced by dedicated blog posts and code examples in the README.
Creating or dropping indices after data population requires running separate MapReduce jobs via the hbase command-line, adding significant operational complexity and potential downtime.
As admitted in the README, counters cannot be indexed, limiting query efficiency for counter-based properties and forcing workarounds for aggregations.
Relies entirely on HBase's infrastructure and tuning, which introduces steep learning curves and resource overhead for teams not already invested in the Hadoop ecosystem.
A distributed, fast open-source graph database featuring horizontal scalability and high availability
JanusGraph: an open-source, distributed graph database
A graph database that supports more than 100+ billion data, high performance and scalability (Include OLTP Engine & REST-API & Backends)
Apache TinkerPop - a graph computing framework
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.