Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. ArcGIS Developer
  3. gis-tools-for-hadoop

gis-tools-for-hadoop

Apache-2.0v2.0

A collection of GIS tools for spatial analysis of big data using Hadoop, integrating with ArcGIS Geoprocessing.

Visit WebsiteGitHubGitHub
522 stars251 forks0 contributors

What is gis-tools-for-hadoop?

GIS Tools for Hadoop is a collection of tools and resources for performing spatial analysis on big data within Hadoop ecosystems. It provides integration between Hadoop's distributed processing capabilities and geographic information systems, enabling organizations to analyze massive spatial datasets. The project includes sample tools, Java libraries, and ArcGIS Geoprocessing integration for comprehensive spatial data processing.

Target Audience

GIS professionals, data engineers, and analysts working with large spatial datasets who need to perform spatial analysis within Hadoop environments. Organizations using ArcGIS who want to extend their spatial analysis capabilities to big data platforms.

Value Proposition

Developers choose GIS Tools for Hadoop because it provides official Esri-supported integration between Hadoop and ArcGIS, offering both ready-to-use tools and foundational libraries for custom development. It bridges the gap between traditional GIS workflows and modern big data processing environments.

Overview

The GIS Tools for Hadoop are a collection of GIS tools for spatial analysis of big data.

Use Cases

Best For

  • Analyzing large-scale spatial datasets like taxi trip data or sensor networks
  • Integrating Hadoop-based big data processing with ArcGIS workflows
  • Building custom spatial analysis tools for Hadoop ecosystems
  • Performing spatial aggregations and calculations on massive datasets
  • Extending existing GIS infrastructure to handle big data workloads
  • Developing Hive queries with spatial functions for geographic analysis

Not Ideal For

  • Projects requiring real-time or streaming spatial analysis, as Hadoop is batch-oriented and this toolkit focuses on batch processing.
  • Organizations using open-source GIS platforms like QGIS without ArcGIS licenses, due to tight integration with Esri's proprietary software.
  • Teams with big data stacks based on non-Hadoop technologies like Apache Spark, as the tools are specifically designed for Hadoop ecosystems.
  • Simple spatial analyses that don't justify the overhead of distributed computing, as the setup requires a Hadoop cluster and ArcGIS.

Pros & Cons

Pros

ArcGIS Geoprocessing Integration

Provides direct access to Hadoop systems from the ArcGIS Geoprocessing environment, enabling seamless workflow integration for users already in the Esri ecosystem, as highlighted in the README.

Comprehensive Sample Tools

Includes full-stack implementations that demonstrate solving GIS problems with Hadoop, offering practical examples and reducing development time for common spatial analysis tasks.

Flexible Data Format Support

Handles multiple spatial data formats like Esri JSON and GeoJSON through specialized InputFormats, making it easier to work with diverse datasets without extensive preprocessing.

Custom Development Resources

Provides Java geometry libraries and Hive spatial user-defined functions, allowing developers to build tailored spatial solutions beyond the out-of-box tools.

Cons

Proprietary Software Dependencies

Requires ArcGIS for Desktop or Server, which adds licensing costs and vendor lock-in, limiting adoption for teams preferring open-source or cross-platform GIS solutions.

Complex Setup Requirements

Demands access to a Hadoop cluster and additional components like Hive or Oozie, making initial configuration and deployment challenging, especially for smaller teams or projects.

Potential Outdatedness

The project's last major updates were in 2019, raising concerns about compatibility with newer Hadoop versions and reduced ongoing maintenance, as noted in the copyright and update logs.

Frequently Asked Questions

Quick Stats

Stars522
Forks251
Contributors0
Open Issues21
Last commit4 years ago
CreatedSince 2013

Tags

#arcgis#geospatial#apache-hive#java#gis#big-data#data-processing#spatial-analysis#hadoop

Built With

J
Java
A
Apache Hadoop

Links & Resources

Website

Included in

ArcGIS Developer314
Auto-fetched 4 hours ago

Related Projects

Turf.jsTurf.js

A modular geospatial engine written in JavaScript and TypeScript

Stars10,360
Forks1,004
Last commit2 days ago
spatial-framework-for-hadoopspatial-framework-for-hadoop

The Spatial Framework for Hadoop allows developers and data scientists to use the Hadoop data processing system for spatial data analysis.

Stars376
Forks158
Last commit2 months ago
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub