A curated list of satellite and aerial imagery datasets with annotations for computer vision and deep learning tasks.
Awesome Satellite Imagery Datasets is a curated, categorized list of publicly available datasets for training computer vision and deep learning models on satellite and aerial imagery. It addresses the problem of fragmented and hard-to-discover geospatial training data by providing a single, structured resource with detailed metadata for each dataset.
Researchers, data scientists, and machine learning engineers working on geospatial computer vision projects, such as object detection in satellite imagery, land cover classification, or disaster damage assessment.
It saves significant time in dataset discovery and evaluation by aggregating and organizing datasets from various sources (academia, competitions, government) with consistent, practical metadata, enabling faster prototyping and benchmarking in geospatial AI.
🛰️ List of satellite image training datasets with annotations for computer vision and deep learning
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Datasets are categorized by computer vision tasks like instance segmentation and object detection, as shown in the README sections, making it easy to find relevant data for specific ML problems.
Each entry includes key statistics, sensor details, and links to papers, providing essential context to assess dataset suitability, such as image counts and resolutions for models.
Covers diverse use cases from disaster assessment to agricultural monitoring, evidenced by datasets like FloodNet and PASTIS listed in the categories.
Part of the 'awesome' list ecosystem with community contributions, ensuring vetted and practical entries for researchers, as indicated by the structured updates and pointers.
The README explicitly states the list is archived with pointers to newer resources, meaning it doesn't include datasets published after its last update, reducing current relevance.
It only indexes datasets; users must navigate external sources for downloads, which can involve complex registration or lack of straightforward access, adding overhead.
While metadata is detailed, licensing terms for commercial use are often missing, requiring additional research on sources like iSAID's academic-only restrictions.