Showing 36 of 88 projects
A drop-in replacement for the MNIST dataset, featuring 70,000 Zalando fashion article images for benchmarking machine learning algorithms.
A comprehensive resource of deep learning techniques and models for analyzing satellite and aerial imagery.
A public dataset of 50 million vector drawings across 345 categories, captured from the Quick, Draw! game.
Machine-readable browser compatibility data for Web APIs, CSS, JavaScript, HTML, and other web technologies.
A curated collection of academic papers covering all aspects of face analysis, including detection, recognition, alignment, generation, and anti-spoofing.
A large-scale dataset of images with object segmentation, bounding boxes, and visual relationship annotations.
A curated collection of publicly accessible JSON datasets across diverse topics like government, finance, climate, and entertainment.
A curated list of awesome JSON datasets that don't require authentication.
A high-performance string library leveraging SIMD and SWAR to accelerate search, hashing, sorting, and edit distances across C, C++, Python, Rust, and more.
A collection of autonomous driving datasets and evaluation code for advancing machine perception and self-driving research.
A curated list of awesome open-source OCR software, libraries, datasets, and literature.
A meticulously curated collection of 31,898 unique color names for design systems, creative tools, and data visualization.
A large collection of real-world system log datasets for AI-driven log analytics research.
A collection of 200 Windows EVTX event log samples mapped to MITRE ATT&CK techniques for detection testing and threat hunting.
A large-scale dataset of object-centric video clips with 3D bounding box annotations and AR metadata for 3D object detection research.
A curated list of datasets, tools, methods, review papers, and competitions for remote sensing change detection.
A web/desktop application for collaborative labeling and annotation of images, text, audio, documents, and other data types.
A curated collection of datasets for Simultaneous Localization and Mapping (SLAM) research, categorized by topic, platform, and environment.
Script to generate question/answer pairs from CNN and Daily Mail articles for machine reading comprehension research.
A curated list of awesome LIDAR sensors, datasets, libraries, algorithms, frameworks, and simulators for robotics and autonomous driving.
A curated list of awesome LIDAR sensors, datasets, libraries, algorithms, and simulators for robotics and autonomous driving.
An open dataset and toolkit for training static PE malware machine learning models, featuring extracted features from millions of Windows executable files.
An open dataset and toolkit for training static PE malware machine learning models, featuring millions of labeled Windows executable samples.
A cleaned and normalized time series dataset of global COVID-19 confirmed cases, deaths, and recoveries, updated daily.
An open-source library providing chest X-ray datasets, pre-trained models, and tools for medical imaging research and analysis.
A curated collection of datasets, APIs, and tools for applying artificial intelligence and data mining to video games.
A framework for semantic and instance segmentation of LiDAR point clouds using range images, designed for autonomous driving applications.
A modular framework for Torch providing abstractions for datasets, engines, meters, and logs to encourage code re-use.
C# and F# language binding and extensions for Apache Spark, enabling .NET developers to write Spark driver programs and data processing operations.
A Python API for the Argoverse dataset, providing tools for 3D tracking, motion forecasting, and HD map interaction for autonomous vehicle research.
A curated list of face-related algorithms, datasets, papers, and open-source libraries for computer vision research.
Generate datasets for AI chatbots, NLP tasks, NER, and text classification using a simple domain-specific language.
A visible-infrared paired dataset for low-light vision tasks like pedestrian detection, image fusion, and image-to-image translation.
A curated list of research papers and resources for scene understanding in computer vision, covering 3D reconstruction, layout estimation, and primitive detection.
A biomedical knowledge graph integrating 20 resources to describe 17,080 diseases with over 4 million relationships across ten biological scales.
A benchmark and toolkit for discovering, detecting, recognizing, and tracking UAVs in the wild using RGB and thermal infrared video.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.