Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Dataset

Dataset

88 projects

Showing 16 of 88 projects

Brno Urban Dataset
Brno Urban Dataset

A multi-sensor dataset for autonomous vehicle and robot navigation, featuring synchronized camera, LiDAR, IMU, and GNSS data collected in urban environments.

#lidar#robotics#multi-sensor-fusion
Stars164
Forks16
Last commit4 years ago
CWFID
CWFID

A public dataset of field images with segmentation masks and plant type annotations for computer vision in precision agriculture.

#weed#crop-monitoring#precision-agriculture
Stars159
Forks49
Last commit11 years ago
networkdata
networkdataR

An R package providing 2,260 network datasets in igraph format from diverse sources like social networks, animal interactions, and movie co-stars.

#igraph#r-package#data-science
Stars146
Forks16
Last commit1 month ago
SymJAX
SymJAXPython

A symbolic programming library built on JAX for concise, explicit, and optimized machine learning computations.

#jax#deep-learning#signal-processing
Stars131
Forks5
Last commit3 years ago
dnddata
dnddataR

A weekly updated dataset of Dungeons & Dragons characters submitted to character sheet web applications, with over 7,900 entries and standardized fields.

#tsv-data#5e#ogan-dnd
Stars122
Forks20
Last commit3 years ago
GitHub repository
GitHub repositoryPython

A Python devkit for working with the Boreas and Boreas Road Trip all-weather autonomous driving datasets.

#lidar#robotics#autonomous-driving
Stars119
Forks14
Last commit1 month ago
Box-score data
Box-score dataHTML

A dataset of NBA game summaries aligned with box- and line-scores for data-to-text generation research.

#nlp-research#data-to-text#nba
Stars115
Forks25
Last commit4 years ago
Guidelines
GuidelinesPython

Replication package and dataset for a research paper on software architecture practices in ROS-based robotic systems.

#robotics#software-architecture#robotics-programming
Stars104
Forks20
Last commit
holicity
holicityPython

A city-scale dataset and platform for learning holistic 3D structures from panoramic and perspective imagery with detailed annotations.

#deep-learning#geolocation#3d-reconstruction
Stars94
Forks8
Last commit4 years ago
GraphQuestions
GraphQuestionsReScript

A characteristic-rich dataset for factoid question answering with explicit question specifications to enable fine-grained QA system evaluation.

#nlp-research#question-answering#natural-language-processing
Stars94
Forks14
Last commit
BODMAS
BODMASPython

An open dataset for learning-based temporal analysis of PE malware, containing over 130,000 Windows PE files with feature vectors and metadata.

#pe-malware#malware-dataset#feature-vectors
Stars93
Forks17
Last commit2 years ago
Packware
PackwarePython

A research project investigating how packers affect the accuracy of static machine-learning malware classifiers.

#adversarial-robustness#cybersecurity-research#reproducible-research
Stars90
Forks18
Last commit2 years ago
nebula
nebulaRust

A package manager for machine learning datasets and models with a CLI and self-hostable registry.

#dataset-management#tonic#nebula
Stars85
Forks5
Last commit1 year ago
TxQuery
TxQueryPascal

A Delphi component that enables SQL queries on TDataSet descendants with its own parser and engine, no DLL required.

#tdataset#sql-engine#clientdataset
Stars71
Forks27
Last commit26 days ago
WebNLG
WebNLGPython

An enriched dataset for Natural Language Generation research, providing intermediate representations for pipeline tasks like lexicalization and aggregation.

#pipeline-architecture#nlp-research#data-to-text
Stars70
Forks22
Last commit5 years ago
bravefrontier_data
bravefrontier_data

A repository of extracted game data for Brave Frontier, including units, items, skills, and missions across Global, JP, and EU servers.

#server-data#gaming-tools#json-database
Stars65
Forks31
Last commit4 years ago
PreviousPage 3 of 3

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
10 months ago
3 years ago
#Computer Vision39
#Machine Learning34
#Deep Learning22
#Lidar12
#Robotics11
#Autonomous Driving11
#Open Data9
#Awesome9
#Awesome List8
#Benchmark8
#Pytorch8
#Object Detection7