An R package providing 2,260 network datasets in igraph format from diverse sources like social networks, animal interactions, and movie co-stars.
networkdata is an R package that provides a curated collection of 2,260 network datasets in igraph format, sourced from various public repositories. It solves the problem of fragmented and hard-to-access network data by offering a centralized, easy-to-use resource for researchers and analysts.
Network scientists, data analysts, and researchers in fields like sociology, biology, and computer science who need pre-processed network data for analysis, modeling, or teaching in R.
Developers choose networkdata for its extensive, ready-to-use dataset collection in a standardized format, saving time on data sourcing and preprocessing while ensuring compatibility with the igraph ecosystem.
R package containing several network datasets
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Aggregates 2,260 networks from 987 datasets across diverse domains like movies, animal interactions, and covert networks, as listed in the README, saving significant data sourcing time.
All datasets are pre-formatted for igraph, R's leading network analysis library, ensuring seamless compatibility and immediate use without conversion, as highlighted in the description.
Datasets can be listed and loaded directly in R using 'data(package = "networkdata")', streamlining retrieval and integration into analysis workflows.
Encourages users to add datasets via pull requests, fostering an expanding resource, as explicitly stated in the README with 'Feel free to add your own dataset via a pull request'.
The package is not available on CRAN, requiring installation from GitHub via remotes, which can be less stable and more complex for some R users compared to standard CRAN packages.
The README admits that many datasets were automatically assembled and may contain errors, necessitating manual verification for reliable use, especially in academic contexts.
Datasets are exclusively formatted for igraph in R, limiting utility for users in other programming environments or those preferring different network analysis tools.