A public domain dataset containing metadata for 70,000 artworks and 3,500 artists from the Tate collection.
The Tate Collection dataset is an open metadata repository containing structured information about artworks and artists from the Tate museum's holdings. It provides programmatic access to collection data including artwork details, artist biographies, and acquisition information. The dataset serves as a resource for data analysis, visualization projects, and digital humanities research.
Researchers, data scientists, digital humanities scholars, and developers working with cultural heritage data or creating art-related applications. It's particularly valuable for those exploring data visualization, computational art analysis, or building educational tools around museum collections.
Unlike proprietary museum databases, this dataset is completely open and free under CC0 licensing, allowing unrestricted use and modification. It provides a substantial, real-world cultural dataset that has been used in numerous academic and creative projects, offering both JSON and CSV formats for different technical needs.
Tate Collection metadata
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Includes detailed records for approximately 70,000 artworks with fields like accession numbers, titles, dates, and dimensions, enabling thorough analysis and visualization projects.
Provides contextual information for around 3,500 artists, supporting research on art history and creator backgrounds, as highlighted in the repository structure.
Offers both JSON for hierarchical data and CSV for flattened tables, catering to different technical workflows and ease of use, as described in the README's format section.
Released under CC0, allowing unrestricted use, modification, and distribution without legal barriers, which is rare for museum datasets and encourages creative reuse.
The dataset is a 2014 snapshot with no plans for updates, limiting its relevance for current research or applications requiring contemporary data.
Images are not part of the dataset; users must separately license or source visual content from Tate, adding complexity for multimedia or visualization projects.
The README notes metadata is dynamic and may contain errors, and CSV files have UTF-8 encoding that could cause issues with older software like Excel.