A large-scale image dataset for self-supervised pretraining without humans, designed to reduce privacy concerns.
PASS is a large-scale image dataset specifically designed for self-supervised pretraining in computer vision. It serves as a privacy-conscious alternative to ImageNet by containing no humans or personally identifiable information, allowing researchers and developers to train models without privacy concerns. The project also provides pretrained models using various self-supervised learning methods like MoCo-v2, SwAV, and DINO.
Machine learning researchers and computer vision practitioners who need privacy-compliant datasets for self-supervised pretraining, particularly those working on projects where human imagery presents ethical or legal challenges.
PASS offers a unique combination of large-scale image data suitable for high-quality pretraining while completely eliminating privacy risks associated with human imagery. Unlike ImageNet, it provides built-in privacy protection while maintaining competitive performance in downstream tasks.
The PASS dataset: pretrained models and how to get the data
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
PASS contains 1.4 million images with no humans or personally identifiable information, directly mitigating ethical and legal risks associated with human data in datasets like ImageNet.
Provides multiple models trained with self-supervised methods like MoCo-v2, SwAV, and DINO, with benchmarks for ResNet-50 and ViT architectures, enabling easy comparison and selection.
Models are accessible via PyTorch Hub with simple commands, such as torch.hub.load('yukimasano/PASS:main', 'dino_100ep_vits16'), reducing setup time for experiments.
Includes automated scripts to help remove humans from existing datasets, extending its value for creating custom privacy-compliant datasets beyond the provided one.
Excluding all human imagery means models pretrained on PASS may underperform on downstream tasks involving human recognition, such as surveillance or healthcare applications.
Benchmarks show PASS-trained models have slightly lower ImageNet-1k accuracy compared to those trained on the original dataset, as seen in the table where MoCo-v2 on PASS achieves 59.5% vs. 60.6% on IN-1k.
The dataset is hosted on Zenodo or AWS with large tar files, requiring manual downloading and setup via scripts, which can be slower and more complex than integrated dataset loaders.
PASS: An An ImageNet replacement for self-supervised pretraining without humans is an open-source alternative to the following products: