A tool for automatically detecting and suggesting mitigation for object, attribute, and geography-based biases in visual datasets.
REVISE is a research tool for measuring and mitigating bias in visual datasets. It automatically detects potential biases along object-based, attribute-based, and geography-based patterns, providing actionable insights to improve dataset fairness. The tool helps identify imbalances in representation, attribute distribution, and geographic coverage that could lead to skewed computer vision models.
Computer vision researchers, data scientists, and AI ethics practitioners who need to audit visual datasets for fairness before model training. It is particularly useful for teams building or curating large-scale image datasets for machine learning.
REVISE offers a comprehensive, multi-axis approach to bias detection that goes beyond simple demographic checks. Its automated measurement pipelines and visual summaries enable systematic dataset auditing, reducing manual effort and providing clear pathways for mitigation.
REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets --- https://arxiv.org/abs/2004.07999
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Analyzes biases along object, attribute, and geography axes, providing a holistic view as emphasized in the philosophy section for comprehensive dataset auditing.
Generates summary PDFs with visualizations and interpretations for each bias axis, reducing manual effort in bias analysis as shown in the sample_summary_pdfs folder.
Supports custom datasets via a template dataloader and allows switching between facial detection backends, including free tools like cv2, as detailed in the setup instructions.
Based on peer-reviewed publications from ECCV and IJCV, ensuring rigorous and validated bias measurement techniques for reliable results.
Requires conda environment creation, model downloads, and troubleshooting for issues like PROJ_LIB errors, as noted in the Potential Environment Issues section, increasing initial overhead.
Recommends Amazon Rekognition for facial detection, which incurs charges and introduces vendor lock-in, though free alternatives are available but may require code changes.
Involves running Jupyter notebooks for exploring biases, as per the steps to perform analysis, which may not be fully automated for continuous integration or production pipelines.