A Python toolbox for auditing machine learning models to detect and quantify bias in black-box predictions.
FairML is a Python toolbox that audits machine learning models for bias by quantifying the relative significance of input features. It helps analysts determine if a black-box predictive model exhibits unintentional discrimination based on protected characteristics like race or gender. The toolbox is designed to assess fairness in models deployed in critical domains such as credit scoring, insurance, and employment.
Data scientists, machine learning engineers, and analysts who build or deploy predictive models in regulated or high-impact domains and need to audit for fairness and compliance. Researchers focused on algorithmic fairness and responsible AI will also find it valuable.
FairML provides a practical, model-agnostic toolkit that works with any black-box predictor, making fairness auditing accessible without requiring model interpretability. Its unique combination of model compression and multiple ranking algorithms offers a robust methodology to quantify and visualize discriminatory patterns.
FairML is an end-to-end Python toolbox designed to audit predictive models by quantifying their relative dependence on input features, enabling assessment of potential discrimination. It addresses the critical need to audit black-box models deployed in high-stakes domains like credit, insurance, and employment, where unintentional bias based on race, gender, or other protected characteristics can have serious societal impacts.
FairML is built on the principle that transparency and accountability are essential for responsible AI, providing tools to systematically audit models for fairness without requiring interpretability of the underlying algorithm.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Works with any predictive model that provides a predict function, as demonstrated with scikit-learn's LogisticRegression in the README, without requiring internal model details.
Leverages four input ranking algorithms and model compression to measure predictive dependence, offering a robust methodology for fairness assessment beyond single-method approaches.
Generates feature dependence plots to visually communicate audit results, exemplified in the README with saved EPS files for clear reporting.
Includes an optional flag to check model dependence on feature interactions, adding depth to the auditing process for complex models.
The auditing process involves multiple runs (controlled by number_of_runs), which can be slow for large models or datasets, as admitted by the iterative methodology in the code demo.
The README provides only a basic demo and lacks comprehensive examples, advanced use cases, or troubleshooting guides, which may hinder adoption beyond simple audits.
Requires installation via GitHub URL or cloning rather than a standard PyPI release, making integration into automated workflows or production environments more cumbersome.