Explainability methods to detect and measure discrimination in machine learning models

Goethals, Sofie; Martens, David; Calders, Toon

Title

Author

Goethals, Sofie

Martens, David

Calders, Toon

Abstract

Today, it is common to use machine learning models for high-stakes decisions, but this can pose a threat to fairness as these models can amplify bias present in the dataset. At the moment, there is no consensus on a universal method to tackle this, and we argue that this is also not possible as the right method will depend on the context of each case. As a solution, our aim was to bring transparency in the fairness domain, and in earlier work, we proposed a counterfactual-based algorithm (𝑃𝑟𝑒𝐶𝑜𝐹) to identify bias in machine learning models. This method attempts to counter the disagreement problem in Explainable AI, by reducing the flexibility of the model owner. We envision a future where transparency tools such as the latter are used to perform fairness audits by independent auditors who can judge for each case whether the audit revealed discriminatory patterns or not. This approach would be more in line with the current nature of EU legislation, as its requirements are often too contextual and open to judicial interpretation to be automated.

Language

English

Source (journal)

CEUR workshop proceedings

Source (book)

EWAF’23 : European Workshop on Algorithmic Fairness, June 07–09, 2023, Winterthur, Switzerland

Publication

CEUR , 2023

Volume/pages

3442 , p. 1-5

Full text (open access)

Licensed under a CC BY Attribution license

Faculty/Department				Faculty of Business and Economics Faculty of Sciences. Mathematics and Computer Science

Research group				ADReM Data Lab (ADReM) Engineering Management

Publication type				P1 Proceeding

Subject				Computer. Automation

Affiliation				Publications with a UAntwerp address

VABB-SHW

This title in VABB-SHW

Identifier

c:irua:199825

Creation

16.10.2023

Last edited

07.04.2025

To cite this reference

https://hdl.handle.net/10067/1998250151162165141