How to be fair? A study of label and selection bias

Favier, Marco; Calders, Toon; Pinxteren, Sam; Meyer, Jonathan

doi:10.1007/S10994-023-06401-1

Title

How to be fair? A study of label and selection bias

Author

Favier, Marco

Calders, Toon

Pinxteren, Sam

Meyer, Jonathan

Abstract

It is widely accepted that biased data leads to biased and thus potentially unfair models. Therefore, several measures for bias in data and model predictions have been proposed, as well as bias mitigation techniques whose aim is to learn models that are fair by design. Despite the myriad of mitigation techniques developed in the past decade, however, it is still poorly understood under what circumstances which methods work. Recently, Wick et al. showed, with experiments on synthetic data, that there exist situations in which bias mitigation techniques lead to more accurate models when measured on unbiased data. Nevertheless, in the absence of a thorough mathematical analysis, it remains unclear which techniques are effective under what circumstances. We propose to address this problem by establishing relationships between the type of bias and the effectiveness of a mitigation technique, where we categorize the mitigation techniques by the bias measure they optimize. In this paper we illustrate this principle for label and selection bias on the one hand, and demographic parity and "We're All Equal" on the other hand. Our theoretical analysis allows to explain the results of Wick et al. and we also show that there are situations where minimizing fairness measures does not result in the fairest possible distribution.

Language

English

Source (journal)

Machine learning. - Boston, Mass., 1986, currens

Publication

Dordrecht : Springer , 2023

ISSN

0885-6125 [print]

1573-0565 [online]

DOI

10.1007/S10994-023-06401-1

Volume/pages

112 (2023) , p. 5081-5104

ISI

001071859700002

Full text (Publisher's DOI)

https://doi.org/10.1007/S10994-023-06401-1

Full text (open access)

https://repository.uantwerpen.be/docstore/d:irua:20396

Full text (publisher's version - intranet only)

https://repository.uantwerpen.be/docstore/d:iruaintra:10625

Faculty/Department				Faculty of Sciences. Mathematics and Computer Science

Research group				ADReM Data Lab (ADReM)

Publication type				A1 Journal article

Subject				Computer. Automation

Affiliation				Publications with a UAntwerp address

Web of Science

View record in Web of Science®

View citing articles in Web of Science®

Identifier

Creation

30.10.2023

Last edited

09.09.2024

To cite this reference

https://hdl.handle.net/10067/2002770151162165141