Publication
Title
Robustness verification for classifier ensembles
Author
Abstract
We give a formal verification procedure that decides whether a classifier ensemble is robust against arbitrary randomized attacks. Such attacks consist of a set of deterministic attacks and a distribution over this set. The robustness-checking problem consists of assessing, given a set of classifiers and a labelled data set, whether there exists a randomized attack that induces a certain expected loss against all classifiers. We show the NP-hardness of the problem and provide an upper bound on the number of attacks that is sufficient to form an optimal randomized attack. These results provide an effective way to reason about the robustness of a classifier ensemble. We provide SMT and MILP encodings to compute optimal randomized attacks or prove that there is no attack inducing a certain expected loss. In the latter case, the classifier ensemble is provably robust. Our prototype implementation verifies multiple neural-network ensembles trained for image-classification tasks. The experimental results using the MILP encoding are promising both in terms of scalability and the general applicability of our verification procedure.
Language
English
Source (journal)
Lecture notes in computer science. - Berlin, 1973, currens
Source (book)
Automated Technology for Verification and Analysis : proceedings of the 18th International Symposium, ATVA 2020, October 19–23, 2020, Hanoi, Vietnam
Source (series)
Lecture notes in computer science ; 12302
Publication
Cham : Springer , 2020
ISBN
978-3-030-59151-9
DOI
10.1007/978-3-030-59152-6_15
Volume/pages
p. 271-287
ISI
000723555700015
Full text (Publisher's DOI)
Full text (open access)
UAntwerpen
Faculty/Department
Research group
Project info
SAILor: Safe Artificial Intelligence and Learning for Verification.
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Web of Science
Record
Identifier
Creation 28.12.2020
Last edited 02.10.2024
To cite this reference