Publication
Title
Comprehensible credit scoring models using rule extraction from support vector machines
Author
Abstract
In recent years, support vector machines (SVMs) were successfully applied to a wide range of applications. However, since the classifier is described as a complex mathematical function, it is rather incomprehensible for humans. This opacity property prevents them from being used in many real-life applications where both accuracy and comprehensibility are required, such as medical diagnosis and credit risk evaluation. To overcome this limitation, rules can be extracted from the trained SVM that are interpretable by humans and keep as much of the accuracy of the SVM as possible. In this paper, we will provide an overview of the recently proposed rule extraction techniques for SVMs and introduce two others taken from the artificial neural networks domain, being Trepan and G-REX. The described techniques are compared using publicly available datasets, such as Ripleys synthetic dataset and the multi-class iris dataset. We will also look at medical diagnosis and credit scoring where comprehensibility is a key requirement and even a regulatory recommendation. Our experiments show that the SVM rule extraction techniques lose only a small percentage in performance compared to SVMs and therefore rank at the top of comprehensible classification techniques.
Language
English
Source (journal)
European journal of operational research. - Amsterdam
Publication
Amsterdam : 2007
ISSN
0377-2217
DOI
10.1016/J.EJOR.2006.04.051
Volume/pages
183 :3 (2007) , p. 1466-1476
ISI
000248590100039
Full text (Publisher's DOI)
UAntwerpen
Faculty/Department
Publication type
Subject
External links
Web of Science
Record
Identifier
Creation 12.09.2011
Last edited 29.01.2023
To cite this reference