Publication
Title
Value-added tax fraud detection with scalable anomaly detection techniques
Author
Abstract
The tax fraud detection domain is characterized by very few labelled data (known fraud/legal cases) that are not representative for the population due to sample selection bias. We use unsupervised anomaly detection (AD) techniques, which are uncommon in tax fraud detection research, to deal with these domain issues. We analyse a unique dataset containing the VAT declarations and client listings of all Belgian VAT numbers pertaining to ten sectors. Our methodology consists in applying AD methods to firms belonging to the same sector and enables an efficient auditing strategy that can be adopted by tax authorities worldwide. The high lifts and hit rates observed in most sectors demonstrate the success of this approach. Sectoral differences exist due to varying market conditions and legal requirements across sectors and we show that the optimal AD method is sector dependent. We focus on three methodological problems that show issues in the related literature. (1) Can we design suitable input features? We develop new fraud indicators from specific fields of the VAT form and client listings and show the predictive value of the combination of these features. (2) Can we design fast algorithms to deal with the large data sizes that can occur in the tax domain? New methods are developed and we demonstrate their scalability both theoretically as well as empirically. (3) How should fraud detection performance be assessed? A new evaluation methodology is proposed that provides reliable performance indications and guarantees that fraud cases are effectively detected by the proposed methods.
Language
English
Source (journal)
Applied soft computing. - Place of publication unknown
Publication
Place of publication unknown : 2020
ISSN
1568-4946
DOI
10.1016/J.ASOC.2019.105895
Volume/pages
86 (2020) , p. 1-20
Article Reference
105895
ISI
000503388200068
Medium
E-only publicatie
Full text (Publisher's DOI)
Full text (open access)
Full text (publisher's version - intranet only)
UAntwerpen
Faculty/Department
Research group
Project info
Digitalisation and Tax (DigiTax).
Data mining for tax fraud detection.
Publication type
Subject
Law 
Affiliation
Publications with a UAntwerp address
External links
Web of Science
Record
Identifier
Creation 07.11.2019
Last edited 12.12.2024
To cite this reference