Outlier detection for skewed dataOutlier detection for skewed data
Faculty of Sciences. Mathematics and Computer Science
Department of Mathematics - Computer Sciences
Journal of chemometrics. - Chichester
International Chemometric Conference (Conferentia Chemometrica 2007), SEP 02-05, 2007, Budapest, HUNGARY
22(2008):3-4, p. 235-246
University of Antwerp
Most outlier detection rules for multivariate data are based on the assumption of elliptical symmetry of the underlying distribution. We propose an outlier detection method which does not need the assumption of symmetry and does not rely on visual inspection. Our method is a generalization of the Stahel-Donoho outlyingness. The latter approach assigns to each observation a measure of outlyingness, which is obtained by projection pursuit techniques that only use univariate robust measures of location and scale. To allow skewness in the data, we adjust this measure of outlyingness by using a robust measure of skewness as well. The observations corresponding to an outlying value of the adjusted outlyingness (AO) are then considered as outliers. For bivariate data, our approach leads to two graphical representations. The first one is a contour plot of the AO values. We also construct an extension of the boxplot for bivariate data, in the spirit of the bagplot  which is based on the concept of half space depth. We illustrate our outlier detection method on several simulated and real data. Copyright (c) 2008 John Wiley & Sons, Ltd.