Efficient cluster detection by ordered neighborhoods

Akşehirli, Emin; Goethals, Bart; Mueller, Emmanuel

doi:10.1007/978-3-319-22729-0_2

Title

Efficient cluster detection by ordered neighborhoods

Author

Akşehirli, Emin

Goethals, Bart

Mueller, Emmanuel

Abstract

Detecting cluster structures seems to be a simple task, i.e. separating similar from dissimilar objects. However, given today's complex data, (dis-)similarity measures and traditional clustering algorithms are not reliable in separating clusters from each other. For example, when too many dimensions are considered simultaneously, objects become unique and (dis-) similarity does not provide meaningful information to detect clusters anymore. While the (dis-) similarity measures might be meaningful for individual dimensions, algorithms fail to combine this information for cluster detection. In particular, it is the severe issue of a combinatorial search space that results in inefficient algorithms. In this paper we propose a cluster detection method based on the ordered neighborhoods. By considering such ordered neighborhoods in each dimension individually, we derive properties that allow us to detect clustered objects in dimensions in linear time. Our algorithm exploits the ordered neighborhoods in order to find both the similar objects and the dimensions in which these objects show high similarity. Evaluation results show that our method is scalable with both database size and dimensionality and enhances cluster detection w.r.t. state-of-the-art clustering techniques.

Language

English

Source (journal)

Lecture notes in computer science. - Berlin, 1973, currens

Source (book)

17th International Conference on Big Data Analytics and Knowledge, Discovery (DaWaK), SEP 01-04, 2015, Valencia, SPAIN

Publication

Berlin : Springer-verlag berlin , 2015

ISBN

978-3-319-22728-3

978-3-319-22729-0

978-3-319-22728-3

DOI

10.1007/978-3-319-22729-0_2

Volume/pages

9263 (2015) , p. 15-27

ISI

000363583200002

Full text (Publisher's DOI)

https://doi.org/10.1007/978-3-319-22729-0_2

Faculty/Department				Faculty of Sciences. Mathematics and Computer Science

Research group				ADReM Data Lab (ADReM)
Publication type				P1 Proceeding

Subject				Computer. Automation

Affiliation				Publications with a UAntwerp address

Web of Science

View record in Web of Science®

View citing articles in Web of Science®

Identifier

Creation

09.12.2015

Last edited

09.10.2023

To cite this reference

https://hdl.handle.net/10067/1294990151162165141