Publication
Title
Efficiently mining cohesion-based patterns and rules in event sequences
Author
Abstract
Discovering patterns in long event sequences is an important data mining task. Traditionally, research focused on frequency-based quality measures that allow algorithms to use the anti-monotonicity property to prune the search space and efficiently discover the most frequent patterns. In this work, we step away from such measures, and evaluate patterns using cohesion a measure of how close to each other the items making up the pattern appear in the sequence on average. We tackle the fact that cohesion is not an anti-monotonic measure by developing an upper bound on cohesion in order to prune the search space. By doing so, we are able to efficiently unearth rare, but strongly cohesive, patterns that existing methods often fail to discover. Furthermore, having found the occurrences of cohesive itemsets in the input sequence, we use them to discover the representative sequential patterns and the dominant partially ordered episodes, without going through the computationally expensive candidate generation procedures typically associated with sequential pattern and episode mining. Experiments show that our method efficiently discovers important patterns that existing state-of-the-art methods fail to discover.
Language
English
Source (journal)
Data mining and knowledge discovery. - Boston, Mass., 1997, currens
Publication
Boston, Mass. : 2019
ISSN
1384-5810 [print]
1573-756X [online]
DOI
10.1007/S10618-019-00628-0
Volume/pages
33 :4 (2019) , p. 1125-1182
ISI
000471246600010
Full text (Publisher's DOI)
Full text (open access)
Full text (publisher's version - intranet only)
UAntwerpen
Faculty/Department
Research group
Project info
Reliable on-the-fly prediction of future events in data streams.
City of Things
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Web of Science
Record
Identifier
Creation 01.08.2019
Last edited 02.10.2024
To cite this reference