Axiomatization of frequent itemsetsAxiomatization of frequent itemsets
Faculty of Sciences. Mathematics and Computer Science
Department of Mathematics - Computer Sciences
Advanced Database Research and Modeling (ADReM)
Theoretical computer science. - Amsterdam
290(2003):1, p. 669-693
University of Antwerp
Mining association rules is very popular in the data mining community. Most algorithms designed for finding association rules start with searching for frequent itemsets. Typically, in these algorithms, counting phases and pruning phases are interleaved. In the counting phase, partial information about the frequencies of selected itentsets is gathered. In the pruning phase as much as possible of the search space is pruned, based on the counting information. We introduce frequent set expressions to represent (possible partial) information acquired in the counting phase. A frequent set expression is a pair containing an itemset and a fraction that is a lower bound on the actual frequency of the itemset. A system of frequent sets is a collection of such pairs. We give an axiomatization for those systems that are complete in the sense that they explicitly contain all information they logically imply. Every system of frequent sets has a unique completion that actually represents all knowledge that can be derived. We also study sparse systems, in which not for every frequent set an expression is given. Furthermore, we explore the links with probabilistic logics. (C) 2002 Elsevier Science B.V. All rights reserved.