A primer to frequent itemset mining for bioinformatics

Naulaerts, Stefan; Meysman, Pieter; Bittremieux, Wout; Vu, Trung Nghia; Vanden Berghe, Wim; Goethals, B.; Laukens, Kris

doi:10.1093/BIB/BBT074

Title

A primer to frequent itemset mining for bioinformatics

Author

Naulaerts, Stefan

Meysman, Pieter

Bittremieux, Wout

Vu, Trung Nghia

Vanden Berghe, Wim

Goethals, B.

Laukens, Kris

Abstract

Over the past two decades, pattern mining techniques have become an integral part of many bioinformatics solutions. Frequent itemset mining is a popular group of pattern mining techniques designed to identify elements that frequently co-occur. An archetypical example is the identification of products that often end up together in the same shopping basket in supermarket transactions. A number of algorithms have been developed to address variations of this computationally non-trivial problem. Frequent itemset mining techniques are able to efficiently capture the characteristics of (complex) data and succinctly summarize it. Owing to these and other interesting properties, these techniques have proven their value in biological data analysis. Nevertheless, information about the bioinformatics applications of these techniques remains scattered. In this primer, we introduce frequent itemset mining and their derived association rules for life scientists. We give an overview of various algorithms, and illustrate how they can be used in several real-life bioinformatics application domains. We end with a discussion of the future potential and open challenges for frequent itemset mining in the life sciences.

Language

English

Source (journal)

Briefings in bioinformatics. - London

Publication

London : 2015

ISSN

1467-5463

DOI

10.1093/BIB/BBT074

Volume/pages

16 :2 (2015) , p. 216-231

ISI

000352240000004

Pubmed ID

24162173

Full text (Publisher's DOI)

https://doi.org/10.1093/BIB/BBT074

Full text (open access)

https://repository.uantwerpen.be/docman/irua/74aae0/9639.pdf

Faculty/Department				Faculty of Sciences. Mathematics and Computer Science Faculty of Pharmaceutical, Biomedical and Veterinary Sciences . Biomedical Sciences

Research group				ADReM Data Lab (ADReM)
Project info				Intelligent analysis and data-mining of mass spectrometry-based proteome data. Integrative bioinformatics analysis of combined epigenome, transcriptome and proteome data.
Publication type				A1 Journal article

Subject				Mathematics Chemistry Biology Computer. Automation

Affiliation				Publications with a UAntwerp address

Web of Science

View record in Web of Science®

View citing articles in Web of Science®

Identifier

Creation

06.11.2013

Last edited

09.10.2023

To cite this reference

https://hdl.handle.net/10067/1110880151162165141