Mining the enriched subgraphs for specific vertices in a biological graph
Faculty of Sciences. Biology
Faculty of Sciences. Chemistry
Faculty of Sciences. Mathematics and Computer Science
Faculty of Pharmaceutical, Biomedical and Veterinary Sciences . Biomedical Sciences
New York, N.Y.
IEEE/ACM transactions on computational biology and bioinformatics / Institute of Electrical and Electronics Engineers [New York, N.Y.] - New York, N.Y.
, p. 1-12
University of Antwerp
In this paper, we present a subgroup discovery method to find subgraphs in a graph that are associated with a given set of vertices. The association between a subgraph pattern and a set of vertices is defined by its significant enrichment based on a Bonferroni-corrected hypergeometric probability value. This interestingness measure requires a dedicated pruning procedure to limit the number of subgraph matches that must be calculated. The presented mining algorithm to find associated subgraph patterns in large graphs is therefore designed to efficiently traverse the search space. We demonstrate the operation of this method by applying it on three biological graph data sets and show that we can find associated subgraphs for a biologically relevant set of vertices and that the found subgraphs themselves are biologically interesting.