Query-based biclustering of gene expression data using probabilistic relational models

Zhao, Hui; Cloots, Lore; van den Bulcke, Tim; Wu, Yan; de Smet, Riet; Storms, Valerie; Meysman, Pieter; Engelen, Kristof; Marchal, Kathleen

doi:10.1186/1471-2105-12-S1-S37

Title

Query-based biclustering of gene expression data using probabilistic relational models

Author

Zhao, Hui

Cloots, Lore

van den Bulcke, Tim

Wu, Yan

de Smet, Riet

Storms, Valerie

Meysman, Pieter

Engelen, Kristof

Marchal, Kathleen

Abstract

Background With the availability of large scale expression compendia it is now possible to view own findings in the light of what is already available and retrieve genes with an expression profile similar to a set of genes of interest (i.e., a query or seed set) for a subset of conditions. To that end, a query-based strategy is needed that maximally exploits the coexpression behaviour of the seed genes to guide the biclustering, but that at the same time is robust against the presence of noisy genes in the seed set as seed genes are often assumed, but not guaranteed to be coexpressed in the queried compendium. Therefore, we developed ProBic, a query-based biclustering strategy based on Probabilistic Relational Models (PRMs) that exploits the use of prior distributions to extract the information contained within the seed set. Results We applied ProBic on a large scale Escherichia coli compendium to extend partially described regulons with potentially novel members. We compared ProBic's performance with previously published query-based biclustering algorithms, namely ISA and QDB, from the perspective of bicluster expression quality, robustness of the outcome against noisy seed sets and biological relevance. This comparison learns that ProBic is able to retrieve biologically relevant, high quality biclusters that retain their seed genes and that it is particularly strong in handling noisy seeds. Conclusions ProBic is a query-based biclustering algorithm developed in a flexible framework, designed to detect biologically relevant, high quality biclusters that retain relevant seed genes even in the presence of noise or when dealing with low quality seed sets.

Language

English

Source (journal)

BMC bioinformatics. - London

Publication

London : 2011

ISSN

1471-2105

DOI

10.1186/1471-2105-12-S1-S37

Volume/pages

12:S:1 (2011) , p. S37,1-S37,11

ISI

000290221000038

Full text (Publisher's DOI)

https://doi.org/10.1186/1471-2105-12-S1-S37

Full text (open access)

https://repository.uantwerpen.be/docman/irua/bcb3cb/a0c7f4ab.pdf

Faculty/Department				Faculty of Sciences. Mathematics and Computer Science Faculty of Medicine and Health Sciences

Research group
Publication type				A1 Journal article

Subject				Biology Human medicine Engineering sciences. Technology Computer. Automation

Affiliation				Publications with a UAntwerp address

Web of Science

View record in Web of Science®

View citing articles in Web of Science®

Identifier

Creation

28.06.2011

Last edited

15.11.2022

To cite this reference

https://hdl.handle.net/10067/897750151162165141