Publication
Title
pBRIT : gene prioritization by correlating functional and phenotypic annotations through integrative data fusion
Author
Abstract
Motivation Computational gene prioritization can aid in disease gene identification. Here, we propose pBRIT (prioritization using Bayesian Ridge regression and Information Theoretic model), a novel adaptive and scalable prioritization tool, integrating Pubmed abstracts, Gene Ontology, Sequence similarities, Mammalian and Human Phenotype Ontology, Pathway, Interactions, Disease Ontology, Gene Association database and Human Genome Epidemiology database, into the prediction model.We explore and address effects of sparsity and inter-feature dependencies within annotation sources, and the impact of bias towards specific annotations. Results pBRIT models feature dependencies and sparsity by an Information-Theoretic (data driven) approach and applies intermediate integration based data fusion. Following the hypothesis that genes underlying similar diseases will share functional and phenotype characteristics, it incorporates Bayesian Ridge regression to learn a linear mapping between functional and phenotype annotations. Genes are prioritized on phenotypic concordance to the training genes. We evaluated pBRIT against 9 existing methods, and on over 2,000 HPO-gene associations retrieved after construction of pBRIT data sources. We achieve maximum AUC scores ranging from 0.92 to 0.96 against benchmark datasets and of 0.80 against the time-stamped HPO entries, indicating good performance with high sensitivity and specificity. Our model shows stable performance with regard to changes in the underlying annotation data, is fast and scalable for implementation in routine pipelines.
Language
English
Source (journal)
Bioinformatics. - Oxford, 1998, currens
Publication
Oxford : Oxford University Press , 2018
ISSN
1367-4803 [print]
1367-4811 [online]
DOI
10.1093/BIOINFORMATICS/BTY079
Volume/pages
34 :13 (2018) , p. 2254-2262
ISI
000438247800078
Pubmed ID
29452392
Full text (Publisher's DOI)
Full text (open access)
UAntwerpen
Faculty/Department
Research group
Project info
Exaptation: Scalable solutions for image-based and across-partner compound activity prediction and application to compound selection
Bicuspid Related Aortopathy, a Vibrant Exploration (BRAVE).
Clinical and (patho)genetic study of bicuspid aortic valve and associated aortic aneurysm.
Deciphering hidden inheritance patterns using advanced data mining techniques on high throughput genomic data.
Deciphering hidden inheritance patterns using frequent itemset mining techniques on high throughput genomic data.
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Web of Science
Record
Identifier
Creation 28.02.2018
Last edited 09.10.2023
To cite this reference