TY - GEN
T1 - Integrative protein function transfer using factor graphs and heterogeneous data sources
AU - Mitrofanova, Antonina
AU - Pavlovic, Vladimir
AU - Mishra, Bud
PY - 2008
Y1 - 2008
N2 - We propose a novel approach for predicting protein functions of an organism by coupling sequence homology and PPI data between two (or more) species with multifunctional Gene Ontology information into a single computational model. Instead of using a network of one organism in isolation, we join networks of different species by inter-species sequence homology links of sufficient similarity. As a consequence, the knowledge of a protein's function is acquired not only from one species' network alone, but also through homologous links to the networks of different species. We apply our method to two largest protein networks, Yeast (Saccharomyces cerevisiae) and Fly (Drosophila melanogaster). Our joint Fly-Yeast network displays statistically significant improvements in precision, accuracy, and false positive rate over networks that consider either of the sources in isolation, while retaining the computational efficiency of the simpler models.
AB - We propose a novel approach for predicting protein functions of an organism by coupling sequence homology and PPI data between two (or more) species with multifunctional Gene Ontology information into a single computational model. Instead of using a network of one organism in isolation, we join networks of different species by inter-species sequence homology links of sufficient similarity. As a consequence, the knowledge of a protein's function is acquired not only from one species' network alone, but also through homologous links to the networks of different species. We apply our method to two largest protein networks, Yeast (Saccharomyces cerevisiae) and Fly (Drosophila melanogaster). Our joint Fly-Yeast network displays statistically significant improvements in precision, accuracy, and false positive rate over networks that consider either of the sources in isolation, while retaining the computational efficiency of the simpler models.
UR - http://www.scopus.com/inward/record.url?scp=58149159887&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=58149159887&partnerID=8YFLogxK
U2 - 10.1109/BIBM.2008.65
DO - 10.1109/BIBM.2008.65
M3 - Conference contribution
AN - SCOPUS:58149159887
SN - 9780769534527
T3 - Proceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008
SP - 314
EP - 318
BT - Proceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008
T2 - 2008 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008
Y2 - 3 November 2008 through 5 November 2008
ER -