TY - JOUR
T1 - DREAM3
T2 - Network inference using dynamic context likelihood of relatedness and the inferelator
AU - Madar, Aviv
AU - Greenfield, Alex
AU - Vanden-Eijnden, Eric
AU - Bonneau, Richard
PY - 2010
Y1 - 2010
N2 - Background: Many current works aiming to learn regulatory networks from systems biology data must balance model complexity with respect to data availability and quality. Methods that learn regulatory associations based on unit-less metrics, such as Mutual Information, are attractive in that they scale well and reduce the number of free parameters (model complexity) per interaction to a minimum. In contrast, methods for learning regulatory networks based on explicit dynamical models are more complex and scale less gracefully, but are attractive as they may allow direct prediction of transcriptional dynamics and resolve the directionality of many regulatory interactions. Methodology: We aim to investigate whether scalable information based methods (like the Context Likelihood of Relatedness method) and more explicit dynamical models (like Inferelator 1.0) prove synergistic when combined. We test a pipeline where a novel modification of the Context Likelihood of Relatedness (mixed-CLR, modified to use time series data) is first used to define likely regulatory interactions and then Inferelator 1.0 is used for final model selection and to build an explicit dynamical model. Conclusions/Significance: Our method ranked 2nd out of 22 in the DREAM3 100-gene in silico networks challenge. Mixed- CLR and Inferelator 1.0 are complementary, demonstrating a large performance gain relative to any single tested method, with precision being especially high at low recall values. Partitioning the provided data set into four groups (knock-down, knock-out, time-series, and combined) revealed that using comprehensive knock-out data alone provides optimal performance. Inferelator 1.0 proved particularly powerful at resolving the directionality of regulatory interactions, i.e. "who regulates who" (approximately 93% of identified true positives were correctly resolved). Performance drops for high indegree genes, i.e. as the number of regulators per target gene increases, but not with out-degree, i.e. performance is not affected by the presence of regulatory hubs.
AB - Background: Many current works aiming to learn regulatory networks from systems biology data must balance model complexity with respect to data availability and quality. Methods that learn regulatory associations based on unit-less metrics, such as Mutual Information, are attractive in that they scale well and reduce the number of free parameters (model complexity) per interaction to a minimum. In contrast, methods for learning regulatory networks based on explicit dynamical models are more complex and scale less gracefully, but are attractive as they may allow direct prediction of transcriptional dynamics and resolve the directionality of many regulatory interactions. Methodology: We aim to investigate whether scalable information based methods (like the Context Likelihood of Relatedness method) and more explicit dynamical models (like Inferelator 1.0) prove synergistic when combined. We test a pipeline where a novel modification of the Context Likelihood of Relatedness (mixed-CLR, modified to use time series data) is first used to define likely regulatory interactions and then Inferelator 1.0 is used for final model selection and to build an explicit dynamical model. Conclusions/Significance: Our method ranked 2nd out of 22 in the DREAM3 100-gene in silico networks challenge. Mixed- CLR and Inferelator 1.0 are complementary, demonstrating a large performance gain relative to any single tested method, with precision being especially high at low recall values. Partitioning the provided data set into four groups (knock-down, knock-out, time-series, and combined) revealed that using comprehensive knock-out data alone provides optimal performance. Inferelator 1.0 proved particularly powerful at resolving the directionality of regulatory interactions, i.e. "who regulates who" (approximately 93% of identified true positives were correctly resolved). Performance drops for high indegree genes, i.e. as the number of regulators per target gene increases, but not with out-degree, i.e. performance is not affected by the presence of regulatory hubs.
UR - http://www.scopus.com/inward/record.url?scp=78149461178&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78149461178&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0009803
DO - 10.1371/journal.pone.0009803
M3 - Article
C2 - 20339551
AN - SCOPUS:78149461178
SN - 1932-6203
VL - 5
JO - PloS one
JF - PloS one
IS - 3
M1 - e9803
ER -