TY - JOUR
T1 - Algorithmic methods to infer the evolutionary trajectories in cancer progression
AU - Caravagna, Giulio
AU - Graudenzi, Alex
AU - Ramazzotti, Daniele
AU - Sanz-Pamplona, Rebeca
AU - De Sano, Luca
AU - Mauri, Giancarlo
AU - Moreno, Victor
AU - Antoniotti, Marco
AU - Mishra, Bud
N1 - Funding Information:
We thank the anonymous reviewers and Prof. Giovanni Tonon for their help with improving the quality and rigor of the manuscript. This work was supported by the SysBioNet project, a Ministero dell'Istruzione, dell'Università e della Ricerca initiative for the Italian Roadmap of European Strategy Forum on Research Infrastructures, and Regione Lombardia (Italy) for the research projects RetroNet through ASTIL Program Grant 12-4-5148000-40 (to M.A., G.M., G.C., A.G., and D.R.). This work was also supported by UA 053 and Network Enabled Drug Design Project ID14546A Rif SAL-7, Fondo Accordi Istituzionali 2009, National Science Foundation Grants CCF-0836649 and CCF-0926166, and a National Cancer Institute Physical Sciences-Oncology Center Grant U54 CA193313-01 (to B.M.); and by the Instituto de Salud Carlos III, supported by European Regional Development Fund Grants PI11-01439, PIE13/00022, the Spanish Association Against Cancer Scientific Foundation, and Catalan Government DURSI Grant 2014SGR647 (to V.M. and R.S.-P.).
Publisher Copyright:
© 2016, National Academy of Sciences. All rights reserved.
PY - 2016/7/12
Y1 - 2016/7/12
N2 - The genomic evolution inherent to cancer relates directly to a renewed focus on the voluminous next-generation sequencing data and machine learning for the inference of explanatory models of how the (epi)genomic events are choreographed in cancer initiation and development. However, despite the increasing availability of multiple additional -omics data, this quest has been frustrated by various theoretical and technical hurdles, mostly stemming from the dramatic heterogeneity of the disease. In this paper, we build on our recent work on the "selective advantage" relation among driver mutations in cancer progression and investigate its applicability to the modeling problem at the population level. Here, we introduce PiCnIc (Pipeline for Cancer Inference), a versatile, modular, and customizable pipeline to extract ensemble-level progression models from cross-sectional sequenced cancer genomes. The pipeline has many translational implications because it combines state-of-the-art techniques for sample stratification, driver selection, identification of fitness-equivalent exclusive alterations, and progression model inference. We demonstrate PiCnIc's ability to reproduce much of the current knowledge on colorectal cancer progression as well as to suggest novel experimentally verifiable hypotheses.
AB - The genomic evolution inherent to cancer relates directly to a renewed focus on the voluminous next-generation sequencing data and machine learning for the inference of explanatory models of how the (epi)genomic events are choreographed in cancer initiation and development. However, despite the increasing availability of multiple additional -omics data, this quest has been frustrated by various theoretical and technical hurdles, mostly stemming from the dramatic heterogeneity of the disease. In this paper, we build on our recent work on the "selective advantage" relation among driver mutations in cancer progression and investigate its applicability to the modeling problem at the population level. Here, we introduce PiCnIc (Pipeline for Cancer Inference), a versatile, modular, and customizable pipeline to extract ensemble-level progression models from cross-sectional sequenced cancer genomes. The pipeline has many translational implications because it combines state-of-the-art techniques for sample stratification, driver selection, identification of fitness-equivalent exclusive alterations, and progression model inference. We demonstrate PiCnIc's ability to reproduce much of the current knowledge on colorectal cancer progression as well as to suggest novel experimentally verifiable hypotheses.
KW - Bayesian structural inference
KW - Cancer evolution
KW - Causality
KW - Next generation sequencing
KW - Selective advantage
UR - http://www.scopus.com/inward/record.url?scp=84978107412&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84978107412&partnerID=8YFLogxK
U2 - 10.1073/pnas.1520213113
DO - 10.1073/pnas.1520213113
M3 - Article
C2 - 27357673
AN - SCOPUS:84978107412
SN - 0027-8424
VL - 113
SP - E4025-E4034
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 28
ER -