@article{e13cb5813fa546cda64deb865951ed32,
title = "Conserved Epigenetic Regulatory Logic Infers Genes Governing Cell Identity",
abstract = "Determining genes that orchestrate cell differentiation in development and disease remains a fundamental goal of cell biology. This study establishes a genome-wide metric based on the gene-repressive trimethylation of histone H3 at lysine 27 (H3K27me3) across hundreds of diverse cell types to identify genetic regulators of cell differentiation. We introduce a computational method, TRIAGE, which uses discordance between gene-repressive tendency and expression to identify genetic drivers of cell identity. We apply TRIAGE to millions of genome-wide single-cell transcriptomes, diverse omics platforms, and eukaryotic cells and tissue types. Using a wide range of data, we validate the performance of TRIAGE in identifying cell-type-specific regulatory factors across diverse species including human, mouse, boar, bird, fish, and tunicate. Using CRISPR gene editing, we use TRIAGE to experimentally validate RNF220 as a regulator of Ciona cardiopharyngeal development and SIX3 as required for differentiation of endoderm in human pluripotent stem cells. A record of this paper's transparent peer review process is included in the Supplemental Information. Perturbing genes controlling cell decisions have major implications in development or disease. However, identifying key regulatory genes from the thousands expressed in a cell is challenging. TRIAGE is a computational method that distills patterns of epigenetic repression across diverse cell types to infer regulatory genes using input gene expression data from any cell type. Demonstrating its utility, we combine single-cell RNA-seq and TRIAGE to identify and experimentally confirm novel regulators of heart development in evolutionarily distant species.",
keywords = "EpiMap, H3K27me3, TRIAGE, cell differentiation, computational method, epigenetic, pluripotent stem cell, predictions, single cell RNA-seq, Epigenomics/methods, Humans, Cell Differentiation",
author = "Shim, {Woo Jun} and Enakshi Sinniah and Jun Xu and Burcu Vitrinel and Michael Alexanian and Gaia Andreoletti and Sophie Shen and Yuliangzi Sun and Brad Balderson and Carles Boix and Guangdun Peng and Naihe Jing and Yuliang Wang and Manolis Kellis and Tam, {Patrick P.L.} and Aaron Smith and Michael Piper and Lionel Christiaen and Quan Nguyen and Mikael Bod{\'e}n and Palpant, {Nathan J.}",
note = "Funding Information: E.S. acknowledges funding by the Children{\textquoteright}s Hospital Foundation Queensland (award reference number: 50268 ). B.V. acknowledges funding by the American Heart Association grant # 18PRE33990254 . The Ciona work was supported by NIH/ NHLBI award R01 HL108643 to L.C., M.A. was supported by the Swiss National Science Foundation (project P2LAP3_178056 ), and P.P.L.T. is supported by the National Health and Medical Research Council of Australia (grant 1110751 ). Q.N. is supported by an ARC DECRA (the ARC DECRA DE190100116 ). N.P. is supported by the National Health and Medical Research Council of Australia (grant 1143163 ) and the Australian Research Council (grant SR1101002 ). Funding Information: E.S. acknowledges funding by the Children's Hospital Foundation Queensland (award reference number: 50268). B.V. acknowledges funding by the American Heart Association grant #18PRE33990254. The Ciona work was supported by NIH/NHLBI award R01 HL108643 to L.C. M.A. was supported by the Swiss National Science Foundation (project P2LAP3_178056), and P.P.L.T. is supported by the National Health and Medical Research Council of Australia (grant 1110751). Q.N. is supported by an ARC DECRA (the ARC DECRA DE190100116). N.P. is supported by the National Health and Medical Research Council of Australia (grant 1143163) and the Australian Research Council (grant SR1101002). W.J.S. developed the computational basis for the study, performed data analysis and wrote the manuscript. E.S. contributed to experimental and computational design for the study, performed data analysis, carried out functional genetic studies in hPSCs, and wrote the manuscript. J.X. assisted with computational analysis and developed web interactive interface. M.A. and G.A. performed computational analysis on HF pathogenesis data. S.S. assisted with the computational analysis on different single-cell data platforms. B.B. performed computational analysis on melanoma studies. Y.S. performed computational analysis on MOCA data. C.B. and M.K. contributed EpiMap data. B.V. performed functional analysis on Ciona and validated the findings. G.P. and N.J. assisted with spatiotemporal transcriptomic profiling of mouse gastrulation. Y.W. helped with computational analysis of epigenetic data. M.P. assisted with analysis and interpretation of melanoma data. A.S. carried out experiments involving melanoma analysis. P.P.L.T. supervised work on spatiotemporal transcriptomic profiling of mouse gastrulation. L.C. performed functional analysis on Ciona and validated the findings. Q.N. provided assistance to implement TRIAGE on single-cell datasets. M.B. and N.J.P. supervised the project, raised funding, and wrote the manuscript. The authors declare no competing interests. Publisher Copyright: {\textcopyright} 2020 The Authors",
year = "2020",
month = dec,
day = "16",
doi = "10.1016/j.cels.2020.11.001",
language = "English (US)",
volume = "11",
pages = "625--639.e13",
journal = "Cell Systems",
issn = "2405-4712",
publisher = "Cell Press",
number = "6",
}