TY - JOUR
T1 - Cost-effective solutions for high-throughput enzymatic DNA methylation sequencing
AU - Cayo Biobank Research Unit
AU - Longtin, Amy
AU - Watowich, Marina M.
AU - Sadoughi, Baptiste
AU - Petersen, Rachel M.
AU - Brosnan, Sarah F.
AU - Buetow, Kenneth
AU - Cai, Qiuyin
AU - Gurven, Michael D.
AU - Higham, James P.
AU - Highland, Heather M.
AU - Huang, Yi Ting
AU - Kaplan, Hillard
AU - Kraft, Thomas S.
AU - Lim, Yvonne A.L.
AU - Long, Jirong
AU - Melin, Amanda D.
AU - Montague, Michael J.
AU - Roberson, Jamie
AU - Ng, Kee Seong
AU - Platt, Michael L.
AU - Schneider-Crease, India A.
AU - Stieglitz, Jonathan
AU - Trumble, Benjamin C.
AU - Venkataraman, Vivek V.
AU - Wallace, Ian J.
AU - Wu, Jie
AU - Snyder-Mackler, Noah
AU - Jones, Angela
AU - Bick, Alexander G.
AU - Lea, Amanda J.
N1 - Publisher Copyright:
Copyright: © 2025 Longtin et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2025/5
Y1 - 2025/5
N2 - Characterizing DNA methylation patterns is important for addressing key questions in evolutionary biology, development, geroscience, and medical genomics. While costs are decreasing, whole-genome DNA methylation profiling remains prohibitively expensive for most population-scale studies, creating a need for cost-effective, reduced representation approaches (i.e., assays that rely on microarrays, enzyme digests, or sequence capture to target a subset of the genome). Most common whole genome and reduced representation techniques rely on bisulfite conversion, which can damage DNA resulting in DNA loss and sequencing biases. Enzymatic methyl sequencing (EM-seq) was recently proposed to overcome these issues, but thorough benchmarking of EM-seq combined with cost-effective, reduced representation strategies is currently lacking. To address this gap, we optimized the Targeted Methylation Sequencing protocol (TMS)—which profiles ~4 million CpG sites—for miniaturization, flexibility, and multispecies use at a cost of ~USD 80. First, we tested modifications to increase throughput and reduce cost, including increasing multiplexing, decreasing DNA input, and using enzymatic rather than mechanical fragmentation to prepare DNA. Second, we compared our optimized TMS protocol to commonly used techniques, specifically the Infinium MethylationEPIC BeadChip (n = 55 paired samples) and whole genome bisulfite sequencing (n = 6 paired samples). In both cases, we found strong agreement between technologies (R2 = 0.97 and 0.99, respectively). Third, we tested the optimized TMS protocol in three non-human primate species (rhesus macaques, geladas, and capuchins). We captured a high percentage (mean = 77.1%) of targeted CpG sites and produced methylation level estimates that agreed with those generated from reduced representation bisulfite sequencing (R2 = 0.98). Finally, we confirmed that estimates of 1) epigenetic age and 2) tissue-specific DNA methylation patterns are strongly recapitulated using data generated from TMS versus other technologies. Altogether, our optimized TMS protocol will enable cost-effective, population-scale studies of genome-wide DNA methylation levels across human and non-human primate species.
AB - Characterizing DNA methylation patterns is important for addressing key questions in evolutionary biology, development, geroscience, and medical genomics. While costs are decreasing, whole-genome DNA methylation profiling remains prohibitively expensive for most population-scale studies, creating a need for cost-effective, reduced representation approaches (i.e., assays that rely on microarrays, enzyme digests, or sequence capture to target a subset of the genome). Most common whole genome and reduced representation techniques rely on bisulfite conversion, which can damage DNA resulting in DNA loss and sequencing biases. Enzymatic methyl sequencing (EM-seq) was recently proposed to overcome these issues, but thorough benchmarking of EM-seq combined with cost-effective, reduced representation strategies is currently lacking. To address this gap, we optimized the Targeted Methylation Sequencing protocol (TMS)—which profiles ~4 million CpG sites—for miniaturization, flexibility, and multispecies use at a cost of ~USD 80. First, we tested modifications to increase throughput and reduce cost, including increasing multiplexing, decreasing DNA input, and using enzymatic rather than mechanical fragmentation to prepare DNA. Second, we compared our optimized TMS protocol to commonly used techniques, specifically the Infinium MethylationEPIC BeadChip (n = 55 paired samples) and whole genome bisulfite sequencing (n = 6 paired samples). In both cases, we found strong agreement between technologies (R2 = 0.97 and 0.99, respectively). Third, we tested the optimized TMS protocol in three non-human primate species (rhesus macaques, geladas, and capuchins). We captured a high percentage (mean = 77.1%) of targeted CpG sites and produced methylation level estimates that agreed with those generated from reduced representation bisulfite sequencing (R2 = 0.98). Finally, we confirmed that estimates of 1) epigenetic age and 2) tissue-specific DNA methylation patterns are strongly recapitulated using data generated from TMS versus other technologies. Altogether, our optimized TMS protocol will enable cost-effective, population-scale studies of genome-wide DNA methylation levels across human and non-human primate species.
UR - http://www.scopus.com/inward/record.url?scp=105006474018&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105006474018&partnerID=8YFLogxK
U2 - 10.1371/journal.pgen.1011667
DO - 10.1371/journal.pgen.1011667
M3 - Article
C2 - 40402999
AN - SCOPUS:105006474018
SN - 1553-7390
VL - 21
JO - PLoS genetics
JF - PLoS genetics
IS - 5 MAY
M1 - e1011667
ER -