TY - JOUR
T1 - Extensive sequencing of seven human genomes to characterize benchmark reference materials
AU - Zook, Justin M.
AU - Catoe, David
AU - McDaniel, Jennifer
AU - Vang, Lindsay
AU - Spies, Noah
AU - Sidow, Arend
AU - Weng, Ziming
AU - Liu, Yuling
AU - Mason, Christopher E.
AU - Alexander, Noah
AU - Henaff, Elizabeth
AU - McIntyre, Alexa B.R.
AU - Chandramohan, Dhruva
AU - Chen, Feng
AU - Jaeger, Erich
AU - Moshrefi, Ali
AU - Pham, Khoa
AU - Stedman, William
AU - Liang, Tiffany
AU - Saghbini, Michael
AU - Dzakula, Zeljko
AU - Hastie, Alex
AU - Cao, Han
AU - Deikus, Gintaras
AU - Schadt, Eric
AU - Sebra, Robert
AU - Bashir, Ali
AU - Truty, Rebecca M.
AU - Chang, Christopher C.
AU - Gulbahce, Natali
AU - Zhao, Keyan
AU - Ghosh, Srinka
AU - Hyland, Fiona
AU - Fu, Yutao
AU - Chaisson, Mark
AU - Xiao, Chunlin
AU - Trow, Jonathan
AU - Sherry, Stephen T.
AU - Zaranek, Alexander W.
AU - Ball, Madeleine
AU - Bobe, Jason
AU - Estep, Preston
AU - Church, George M.
AU - Marks, Patrick
AU - Kyriazopoulou-Panagiotopoulou, Sofia
AU - Zheng, Grace X.Y.
AU - Schnall-Levin, Michael
AU - Ordonez, Heather S.
AU - Mudivarti, Patrice A.
AU - Giorda, Kristina
AU - Sheng, Ying
AU - Rypdal, Karoline Bjarnesdatter
AU - Salit, Marc
N1 - Funding Information:
National Institutes of Health (R25EB020393, R01NS076465).
PY - 2016/6/7
Y1 - 2016/6/7
N2 - The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.
AB - The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.
UR - http://www.scopus.com/inward/record.url?scp=84976413217&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84976413217&partnerID=8YFLogxK
U2 - 10.1038/sdata.2016.25
DO - 10.1038/sdata.2016.25
M3 - Article
C2 - 27271295
AN - SCOPUS:84976413217
SN - 2052-4463
VL - 3
JO - Scientific Data
JF - Scientific Data
M1 - 160025
ER -