Incorporation of unique molecular identifiers in TruSeq adapters improves the accuracy of quantitative sequencing

Jungeui Hong, David Gresham

Research output: Contribution to journalArticlepeer-review

Abstract

Quantitative analysis of next-generation sequencing (NGS) data requires discriminating duplicate reads generated by PCR from identical molecules that are of unique origin. Typically, PCR duplicates are identified as sequence reads that align to the same genomic coordinates using reference-based alignment. However, identical molecules can be independently generated during library preparation. Misidentification of these molecules as PCR duplicates can introduce unforeseen biases during analyses. Here, we developed a cost-effective sequencing adapter design by modifying Illumina TruSeq adapters to incorporate a unique molecular identifier (UMI) while maintaining the capacity to undertake multiplexed, single-index sequencing. Incorporation of UMIs into TruSeq adapters (TrUMIseq adapters) enables identification of bona fide PCR duplicates as identically mapped reads with identical UMIs.

Original languageEnglish (US)
Pages (from-to)221-226
Number of pages6
JournalBioTechniques
Volume63
Issue number5
DOIs
StatePublished - Nov 2017

Keywords

  • PCR duplicates
  • RNA-Seq
  • TrUMIseq
  • TruSeq
  • Unique molecular identifier (UMI)

ASJC Scopus subject areas

  • Biotechnology
  • General Biochemistry, Genetics and Molecular Biology

Fingerprint

Dive into the research topics of 'Incorporation of unique molecular identifiers in TruSeq adapters improves the accuracy of quantitative sequencing'. Together they form a unique fingerprint.

Cite this