Simple, multiplexed, PCR-based barcoding of DNA enables sensitive mutation detection in liquid biopsies using sequencing
Published online 7 April 2016
Nucleic Acids Research, 2016, Vol. 44, No. 11 e105
doi: 10.1093/nar/gkw224
Simple, multiplexed, PCR-based barcoding of DNA
enables sensitive mutation detection in liquid
biopsies using sequencing
Anders Ståhlberg1,2,* , Paul M. Krzyzanowski3 , Jennifer B. Jackson1 , Matthew Egyud1 ,
Lincoln Stein3 and Tony E. Godfrey1,*
1
Department of Surgery, Boston University School of Medicine, 700 Albany Street, Boston, MA 02118, USA,
Department of Pathology, Sahlgrenska Cancer Center, Institute of Biomedicine, Sahlgrenska Academy at University
of Gothenburg, Medicinaregatan 1F, 405 30 Gothenberg, Sweden and 3 Ontario Institute for Cancer Research, MaRS
Centre, 661 University Avenue, Suite 510, Toronto, Ontario M5G 0A3, Canada
2
Received November 6, 2015; Revised March 21, 2016; Accepted March 22, 2016
Detection of cell-free DNA in liquid biopsies offers
great potential for use in non-invasive prenatal testing and as a cancer biomarker. Fetal and tumor DNA
fractions however can be extremely low in these samples and ultra-sensitive methods are required for
their detection. Here, we report an extremely simple and fast method for introduction of barcodes
into DNA libraries made from 5 ng of DNA. Barcoded adapter primers are designed with an oligonucleotide hairpin structure to protect the molecular
barcodes during the first rounds of polymerase chain
reaction (PCR) and prevent them from participating
in mis-priming events. Our approach enables highlevel multiplexing and next-generation sequencing
library construction with flexible library content. We
show that uniform libraries of 1-, 5-, 13- and 31-plex
can be generated. Utilizing the barcodes to generate consensus reads for each original DNA molecule
reduces background sequencing noise and allows
detection of variant alleles below 0.1% frequency in
clonal cell line DNA and in cell-free plasma DNA.
Thus, our approach bridges the gap between the
highly sensitive but specific capabilities of digital
PCR, which only allows a limited number of variants
to be analyzed, with the broad target capability of
next-generation sequencing which traditionally lacks
the sensitivity to detect rare variants.
INTRODUCTION
The ability of massively-parallel, next-generation DNA sequencing (NGS) to identify low prevalence mutations in
heterogeneous samples has revolutionized basic and translational research in cancer and many other fields (1). However, detection of sequence variants below 1% frequency
remains a challenge with standard NGS protocols due to
background noise, much of which is introduced by polymerases during library construction (2). This background
noise is problematic in many clinical and research applications, including detection of rare sequence variants in liquid
biopsies for non-invasive prenatal diagnostics (NIPD) or for
biomarker applications in cancer.
Detection and analysis of fetal DNA in maternal plasma
has led to a revolution in NIPD for Downs Syndrome and
other disorders involving large chromosomal abnormalities (3,4). Moving forward, detection of single nucleotide
variants specific to the fetus offers the potential to diagnose monogenic disorders early on in pregnancy without
the risks associated with chorionic villus sampling or amniocentesis (5–7). In cancer, applications of rare mutation
detection in liquid biopsies include analysis of tumor heterogeneity and identification of therapy resistant clones(8),
monitoring clonal evolution and response to therapy (9)
and early cancer diagnosis using blood/plasma, sputum,
urine or other bodily fluids (10–12). In many cases, these
scenarios potentially require detection of variant allele fractions of 0.1% or less.
In both NIPD and cancer biomarker research, the introduction of COLD polymerase chain reaction (PCR) (13,14)
more recently digital PCR (15) technologies has enabled detection and quantification of ultra-rare sequence variants
in liquid biopsies (16,17). However, digital PCR assays are
specific for both nucleotide position and the specific base
change. Combined with the fact that multiplexing capability is limited, digital PCR is most useful in situations where
a known variant is being sought or where disease-related
variants are well characterized and limited in number. For
* To whom correspondence should be addressed. Tel: +46 31 786 6735; Email:
Correspondence may also be addressed to Tony E. Godfrey. Tel: +1 617 414 8013; Email:
C The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which
permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact
ABSTRACT
e105 Nucleic Acids Research, 2016, Vol. 44, No. 11
MATERIALS AND METHODS
DNA
Wild-type genomic DNA was extracted from a clonally derived Barrett’s esophageal cell line, CP-A, using the QIAamp DNA Mini kit (Qiagen). Wild-type circulating, cellfree DNA (ccfDNA) was extracted from pooled patient
plasma (Innovative Research) using QIAamp Circulating
Nucleic Acid kit (Qiagen). DNA concentrations were quantified with the Qubit 2.0 Fluorometer (Life Technologies)
and stored at −20◦ C. Genomic DNA was sheared using a
M220 focused-ultrasonicator (Covaris).
Melting curve analysis
Hairpin stability was analyzed by melting curve analysis using Varian Cary 300 UV-Vis spectrophotometer (Varian,
Inc). Primers were analyzed at a concentration of 1 M in
PCR buffer (10 mM Tris–HCl (pH 8.0), 50 mM KCl and
5 mM MgCl2 ). Samples were degased using preheating at
90◦ C for 10 min. The absorbance was measured at 260 nm
with a temperature gradient from 25 to 90◦ C, increasing the
temperature stepwise, 0.4◦ C/min. Data were recorded every
0.4◦ C.
Barcoding and library construction
Barcoding of DNA was performed with PCR in 10 l using 1× AccuPrime PCR Buffer II, 0.2 U AccuPrime Taq
DNA Polymerase High Fidelity (both Invitrogen, Thermo
Fisher Scientific), 40 nM of each primer (IDT, Inc) and 5–
100 ng DNA. Primer sequences are shown in Supplementary Table S1. The temperature profile was 98◦ C for 3 min
followed by three cycles of amplification (98◦ C for 10 s, 62◦ C
for 6 min and 72◦ C for 30 s), 65◦ C for 15 min and 95◦ C
for 15 min. Twenty microliter TE buffer, pH 8.0 (Ambion,
Thermo Fisher Scientific) with final concentration of 30
ng/l protease (Streptomyces griseus, Sigma Aldrich) was
added to inactivate the Taq DNA polymerase at the 65◦ C
for 15 min step. The second round of PCR was performed
in 40 l using 1× Q5 Hot Start High-Fidelity Master Mix
(New England BioLabs), 400 nM of each Illumina adaptor primer and 10 l PCR products from the first round of
PCR. The temperature profile was 95◦ C for 3 min followed
by 18–30 cycles of amplification (98◦ C for 10 s, ramping
from 80◦ C down to 72◦ C (...truncated)