Pool-packaged AAV libraries exhibit extensive length-dependent and homology-dependent chimerism

Nature Biotechnology, May 2026

Adeno-associated viruses (AAVs) are preferred gene therapy vectors because of their versatility, durability and safety profile. Here, we demonstrate extensive chimerism, manifesting as pervasive barcode swapping, among complex recombinant AAV (rAAV) libraries that are packaged as a pool. The observed chimerism is length and homology dependent but capsid independent, in some cases affecting the majority of packaged rAAV genomes. These results have implications for the design and deployment of functional rAAV libraries.

Article PDF cannot be displayed. You can download it here:

https://www.nature.com/articles/s41587-026-03097-1.pdf

Pool-packaged AAV libraries exhibit extensive length-dependent and homology-dependent chimerism

nature biotechnology Brief Communication https://doi.org/10.1038/s41587-026-03097-1 Pool-packaged AAV libraries exhibit extensive length-dependent and homology-dependent chimerism Received: 20 December 2024 Accepted: 19 March 2026 Jean-Benoît Lalanne 1,2,8 , Chau Huynh1,3,8, John K. Mich Avery C. Hunker 4, Troy A. McDiarmid 1,3, Haedong Kim Jonathan T. Ting 4 & Jay Shendure 1,3,5,6,7 , , Boaz P. Levi 4 1,3 , 4 Published online: xx xx xxxx Check for updates Adeno-associated viruses (AAVs) are preferred gene therapy vectors because of their versatility, durability and safety profile. Here, we demonstrate extensive chimerism, manifesting as pervasive barcode swapping, among complex recombinant AAV (rAAV) libraries that are packaged as a pool. The observed chimerism is length and homology dependent but capsid independent, in some cases affecting the majority of packaged rAAV genomes. These results have implications for the design and deployment of functional rAAV libraries. Multiplexed functional genomic screens often use linked components within a cargo, for example, single-cell CRISPR screens1,2 (single guide RNA (sgRNA) and barcode) or massively parallel reporter assays (MPRAs; regulatory element and barcode)3. In these, any level of decoupling of expected pairings ultimately degrades signal quality. A prominent example is the frequent recombination seen in lentiviral cargo packaging4,5. Such recombination, which is length and homology dependent and results from lowly processive reverse transcription and template switching during replication6, stymied early single-cell functional genomics efforts using this delivery strategy7. By contrast, pervasive chimeric rearrangements of recombinant adeno-associated virus (rAAV) genetic material during packaging have not been described to our knowledge. Yet, recent studies of long-read sequenced AAV-packaged DNA have revealed unexpected DNA arrangements8,9. In parallel, high levels of noise and limited dynamic range are commonly observed in barcoded MPRA experiments using rAAVs10–14. These hint at possible unknown complexities during rAAV packaging. To explicitly test for chimera formation during rAAV packaging, we performed barcode swap experiments. Specifically, we constructed a series of rAAV libraries harboring a large number of uniquely associated pairs of barcodes separated by different inserts and flanked by AAV2 inverted terminal repeats (ITRs) (Fig. 1a; all plasmids and oligos listed in Supplementary Data 1). The six libraries harbored inserts of three different lengths (short: ~225 bp, mid-sized: ~800 bp, long: ~2.1 kb) each with two homology classes (homologous: identical sequences, nonhomologous: size-matched to homologous counterparts and bearing tagmented, narrowly size-selected Escherichia coli genomic DNA; Fig. 1a and Extended Data Figs. 1 and 2). Each insert further had a short internal sequence index for downstream demultiplexing (Extended Data Fig. 1d). The resulting libraries were complex (>5 million barcode pairs in parental library p146, bottlenecked to 15,000–45,000 pairs in libraries p149–p154). Inserts were size-adjusted outside the BC1–BC2 intervening sequence with filler sequences to fix the total ITR-to-ITR length to ~2.3 kb for inserts of different sizes (Extended Data Fig. 1c; see library p153× below). These libraries were then packaged separately into AAV capsids (all with PHP.eB, some with AAV2 serotypes, 14 packaging conditions total; Supplementary Data 2 and Methods). To assess for chimeras, defined as discordant barcode pairs within a single read, we performed long-read sequencing of the barcoded inserts (Fig. 1b) using PCR-free library preparation, from both sized-selected digested plasmids (‘zero-swap’ controls) and AAV-packaged DNA, on the Oxford Nanopore Technology (ONT) platform (through 1 Department of Genome Sciences, University of Washington, Seattle, WA, USA. 2Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montréal, Québec, Canada. 3Seattle Hub for Synthetic Biology, Seattle, WA, USA. 4Allen Institute for Brain Science, Seattle, WA, USA. 5 Brotman Baty Institute for Precision Medicine, Seattle, WA, USA. 6Howard Hughes Medical Institute, Seattle, WA, USA. 7Allen Discovery Center for Cell Lineage Tracing, Seattle, WA, USA. 8These authors contributed equally: Jean-Benoît Lalanne, Chau Huynh. e-mail: ; Nature Biotechnology Brief Communication AAV2 ITR ... AAV2 ITR Different lengths short (200–250 bp) Complex library of unique barcode pairs Separately for each library (for example, p151:mid-hom): Barcode swap experimental workflow Doubly barcoded parental plasmid library Plasmid libraries with various inserts mid (760–940 bp) BC1 BC2 Barcode pair dictionary Homologous p149:short-hom p150:short-nonhom p151:mid-hom p152:mid-nonhom p153×:long-hom p154:long-nonhom 1.0 Plasmid DNA 0.5 1.0 0.1 AAV DNA 0.5 *** p146: parental c Fraction disconcordant BC pairs BC1–BC2 plasmid libraries 0 500 1,000 1,500 2,000 BC-to-BC length (bp) 0 500 AAV p150:AAV p154:AAV p150 1,000 1,500 2,000 p146 BC-to-BC length (bp) 0 d Plasmid p149 0 0 0 (4) Discordant pairs? (2) Digest insert from plasmid Nonhomologous long (1,970–2,200 bp) (3) Long-read sequencing (1) AAV packaging Homology class Zoom Inserts BC2 *** BC1 500 Fraction disconcordant BC pairs b Doubly barcoded AAV constructs *** a https://doi.org/10.1038/s41587-026-03097-1 p152:AAV p151 p152 p154 p153 × 1,000 1,500 2,000 BC-to-BC length (bp) 0.4 p151:AAV 0.2 9% per 10× 0 p152:AAV 10−1 15 ng 100 101 102 % Input dose 15 µg per 15 cm Fig. 1 | Chimera formation during rAAV packaging revealed by barcode swapping experiments. a, A complex doubly barcoded cloning dock with associated dictionary of valid BC1–BC2 pairs was constructed and served as the starting point to clone libraries of inserts of varying lengths and homology class within AAV2 ITRs (six separate libraries: [short ~0.2 kb, mid ~0.8 kb, long ~2.1 kb] × [homologous, nonhomologous]). The seven different libraries considered are schematized (Extended Data Figs. 1 and 2). b, Each cloned barcoded library was separately: (1) digested to liberate the barcoded insert and (2) AAV-packaged. Both plasmid-derived insert and AAV DNA were submitted for direct long-read sequencing. Resulting long reads were scanned for barcodes and the fraction of discordant BC1–BC2 pairs, as compared to the bona fide parental dictionary, was determined. c, Quantification of the fraction of discordant barcode pairs as a function of the full-length BC-to-BC average size. Left, plasmid DNA; middle, AAV-packaged DNA; right, zoomed-in view of y axis range 0–0.1. Each point corresponds to swap quantification for a library for both plasmidderived (square) and AAV-derived (circle) material (n = 1 replicate per library). For each data point, we analyzed full-length BC-to-BC reads passing quality control filters and having s (...truncated)


This is a preview of a remote PDF: https://www.nature.com/articles/s41587-026-03097-1.pdf
Article home page: https://www.nature.com/articles/s41587-026-03097-1

Lalanne, Jean-Benoît, Huynh, Chau, Mich, John K., Hunker, Avery C., McDiarmid, Troy A., Kim, Haedong, Levi, Boaz P., Ting, Jonathan T., Shendure, Jay. Pool-packaged AAV libraries exhibit extensive length-dependent and homology-dependent chimerism, Nature Biotechnology, DOI: 10.1038/s41587-026-03097-1