Binary tree-inspired digital dendrimer

Nature Communications, Apr 2019

Digital polymers with precisely ordered units acting as the coded 0- or 1-bit, are introduced as a promising option for molecular data storage. However, the pursuit of better performance in terms of high storage capacity and useful functions never stops. Herein, we propose a concept of an information-coded 2D digital dendrimer. The divergent growth via thiol-maleimide Michael coupling allows precise arrangements of the 0- and 1-bits in the uniform dendrimers. A protocol for calculating the storage capacity of non-linear binary digital dendrimer is established based on data matrix barcode, generated by the tandem mass spectrometry decoding and encryption. Furthermore, the generated data matrix barcode can be read by a common hand-held device to cater the applications such as item identification, traceability and anticouterfeiting purpose. This work demonstrates the high data storage capacity of a uniform dendrimer and uncovers good opportunities for the digital polymers.

A PDF file should load here. If you do not see its contents the file may be temporarily unavailable at the journal website or you do not have a PDF plug-in installed and enabled in your browser.

Alternatively, you can download the file locally and open with any standalone PDF reader:

https://www.nature.com/articles/s41467-019-09957-6.pdf

Binary tree-inspired digital dendrimer

Binary tree-inspired digital dendrimer Zhihao Huang 0 Qiunan Shi 0 Jiang Guo Fanying Meng 0 1 Yajie Zhang 0 Yutong Lu 0 Zhuangfei Qian 0 Xiaopeng Li Nianchen Zhou 0 2 Zhengbiao Zhang 0 College of Chemistry, Chemical Engineering and Materials Science, Soochow University , 215123 Suzhou , China 1 Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology , Cambridge, MA 02139 , USA 2 Department of Chemistry, University of South Florida , Tampa , USA Digital polymers with precisely ordered units acting as the coded 0- or 1-bit, are introduced as a promising option for molecular data storage. However, the pursuit of better performance in terms of high storage capacity and useful functions never stops. Herein, we propose a concept of an information-coded 2D digital dendrimer. The divergent growth via thiolmaleimide Michael coupling allows precise arrangements of the 0- and 1-bits in the uniform dendrimers. A protocol for calculating the storage capacity of non-linear binary digital dendrimer is established based on data matrix barcode, generated by the tandem mass spectrometry decoding and encryption. Furthermore, the generated data matrix barcode can be read by a common hand-held device to cater the applications such as item identification, traceability and anticouterfeiting purpose. This work demonstrates the high data storage capacity of a uniform dendrimer and uncovers good opportunities for the digital polymers. - Beach node has at most two children1. Binary-tree-based inary tree in computer science is a data structure in which data structures are widely used in computer science for efficient searching, labeling, and sorting values. Another important application of binary tree is information coding, such as Huffman coding2, used in lossless data compression, encryption and decryption. The basic principle of information coding is that the particular arrangement of nodes in tree structure is considered as the information. Interestingly, in polymer science, the dendron or dendrimer has the similar binary-tree-like structure (Fig. 1). Due to their globular and highly branched structures, the dendrimers usually convey interesting properties in comparison with their linear analogs, such as increased solubility, low intrinsic viscosity, etc.3. Many specific applications using dendrimers have thus been developed3?19. Inspired by the 1?2 connectivity of binary-tree data structure, we synthesized dendrons and dendrimers with similar branching feature to explore their application in information coding with high storage capacity. Note that researchers have constantly been seeking for more efficient polymer system as data storage media; they range from natural polymer, i.e., DNA20?23, to linear synthetic polymer24?36, which is termed as digital polymer. Similar with the life information coding in DNA, the precise order of different monomer units along a digital polymer chain is regarded as the coded information. Owing to the inherent structure induced information coding, the digital polymers can be used as molecular barcodes or tags for anticounterfeiting purpose37?39. Considering its binary-tree like and highly branched structure, we envisioned that using dendrimer as another type of digital polymer24,30,40,41 might allow a high capability of information coding; that is, richer information can be reasonably decoded from a digital dendrimer. Moreover, intriguing applications of this digital dendrimer were also anticipated. In this work, digital dendrimers were precisely built by using binary-coded monomers via divergent strategy42, as illustrated in (Fig. 2a, b). The monomers were varied by assembling two different sub-monomer units (0- and 1-bit), endowing the binary codes and the structural diversity. Reliable decoding of these digital dendrimers was realized by using tandem mass spectrometry (MS/MS) authenticity31,39. The translation of the MS/MS decoded data created a readable data matrix barcode which was useful for item identification and anticounterfeiting tag. Results Design and synthesis of digital dendrimers. Information-coded digital polymers should meet two basic criteria, i.e., uniform chain length (monodisperse) and precisely-defined unit sequence. It is widely accepted that dendrimers are Binary tree in computer science Dendrimer in polymer science monodispersed and highly branched molecules, which meets the first criterion of digital polymer. Moreover, dendrimers are conventionally synthesized via stepwise divergent or convergent approach, offering the possibility to precisely arrange different monomers. In order to encode binary units into dendrimers, as depicted in Fig. 2a, two sub-monomer units containing a hexyl (1-bit) or a butyl group (0-bit) were designed (Supplementary Fig. 1). Then, a library of branched monomers coded by binary information, e.g., t-0(00)-f2, t-0(11)-f2, t-1(00)-f2, t-1(01)-f2, and t-1(11)-f2, (t presents as thiol group at the focal core, while fn presents as furan group at the periphery terminal) was built (Supplementary Figs. 2?4 and 14?26, see experiment details in Supplementary Methods). By deliberately installing different monomers along the direction from focal to periphery, an array of binary digital dendrimers was constructed based on the highly efficient chemistry of orthogonal deprotections of thiol and maleimide, thiol-maleimide Michael addition (Supplementary Figs. 5?7, see experiment details in Supplementary Methods). For example, the dendron DN-011-G2 with the binary information was prepared by iteratively installing t-0 (11)-f2 monomer (DN-011-G0) via the divergent dendrimer growth (Fig. 2b). Figure 2c displayed an example showing a digital dendrimer (DN-011-G1) containing the certain binary information. Using the same synthesis strategy, 12 digital dendrimers were precisely built as summarized in Table 1. These dendrimers were stringently characterized by thermal characterizations (Supplementary Figs. 9, 10), 1H NMR spectrometer (Supplementary Figs. 27?34 and 36?40), size exclusion chromatography (SEC) (Supplementary Fig. 35), and matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) (Supplementary Figs. 41?43, 52?61). The precise constructions of these dendrimers indicated the successful coding of binary information as designed. MS/MS decoding of digital dendrimers. Besides the efficient data coding, reliable decoding is very important for a digital polymer as well. Tandem mass spectrometry (MS/MS) is regarded as an effective tool for decoding of both natural and synthetic polymers, especially for linear digital polymers26,43. During MS/ MS decoding, the polymer chains are fragmented into sequenceinduced species. The mass/charge ratio analysis of the fragments enables the restoration of the sequence information27, thus realizes the information decoding. Up to date, the structure decoding of a non-linear digital polymer has rarely been explored. Here, MS/MS protocol was used to decode the binary information in the digital dendrimer. The 0-bit (butyl group, 184 Da) and 1-bit (hexyl group, 212 Da) coded sub-monomer units were designed with a succinimide thioether linkage as shown in Fig. 3a). Due to the relatively low bond dissociation energy (simulated: 57.16 kcal/ mol) (Supplementary Fig. 8), the S-C bond tends to break during MS/MS with easily recognizable fragmentation pattern. All these dendrimers held an acetyl protected thiol moiety as the focal core (t) and furan protected maleimide as the peripheral terminal (f) (Fig. 3a). The fragments holding the f end-group were named as aLB (all the furan groups were detached during MS/MS analysis), while the complementary fragments with t terminal were named as tLB44,45. Taking DN-011-G1 as an example, Fig. 3b, c showed its MS ([M+Na-4Fu]+, m/z = 1892.69 Da) and typical MS/MS pattern with detail fragmentation assignments, respectively. In Fig. 3c, the MS/MS signals could be well recognized and ascribed to the fragmented species. For example, three mono-protonated fragmented species were detected at m/z of 1680.59, 1283.56, and 1070.46 Da, which matched with the expected values of [t4?+Na]+, [t3?+Na]+, and [t2?+Na]+ (Fragmentation #1, 3 and 6 in Fig. 3c, d and Supplementary Table 1). Two complementary O S ions were found at m/z of 632.23 and 843.37 Da, corresponding to [a3?+Na]+ and [a2?+Na]+, respectively. Moreover, secondary fragmentation ions were also proposed in Supplementary Fig. 51. All the fragmentation species with calculated MS values were summarized in Fig. 3d, agreeing well with the experimental ones in Fig. 3c. These results confirmed that the MS/MS could induce useful fragmentation pattern of a dendrimer. However, due to its highly branched structure, the MS/MS technique cannot directly provide sequence information of the dendrimer like DN-011-G1, i.e., the spatial arrangements of the binary bits encoded in the dendrimer cannot be exactly decoded. This is much different from the linear digital polymer, which can be readily sequenced by using MS/MS decoding. MS/MS translation and storage capacity calculation. Despite having difficulty in sequencing the dendrimer structure by MS/MS technique, it can afford a full sequence-related fragmentation pattern of a specific dendrimer (Fig. 3c, d). The question is how to translate the fragmentation pattern to the binary digital information and thus calculate the data storage capacity in a reasonable manner? Here, a similar manner for sequencing the linear polymer was used, wherein 1 binary path from one chain terminal to the other could be extracted from the fragmentation pattern. For the dendrimer as DN-011-G1, theoretically, there are 6 paths by orderly cleaving the 0- or 1-bit one by one from the periphery terminals to the focal core (see Supplementary Figs. 44?49). The analysis in Fig. 3c confirmed that the MS/ MS decoding results agreed with the theoretical results, i.e., 6 binary t ?0(00) ?f2 Path 1 Path 2 Path 3 Path 4 Path 5 Path 6 paths, e.g., 011001111, 010101111, 010110111, etc, were found, as illustrated in Fig. 4a. It should be noted that each binary path is not the MS/MS induced chain cleavage route. Actually, the S-C bond cleavages during MS/MS decoding are random and nonselective in terms of its locations in the dendrimer. Since that the focal unit (0bit) was connected with two adjacent 1-bits by two S-C bonds (Fig. 3a), the fragmentation of the focal 0-bit from DN-011-G1 cannot provide useful information for sequencing. Therefore, the overall 6 binary paths represented the binary information encoded in of DN-011-G1. Treating each binary path as a row of a 0/1 data matrix, it was interesting that 720 possibilities of data matrix could be obtained with different arrangements of the binary row. Simply put, each data matrix differed from another by different locations of the binary row. In order to obtain a data matrix for a specific dendrimer, we introduced the encryption cryptography46,47 in this study. As a matter of fact, the rule of encryption can be designed and varied on the demand for confidentiality. Here, the encryption rule of ?1 > 0, right to left, upper? was applied. It compared the bits of different rows one by one from right to left. Specifically, if 1-bit was located in the right position, it was applied to the upper row. By using this encryption, 720 or even more possibilities of data matrix could be encrypted into one unique matrix for a specific dendrimer (Fig. 4 and Supplementary Fig. 50). As such, by MS/MS decoding and encryption, the full sequence-related fragmentation pattern can be translated to binary digital information via the data matrix. The data storage capacity is a key parameter for a digital polymer. Obviously, the calculation protocol of the storage capacity of a digital dendrimer differed much from the linear digital polymer. Herein, a protocol for calculating the storage capacity of a uniform binary-coded dendrimer was proposed by using a 2D data matrix barcode with the ever developed rule for calculating storage capacity48 (https://en.wikipedia.org/wiki/ Data_Matrix). Taking DN-011-G1 as the example, the generated specific data matrix by summing and encrypting (Fig. 4) these 6 binary paths, can be translated into a pre-data matrix barcode (PDMB) by replacing 1-bit with a black module and 0-bit with a white module. To facilitate the building of PDMB, especially for digital dendrimers with complex structures, a binary-tree-based computational algorithm consisting of ExtractPath and TraverseTree was developed for the generation of PDMB (Supplementary Fig. 11). With this powerful binary-tree-based computation, a PDMB library was readily constructed including 5 monomers and 12 dendrimers. Each simulated PDMB perfectly matched with that by MS/MS decoding and encryption process. These PDMBs allowed the information storage capacity of a digital dendrimer to be calculated by considering the number of the modules and 11% normal correction level48 (Fig. 5a). The storage capacity of each DN-011-G1 molecule was calculated to be 48 bits. Note that the storage capacity of a PDMB strongly relies on the structural factors of a dendrimer, such as generation and monomer structure. As a dendrimer grows, the storage capacity significantly increases with more binary codes. For instance, the storage capacity increased 612 times from 48 bits of 1st generation DN011-G1 to 3672 bytes of 2nd generation DN-011-G2 (Fig. 5b). In addition, due to different fragmentation patterns upon MS/MS decoding, dendrimers with different monomer structures might also cause different storage capacities. For example, in contrast to the 48 bits-coded DN-011-G1 from monomer t-0(11)-f2, each DN-101-G1 molecule could code for 207 bits of information. The dendrimer DR-011-111-G2 from t-0(11)-f2 and t-1(11)-f2 could code for 2361 bytes of information; however, the dendrimer DR011-G2 from t-0(11)-f2 (Fig. 5b) could only encode 1574 bytes per molecule. Finally, after equipping PDMB with finder patterns (blue and pink) and a dashed pattern (green), the coded specific information in the data matrix barcode, such as chemical information or URL (Universal Resource Locator) (Supplementary Fig. 12), can be extracted with a smartphone. Thus, it can be used like a commercial data matrix barcode for product identification and traceability (Supplementary movie 1). Importantly and uniquely, the generated PDMB stemmed from a specific dendrimer could be used as an anticounterfeiting label (Supplementary Figs. 13, 62). a n m u l o c n i s it b 6 9 (row) ? 6 (col) = 54 bits 9 bits in row Reed-Solomon algorithm with 11% correction level 48 bits b 8000 6000 y )e itc ty 4000 pa (b ca 2000 e g ra ) 200 to itb S ( 100 0 0 1 2 -G -G -G 1 1 1 1 1 1 -0 -0 -0 DN DN DN s its tye b b its 48 672 b3 3 0 1 2 -G -G -G 1 1 1 0 0 0 -1 -1 -1 N N N D D D s it b 6 s it b 7 0 2 s e t y b 5 4 3 7 1 2 -G -G 0 0 11 11 0 1 2 -G -G -1 -1 -11G -11G -11G -110 -111 -110 -110 -0 -0 -0 N N R R N R R D D D D D D D s e t y Discussion Inspired by the binary-tree data structure in computer science, this work married the dendrimer with information storage, and put forward the concept of digital dendrimer for information coding at a molecular level. Through MS/MS decoding and encryption, the coded binary data of a dendrimer was translated to a readable data matrix barcode. Based on the data matrix barcode, a protocol for calculating the storage capacity of nonlinear binary digital dendrimer was established. The results demonstrated that the digital dendrimer had the high information storage capacity. This data codeable/decodable digital dendrimer can be used for item identification with powerful anticounterfeiting abilities. This work opens up a horizon of dendrimer with promising applications. Methods General synthetic protocol for digital dendrimer. DN-011-G0-MA: DN-011-G0 (4.0 g, 5.1 mmol) and 150 mL of toluene were added to a three-neck round-bottom flask equipped with a condenser. The mixture was stirred and refluxed at 110 ?C under argon flow for about 10 h. 1H NMR showed that the reaction was complete. After cooling to room temperature, toluene was evaporated under vacuum. And the residue was dried under vacuum at 25 ?C for 24 h to afford DN-011-G0-MA (3.1 g, yield 95%) as a pale yellow oil. 1H NMR (300 MHz, CDCl3, ppm): ? 6.74 (s, 4 H), 3.50 (m, 8 H), 2.97?2.63 (m, 6 H), 2.34 (s, 3 H), 1.43 (m, 20 H). DN-011-G0-SH: DN-011-G0 (4.0 g, 5.1 mmol) was dissolved in a mix solvent of 80 mL of anhydrous MeOH and 400 mL of DCM in a 1.0-L three-neck roundbottom flask equipped with a 25-mL slow-addition apparatus under argon atmosphere. The mixture was cooled to ?78 ?C. Then acetyl chloride (17.1 mL, 0.24 mol) was added dropwise to the solution. The mixture was warmed to room temperature and stirred for 15 h. 1H NMR showed that the reaction was complete. The mixture was evaporated and the residue was re-dissolved in 100 mL DCM. The organic layer was washed with washed with saturated NaHCO3 (aq.) (2 ? 50 mL), 50 mL of water and dried with anhydrous Na2SO4. Then, DCM was concentrated under vacuum to afford DN-011-G0-SH (3.5 g, yield 93%) as a pale yellow oil. 1H NMR (300 MHz, CDCl3, ppm): ? 6.51 (s, 4 H), 5.26 (s, 4 H), 3.62?3.36 (m, 8 H), 2.92?2.67 (m, 8 H), 2.55 (dd, J = 14.5, 7.0 Hz, 2 H), 1.48 (m, 20 H). DN-011-G0-MA (0.20 g, 0.3 mmol) and DN-011-G0-SH (0.67 g, 0.9 mmol) were dissolved in 45.0 mL of dry CHCl3 in a 100-mL three-neck round-bottom flask under argon atmosphere at 25 ?C. Triethylamine (0.4 mL, 2.2 mmol) was add via a 1-mL syringe over 5 min. The mixture was stirred for about 18 h. 1H NMR indicated that the reaction was complete. The reaction mixture was washed with saturated NaHCO3 (aq.) (15.0 mL) and water (15.0 mL). The combined organic layer was dried with anhydrous Na2SO4 and the solvent was evaporated to afford the crude product which was purified by column chromatography on silica gel eluting with dichloromethane/ methanol (100/1-80/1) to give the DN-011-G1 (0.48 g, yield 74.5%) as a yellow sticky oil. 1H NMR (300 MHz, CDCl3, ppm): ? 6.52 (s, 8 H), 5.26 (s, 8 H), 3.72 (dd, J = 8.9, 3.3 Hz, 2 H), 3.63?3.36 (m, 24 H), 3.13 (dd, J = 18.6, 9.0 Hz, 2 H), 3.01?2.63 (m, 26 H), 2.55?2.41 (m, 2 H), 2.32 (s, 3 H), 1.80?1.18 (m, 60 H). 13C NMR (75 MHz, CDCl3, ppm) ? 195.54, 176.56, 176.27, 174.72, 174.59, 174.52, 136.52, 80.88, 47.35, 46.68, 38.70, 35.97, 31.98, 31.05, 30.63, 28.71, 28.35, 28.06, 27.31, 26.60, 26.46, 26.18, 26.01. MALDI-TOF MS for DN-011-G1, Calcd: m/z = 1892.60 of [M+Na-4Furan] +; Found: 1892.69 of [M+Na-4Fu]+. Instrumentation. MALDI-TOF mass spectroscopy (MS) were acquired on an UltrafleXtreme MALDI-TOF mass spectrometer (Bruker Daltonics, Germany) equipped with an Nd:YAG smart beam-II laser with 355-nm wavelength and 200 Hz firing rate. For high-resolution mass analysis, the instrument was operated in the reflector mode. Tandem MALDI-TOF MS/MS analysis was recorded by using the LIFT mode on the same instrument controlled by the Flexcontrol 1.4 software package. For MS/MS, ions generated by the MALDI process were accelerated at 7.50 kV through a grid at 6.85 kV into a precursor ion selector (PCIS). In this region, the ions pass through a timed-ion-selector device that is able to select one parent ion from a mixture of ions at different m/z values for subsequent fragmentation in the LIFT cell. After the parent ion at a given m/z was selected by the timed-ion-selector, it passed through a retarding lens where the ions were decelerated and then passed into the LIFT cell. Fragmentation was performed in the simple metastable decomposition mode, and the fragments were further accelerated by 19 kV in the LIFT cell, passed through a post lift metastable suppressor (PLMS), into the reflector, and finally to the detector. MS and MS/MS data were further processed using FlexAnalysis 1.3 software package. The compound trans-2-[3-(4-tert-butyl-phenyl)?2-methyl- 2-propenylidene]malononitrile (DCTB, Aldrich, >98%) served as the matrix and was prepared in CHCl3 at a concentration of 20 mg/mL. The cationizing agent sodium trifluoroacetate was prepared in ethanol at a concentration of 10 mg/mL. The matrix and cationizing salt solutions were mixed in a ratio of 10/1 (v/v). The instrument was calibrated prior to each measurement with external PMMA at the molecular weight under consideration. All samples were dissolved in CHCl3 at a concentration of 10 mg/mL. After sample preparation and solvent evaporation, the target plate was inserted into the MALDI-TOF mass spectrometer. Data availability All relevant data are available within the paper and its Supplementary Information. The computational algorithm can be found at https://github.com/jiangfeng1124/digitaldendrimer. The source data underlying Table 1 and Fig. 3c are provided as a Source Data file. All other data are available from the authors upon request. Code availability The computational algorithm can be found at https://github.com/jiangfeng1124/digitaldendrimer Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article?s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article?s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/ licenses/by/4.0/. This work was supported by the National Science Foundation of China (21674072), State and Local Joint Engineering Laboratory for Novel Functional Polymeric Materials, Jiangsu Key Laboratory of Advanced Functional Polymer Design and Application, the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD), and the Program of Innovative Research Team of Soochow University. S. , F.Y.M. , Y.J.Z. and Y.T. L carried out the syntheses . Z.H.H. and Q.N.S. characterized each compound by NMR, SEC, MALDI-TOF/TOF-TOF mass spectroscopy, DSC and TGA. Z.H.H . and Z.F.Q. created all the figures and supplementary movie 1 . Z.B.Z. , X.P. Additional information Supplementary Information accompanies this paper at https://doi.org/10.1038/s41467- 019-09957-6. ? The Author(s) 2019


This is a preview of a remote PDF: https://www.nature.com/articles/s41467-019-09957-6.pdf

Zhihao Huang, Qiunan Shi, Jiang Guo, Fanying Meng, Yajie Zhang, Yutong Lu, Zhuangfei Qian, Xiaopeng Li, Nianchen Zhou, Zhengbiao Zhang, Xiulin Zhu. Binary tree-inspired digital dendrimer, Nature Communications, 2019, DOI: 10.1038/s41467-019-09957-6