Spatial effects on the speed and reliability of protein–DNA search (pdf)

Article PDF cannot be displayed. You can download it here:

https://nar.oxfordjournals.org/content/36/11/3570.full.pdf

Spatial effects on the speed and reliability of protein–DNA search

Zeba Wunderlich 2 Leonid A. Mirny 0 1 0 Department of Physics, Massachusetts Institute of Technology , Cambridge, MA 02139, USA 1 Harvard-MIT Division of Health Sciences and Technology 2 Biophysics Program, Harvard University , Cambridge, MA , 02138 Strong experimental and theoretical evidence shows that transcription factors (TFs) and other specific DNA-binding proteins find their sites using a two-mode search: alternating between threedimensional (3D) diffusion through the cell and one-dimensional (1D) sliding along the DNA. We show that, due to the 1D component of the search process, the search time of a TF can depend on the initial position of the TF. We formalize this effect by discriminating between two types of searches: global and local. Using analytical calculations and simulations, we estimate how close a TF and binding site need to be to make a local search likely. We then use our model to interpret the wide range of experimental measurements of this parameter. We also show that local and global searches differ significantly in average search time and the variability of search time. These results lead to a number of biological implications, including suggestions of how prokaryotes achieve rapid gene regulation and the relationship between the search mechanism and noise in gene expression. Lastly, we propose a number of experiments to verify the existence and quantify the extent of spatial effects on the TF search process in prokaryotes. - INTRODUCTION ProteinDNA interactions are vitally important for every cell. Transcription factors (TFs) are proteins that interact with specific DNA sequences to regulate gene expression. The targeting of TFs to their sites is a passive process; therefore, it seems natural to assume that TFs simply diffuse through the nucleus (in eukaryotes) or cell (in prokaryotes) until they find their sites. In the 1970s, this assumption was challenged by the observation that, in vitro, the prokaryotic TF LacI is able to find its binding site 100 times faster than expected by three-dimensional (3D) diffusion in the solvent (1). This led to the suggestion of a facilitated diffusion mechanism in which TFs alternate between 3D diffusion, jumping, through the volume of the cell and one-dimensional (1D) sliding along the DNA to rapidly locate their binding sites (24). This hypothesis was corroborated by several pieces of evidencemost strikingly several single molecule studies in which the authors visualized individual proteins sliding along DNA (57). Several groups have also mathematically modeled this process and shown it to be a plausible way of making the search significantly faster than 3D diffusion alone (3,4,811). Several aspects of facilitated diffusion, however, remain puzzling, e.g. the effect of the DNA sequence composition and conformational transitions in the protein on the rate of sliding (10,12) and role of the DNA conformation (11). Here we consider how spatial effects influence the search process. Specifically, we ask whether and how search time depends on the initial distance between the protein and the target site. The distance dependence of the TF search process has not been considered before because the rate of a bimolecular reaction in 3D is distance-independent (13). Therefore, the time it takes for a protein diffusing in 3D to find its target does not depend on the initial distance between the two, as long as this distance is greater than the size of the target. In contrast, the time of search in two dimensions (2D) (e.g. on a membrane) or in 1D (e.g. along DNA or along a filament) is distance-dependent (13). Therefore, we ask: can the 1D component of facilitated diffusion make search much faster for a protein that starts a small distance from its target site? Here we use simulations and analytical estimates to demonstrate that TF search time indeed depends on the initial position of the TF with respect to its binding site. We show that the trajectories can be naturally separated into fast local and slow global searches (Figure 1A). We find that if a TF starts sufficiently closeless than 1000 base pairs (bp) for our model organism Escherichia coli to its binding site, a local search is likely. While studying how spatial effects contribute to the search process, we observe that upon dissociation from Global Jump Transcription Factor DNA DNA, a protein is likely to quickly re-associate near is dissociation point, thus making a short-range hop, rather than a long-range jump (Figure 1B). We examine how these two types of spatial excursions influence the search process, allowing us to reconcile the widely ranging experimental measurements of the sliding length (6,7,14,15). Finally, we show that the strong non-specific binding of TFs to DNA makes global search rather slow, thus making local search appreciably faster. Moreover, local searches have significantly smaller variance in the search time, making them an attractive mechanism to deliver DNA-binding proteins to their targets quickly and reliably. There are a number of biological implications of these spatial effects. Since transcription and translation are coupled in bacteria, proteins are produced near the location of their genes. Therefore, TFs whose genes are co-localized with their binding sites are likely to use a local search mechanism. The efficiency of local search provides a physical justification for the observed co-localization of TF genes and their binding sites in prokaryotic genomes (1618). We also propose a number of experiments to test the mechanism and its predictions. MATERIALS AND METHODS Characterizing hops using simulations To include hops in the search model, we needed to estimate the relative frequency of hops and jumps and the displacement due to hops. Assuming that DNA could be treated as straight rods on the length scale of a hop, we considered the problem in a cylindrical geometry and simplified it further to a 2D geometry (Figure 2A). In the 2D cross-section, DNA strands are represented as absorbing circles. To simulate diffusion in 2D, we discretized the cross section into a 1 mm2 square lattice with 1 nm spacing and randomly distributed DNA strands, each with an absorbing radius of 2 nm. We simulated a TF trajectory as a random walk on the lattice, starting from its dissociation from DNA and ending with its association to DNA. Trajectories that started and ended on the same DNA strand were called hops; otherwise they were jumps (Figure 2A). From these trajectories, we calculated the DNA probability of a hop as a function of the number DNA strands in the lattice (Figure 2B). Using the length of the hop trajectories, we also calculated the displacement along the DNA strand during a hop for lattices with 1500 strands, the approximate density of DNA in E. coli. We assumed that, in the 3D geometry, two-thirds of the random walk steps were in the 2D plane and one-third were in the z-directionalong the DNA. Therefore, given the l (...truncated)