Modelling the Efficiency of Codon–tRNA Interactions Based on Codon Usage Bias (pdf)

Article PDF cannot be displayed. You can download it here:

https://academic.oup.com/dnaresearch/article-pdf/21/5/511/17462932/dsu017.pdf

Modelling the Efficiency of Codon–tRNA Interactions Based on Codon Usage Bias

DNA RESEARCH 21, 511–525, (2014) Advance Access publication on 6 June 2014 doi:10.1093/dnares/dsu017 Modelling the Efficiency of Codon –tRNA Interactions Based on Codon Usage Bias RENANA Sabi1 and TAMIR Tuller1,2,* Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel1 and The Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, Israel2 *To whom correspondence should be addressed. Tel. þ972-64058363. Fax. þ972-64058363. Email: Edited by Prof. Hiroyuki Toh (Received 10 January 2014; accepted 26 April 2014) Abstract The tRNA adaptation index (tAI) is a widely used measure of the efficiency by which a coding sequence is recognized by the intra-cellular tRNA pool. This index includes among others weights that represent wobble interactions between codons and tRNA molecules. Currently, these weights are based only on the gene expression in Saccharomyces cerevisiae. However, the efficiencies of the different codon–tRNA interactions are expected to vary among different organisms. In this study, we suggest a new approach for adjusting the tAI weights to any target model organism without the need for gene expression measurements. Our method is based on optimizing the correlation between the tAI and a measure of codon usage bias. Here, we show that in non-fungal the new tAI weights predict protein abundance significantly better than the traditional tAI weights. The unique tRNA –codon adaptation weights computed for 100 different organisms exhibit a significant correlation with evolutionary distance. The reported results demonstrate the usefulness of the new measure in future genomic studies. Key words: codon usage bias; tRNA adaptation index; protein levels; wobble interactions; ribosome 1. Introduction Allowed by the redundancy of the genetic code, coding regions exhibit non-uniform usage of synonymous codons. This deviation from uniform codon usage is termed codon usage bias (CUB) and is related among others to various aspects of gene translation (and more generally gene expression) and its efficiency;1 – 10 specifically, it was suggested that it regulates transcription and translation, but may also be related to recombination rate. Indeed, it is known for over 30 years that in most organisms the degree of CUB of genes correlates with their expression levels.11 – 14 Various approaches for quantifying the CUB of a gene have been suggested in the last decades: the effective number of codons, for instance, measures deviations from equal usage of synonymous codons,13 while other indices such as the frequency of optimal codons,15 the codon bias index,11 and the codon adaptation index (CAI)16 define a subset of ‘optimal’ codons and measure the frequency of these codons in the coding region of the gene. The CUB indices have been employed in hundreds of previous studies. For example, it is known that in many organisms (e.g. Escherichia coli) the CAI exhibits a very high correlation with protein levels (similar to the one obtained between mRNA levels and protein levels17); thus, CAI has been frequently used as a proxy of expression levels (see, for example,18 – 20). In addition, it has been employed in a vast number of key studies in the recent years.18,19,21,22 One disadvantage of measures that are based on a set of reference genes11,16,23 is the fact that in the case of organisms with poor genomic data and without large scale cellular measurements, creating a meaningful reference set is challenging. Another disadvantage of these measures is the fact that they cannot separate between the various possible causes of CUB in highly # The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. 512 Inference of Codon –tRNA Interaction Efficiencies expressed genes: some of them may be related directly to the translation process (e.g. co-evolution and/or adaptation to the tRNA pool8,24,25) and others may not be related to translation (e.g. GC content and folding9,26 – 28). In 2004, dos Reis et al. 8 proposed the tRNA adaptation index (tAI), which aims to estimate only the adaptation of codons/genes to some aspects directly related to the elongation step occurs in the ribosome via the adaptation to the tRNA pool, wobble interactions, and properties of the ribosome. Specifically, the tAI considers the fact that different tRNA species can recognize a codon with different affinities.2,8,29 Thus, the tAI is different than CUB-based measurements mentioned above and provides important information related to translation that is not necessarily covered by CUB measures. Indeed, measures of the adaptation of genes to the tRNA pool (such as the tAI) have extensively been used in the recent years for studying questions in diverse biomedical disciplines such as evolutionary biology, functional genomics, and systems biology (see, for example,3,30 – 35). Let ni be the number of tRNA isoacceptors that recognize the ith codon, the absolute adaptiveness value of the ith codon is defined in the following equation: Wi ¼ ni X ð1 Sij Þ tGCNij ð1Þ j¼1 where tGCNij is the gene copy number of the jth tRNA that recognizes the ith codon (a proxy of the tRNA levels24,29,36), and Sij is a selective constraint on the efficiency of the interaction between the ith codon and the jth tRNA, which is scored between 0 ( perfect interaction) and 1 (no interaction);8 specifically, the Sij [Vol. 21, weights can be related to aspects of translation elongation (tRNA, wobble interactions, and properties of the ribosome), as these aspects are expected to affect the efficiency of the codon – anticodon interaction. Wi values are calculated according to Crick’s wobble rules for codon –anticodon pairing (Table 1). The codon relative adaptiveness value wi is obtained by dividing each Wi with the maximum Wi value over all codons.8 The tAI of a gene is defined as the geometric mean of the wi values of its codons. In 1966, Crick37 suggested that in some cases wobble pairing may occur in the third base of the codon. According to Crick, the pairing at the third codon position has to obey chemical constrains; thus, some of the optional parings will not occur. For example, the unlikely pairing of guanine – adenine is due to a side group of guanine, which cannot make one of its bonds. In addition to the four standard nucleotides, modified nucleosides often occupy the wobble position of the anticodon (usually position 34 of the tRNA). In fact, the wobble position is the most frequently modified nucleoside in tRNA.38,39 Inosine, for example, is a common modification of adenine that occurs in the wobble position of many tRNA species.39 – 43 Out of the eight Sij weights, four are related to (...truncated)