Predicting the sizes of large RNA molecules

Citation:

Yoffe AM, Prinsen P, Gopal A, Knobler CM, Gelbart WM, Ben-Shaul A. Predicting the sizes of large RNA molecules. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA. 2008;105 :16153-16158.

Date Published:

OCT 21

Abstract:

We present a theory of the dependence on sequence of the three-dimensional size of large single-stranded (ss) RNA molecules. The work is motivated by the fact that the genomes of many viruses are large ssRNA molecules-often several thousand nucleotides long-and that these RNAs are spontaneously packaged into small rigid protein shells. We argue that there has been evolutionary pressure for the genome to have overall spatial properties-including an appropriate radius of gyration, R-g-that facilitate this assembly process. For an arbitrary RNA sequence, we introduce the (thermal) average maximum ladder distance (< MLD >) and use it as a measure of the ``extendedness'' of the RNA secondary structure. The < MLD > values of viral ssRNAs that package into capsids of fixed size are shown to be consistently smaller than those for randomly permuted sequences of the same length and base composition, and also smaller than those of natural ssRNAs that are not under evolutionary pressure to have a compact native form. By mapping these secondary structures onto a linear polymer model and by using < MLD > as a measure of effective contour length, we predict the R-g values of viral ssRNAs are smaller than those of nonviral sequences. More generally, we predict the average < MLD > values of large nonviral ssRNAs scale as N-0.67 +/- 0.01, where N is the number of nucleotides, and that their R-g values vary as < MLD >(0.5) in an ideal solvent, and hence as N-0.34. An alternative analysis, which explicitly includes all branches, is introduced and shown to yield consistent results.

Last updated on 12/25/2016