Stem-loop

Last updated

An example of an RNA stem-loop Stem-loop.svg
An example of an RNA stem-loop

Stem-loop intramolecular base pairing is a pattern that can occur in single-stranded RNA. The structure is also known as a hairpin or hairpin loop. It occurs when two regions of the same strand, usually complementary in nucleotide sequence when read in opposite directions, base-pair to form a double helix that ends in an unpaired loop. The resulting structure is a key building block of many RNA secondary structures. As an important secondary structure of RNA, it can direct RNA folding, protect structural stability for messenger RNA (mRNA), provide recognition sites for RNA binding proteins, and serve as a substrate for enzymatic reactions. [1]

Contents

Formation and stability

The formation of a stem-loop structure is dependent on the stability of the resulting helix and loop regions. The first prerequisite is the presence of a sequence that can fold back on itself to form a paired double helix. The stability of this helix is determined by its length, the number of mismatches or bulges it contains (a small number are tolerable, especially in a long helix) and the base composition of the paired region. Pairings between guanine and cytosine have three hydrogen bonds and are more stable compared to adenine-uracil pairings, which have only two. In RNA, adenine-uracil pairings featuring two hydrogen bonds are equal to the adenine-thymine bond of the DNA. Base stacking interactions, which align the pi bonds of the bases' aromatic rings in a favorable orientation, also promote helix formation.

The stability of the loop also influences the formation of the stem-loop structure. "Loops" that are fewer than three bases long are sterically impossible and do not form. Large loops with no secondary structure of their own (such as pseudoknot pairing) are also unstable. Optimal loop length tends to be about 4-8 bases long. One common loop with the sequence UUCG is known as the "tetraloop" and is particularly stable due to the base-stacking interactions of its component nucleotides. Therefore, such loops can form on the microsecond time scale. [2]

Structural contexts

Stem-loops occur in pre-microRNA structures and most famously in transfer RNA, which contain three true stem-loops and one stem that meet in a cloverleaf pattern. The anticodon that recognizes a codon during the translation process is located on one of the unpaired loops in the tRNA. Two nested stem-loop structures occur in RNA pseudoknots, where the loop of one structure forms part of the second stem.

Many ribozymes also feature stem-loop structures. The self-cleaving hammerhead ribozyme contains three stem-loops that meet in a central unpaired region where the cleavage site lies. The hammerhead ribozyme's basic secondary structure is required for self-cleavage activity.

Hairpin loops are often elements found within the 5'UTR of prokaryotes. These structures are often bound by proteins or cause the attenuation of a transcript in order to regulate translation. [3]

The mRNA stem-loop structure forming at the ribosome binding site may control an initiation of translation. [4] [5]

Stem-loop structures are also important in prokaryotic rho-independent transcription termination. The hairpin loop forms in an mRNA strand during transcription and causes the RNA polymerase to become dissociated from the DNA template strand. This process is known as rho-independent or intrinsic termination, and the sequences involved are called terminator sequences.

See also

Related Research Articles

<span class="mw-page-title-main">Protein biosynthesis</span> Assembly of proteins inside biological cells

Protein biosynthesis is a core biological process, occurring inside cells, balancing the loss of cellular proteins through the production of new proteins. Proteins perform a number of critical functions as enzymes, structural proteins or hormones. Protein synthesis is a very similar process for both prokaryotes and eukaryotes but there are some distinct differences.

<span class="mw-page-title-main">RNA</span> Family of large biological molecules

Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself or by forming a template for the production of proteins. RNA and deoxyribonucleic acid (DNA) are nucleic acids. The nucleic acids constitute one of the four major macromolecules essential for all known forms of life. RNA is assembled as a chain of nucleotides. Cellular organisms use messenger RNA (mRNA) to convey genetic information that directs synthesis of specific proteins. Many viruses encode their genetic information using an RNA genome.

An inverted repeat is a single stranded sequence of nucleotides followed downstream by its reverse complement. The intervening sequence of nucleotides between the initial sequence and the reverse complement can be any length including zero. For example, 5'---TTACGnnnnnnCGTAA---3' is an inverted repeat sequence. When the intervening length is zero, the composite sequence is a palindromic sequence.

In a chain-like biological molecule, such as a protein or nucleic acid, a structural motif is a common three-dimensional structure that appears in a variety of different, evolutionarily unrelated molecules. A structural motif does not have to be associated with a sequence motif; it can be represented by different and completely unrelated sequences in different proteins or RNA.

<span class="mw-page-title-main">Biomolecular structure</span> 3D conformation of a biological sequence, like DNA, RNA, proteins

Biomolecular structure is the intricate folded, three-dimensional shape that is formed by a molecule of protein, DNA, or RNA, and that is important to its function. The structure of these molecules may be considered at any of several length scales ranging from the level of individual atoms to the relationships among entire protein subunits. This useful distinction among scales is often expressed as a decomposition of molecular structure into four levels: primary, secondary, tertiary, and quaternary. The scaffold for this multiscale organization of the molecule arises at the secondary level, where the fundamental structural elements are the molecule's various hydrogen bonds. This leads to several recognizable domains of protein structure and nucleic acid structure, including such secondary-structure features as alpha helixes and beta sheets for proteins, and hairpin loops, bulges, and internal loops for nucleic acids. The terms primary, secondary, tertiary, and quaternary structure were introduced by Kaj Ulrik Linderstrøm-Lang in his 1951 Lane Medical Lectures at Stanford University.

<span class="mw-page-title-main">Hammerhead ribozyme</span>

The hammerhead ribozyme is an RNA motif that catalyzes reversible cleavage and ligation reactions at a specific site within an RNA molecule. It is one of several catalytic RNAs (ribozymes) known to occur in nature. It serves as a model system for research on the structure and properties of RNA, and is used for targeted RNA cleavage experiments, some with proposed therapeutic applications. Named for the resemblance of early secondary structure diagrams to a hammerhead shark, hammerhead ribozymes were originally discovered in two classes of plant virus-like RNAs: satellite RNAs and viroids. They are also known in some classes of retrotransposons, including the retrozymes. The hammerhead ribozyme motif has been ubiquitously reported in lineages across the tree of life.

<span class="mw-page-title-main">Hairpin ribozyme</span> Enzymatic section of RNA

The hairpin ribozyme is a small section of RNA that can act as a ribozyme. Like the hammerhead ribozyme it is found in RNA satellites of plant viruses. It was first identified in the minus strand of the tobacco ringspot virus (TRSV) satellite RNA where it catalyzes self-cleavage and joining (ligation) reactions to process the products of rolling circle virus replication into linear and circular satellite RNA molecules. The hairpin ribozyme is similar to the hammerhead ribozyme in that it does not require a metal ion for the reaction.

<span class="mw-page-title-main">Purine riboswitch</span>

A purine riboswitch is a sequence of ribonucleotides in certain messenger RNA (mRNA) that selectively binds to purine ligands via a natural aptamer domain. This binding causes a conformational change in the mRNA that can affect translation by revealing an expression platform for a downstream gene, or by forming a translation-terminating stem-loop. The ultimate effects of such translational regulation often take action to manage an abundance of the instigating purine, and might produce proteins that facilitate purine metabolism or purine membrane uptake.

<span class="mw-page-title-main">SAM riboswitch (S-box leader)</span>

The SAM riboswitch is found upstream of a number of genes which code for proteins involved in methionine or cysteine biosynthesis in Gram-positive bacteria. Two SAM riboswitches in Bacillus subtilis that were experimentally studied act at the level of transcription termination control. The predicted secondary structure consists of a complex stem-loop region followed by a single stem-loop terminator region. An alternative and mutually exclusive form involves bases in the 3' segment of helix 1 with those in the 5' region of helix 5 to form a structure termed the anti-terminator form. When SAM is unbound, the anti-terminator sequence sequesters the terminator sequence so the terminator is unable to form, allowing the polymerase to read-through the downstream gene. When S-Adenosyl methionine (SAM) is bound to the aptamer, the anti-terminator is sequestered by an anti-anti-terminator; the terminator forms and terminates the transcription. However, many SAM riboswitches are likely to regulate gene expression at the level of translation.

<span class="mw-page-title-main">Intrinsic termination</span>

Intrinsic, or rho-independent termination, is a process to signal the end of transcription and release the newly constructed RNA molecule. In bacteria such as E. coli, transcription is terminated either by a rho-dependent process or rho-independent process. In the Rho-dependent process, the rho-protein locates and binds the signal sequence in the mRNA and signals for cleavage. Contrarily, intrinsic termination does not require a special protein to signal for termination and is controlled by the specific sequences of RNA. When the termination process begins, the transcribed mRNA forms a stable secondary structure hairpin loop, also known as a stem-loop. This RNA hairpin is followed by multiple uracil nucleotides. The bonds between uracil (rU) and adenine (dA) are very weak. A protein bound to RNA polymerase (nusA) binds to the stem-loop structure tightly enough to cause the polymerase to temporarily stall. This pausing of the polymerase coincides with transcription of the poly-uracil sequence. The weak adenine-uracil bonds lower the energy of destabilization for the RNA-DNA duplex, allowing it to unwind and dissociate from the RNA polymerase. Overall, the modified RNA structure is what terminates transcription.

<span class="mw-page-title-main">Tetraloop</span>

Tetraloops are a type of four-base hairpin loop motifs in RNA secondary structure that cap many double helices. There are many variants of the tetraloop. The published ones include ANYA, CUYG, GNRA, UNAC and UNCG.

<span class="mw-page-title-main">Red clover necrotic mosaic virus translation enhancer elements</span>

Red clover necrotic mosaic virus (RCNMV) contains several structural elements present within the 3′ and 5′ untranslated regions (UTR) of the genome that enhance translation. In eukaryotes transcription is a prerequisite for translation. During transcription the pre-mRNA transcript is processes where a 5′ cap is attached onto mRNA and this 5′ cap allows for ribosome assembly onto the mRNA as it acts as a binding site for the eukaryotic initiation factor eIF4F. Once eIF4F is bound to the mRNA this protein complex interacts with the poly(A) binding protein which is present within the 3′ UTR and results in mRNA circularization. This multiprotein-mRNA complex then recruits the ribosome subunits and scans the mRNA until it reaches the start codon. Transcription of viral genomes differs from eukaryotes as viral genomes produce mRNA transcripts that lack a 5’ cap site. Despite lacking a cap site viral genes contain a structural element within the 5’ UTR known as an internal ribosome entry site (IRES). IRES is a structural element that recruits the 40s ribosome subunit to the mRNA within close proximity of the start codon.

<span class="mw-page-title-main">Nucleic acid tertiary structure</span> Three-dimensional shape of a nucleic acid polymer

Nucleic acid tertiary structure is the three-dimensional shape of a nucleic acid polymer. RNA and DNA molecules are capable of diverse functions ranging from molecular recognition to catalysis. Such functions require a precise three-dimensional structure. While such structures are diverse and seemingly complex, they are composed of recurring, easily recognizable tertiary structural motifs that serve as molecular building blocks. Some of the most common motifs for RNA and DNA tertiary structure are described below, but this information is based on a limited number of solved structures. Many more tertiary structural motifs will be revealed as new RNA and DNA molecules are structurally characterized.

<span class="mw-page-title-main">Riboregulator</span>

In molecular biology, a riboregulator is a ribonucleic acid (RNA) that responds to a signal nucleic acid molecule by Watson-Crick base pairing. A riboregulator may respond to a signal molecule in any number of manners including, translation of the RNA into a protein, activation of a ribozyme, release of silencing RNA (siRNA), conformational change, and/or binding other nucleic acids. Riboregulators contain two canonical domains, a sensor domain and an effector domain. These domains are also found on riboswitches, but unlike riboswitches, the sensor domain only binds complementary RNA or DNA strands as opposed to small molecules. Because binding is based on base-pairing, a riboregulator can be tailored to differentiate and respond to individual genetic sequences and combinations thereof.

<span class="mw-page-title-main">Nucleic acid structure</span> Biomolecular structure of nucleic acids such as DNA and RNA

Nucleic acid structure refers to the structure of nucleic acids such as DNA and RNA. Chemically speaking, DNA and RNA are very similar. Nucleic acid structure is often divided into four different levels: primary, secondary, tertiary, and quaternary.

<span class="mw-page-title-main">Nucleic acid secondary structure</span>

Nucleic acid secondary structure is the basepairing interactions within a single nucleic acid polymer or between two polymers. It can be represented as a list of bases which are paired in a nucleic acid molecule. The secondary structures of biological DNAs and RNAs tend to be different: biological DNA mostly exists as fully base paired double helices, while biological RNA is single stranded and often forms complex and intricate base-pairing interactions due to its increased ability to form hydrogen bonds stemming from the extra hydroxyl group in the ribose sugar.

<span class="mw-page-title-main">Complementarity (molecular biology)</span> Lock-and-key pairing between two structures

In molecular biology, complementarity describes a relationship between two structures each following the lock-and-key principle. In nature complementarity is the base principle of DNA replication and transcription as it is a property shared between two DNA or RNA sequences, such that when they are aligned antiparallel to each other, the nucleotide bases at each position in the sequences will be complementary, much like looking in the mirror and seeing the reverse of things. This complementary base pairing allows cells to copy information from one generation to another and even find and repair damage to the information stored in the sequences.

<span class="mw-page-title-main">Kissing stem-loop</span>

In genetics, a kissing stem-loop, or kissing stem loop interaction, is formed in ribonucleic acid (RNA) when two bases between two hairpin loops pair. These intra- and intermolecular kissing interactions are important in forming the tertiary or quaternary structure of many RNAs.

Non-canonical base pairs are planar hydrogen bonded pairs of nucleobases, having hydrogen bonding patterns which differ from the patterns observed in Watson-Crick base pairs, as in the classic double helical DNA. The structures of polynucleotide strands of both DNA and RNA molecules can be understood in terms of sugar-phosphate backbones consisting of phosphodiester-linked D 2’ deoxyribofuranose sugar moieties, with purine or pyrimidine nucleobases covalently linked to them. Here, the N9 atoms of the purines, guanine and adenine, and the N1 atoms of the pyrimidines, cytosine and thymine, respectively, form glycosidic linkages with the C1’ atom of the sugars. These nucleobases can be schematically represented as triangles with one of their vertices linked to the sugar, and the three sides accounting for three edges through which they can form hydrogen bonds with other moieties, including with other nucleobases. The side opposite to the sugar linked vertex is traditionally called the Watson-Crick edge, since they are involved in forming the Watson-Crick base pairs which constitute building blocks of double helical DNA. The two sides adjacent to the sugar-linked vertex are referred to, respectively, as the Sugar and Hoogsteen edges.

Coronavirus genomes are positive-sense single-stranded RNA molecules with an untranslated region (UTR) at the 3′ end which is called the 3′ UTR. The 3′ UTR is responsible for important biological functions, such as viral replication. The 3′ UTR has a conserved RNA secondary structure but different Coronavirus genera have different structural features described below.

References

  1. Svoboda, P., & Cara, A. (2006). Hairpin RNA: A secondary structure of primary importance. Cellular and Molecular Life Sciences, 63(7), 901-908.
  2. Ma H, Proctor D, Bevilacqua P, Gruebele M (2006). "Exploring the energy landscape of a small RNA hairpin". J Am Chem Soc. 128 (5): 1523–1530. doi:10.1021/ja0553856. PMID   16448122.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  3. Meyer, Michelle; Deiorio-Haggar K; Anthony J (July 2013). "RNA structures regulating ribosomal protein biosynthesis in bacilli". RNA Biology. 7. 10 (7): 1160–1164. doi:10.4161/rna.24151. PMC   3849166 . PMID   23611891.
  4. Malys N, Nivinskas R (2009). "Non-canonical RNA arrangement in T4-even phages: accommodated ribosome binding site at the gene 26-25 intercistronic junction". Mol Microbiol. 73 (6): 1115–1127. doi:10.1111/j.1365-2958.2009.06840.x. PMID   19708923. S2CID   8187771.
  5. Malys N, McCarthy JEG (2010). "Translation initiation: variations in the mechanism can be anticipated". Cellular and Molecular Life Sciences. 68 (6): 991–1003. doi:10.1007/s00018-010-0588-z. PMID   21076851. S2CID   31720000.