Retron

Last updated
Retron msr RNA
RF00170.jpg
Identifiers
Symbolmsr
Rfam RF00170
Other data
RNA type Gene
Domain(s) Bacteria
SO SO:0000233
PDB structures PDBe

A retron is a distinct DNA sequence found in the genome of many bacteria species that codes for reverse transcriptase and a unique single-stranded DNA/RNA hybrid called multicopy single-stranded DNA (msDNA). Retron msr RNA is the non-coding RNA produced by retron elements and is the immediate precursor to the synthesis of msDNA. The retron msr RNA folds into a characteristic secondary structure that contains a conserved guanosine residue at the end of a stem loop. Synthesis of DNA by the retron-encoded reverse transcriptase (RT) results in a DNA/RNA chimera which is composed of small single-stranded DNA linked to small single-stranded RNA. The RNA strand is joined to the 5′ end of the DNA chain via a 2′–5′ phosphodiester linkage that occurs from the 2′ position of the conserved internal guanosine residue.

Contents

Sequence and structure

The retron operon carries a promoter sequence P that controls the synthesis of an RNA transcript carrying three loci: msr, msd, and ret. The ret gene product, a reverse transcriptase, processes the msd/msr portion of the RNA transcript into msDNA. Retron organization.png
The retron operon carries a promoter sequence P that controls the synthesis of an RNA transcript carrying three loci: msr, msd, and ret. The ret gene product, a reverse transcriptase, processes the msd/msr portion of the RNA transcript into msDNA.

Retron elements are about 2 kb long. They contain a single operon controlling the synthesis of an RNA transcript carrying three loci, msr, msd, and ret, that are involved in msDNA synthesis. The DNA portion of msDNA is encoded by the msd gene, the RNA portion is encoded by the msr gene, while the product of the ret gene is a reverse transcriptase similar to the RTs produced by retroviruses and other types of retroelements. [1] Like other reverse transcriptases, the retron RT contains seven regions of conserved amino acids (labeled 1–7 in the figure), including a highly conserved tyr-ala-asp-asp (YADD) sequence associated with the catalytic core. The ret gene product is responsible for processing the msd/msr portion of the RNA transcript into msDNA.

Classification and occurrence

For many years after their discovery in animal viruses, reverse transcriptases were believed to be absent from prokaryotes. Currently, however, RT-encoding elements, i.e. retroelements, have been found in a wide variety of different bacteria:

Function

Since retrons are not mobile, their appearance in diverse bacterial species is not a "selfish DNA" phenomenon. Rather, bacterial retrons confer some protection from phage infection to bacterial hosts. Several retrons are located in DNA regions next to certain protein effector-coding genes. When their expression is activated, most of these effectors and their associated retrons function together to block phage infection. [5] [6]

Retrons are being developed into genome-editing tools. [7]

Related Research Articles

<span class="mw-page-title-main">Complementary DNA</span> Single-stranded DNA synthesized from RNA

In genetics, complementary DNA (cDNA) is DNA synthesized from a single-stranded RNA template in a reaction catalyzed by the enzyme reverse transcriptase. cDNA is often used to express a specific protein in a cell that does not normally express that protein, or to sequence or quantify mRNA molecules using DNA based methods. cDNA that codes for a specific protein can be transferred to a recipient cell for expression, often bacterial or yeast expression systems. cDNA is also generated to analyze transcriptomic profiles in bulk tissue, single cells, or single nuclei in assays such as microarrays, qPCR, and RNA-seq.

<span class="mw-page-title-main">Retrovirus</span> Family of viruses

A retrovirus is a type of virus that inserts a DNA copy of its RNA genome into the DNA of a host cell that it invades, thus changing the genome of that cell. After invading a host cell's cytoplasm, the virus uses its own reverse transcriptase enzyme to produce DNA from its RNA genome, the reverse of the usual pattern, thus retro (backwards). The new DNA is then incorporated into the host cell genome by an integrase enzyme, at which point the retroviral DNA is referred to as a provirus. The host cell then treats the viral DNA as part of its own genome, transcribing and translating the viral genes along with the cell's own genes, producing the proteins required to assemble new copies of the virus. Many retroviruses cause serious diseases in humans, other mammals, and birds.

<span class="mw-page-title-main">Retroposon</span>

Retroposons are repetitive DNA fragments which are inserted into chromosomes after they had been reverse transcribed from any RNA molecule.

<span class="mw-page-title-main">Reverse transcriptase</span> Enzyme which generates DNA

A reverse transcriptase (RT) is an enzyme used to generate complementary DNA (cDNA) from an RNA template, a process termed reverse transcription. Reverse transcriptases are used by viruses such as HIV and hepatitis B to replicate their genomes, by retrotransposon mobile genetic elements to proliferate within the host genome, and by eukaryotic cells to extend the telomeres at the ends of their linear chromosomes. Contrary to a widely held belief, the process does not violate the flows of genetic information as described by the classical central dogma, as transfers of information from RNA to DNA are explicitly held possible.

DNA primase is an enzyme involved in the replication of DNA and is a type of RNA polymerase. Primase catalyzes the synthesis of a short RNA segment called a primer complementary to a ssDNA template. After this elongation, the RNA piece is removed by a 5' to 3' exonuclease and refilled with DNA.

<span class="mw-page-title-main">Ribonuclease H</span> Enzyme family

Ribonuclease H is a family of non-sequence-specific endonuclease enzymes that catalyze the cleavage of RNA in an RNA/DNA substrate via a hydrolytic mechanism. Members of the RNase H family can be found in nearly all organisms, from bacteria to archaea to eukaryotes.

A cDNA library is a combination of cloned cDNA fragments inserted into a collection of host cells, which constitute some portion of the transcriptome of the organism and are stored as a "library". cDNA is produced from fully transcribed mRNA found in the nucleus and therefore contains only the expressed genes of an organism. Similarly, tissue-specific cDNA libraries can be produced. In eukaryotic cells the mature mRNA is already spliced, hence the cDNA produced lacks introns and can be readily expressed in a bacterial cell. While information in cDNA libraries is a powerful and useful tool since gene products are easily identified, the libraries lack information about enhancers, introns, and other regulatory elements found in a genomic DNA library.

<span class="mw-page-title-main">Retrotransposon</span> Type of genetic component

Retrotransposons are a type of genetic component that copy and paste themselves into different genomic locations (transposon) by converting RNA back into DNA through the reverse transcription process using an RNA transposition intermediate.

<span class="mw-page-title-main">Library (biology)</span>

In molecular biology, a library is a collection of DNA fragments that is stored and propagated in a population of micro-organisms through the process of molecular cloning. There are different types of DNA libraries, including cDNA libraries, genomic libraries and randomized mutant libraries. DNA library technology is a mainstay of current molecular biology, genetic engineering, and protein engineering, and the applications of these libraries depend on the source of the original DNA fragments. There are differences in the cloning vectors and techniques used in library preparation, but in general each DNA fragment is uniquely inserted into a cloning vector and the pool of recombinant DNA molecules is then transferred into a population of bacteria or yeast such that each organism contains on average one construct. As the population of organisms is grown in culture, the DNA molecules contained within them are copied and propagated.

<span class="mw-page-title-main">Multicopy single-stranded DNA</span>

Multicopy single-stranded DNA (msDNA) is a type of extrachromosomal satellite DNA that consists of a single-stranded DNA molecule covalently linked via a 2'-5'phosphodiester bond to an internal guanosine of an RNA molecule. The resultant DNA/RNA chimera possesses two stem-loops joined by a branch similar to the branches found in RNA splicing intermediates. The coding region for msDNA, called a "retron", also encodes a type of reverse transcriptase, which is essential for msDNA synthesis.

<span class="mw-page-title-main">Group II intron</span> Class of self-catalyzing ribozymes

Group II introns are a large class of self-catalytic ribozymes and mobile genetic elements found within the genes of all three domains of life. Ribozyme activity can occur under high-salt conditions in vitro. However, assistance from proteins is required for in vivo splicing. In contrast to group I introns, intron excision occurs in the absence of GTP and involves the formation of a lariat, with an A-residue branchpoint strongly resembling that found in lariats formed during splicing of nuclear pre-mRNA. It is hypothesized that pre-mRNA splicing may have evolved from group II introns, due to the similar catalytic mechanism as well as the structural similarity of the Group II Domain V substructure to the U6/U2 extended snRNA. Finally, their ability to site-specifically insert into DNA sites has been exploited as a tool for biotechnology. For example, group II introns can be modified to make site-specific genome insertions and deliver cargo DNA such as reporter genes or lox sites

<span class="mw-page-title-main">Bacteriophage MS2</span> Species of virus

Bacteriophage MS2, commonly called MS2, is an icosahedral, positive-sense single-stranded RNA virus that infects the bacterium Escherichia coli and other members of the Enterobacteriaceae. MS2 is a member of a family of closely related bacterial viruses that includes bacteriophage f2, bacteriophage Qβ, R17, and GA.

<span class="mw-page-title-main">Group I catalytic intron</span>

Group I introns are large self-splicing ribozymes. They catalyze their own excision from mRNA, tRNA and rRNA precursors in a wide range of organisms. The core secondary structure consists of nine paired regions (P1-P9). These fold to essentially two domains – the P4-P6 domain and the P3-P9 domain. The secondary structure mark-up for this family represents only this conserved core. Group I introns often have long open reading frames inserted in loop regions.

Numerous key discoveries in biology have emerged from studies of RNA, including seminal work in the fields of biochemistry, genetics, microbiology, molecular biology, molecular evolution and structural biology. As of 2010, 30 scientists have been awarded Nobel Prizes for experimental work that includes studies of RNA. Specific discoveries of high biological significance are discussed in this article.

Rauchvirus is a genus of viruses in the order Caudovirales, in the family Podoviridae. Bacteria serve as natural hosts. The genus contains only one species: Bordetella virus BPP1.

RT RNA motifs refers to conserved RNA motifs discovered by bioinformatics and that are usually or always located nearby to genes predicted to encode reverse transcriptase. Known RNAs located nearby to RT genes include self-splicing introns, retrons and diversity-generating retroelements (DGR), and RT RNA motifs could function as part of such elements.

<i>Riboviria</i> Realm of viruses

Riboviria is a realm of viruses that includes all viruses that use a homologous RNA-dependent polymerase for replication. It includes RNA viruses that encode an RNA-dependent RNA polymerase, as well as reverse-transcribing viruses that encode an RNA-dependent DNA polymerase. RNA-dependent RNA polymerase (RdRp), also called RNA replicase, produces RNA from RNA. RNA-dependent DNA polymerase (RdDp), also called reverse transcriptase (RT), produces DNA from RNA. These enzymes are essential for replicating the viral genome and transcribing viral genes into messenger RNA (mRNA) for translation of viral proteins.

In virology, realm is the highest taxonomic rank established for viruses by the International Committee on Taxonomy of Viruses (ICTV), which oversees virus taxonomy. Six virus realms are recognized and united by specific highly conserved traits:

Diversity-generating retroelements (DGRs) are a family of retroelements that were first found in Bordetella phage (BPP-1), and since been found in bacteria, Archaea, Archaean viruses, temperate phages, and lytic phages. DGRs benefit their host by mutating particular regions of specific target proteins, for instance, phage tail fiber in BPP-1, lipoprotein in legionella pneumophila, and TvpA in Treponema denticola . An error-prone reverse transcriptase is responsible for generating these hypervariable regions in target proteins. In mutagenic retrohoming, a mutagenized cDNA is reverse transcribed from a template region (TR), and is replaced with a segment similar to the template region called variable region (VR). Accessory variability determinant (Avd) protein is another component of DGRs, and its complex formation with the error-prone RT is of importance to mutagenic rehoming.

<span class="mw-page-title-main">Alan Lambowitz</span> American academic

Alan Lambowitz is a professor for the University of Texas at Austin in Molecular Biosciences and Oncology and has been instrumental in many bio-molecular processes and concepts, such as intron splicing and mitochondrial ribosomal assembly.

References

  1. Lampson BC, Inouye M, Inouye S (2005). "Retrons, msDNA, and the bacterial genome" (PDF). Cytogenet Genome Res. 110 (1–4): 491–499. doi:10.1159/000084982. PMID   16093702. S2CID   24854188.
  2. 1 2 Medhekar B, Mille JF (2007). "Diversity-Generating Retroelements". Current Opinion in Microbiology. 10 (4): 388–395. doi:10.1016/j.mib.2007.06.004. PMC   2703298 . PMID   17703991.
  3. Simon DM, Zimmerly S (2008). "A diversity of uncharacterized reverse transcriptases in bacteria". Nucleic Acids Res. 36 (22): 7219–7229. CiteSeerX   10.1.1.358.8390 . doi:10.1093/nar/gkn867. PMC   2602772 . PMID   19004871.
  4. Liu M, Gingery M, Doulatov SR, Liu Y, Hodes A, Baker S, Davis P, Simmonds M, Churcher C, Mungall K, Quail MA, Preston A, Harvill ET, Maskell DJ, Eiserling FA, Parkhill J, Miller JF (2004). "Genomic and Genetic Analysis of Bordetella Bacteriophages Encoding Reverse Transcriptase-Mediated Tropism-Switching Cassettes". J. Bacteriol. 186 (5): 1503–1517. doi:10.1128/JB.186.5.1503-1517.2004. PMC   344406 . PMID   14973019.
  5. Bobonis, Jacob; Mitosch, Karin; Mateus, André; Karcher, Nicolai; Kritikos, George; Selkrig, Joel; Zietek, Matylda; Monzon, Vivian; Pfalz, Birgit; Garcia-Santamarina, Sarela; Galardini, Marco; Sueki, Anna; Kobayashi, Callie; Stein, Frank; Bateman, Alex (2022-09-01). "Bacterial retrons encode phage-defending tripartite toxin–antitoxin systems". Nature. 609 (7925): 144–150. doi:10.1038/s41586-022-05091-4. ISSN   0028-0836. PMID   35850148. S2CID   250643138.
  6. Millman A, Bernheim A, Stokar-Avihail A, Fedorenko T, Voichek M, Leavitt A, Oppenheimer-Shaanan Y, Sorek R (2020). "Bacterial Retrons Function In Anti-Phage Defense". Cell. 183 (6): 1551–1561. doi: 10.1016/j.cell.2020.09.065 . PMID   33157039.
  7. Simon AJ, Ellington AD, Finkelstein IJ (2019). "Retrons and their applications in genome engineering". Nucleic Acids Research. 47 (21): 11007–11019. doi: 10.1093/nar/gkz865 . PMC   6868368 . PMID   31598685.