HUH-tag

Last updated
Generic HUH endonuclease binding to single-stranded DNA. HUH reaction.tif
Generic HUH endonuclease binding to single-stranded DNA.

HUH endonucleases (HUH-tags) are sequence-specific single-stranded DNA (ssDNA) binding proteins originating from numerous species of bacteria and viruses. [1] Viral HUH endonucleases are involved in initiating rolling circle replication while ones of bacterial origin initiate bacterial conjugation. In biotechnology, they can be used to create protein-DNA linkages, [2] akin to other methods such as SNAP-tag. In doing so, they create a 5' covalent bond between the ssDNA and the protein. HUH endonucleases can be fused with other proteins or used as protein tags.

Contents

The name HUH stands for "histidine-hydrophobic-histidine," referring to the three amino acids at the active site of the endonuclease. Some DNA viruses code for an HUH endonuclease which initiates rolling circle replication of the viral genome, and this process defines the realm Monodnaviria . [3]

Types of HUH endonucleases

HUH endonucleases are broadly split into two categories of enzymes: replication initiator proteins (Rep) or relaxase / mobilization proteins. They both contain small protein domains that recognize sequence-specific origins of replication or origin of transfer at which site they nick DNA. The nicking domain of Reps tend to be smaller, on the order of 10-20 kDa while nicking domains from relaxases are larger, roughly 20-40 kDa in size. [2]

Mode of action

HUH endonucleases generally have two histidine (H) residues in the active site coordinating a metal cation (Mg2+ or Mn2+) that interacts with the phosphate backbone of DNA. These residues allow for a nucleophilic attack, most commonly by an activated tyrosine of the scissile phosphate in the DNA backbone, generating a 5' covalent bond with the ssDNA. In contrast to other DNA-protein linkage approaches, this reaction occurs at ambient conditions and does not require any additional modifications. X-ray crystallography and NMR structures have provided insight into the sequence specificity of DNA binding. [4] [5]

WDV Rep reaction mechanism of nucleophilic attack on the DNA backbone. WDV Rep cleavage mechanism.png
WDV Rep reaction mechanism of nucleophilic attack on the DNA backbone.

Applications

Related Research Articles

<span class="mw-page-title-main">DNA virus</span> Virus that has DNA as its genetic material

A DNA virus is a virus that has a genome made of deoxyribonucleic acid (DNA) that is replicated by a DNA polymerase. They can be divided between those that have two strands of DNA in their genome, called double-stranded DNA (dsDNA) viruses, and those that have one strand of DNA in their genome, called single-stranded DNA (ssDNA) viruses. dsDNA viruses primarily belong to two realms: Duplodnaviria and Varidnaviria, and ssDNA viruses are almost exclusively assigned to the realm Monodnaviria, which also includes some dsDNA viruses. Additionally, many DNA viruses are unassigned to higher taxa. Reverse transcribing viruses, which have a DNA genome that is replicated through an RNA intermediate by a reverse transcriptase, are classified into the kingdom Pararnavirae in the realm Riboviria.

<span class="mw-page-title-main">Integrase</span> Class of enzymes

Retroviral integrase (IN) is an enzyme produced by a retrovirus that integrates its genetic information into that of the host cell it infects. Retroviral INs are not to be confused with phage integrases (recombinases) used in biotechnology, such as λ phage integrase, as discussed in site-specific recombination.

<i>Parvoviridae</i> Family of viruses

Parvoviruses are a family of animal viruses that constitute the family Parvoviridae. They have linear, single-stranded DNA (ssDNA) genomes that typically contain two genes encoding for a replication initiator protein, called NS1, and the protein the viral capsid is made of. The coding portion of the genome is flanked by telomeres at each end that form into hairpin loops that are important during replication. Parvovirus virions are small compared to most viruses, at 23–28 nanometers in diameter, and contain the genome enclosed in an icosahedral capsid that has a rugged surface.

DNA primase is an enzyme involved in the replication of DNA and is a type of RNA polymerase. Primase catalyzes the synthesis of a short RNA segment called a primer complementary to a ssDNA template. After this elongation, the RNA piece is removed by a 5' to 3' exonuclease and refilled with DNA.

<span class="mw-page-title-main">Protein splicing</span> The post-translational removal of peptide sequences from within a protein sequence

Protein splicing is an intramolecular reaction of a particular protein in which an internal protein segment is removed from a precursor protein with a ligation of C-terminal and N-terminal external proteins on both sides. The splicing junction of the precursor protein is mainly a cysteine or a serine, which are amino acids containing a nucleophilic side chain. The protein splicing reactions which are known now do not require exogenous cofactors or energy sources such as adenosine triphosphate (ATP) or guanosine triphosphate (GTP). Normally, splicing is associated only with pre-mRNA splicing. This precursor protein contains three segments—an N-extein followed by the intein followed by a C-extein. After splicing has taken place, the resulting protein contains the N-extein linked to the C-extein; this splicing product is also termed an extein.

In molecular biology, endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain. Some, such as deoxyribonuclease I, cut DNA relatively nonspecifically, while many, typically called restriction endonucleases or restriction enzymes, cleave only at very specific nucleotide sequences. Endonucleases differ from exonucleases, which cleave the ends of recognition sequences instead of the middle (endo) portion. Some enzymes known as "exo-endonucleases", however, are not limited to either nuclease function, displaying qualities that are both endo- and exo-like. Evidence suggests that endonuclease activity experiences a lag compared to exonuclease activity.

<span class="mw-page-title-main">Rolling circle replication</span>

Rolling circle replication (RCR) is a process of unidirectional nucleic acid replication that can rapidly synthesize multiple copies of circular molecules of DNA or RNA, such as plasmids, the genomes of bacteriophages, and the circular RNA genome of viroids. Some eukaryotic viruses also replicate their DNA or RNA via the rolling circle mechanism.

<span class="mw-page-title-main">Porcine circovirus</span> Species of virus

Porcine circovirus (PCV) is a group of four single-stranded DNA viruses that are non-enveloped with an unsegmented circular genome. They are members of the genus Circovirus that can infect pigs. The viral capsid is icosahedral and approximately 17 nm in diameter.

A relaxase is a single-strand DNA transesterase enzyme produced by some prokaryotes and viruses. Relaxases are responsible for site- and strand-specific nicks in unwound double-stranded DNA. Known relaxases belong to the rolling circle replication (RCR) initiator superfamily of enzymes and fall into two broad classes: replicative (Rep) and mobilization (Mob). The nicks produced by Rep relaxases initiate plasmid or virus RCR. Mob relaxases nick at origin of transfer (oriT) to initiate the process of DNA mobilization and transfer known as bacterial conjugation. Relaxases are so named because the single-stranded DNA nick that they catalyze lead to relaxation of helical tension.

<span class="mw-page-title-main">Flap structure-specific endonuclease 1</span> Protein-coding gene in the species Homo sapiens

Flap endonuclease 1 is an enzyme that in humans is encoded by the FEN1 gene.

Helitrons are one of the three groups of eukaryotic class 2 transposable elements (TEs) so far described. They are the eukaryotic rolling-circle transposable elements which are hypothesized to transpose by a rolling circle replication mechanism via a single-stranded DNA intermediate. They were first discovered in plants and in the nematode Caenorhabditis elegans, and now they have been identified in a diverse range of species, from protists to mammals. Helitrons make up a substantial fraction of many genomes where non-autonomous elements frequently outnumber the putative autonomous partner. Helitrons seem to have a major role in the evolution of host genomes. They frequently capture diverse host genes, some of which can evolve into novel host genes or become essential for Helitron transposition.

<span class="mw-page-title-main">Genome editing</span> Type of genetic engineering

Genome editing, or genome engineering, or gene editing, is a type of genetic engineering in which DNA is inserted, deleted, modified or replaced in the genome of a living organism. Unlike early genetic engineering techniques that randomly inserts genetic material into a host genome, genome editing targets the insertions to site-specific locations. The basic mechanism involved in genetic manipulations through programmable nucleases is the recognition of target genomic loci and binding of effector DNA-binding domain (DBD), double-strand breaks (DSBs) in target DNA by the restriction endonucleases, and the repair of DSBs through homology-directed recombination (HDR) or non-homologous end joining (NHEJ).

<span class="mw-page-title-main">Cas9</span> Microbial protein found in Streptococcus pyogenes M1 GAS

Cas9 is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic engineering applications. Its main function is to cut DNA and thereby alter a cell's genome. The CRISPR-Cas9 genome editing technique was a significant contributor to the Nobel Prize in Chemistry in 2020 being awarded to Emmanuelle Charpentier and Jennifer Doudna.

Cas12a is an RNA-guided endonuclease of that forms part of the CRISPR system in some bacteria and is used by scientists to modify DNA. It originates as part of a bacterial immune mechanism, where it serves to destroy the genetic material of viruses and thus protect the cell and colony from viral infection. Cas12a and other CRISPR associated endonucleases are unique in that their use of a guide RNA makes this action highly selective; they only cut DNA when it is adjacent to a certain sequence of nucleotides. In the organisms from which it originates, this guide RNA is a copy of a piece of the genome from a virus that previously infected the cell.

Prime editing is a 'search-and-replace' genome editing technology in molecular biology by which the genome of living organisms may be modified. The technology directly writes new genetic information into a targeted DNA site. It uses a fusion protein, consisting of a catalytically impaired Cas9 endonuclease fused to an engineered reverse transcriptase enzyme, and a prime editing guide RNA (pegRNA), capable of identifying the target site and providing the new genetic information to replace the target DNA nucleotides. It mediates targeted insertions, deletions, and base-to-base conversions without the need for double strand breaks (DSBs) or donor DNA templates.

<i>Ground squirrel hepatitis virus</i> Species of virus

Ground squirrel hepatitis virus, abbreviated GSHV, is a partially double-stranded DNA virus that is closely related to human Hepatitis B virus (HBV) and Woodchuck hepatitis virus (WHV). It is a member of the family of viruses Hepadnaviridae and the genus Orthohepadnavirus. Like the other members of its family, GSHV has high degree of species and tissue specificity. It was discovered in Beechey ground squirrels, Spermophilus beecheyi, but also infects Arctic ground squirrels, Spermophilus parryi. Commonalities between GSHV and HBV include morphology, DNA polymerase activity in genome repair, cross-reacting viral antigens, and the resulting persistent infection with viral antigen in the blood (antigenemia). As a result, GSHV is used as an experimental model for HBV.

<i>Monodnaviria</i> Realm of viruses

Monodnaviria is a realm of viruses that includes all single-stranded DNA viruses that encode an endonuclease of the HUH superfamily that initiates rolling circle replication of the circular viral genome. Viruses descended from such viruses are also included in the realm, including certain linear single-stranded DNA (ssDNA) viruses and circular double-stranded DNA (dsDNA) viruses. These atypical members typically replicate through means other than rolling circle replication.

<i>Cressdnaviricota</i> Phylum of viruses

Cressdnaviricota is a phylum of viruses with small, circular single-stranded DNA genomes and encoding rolling circle replication-initiation proteins with the N-terminal HUH endonuclease and C-terminal superfamily 3 helicase domains. While the replication-associated proteins are homologous among viruses within the phylum, the capsid proteins are very diverse and have presumably been acquired from RNA viruses on multiple independent occasions. Nevertheless, all cressdnaviruses for which structural information is available appear to contain the jelly-roll fold.

<i>Bacilladnaviridae</i> Family of viruses

Bacilladnaviridae is a family of single-stranded DNA viruses that primarily infect diatoms.

Faba bean necrotic yellows virus (FBNYV) is a Nanovirus disease of legumes.

References

  1. Chandler, Michael; de la Cruz, Fernando; Dyda, Fred; Hickman, Alison B.; Moncalian, Gabriel; Ton-Hoang, Bao (2013-07-08). "Breaking and joining single-stranded DNA: the HUH endonuclease superfamily". Nature Reviews Microbiology. 11 (8): 525–538. doi:10.1038/nrmicro3067. ISSN   1740-1526. PMC   6493337 . PMID   23832240.
  2. 1 2 Lovendahl, Klaus N.; Hayward, Amanda N.; Gordon, Wendy R. (2017-05-24). "Sequence-Directed Covalent Protein–DNA Linkages in a Single Step Using HUH-Tags". Journal of the American Chemical Society. 139 (20): 7030–7035. doi:10.1021/jacs.7b02572. ISSN   0002-7863. PMC   5517037 . PMID   28481515.
  3. Koonin EV, Dolja VV, Krupovic M, Varsani A, Wolf YI, Yutin N, Zerbini M, Kuhn JH (18 October 2019). "Create a megataxonomic framework, filling all principal taxonomic ranks, for ssDNA viruses" (docx). International Committee on Taxonomy of Viruses. Retrieved 27 May 2020.
  4. Vega-Rocha, Susana; Byeon, In-Ja L.; Gronenborn, Bruno; Gronenborn, Angela M.; Campos-Olivas, Ramón (2007). "Solution Structure, Divalent Metal and DNA Binding of the Endonuclease Domain from the Replication Initiation Protein from Porcine Circovirus 2". Journal of Molecular Biology. 367 (2): 473–487. doi:10.1016/j.jmb.2007.01.002. ISSN   0022-2836. PMID   17275023.
  5. Everett, Blake A.; Litzau, Lauren A.; Tompkins, Kassidy; Shi, Ke; Nelson, Andrew; Aihara, Hideki; Evans Iii, Robert L.; Gordon, Wendy R. (2019-12-01). "Crystal structure of the Wheat dwarf virus Rep domain". Acta Crystallographica Section F. 75 (Pt 12): 744–749. doi:10.1107/S2053230X19015796. ISSN   2053-230X. PMC   6891580 . PMID   31797816.
  6. Zdechlik, Alina C.; He, Yungui; Aird, Eric J.; Gordon, Wendy R.; Schmidt, Daniel (2019-12-06). "Programmable Assembly of Adeno-Associated Virus–Antibody Composites for Receptor-Mediated Gene Delivery". Bioconjugate Chemistry. 31 (4): 1093–1106. doi:10.1021/acs.bioconjchem.9b00790. ISSN   1043-1802. PMC   7676631 . PMID   31809024.
  7. Aird, Eric J.; Lovendahl, Klaus N.; Martin, Amber St; Harris, Reuben S.; Gordon, Wendy R. (2018-05-31). "Increasing Cas9-mediated homology-directed repair efficiency through covalent tethering of DNA repair template". Communications Biology. 1 (1): 54. doi:10.1038/s42003-018-0054-2. ISSN   2399-3642. PMC   6123678 . PMID   30271937.
  8. Ali, Zahir; Shami, Ashwag; Sedeek, Khalid; Kamel, Radwa; Alhabsi, Abdulrahman; Tehseen, Muhammad; Hassan, Norhan; Butt, Haroon; Kababji, Ahad; Hamdan, Samir M.; Mahfouz, Magdy M. (2020-01-23). "Fusion of the Cas9 endonuclease and the VirD2 relaxase facilitates homology-directed repair for precise genome engineering in rice". Communications Biology. 3 (1): 44. doi:10.1038/s42003-020-0768-9. ISSN   2399-3642. PMC   6978410 . PMID   31974493.
  9. Guo, Wei; Mashimo, Yasumasa; Kobatake, Eiry; Mie, Masayasu (2020-03-16). "Construction of DNA-displaying nanoparticles by enzymatic conjugation of DNA and elastin-like polypeptides using a replication initiation protein". Nanotechnology. 31 (25): 255102. doi:10.1088/1361-6528/ab8042. ISSN   0957-4484. PMID   32176872.
  10. Sagredo, Sandra; Pirzer, Tobias; Aghebat Rafat, Ali; Goetzfried, Marisa A.; Moncalian, Gabriel; Simmel, Friedrich C.; de la Cruz, Fernando (2016). "Orthogonal Protein Assembly on DNA Nanostructures Using Relaxases". Angewandte Chemie International Edition. 55 (13): 4348–4352. doi:10.1002/anie.201510313. ISSN   1521-3773. PMC   5067690 . PMID   26915475.
  11. Mie, Masayasu; Niimi, Takahiro; Mashimo, Yasumasa; Kobatake, Eiry (2019-01-03). "Construction of DNA-NanoLuc luciferase conjugates for DNA aptamer-based sandwich assay using Rep protein". Biotechnology Letters. 41 (3): 357–362. doi:10.1007/s10529-018-02641-7. ISSN   0141-5492. PMID   30603832.