The HUH Catalytic site
Mechanism and
Overall Protein Architecture.
Historically,
bacteriophage fX174 protein A (gpA) was the first identified
HUH superfamily member (see (Kornberg &
Baker, 1992)) although, surprisingly, no structural information is
available.
Many
related proteins subsequently identified by bioinformatics (Ilyina & Koonin, 1992, Koonin & Ilyina, 1993) included proteins involved in catalysis of viral and plasmid rolling circle replication
(RCR), conjugative plasmid transfer, and DNA transposition (Kapitonov & Jurka, 2001, Garcillan-Barcia, et al., 2002, Ronning, et al., 2005, Ton-Hoang, et al., 2005, Toleman, et al., 2006, Garcillan-Barcia, et al., 2009). They all
carry conserved protein motifs, including the "HUH" motif composed
of two histidine (H) residues separated by a bulky hydrophobic (U) residue, and
the Y-motif containing either one or two tyrosine (Tyr) residues (found in Y1
and Y2 enzymes respectively).
Y1 HUH enzymes (Fig 1.41.1) include Rep
proteins of some plasmids with ssDNA replication intermediates (such as pUB110 (Gruss & Ehrlich, 1989), a wide range of
eukaryotic viruses (Rosario, et al., 2012), most conjugative plasmid relaxases (de la Cruz, et
al., 2010, Guglielmini, et al.,
2011), ISCR (insertion
sequences related to IS91(Toleman, et al., 2006)) and IS200/IS605 insertion
sequence family transposases (Ronning, et al., 2005, Ton-Hoang, et al., 2005). Y2 enzymes
include fX174 gpA itself, Rep proteins of other isometric
ssDNA and dsDNA phages (e.g. phage P2 (Odegrip
& Haggard-Ljungquist, 2001)), some cyanobacterial and archaeal
plasmids and parvoviruses (e.g. adeno-associated virus, AAV) as well as
transposases of the IS91 and helitron
families (Kapitonov & Jurka, 2001),
and MOBF family plasmid relaxases. In some cases, both Y residues
are mechanistically important while for others, only one of the pair appears to
be essential.
HUH enzymes use a unique mechanism for catalysing ssDNA breakage and
joining. The active site tyrosine creates a 5'-phosphotyrosine intermediate and
a free 3'-OH at the cleavage site (Fig 1.41.1). The
3'-OH can be used for different tasks. The most obvious is to prime
replication, as observed for HUH domains in single-stranded phage Rep proteins,
RCR plasmids and conjugative relaxases. The 3'-OH group can also act as the
nucleophile for strand transfer to resolve the phosphotyrosine intermediate in
the termination step of RCR replication, conjugative transfer and
transposition.
The HUH enzyme cleavage polarity is inverse to that of the tyrosine
recombinases, which make 3' phosphotyrosine intermediates and generate free
5'-OH groups that cannot be used as replication primers (Grindley, et al., 2006).
HUH enzymes also require a divalent metal ion to facilitate cleavage by
localizing and polarising the scissile phosphodiester bond in contrast to the
cofactor-independent tyrosine recombinases. Depending on the enzyme, Mg2+,
Mn2+ or other divalent metal ions can be used in vitro (Datta, et al., 2003, Larkin, et
al., 2005, Boer, et al., 2006,
Boer, et al., 2009, Hickman, et al., 2010, Edwards, et al., 2013). It is presumed
that Mg2+ or Mn2+ are the physiological cofactors. The
HUH histidine pair provides two of the three ligands necessary for metal ion
coordination (Fig 1.41.1). The
location and identity of the third, invariably polar (Glu, Asp, His or Gln)
residue varies across the superfamily.
3D
structures of several Rep and relaxase HUH domains with and without bound DNA
are available e.g. (Datta, et al., 2003, Guasch, et
al., 2003, Larkin, et al., 2003,
Hickman, et al., 2004, Boer, et al., 2006, Boer, et al., 2009, Messing, et
al., 2012). The order of HUH and Y motifs varies in the primary
sequence: in the Relaxase group, the Y-motif is upstream of the HUH-motif
whereas in the Rep group it is downstream (Fig1.41.1). This
"circular permutation" (Koonin & Ilyina,
1993, Dyda & Hickman, 2003, Guasch,
et al., 2003) changes the domain topology. Nevertheless, the
three-dimensional constellation of active site residues is virtually identical
across the superfamily.
Given the
diverse HUH protein functions, it is not surprising that other domains are
often appended to the HUH domain (Fig1.41.1). These are often of unknown function but, ATP dependent helicase, zinc
binding, primase and multimerisation domains are recurring themes (Petit, et al., 1998, Bruand & Ehrlich,
2000, Odegrip, et al., 2000,
Kapitonov & Jurka, 2001, Chang, et
al., 2002, Hickman, et al., 2004,
Clerot & Bernardi, 2006). For example, the ssDNA substrates needed by HUH enzymes can be generated by
a dedicated DNA helicase domain C-terminal to the HUH domain (Im & Muzyczka, 1990, Brister & Muzyczka,
1999, Kapitonov & Jurka, 2001, Clerot & Bernardi, 2006) or
alternatively by recruitment of a host helicase (Petit, et al., 1998, Bruand & Ehrlich,
2000, Odegrip, et al., 2000, Chang, et al., 2002). RCR processes use 3'-5'helicase activity acting
on the template strand to facilitate DNA unwinding at the replication fork
while in conjugation, helicases (as part of the relaxase) are transported into
the recipient cell and track 5' to 3' on the transported ssDNA.
DNA Recognition
Many HUH
nucleases recognize and bind DNA hairpin structures with cleavage sites located
within the hairpin or in the ssDNA on the 5' or 3' side of the stem. The
crucial role of hairpins has been firmly established in many systems including
plasmid conjugation, eukaryotic viral and plasmid replication and transposition (Orozco &
Hanley-Bowdoin, 1996, Brister & Muzyczka, 2000, Ronning, et al., 2005, Ton-Hoang, et
al., 2005, Boer, et al., 2006,
Messing, et al., 2012, Ton-Hoang, et al., 2012). In other
systems, palindromic sequences that can form DNA hairpins are present near the
probable HUH nuclease cleavage sites [Feschotte, 2001] (del Solar, et
al., 1998). Such hairpins can be formed in vivo under a
number of physiological conditions (see (Bikard, et al., 2011)).
Structural studies revealed that
small DNA hairpins can be recognized in several different ways:
sequence-specific recognition of the dsDNA stem; structure-specific recognition
of irregularities in the stem; or sequence-specific recognition of the hairpin
loop (Guasch,
et al., 2003, Hickman, et al.,
2004, Ronning, et al., 2005, Hickman, et al., 2010, Messing, et al., 2012, Edwards, et al., 2013).
The hairpin-flanking DNA - in
many cases in single-stranded form - is also often important for recognition.
Relaxases, for example, make extensive contacts with the bases extending
between the hairpin and cleavage site (Guasch, et al., 2003, Larkin, et al., 2003, Edwards, et al., 2013), and for a
relative of IS200/IS605 transposases, TnpAREP,
nucleotides on the 5' side of the hairpin are crucial for binding and
sequence-specific recognition (Messing, et al., 2012). Other family
members (Hickman, et al., 2004, Ruiz-Maso, et
al., 2007), have more complex binding modes.
HUH enzymes as transposases
Transposases of members of the IS200/IS605(Ton-Hoang, et al.,
2005), IS91 (Mendiola, et al., 1994) and ISCR (Toleman, et al.,
2006) insertion sequence families and the
eukaryotic helitrons (Kapitonov &
Jurka, 2001) are also HUH enzymes. Those
IS200/IS605 family are the best understood.
IS200/IS605 family
IS200/IS605 family transposases are single domain proteins with only the essential HUH motif and a single
catalytic Tyr (Y1 transposases, Fig 1.41.1). Both TnpAIS608 and TnpAISDra2 are obligatory dimers and the
active sites are believed to adopt two functionally important conformations,
one in which each is composed of the HUH motif from one monomer and the Tyr
residue carried by an alpha-helix (aD) from
the other (trans configuration), and
the other in which both motifs are contributed by the same monomer (cis configuration). Only the former has
been observed crystallographically.
Similar proteins are
sometimes found associated with repeated Extragenic Palindromes (REP sequences
whose hairpin structures resemble the ends of IS200/IS605 family members.
RCR transposons: IS91, ISCR and Helitron families
The earliest identified HUH domain transposases were those of the IS91 family (Garcillan-Barcia, et al., 2002) and are significantly larger than Y1
transposases (Fig 1.41.1), carry
a Y2 motif and include an N-terminal zinc binding motif and additional domains
of yet unidentified function.
A group of related elements, the ISCRs
often associated with a variety of antibiotic resistance genes (see (Toleman, et al.,
2006)) carry an orf (the CR or
common region) resembling IS91 family transposases but
with only a single Tyr (Fig 1.41.1). In
addition, eukaryotic relatives, the Helitrons, have been identified by
bioinformatic approaches.
References :
- Bikard
D, Loot C, Baharoglu Z & Mazel D (2011) Folded DNA in action: hairpin
formation and biological functions in prokaryotes. Microbiol Mol Biol Rev 74:
570-588.
- Boer
DR, Ruiz-Maso JA, Lopez-Blanco JR, et al. (2009) Plasmid replication initiator RepB forms a hexamer reminiscent of ring
helicases and has mobile nuclease domains. EMBO
J 28: 1666-1678.
- Boer R,
Russi S, Guasch A, et al. (2006)
Unveiling the molecular mechanism of a conjugative relaxase: The structure of
TrwC complexed with a 27-mer DNA comprising the recognition hairpin and the
cleavage site. J Mol Biol 358: 857-869.
- Brister
JR & Muzyczka N (1999) Rep-mediated nicking of the adeno-associated virus
origin requires two biochemical activities, DNA helicase activity and
transesterification. J Virol 73: 9325-9336.
- Brister
JR & Muzyczka N (2000) Mechanism of Rep-mediated adeno-associated virus
origin nicking. J Virol 74: 7762-7771.
- Bruand
C & Ehrlich SD (2000) UvrD-dependent replication of rolling-circle plasmids
in Escherichia coli. Mol Microbiol 35: 204-210.
- Chang
TL, Naqvi A, Anand SP, Kramer MG, Munshi R & Khan SA (2002) Biochemical
characterization of the Staphylococcus aureus PcrA helicase and its role in
plasmid rolling circle replication. J
Biol Chem 277: 45880-45886.
- Clerot
D & Bernardi F (2006) DNA helicase activity is associated with the
replication initiator protein rep of tomato yellow leaf curl geminivirus. J Virol 80: 11322-11330.
- Datta
S, Larkin C & Schildbach JF (2003) Structural insights into single-stranded
DNA binding and cleavage by F factor TraI. Structure 11: 1369-1379.
- de la
Cruz F, Frost LS, Meyer RJ & Zechner EL (2010) Conjugative DNA metabolism
in Gram-negative bacteria. FEMS Microbiol
Rev 34: 18-40.
- del
Solar G, Giraldo R, Ruiz-Echevarria MJ, Espinosa M & Diaz-Orejas R (1998)
Replication and control of circular bacterial plasmids. Microbiol Mol Biol Rev 62:
434-464.
- Dyda F
& Hickman AB (2003) A mob of reps. Structure
(Camb) 11: 1310-1311.
- Edwards
JS, Betts L, Frazier ML, et al. (2013) Molecular basis of antibiotic multiresistance transfer in Staphylococcus
aureus. Proc Natl Acad Sci U S A 110: 2804-2809.
- Garcillan-Barcia
MP, Francia MV & de la Cruz F (2009) The diversity of conjugative relaxases
and its application in plasmid classification.FEMS Microbiol Rev 33: 657-687.
- Garcillan-Barcia MP, Bernales
I, Mendiola MV & De la Cruz F (2002) IS91 rolling circle transposition. Mobile DNA, Vol. II (Craig NL, Craigie R, Gellert M & Lambowitz A, eds.),
pp. 891-904. ASM press, Washington DC.
- Grindley
ND, Whiteson KL & Rice PA (2006) Mechanisms of site-specific recombination. Annu Rev Biochem 75: 567-605.
- Gruss A
& Ehrlich SD (1989) The family of highly interrelated single-stranded
deoxyribonucleic acid plasmids. Microbiol
Rev 53: 231-241.
- Guasch
A, Lucas M, Moncalian G, et al. (2003) Recognition and processing of the origin of transfer DNA by conjugative
relaxase TrwC. Nat Struct Biol 10: 1002-1010.
- Guglielmini
J, Quintais L, Garcillan-Barcia MP, de la Cruz F & Rocha EP (2011) The
repertoire of ICE in prokaryotes underscores the unity, diversity, and ubiquity
of conjugation. PLoS Genet 7: e1002222.
- Hickman
AB, Ronning DR, Perez ZN, Kotin RM & Dyda F (2004) The nuclease domain of
adeno-associated virus rep coordinates replication initiation using two
distinct DNA recognition interfaces. Mol
Cell 13: 403-414.
- Hickman
AB, James JA, Barabas O, et al. (2010) DNA recognition and the precleavage state during single-stranded DNA
transposition in D. radiodurans. Embo J 29: 3840-3852.
- Ilyina
TV & Koonin EV (1992) Conserved sequence motifs in the initiator proteins
for rolling circle DNA replication encoded by diverse replicons from
eubacteria, eucaryotes and archaebacteria. Nucleic.Acids.Res. 20: 3279-3285.
- Im DS
& Muzyczka N (1990) The AAV origin binding protein Rep68 is an
ATP-dependent site-specific endonuclease with DNA helicase activity. Cell 61: 447-457.
- Kapitonov
VV & Jurka J (2001) Rolling-circle transposons in eukaryotes. Proc Natl Acad Sci U S A 98: 8714-8719.
- Koonin
EV & Ilyina TV (1993) Computer-assisted dissection of rolling circle DNA
replication. Biosystems 30: 241-268.
- Kornberg
A & Baker TA (1992) DNA Replication.
W.H.Freeman, New York.
- Larkin
C, Datta S, Nezami A, Dohm JA & Schildbach JF (2003) Crystallization and
preliminary X-ray characterization of the relaxase domain of F factor TraI. Acta Crystallogr D Biol Crystallogr 59: 1514-1516.
- Larkin
C, Datta S, Harley MJ, Anderson BJ, Ebie A, Hargreaves V & Schildbach JF
(2005) Inter- and intramolecular determinants of the specificity of
single-stranded DNA binding and cleavage by the F factor relaxase. Structure 13: 1533-1544.
- Mendiola
MV, Bernales I & de la Cruz F (1994) Differential roles of the transposon
termini in IS91 transposition. Proc.Natl.Acad.Sci.U.S.A. 91: 1922-1926.
- Messing SA, Ton-Hoang B,
Hickman AB, et al.(2012) The processing of repetitive extragenic palindromes: the structure
of a repetitive extragenic palindrome bound to its associated nuclease. Nucleic Acids Res 40: 9964-9979.
- Odegrip
R & Haggard-Ljungquist E (2001) The two active-site tyrosine residues of
the a protein play non-equivalent roles during initiation of rolling circle
replication of bacteriophage p2. J Mol
Biol 308: 147-163.
- Odegrip
R, Schoen S, Haggard-Ljungquist E, Park K & Chattoraj DK (2000) The interaction
of bacteriophage P2 B protein with Escherichia coli DnaB helicase. J Virol 74: 4057-4063.
- Orozco
BM & Hanley-Bowdoin L (1996) A DNA structure is required for geminivirus
replication origin function. J Virol 70: 148-158.
- Petit
MA, Dervyn E, Rose M, Entian KD, McGovern S, Ehrlich SD & Bruand C (1998)
PcrA is an essential DNA helicase of Bacillus subtilis fulfilling functions
both in repair and rolling-circle replication. Mol Microbiol 29:
261-273.
- Ronning
DR, Guynet C, Ton-Hoang B, Perez ZN, Ghirlando R, Chandler M & Dyda F
(2005) Active site sharing and subterminal hairpin recognition in a new class
of DNA transposases. Mol Cell 20: 143-154.
- Rosario
K, Duffy S & Breitbart M (2012) A field guide to eukaryotic circular single-stranded
DNA viruses: insights gained from metagenomics. Arch Virol 157:
1851-1871.
- Ruiz-Maso
JA, Lurz R, Espinosa M & del Solar G (2007) Interactions between the RepB
initiator protein of plasmid pMV158 and two distant DNA regions within the
origin of replication. Nucleic Acids Res 35: 1230-1244.
- Toleman
MA, Bennett PM & Walsh TR (2006) ISCR Elements: Novel Gene-Capturing
Systems of the 21st Century? Microbiol
Mol Biol Rev 70: 296-316.
- Ton-Hoang
B, Guynet C, Ronning DR, Cointin-Marty B, Dyda F & Chandler M (2005)
Transposition of ISHp608, member of an unusual family of bacterial insertion
sequences. Embo J 24: 3325-3338.
- Ton-Hoang B, Siguier P, Quentin Y,
Onillon S, Marty B, Fichant G & Chandler M (2012) Structuring the bacterial
genome: Y1-transposases associated with REP-BIME sequences. Nucleic Acids Res 40: 3596-3609.