ISMex38
- Family Tn3
- Group
Isoform Synonym(s) TnMex38
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 Methylobacterium chloromethanicum CM4 plasmid pMCHL01 |
DNA section
IS Length : 3841 bp
Ends
IR Length : 31/36
IRL : GAGAGCATCGTGTTGGTTGTTCGCCCATTTACGGTTGGCAGGACCGGGCC
IRR : GAGGGCATCGTGTTGACTGTTCGCCCAGTTACGCTTTGTGCGAACGTTCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AGGATCTCT | TGGTA | GAGACAGCC | 5 |
DNA sequence
GAGAGCATCGTGTTGGTTGTTCGCCCATTTACGGTTGGCAGGACCGGGCCGTGAACGGGACATGCTTTTCTGATGGGCGCAACGGTCGGCTACGCCGGTC
GGGCGACGCCCGCCAAGGCCATTTCGCGGTAGACGGTCGCGCGTCCGAGCCCGACCTGGCGAGCCGCCTCGGTCGGTGAGAGGCCGGCCTGGACGAGCTT
CAGGGCGGCGCCGACCTTGTCGGGATCGATCGGCTGCCGACCCGGGCGCCGACCCTTGGCCCGGGCCGCCGCGATGCCGTCCTTGGTGCGCTCGGCGATG
AGCCGCCGCTCGAAGTGCGCGATGGCGCCGAACACGTGGAAGACCAGCTCGCCGGCGGCCGAGCCGGTGTCGATCTTCTCCTCGAGGCTGAGGAGTGCGA
TGCCGCGCTCCTTGAGCAGGGTCACGCTCGACAGCAGCTCGGCGAGGGATCGGCCCAGACGATCGAGGCGCACAACAGCCAGGGTGTCCCCTTTTCGAGC
ATAGGCCAGGAGTTCGCTCAGACCGGGTCGATCCATGGATTTGCCGGACCGCACGTCCGTGAACACCCGGATGGCGCCGGCGTGTTCCAGCCGCAGCCTC
TGTCCGGCGACGTCCTGGTCCCCCGTCGACACCCGGGCATAGCCCAGCACATCGCCCATCGACGCCCTCTGCCCGTCCCGCAACCGGCCGTTCTGGGGAC
GGGTGCGCCGACGATCCGCGACGCGCCTCGATCCCGTCCACCAACTATGTCCCGTTAAGCCGCACTGTCTAGCTTCGACCGCGACCTTTCCTGGACGGTT
CGGGCATAGCGGGACGATGGCAAGACGGTCGCTTCTGAGCACGGCGGAGCGCGCGCGCCTGTTCGGCATCCCGGTCGATCGCGATGGGCTGGCGCGGCAC
TACACCTTCGATCGCCAGGACCTCGCCCTCATCGCCACGCGCCGCGGCGACGCCAACCGGATCGGCTTCGCGGTGCAGCTCGCGCTGCTGCGTCATCCTG
GCTTCGGCTTGTCCCCGGCGATCACGGTCGAGCCGGCACTGGTCACCCGCATCGCCGAACAGCTCGCGATCGACCCGAGGGCCTTCACCGCCTATGCGGG
GCGCAGTCCCACGATATCCGATCACGCCCGCGCCCTCGAACGGGTGCTGGGACTTCGCCCCTGCGCGAGGGCCGACCTGCCGTCGATGATCACGGCCGCG
GCCCGGGCGGCATGGCCGACCGACCAGGGGGAGCCGATCGCGGTCGCGGTGATGGCCGCGTTGCGCGACAGCGGCATCGTCCTGCCGGCACCCGACACGA
TCGAGCGCGCGGGGCTCGCCGGTCGCGCGCGTGCGAGGAAACAGGTCGCCGCCGCGCTGCTCGCGGGCATGACCGACGCGCTCGCGGCGCAGCTCGACGC
CCTTCTGGCGATCGATCCGAAGATCGGGCGGCCACCCCTCTCCTGGATGAAGGACCTGCCCCGTGCGCCCAAGCCCAACCATGTCCGCGAACTCCTCGAC
AAGCTCGCTGCCGTGCGGGATCTCGGGCTCGATCCGCGGGCGGCCGAGCGCATCCACCCCGACCGGCTCGCGCTCCTGATGCGCGAAGGTCGGATCACGC
CGGCCTCTACCCTCGAGCGTTATGCCCCGTCGCGGCGGCGTGCCATCGTGGTCGCGACGTTGCTCGATCTCGAACGCCGCTTGACCGATGCGGCGCTGTC
CATGGCGGACCGCCTCATCGGCGCGAGCTTCACACGCGGCAAAGCCGCGCGCGAGAGGACCTTCGTGGCCACCTCGCGCGACGTCGGCCGGCTCATGCGC
CTTCTCGCGGGGACCGCCGGCGCAGTCGCGACGGCCATGAAGGAGAACGGCGACGCGCTCGCCGCGATCGATGCCGCCGTCGGGCTCGACAGGCTCATCG
CGGCAAAGCCCCAGGCGGCCGAGATCGCCGACGTCGCAGAGGAGGATCCGCTCGTGCGCGCCGCCGATCGCTGGATGAGGCTGCGCAAGTACGGGCCGAT
GCTGATCGAGGCGATCGACTTCAAGGCGGCGCGCGCCGATGACCGCACGGTCGCGGCCCTGACCGCGTTGCGCGATCTGAACCGCTCGGGCAAGCGGGAC
CTTCCCAAGGGTACGCCGATGCCGTTCAAGAAGGAATGGCGCCGGCTCGTGGCCGGGGCGGACGGCAGGCTCGATCGCCGACTGTTCGAGACGGCCCTGT
TCGCCCATCTGAGGAACAAATGGCGCTCGGGTGACCTGTGGGTCGAGCGCTCGACCCACTACCGTCGCTTCGACAGTTATCTCCTGCCCCTCGACGAAGC
GCGGACTATCGTCGCCCCGCTCGGCCTGCCTTGCGACCCCGACGCCTGGCTGGCGGCCCGCGCGGAGCGGCTCGACCGGCGGCTGAAGCGCCTCGGCCGG
CATCTCGGCCGCGGGACTCTCGAAGGCGTGAGCCTGAGGAACGGCAAGCTCTCCATCGCGCCGGTCCGTGCCGACAAGAACCCGGAGGCAGAGGCTCTCG
CAGCCCGCATCGGCGCGCTGATGCCGCGTATCCGCATCACCGAACTCCTCCACGAGGTGGCGCGCGAGACCGGGTTCCTGTCCGCCTTCACCAACGTCCG
CACCCGGCAGCCGGTCGAGGACGGGAACGCGCTGCTGGCCGTCATCCTCGCCGACGCGACCAATCTCGGCCTGTCCCGCATGGCCGAAGCCAGCCAGGGG
GTCACGCGCGACCAGCTGTTCTGGACGCGCGACGCCTTCATCCGCGACGAGACCTACAAGGCCGCCCTCGGCCGGATCGTCGATGCGCATCATGCACTCC
CGATCGCGGCCGTGTGGGGCGAGGGCACCACCGCGGCGAGCGACGGCCAGTTCTTCCGCTCCGGCAAGCGCGGCGACGGTGCCGGCCAGGTCAACGCCCG
CCACGGCATCGAGCCGGGCTACTGCTTCTACACCCACACCTCCGACCAGCACGGGCCGATGCGTTCGGTCAGCATGGCGGCGGCCGAGCACGAGGCCCCC
TACGTGCTCGACGGGCTTCTGCACCACGGCACCGGCCTGACCATCGCCGAGCACTACACCGATACCGGGGGCAGTTCGGATCACGTCCACTTCCTGTGCG
ACAGCCTGGGCATCCGGTTCTGTCCGCGGCTGCGCGACTTCCCCGATCGGCGGCTCGCCTGCCTGGAGCCACCGTCACGCTACCCGGCGCTCGGCGGCCT
CCTGGGCAAGCGGGTCAAGGCCGATCTGATCCGCGCGCACTGGAACGACATCGTCCGGCTGGTCGCCACCCTGAAGGCGGGCGTCGTCGCGCCCTCCACG
ATGTTGAAGAAGCTCGCGGCCTACGAGCGGCAGAACCAGCTGGACCTCGCGATCGGGGAAGCCGGCCGTCTCGTGCGGGCCGAGTTCATGATCGACTGGA
TGGAGGGACCGGCCCTGCGACGGCGCAGCCAGGCCGGGCTCAACAAATCCGAGCAACGCCATACGCTGGCGAGCGTCGTGTGCACCTACGGGCAGGGCCG
CATCGCCGATCGGAGCCAGGAGGTGCAGGAGTACCGGGCCTCGGGGCTCAACCTGGTGATCGCCGCCATCGTGTACTGGAACTCGACCTACATGGCCGAC
GCGGTCGCGCATCTGCGTCGCAGCGGTGATCCCACCCCCGATCGTCTGCTCGCCCACACGTCGCCGGTCGAATGGGAGCACATCGGCTTCTCGGGCGACT
TCCTGTGGCACCGCGCCGCCATGATGCCTGCCAGCCGGCGAAGGCTCAATCTCACCAAGACCCAGCCCGCAGCCGCTTGAACCACGTTCACTGAACGTTC
GCACAAAGCGTAACTGGGCGAACAGTCAACACGATGCCCTC
GGGCGACGCCCGCCAAGGCCATTTCGCGGTAGACGGTCGCGCGTCCGAGCCCGACCTGGCGAGCCGCCTCGGTCGGTGAGAGGCCGGCCTGGACGAGCTT
CAGGGCGGCGCCGACCTTGTCGGGATCGATCGGCTGCCGACCCGGGCGCCGACCCTTGGCCCGGGCCGCCGCGATGCCGTCCTTGGTGCGCTCGGCGATG
AGCCGCCGCTCGAAGTGCGCGATGGCGCCGAACACGTGGAAGACCAGCTCGCCGGCGGCCGAGCCGGTGTCGATCTTCTCCTCGAGGCTGAGGAGTGCGA
TGCCGCGCTCCTTGAGCAGGGTCACGCTCGACAGCAGCTCGGCGAGGGATCGGCCCAGACGATCGAGGCGCACAACAGCCAGGGTGTCCCCTTTTCGAGC
ATAGGCCAGGAGTTCGCTCAGACCGGGTCGATCCATGGATTTGCCGGACCGCACGTCCGTGAACACCCGGATGGCGCCGGCGTGTTCCAGCCGCAGCCTC
TGTCCGGCGACGTCCTGGTCCCCCGTCGACACCCGGGCATAGCCCAGCACATCGCCCATCGACGCCCTCTGCCCGTCCCGCAACCGGCCGTTCTGGGGAC
GGGTGCGCCGACGATCCGCGACGCGCCTCGATCCCGTCCACCAACTATGTCCCGTTAAGCCGCACTGTCTAGCTTCGACCGCGACCTTTCCTGGACGGTT
CGGGCATAGCGGGACGATGGCAAGACGGTCGCTTCTGAGCACGGCGGAGCGCGCGCGCCTGTTCGGCATCCCGGTCGATCGCGATGGGCTGGCGCGGCAC
TACACCTTCGATCGCCAGGACCTCGCCCTCATCGCCACGCGCCGCGGCGACGCCAACCGGATCGGCTTCGCGGTGCAGCTCGCGCTGCTGCGTCATCCTG
GCTTCGGCTTGTCCCCGGCGATCACGGTCGAGCCGGCACTGGTCACCCGCATCGCCGAACAGCTCGCGATCGACCCGAGGGCCTTCACCGCCTATGCGGG
GCGCAGTCCCACGATATCCGATCACGCCCGCGCCCTCGAACGGGTGCTGGGACTTCGCCCCTGCGCGAGGGCCGACCTGCCGTCGATGATCACGGCCGCG
GCCCGGGCGGCATGGCCGACCGACCAGGGGGAGCCGATCGCGGTCGCGGTGATGGCCGCGTTGCGCGACAGCGGCATCGTCCTGCCGGCACCCGACACGA
TCGAGCGCGCGGGGCTCGCCGGTCGCGCGCGTGCGAGGAAACAGGTCGCCGCCGCGCTGCTCGCGGGCATGACCGACGCGCTCGCGGCGCAGCTCGACGC
CCTTCTGGCGATCGATCCGAAGATCGGGCGGCCACCCCTCTCCTGGATGAAGGACCTGCCCCGTGCGCCCAAGCCCAACCATGTCCGCGAACTCCTCGAC
AAGCTCGCTGCCGTGCGGGATCTCGGGCTCGATCCGCGGGCGGCCGAGCGCATCCACCCCGACCGGCTCGCGCTCCTGATGCGCGAAGGTCGGATCACGC
CGGCCTCTACCCTCGAGCGTTATGCCCCGTCGCGGCGGCGTGCCATCGTGGTCGCGACGTTGCTCGATCTCGAACGCCGCTTGACCGATGCGGCGCTGTC
CATGGCGGACCGCCTCATCGGCGCGAGCTTCACACGCGGCAAAGCCGCGCGCGAGAGGACCTTCGTGGCCACCTCGCGCGACGTCGGCCGGCTCATGCGC
CTTCTCGCGGGGACCGCCGGCGCAGTCGCGACGGCCATGAAGGAGAACGGCGACGCGCTCGCCGCGATCGATGCCGCCGTCGGGCTCGACAGGCTCATCG
CGGCAAAGCCCCAGGCGGCCGAGATCGCCGACGTCGCAGAGGAGGATCCGCTCGTGCGCGCCGCCGATCGCTGGATGAGGCTGCGCAAGTACGGGCCGAT
GCTGATCGAGGCGATCGACTTCAAGGCGGCGCGCGCCGATGACCGCACGGTCGCGGCCCTGACCGCGTTGCGCGATCTGAACCGCTCGGGCAAGCGGGAC
CTTCCCAAGGGTACGCCGATGCCGTTCAAGAAGGAATGGCGCCGGCTCGTGGCCGGGGCGGACGGCAGGCTCGATCGCCGACTGTTCGAGACGGCCCTGT
TCGCCCATCTGAGGAACAAATGGCGCTCGGGTGACCTGTGGGTCGAGCGCTCGACCCACTACCGTCGCTTCGACAGTTATCTCCTGCCCCTCGACGAAGC
GCGGACTATCGTCGCCCCGCTCGGCCTGCCTTGCGACCCCGACGCCTGGCTGGCGGCCCGCGCGGAGCGGCTCGACCGGCGGCTGAAGCGCCTCGGCCGG
CATCTCGGCCGCGGGACTCTCGAAGGCGTGAGCCTGAGGAACGGCAAGCTCTCCATCGCGCCGGTCCGTGCCGACAAGAACCCGGAGGCAGAGGCTCTCG
CAGCCCGCATCGGCGCGCTGATGCCGCGTATCCGCATCACCGAACTCCTCCACGAGGTGGCGCGCGAGACCGGGTTCCTGTCCGCCTTCACCAACGTCCG
CACCCGGCAGCCGGTCGAGGACGGGAACGCGCTGCTGGCCGTCATCCTCGCCGACGCGACCAATCTCGGCCTGTCCCGCATGGCCGAAGCCAGCCAGGGG
GTCACGCGCGACCAGCTGTTCTGGACGCGCGACGCCTTCATCCGCGACGAGACCTACAAGGCCGCCCTCGGCCGGATCGTCGATGCGCATCATGCACTCC
CGATCGCGGCCGTGTGGGGCGAGGGCACCACCGCGGCGAGCGACGGCCAGTTCTTCCGCTCCGGCAAGCGCGGCGACGGTGCCGGCCAGGTCAACGCCCG
CCACGGCATCGAGCCGGGCTACTGCTTCTACACCCACACCTCCGACCAGCACGGGCCGATGCGTTCGGTCAGCATGGCGGCGGCCGAGCACGAGGCCCCC
TACGTGCTCGACGGGCTTCTGCACCACGGCACCGGCCTGACCATCGCCGAGCACTACACCGATACCGGGGGCAGTTCGGATCACGTCCACTTCCTGTGCG
ACAGCCTGGGCATCCGGTTCTGTCCGCGGCTGCGCGACTTCCCCGATCGGCGGCTCGCCTGCCTGGAGCCACCGTCACGCTACCCGGCGCTCGGCGGCCT
CCTGGGCAAGCGGGTCAAGGCCGATCTGATCCGCGCGCACTGGAACGACATCGTCCGGCTGGTCGCCACCCTGAAGGCGGGCGTCGTCGCGCCCTCCACG
ATGTTGAAGAAGCTCGCGGCCTACGAGCGGCAGAACCAGCTGGACCTCGCGATCGGGGAAGCCGGCCGTCTCGTGCGGGCCGAGTTCATGATCGACTGGA
TGGAGGGACCGGCCCTGCGACGGCGCAGCCAGGCCGGGCTCAACAAATCCGAGCAACGCCATACGCTGGCGAGCGTCGTGTGCACCTACGGGCAGGGCCG
CATCGCCGATCGGAGCCAGGAGGTGCAGGAGTACCGGGCCTCGGGGCTCAACCTGGTGATCGCCGCCATCGTGTACTGGAACTCGACCTACATGGCCGAC
GCGGTCGCGCATCTGCGTCGCAGCGGTGATCCCACCCCCGATCGTCTGCTCGCCCACACGTCGCCGGTCGAATGGGAGCACATCGGCTTCTCGGGCGACT
TCCTGTGGCACCGCGCCGCCATGATGCCTGCCAGCCGGCGAAGGCTCAATCTCACCAAGACCCAGCCCGCAGCCGCTTGAACCACGTTCACTGAACGTTC
GCACAAAGCGTAACTGGGCGAACAGTCAACACGATGCCCTC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
780 bp | 259 aa | 90 | 869 | - | No |
AG : Tn3 resolvase
ORF sequence :
MPNRRARSAVLRSDRLAIVPLCPNRPGKVAVEARQCGLTGHSWWTGSRRVADRRRTRPQNGRLRDGQRASMGDVLGYARVSTGDQDVAGQRLRLEHAGAI
RVFTDVRSGKSMDRPGLSELLAYARKGDTLAVVRLDRLGRSLAELLSSVTLLKERGIALLSLEEKIDTGSAAGELVFHVFGAIAHFERRLIAERTKDGIA
AARAKGRRPGRQPIDPDKVGAALKLVQAGLSPTEAARQVGLGRATVYREMALAGVARPA
RVFTDVRSGKSMDRPGLSELLAYARKGDTLAVVRLDRLGRSLAELLSSVTLLKERGIALLSLEEKIDTGSAAGELVFHVFGAIAHFERRLIAERTKDGIA
AARAKGRRPGRQPIDPDKVGAALKLVQAGLSPTEAARQVGLGRATVYREMALAGVARPA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
2964 bp | 987 aa | 817 | 3780 | + | No |
Chemistry : DDE
ORF sequence :
MARRSLLSTAERARLFGIPVDRDGLARHYTFDRQDLALIATRRGDANRIGFAVQLALLRHPGFGLSPAITVEPALVTRIAEQLAIDPRAFTAYAGRSPTI
SDHARALERVLGLRPCARADLPSMITAAARAAWPTDQGEPIAVAVMAALRDSGIVLPAPDTIERAGLAGRARARKQVAAALLAGMTDALAAQLDALLAID
PKIGRPPLSWMKDLPRAPKPNHVRELLDKLAAVRDLGLDPRAAERIHPDRLALLMREGRITPASTLERYAPSRRRAIVVATLLDLERRLTDAALSMADRL
IGASFTRGKAARERTFVATSRDVGRLMRLLAGTAGAVATAMKENGDALAAIDAAVGLDRLIAAKPQAAEIADVAEEDPLVRAADRWMRLRKYGPMLIEAI
DFKAARADDRTVAALTALRDLNRSGKRDLPKGTPMPFKKEWRRLVAGADGRLDRRLFETALFAHLRNKWRSGDLWVERSTHYRRFDSYLLPLDEARTIVA
PLGLPCDPDAWLAARAERLDRRLKRLGRHLGRGTLEGVSLRNGKLSIAPVRADKNPEAEALAARIGALMPRIRITELLHEVARETGFLSAFTNVRTRQPV
EDGNALLAVILADATNLGLSRMAEASQGVTRDQLFWTRDAFIRDETYKAALGRIVDAHHALPIAAVWGEGTTAASDGQFFRSGKRGDGAGQVNARHGIEP
GYCFYTHTSDQHGPMRSVSMAAAEHEAPYVLDGLLHHGTGLTIAEHYTDTGGSSDHVHFLCDSLGIRFCPRLRDFPDRRLACLEPPSRYPALGGLLGKRV
KADLIRAHWNDIVRLVATLKAGVVAPSTMLKKLAAYERQNQLDLAIGEAGRLVRAEFMIDWMEGPALRRRSQAGLNKSEQRHTLASVVCTYGQGRIADRS
QEVQEYRASGLNLVIAAIVYWNSTYMADAVAHLRRSGDPTPDRLLAHTSPVEWEHIGFSGDFLWHRAAMMPASRRRLNLTKTQPAAA
SDHARALERVLGLRPCARADLPSMITAAARAAWPTDQGEPIAVAVMAALRDSGIVLPAPDTIERAGLAGRARARKQVAAALLAGMTDALAAQLDALLAID
PKIGRPPLSWMKDLPRAPKPNHVRELLDKLAAVRDLGLDPRAAERIHPDRLALLMREGRITPASTLERYAPSRRRAIVVATLLDLERRLTDAALSMADRL
IGASFTRGKAARERTFVATSRDVGRLMRLLAGTAGAVATAMKENGDALAAIDAAVGLDRLIAAKPQAAEIADVAEEDPLVRAADRWMRLRKYGPMLIEAI
DFKAARADDRTVAALTALRDLNRSGKRDLPKGTPMPFKKEWRRLVAGADGRLDRRLFETALFAHLRNKWRSGDLWVERSTHYRRFDSYLLPLDEARTIVA
PLGLPCDPDAWLAARAERLDRRLKRLGRHLGRGTLEGVSLRNGKLSIAPVRADKNPEAEALAARIGALMPRIRITELLHEVARETGFLSAFTNVRTRQPV
EDGNALLAVILADATNLGLSRMAEASQGVTRDQLFWTRDAFIRDETYKAALGRIVDAHHALPIAAVWGEGTTAASDGQFFRSGKRGDGAGQVNARHGIEP
GYCFYTHTSDQHGPMRSVSMAAAEHEAPYVLDGLLHHGTGLTIAEHYTDTGGSSDHVHFLCDSLGIRFCPRLRDFPDRRLACLEPPSRYPALGGLLGKRV
KADLIRAHWNDIVRLVATLKAGVVAPSTMLKKLAAYERQNQLDLAIGEAGRLVRAEFMIDWMEGPALRRRSQAGLNKSEQRHTLASVVCTYGQGRIADRS
QEVQEYRASGLNLVIAAIVYWNSTYMADAVAHLRRSGDPTPDRLLAHTSPVEWEHIGFSGDFLWHRAAMMPASRRRLNLTKTQPAAA
Blast result :
Comments
ISMex38 is 93% (ORFA) and 74% (ORFB, the transposase) aa similar to ISAli20.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources.(2009) PLoS ONE Submitted.