ISMex22
- Family Tn3
- Group
Isoform Synonym(s) TnMex22
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 |
DNA section
IS Length : 3864 bp
Ends
IR Length : 36
IRL : GGCCCCTGAACATTAAAGGGGCACGGATATACGGTAGGGGCTTGCGAGCG
IRR : GGCCCCTGAACATTAAAGGGGCACGGATATACGGTAATGCGCTCAGGGTG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGAGCAGGCG | ATGGGT | CAGCGACGCG | 6 |
TGATCCCGCG | TTCAC | GGGGCGGCGG | 5 |
AGGCCACCCC | ATGAT | ATGATCCTGT | 5 |
GGTGCTTCCC | ACCGT | GGGCAGCGCG | 5 |
CAGGTGCTGA | TGGA | TGCCCTGCGT | 4 |
DNA sequence
GGCCCCTGAACATTAAAGGGGCACGGATATACGGTAGGGGCTTGCGAGCGGCCGTCGGCGCCGCGGGCTAGGCTCGCATGCGTCTCTGGGCAAGAAGCTT
TTCGCCGTCAGGGCGGAGTTCCCCGCGCGGCCCGACGAACCGGTAGAGGGTTTGTCGGGTGATGCCGAGTTCGACGCAGAGGTCGGCGACACAGGTTTCA
GGGTTTCCCATCGCGGCCTGAGCCAAGCGCAGCTTGGCGGGCGTCATCTTGAAGGGGCGGCCGCCCTTGCGGCCTCGTGCCCGTGCGGCAGCCAGCCCGG
CCTTTGTGCGCTCGGAGATCAGCTCGCGCTCGAACTCGGCCAAGCCGGCGAAGATCGCGAACACCAAGCGGCCGTTGGCCGTGGTGGTGTCGATCGAGGC
GCCTTCGCCCGACAGCACCTTCAGGCCGACGTGGCGCCGAGTGAGGTCTCCAACCAGGTTGACCAAGTGGCGCAGGTCGCGGCCGAGCCGGTCCAGCTTC
CACACCGCGAGCGTATCGCCGGCGCGCAGCGCCTTGAGGCAGGCCTCGAGCCCGGGACGGTCGTCGCTCCGTCCGGACGCGGCATCCTCGTAGATGTGCT
CGGGATCGACCCCGGCCTTCCTCAGCGCATCGCGCTGCAGGTCGTGCACCTGGCTGCCGTCGGCCTTCGACACCCGCGCGTAGCCGATCAGCACCGTCAT
TGATACGTCCGTTGGCGTGACACTGCCGACCGTGCCCTGTTGAGGCCGCTCGTTTGTCACATAATCCGTCTTGCAACGTAAGCTCTGCCAACGGCGTCTG
CGCGGGATTTCGTGACACCCTGGAAGAGCCGATGCCGCGCCGCCTGATCCTGACCGATGCCGAGCGGCGGACGATCCTCGCCCTGCCGACCGATGAGGCG
ACTTTGATCCGTCACTGGAGCCTCGATGACCAGGATCTCGCGCTCCTCGACACGCGGCGGCGCGACGACACCCGGCTCGGCCTGGCGCTCCAGCTCTGCG
CCCTGCGCTACCCCGGCCGCCTGATCCGCCCCGGTGAGACGATCCCCGAGGCCGCGGCGGTGTTCCTGGCCGACCAGCTCGGGGGCGATCCGGACGCGCT
CGCCAGCTTCGCGCGCCGCGCCCCCACCCGCTACGAGCAACTGACGATCCTGCGCCGACGCTTCGGCTTCACCGACCTGTGCCGCCCGTTGCGCGGCGAT
CTCGTCGCCTTCGCACGGGGCATCGCTCTGGCGGTTGCCAAGGATCGCCTCGTCGTCACGGCCCTGGCCGAGGAGATGCGGCGGCGGCGCATCGTCATCC
CCGGCATCACGGTGCTGGAGCGCCTCGCGGCTCAGGCCTGCACCGAGGCCGAGGACGCTCTTCTGGCCGACGTCGCGGGGCGGCTGACGCCCGACCTCGT
CATCCGCATGGAGGCGCTGCTCACTGTGGGACCGCTCGCCATGGGACCACGACACGCCCGGCAGAGCGGGATCTCCTGGCTGCGCGAGCCGCCGGGATCC
GCCGGCACGGCGGCCATGCGCGGCCTCGTCGACCGCCTCGAAGCCGTGCGTCACGTCGGCGTTCCCGCAACCGTGCTCGGGGGCGTTCCGGCCCACCGCA
TCCGCCGCATGGCGCAGGAAGGCCGTCGCCTCACGGCCCAGAACTTCGCGCAAATGCGCCCCAGCCGCCGGCACGCGACCTTGGCCGCCTTCCTGCACGA
CACGCAGACGGCGCTGACCGATGCGGCGATCGGCATGTTCGAGATCCTGGTCGGCCGCGCGTTCCGGCAGGCCGAGGCCGATCGTGAGGCACATCTCACC
GCCAGCGTCGTCGCGGCGGCCGAGGCGCTCGACTTCTTCGCAGGGTTCGGCGACGCCCTTGTGGCCCACAAAGGCGTCGGCCTGTCGCTCGATGCGGCGA
TCACGACCGTCGCGACTTGGGAGGCGCTCGCCCGAGCCACCGCGGCGGCCCAGGCCAACAGGCAGGCCCGGCACGGTGACGACACGATCGCCTTCCTGCG
TCGGCATCATGGCCGCATCCGCGCCTTCGCGGCCCCCTTCCTGACGCGCTTCACGTTCGAGGCCGCCCGGCCCGGCATGGCCCTCGTCACCGCCGTCTCC
CAACTCGGGGAGGCCTGGAAGGCCGGGCGCCGCTCACCGGGCCAGGCCTGGATCGACGCCGCCTTGTCGTTGCTCGACCGGCGCTGGTCCAGGCACGTCC
GTGCCCCGGACGGTACCATCGACCGCAAGATGCTGGAGATCTTCCTCGTCGTCGAGCTGAAGAACCGGATCACCGCCGGCGAGGTCTGGGTGGCGGGGTC
ACGGACCTACCGGGCGCTCGAGGAGAAGCTGATCCCGCCGCAGACCTTCGCGATCATCAAGGCGGAGGCCCGCGTACCCGTCGCTATCCCGGTCGATGTG
GAGATCTACCTGGCCGAGAAGGCCGCCGCGCTCGAAGGGAAGCTGCAGGCGGCGGCGCGCCGCCTGAAGACGGGACGCGGCGAGACGCGCATCGGCGCCA
AGGGTCTACGGGTGCCGGCCGTCAGGACAGCGGAGACCGAGGCGGCCGTCGCCCTGGCCCGGCAGGTGGCCGCGACCATGCCGCCGATCCGGCTCACCGA
CCTCATGGCCGACGTCGACCGGATGACCGGCTTCAGTGCCCTGTTCGAACATCTGCAGACCGGACGGCCGCCGGCCGATCGGCGCGTCTTCCTCGCCGCC
CTGATCGCCGAGGCGACCAACCTCGGCTTCGGCAAGATGGCCTTGGCCTGCCCCGGCCTCACGCGGCGCCAGCTGCAGCAGGTGGCGATCTGGCACTTCC
GGGAAGACACCTTCGCCCTGGCTCTGGCCCGGCTGGTCGAGGCCCAACACGCCGCCCCGTTCTCCGCCACCTTCGGATCGCACGCCATCGCGTCGTCCGA
CGGCCAGCACATCTACCTGGGCGACGGCGGCGAGATCGCCGGCGGCGTCAACGGCCACTACGGCTCCGACCCGATCACCAAGCTCTACACCACGATCTCG
GGCCGCTATGCGCCCTTCCACGTCAAGATCATCGCCGCCACGGCGAGCGAGGCCGTGCACGTGCTCGACGCGTTGCTTGAGACTGAGGCCGGCGCGGCCG
TCACCCGGCACCATGTCGATGGCGGCGGCGTCAGCGACCTGGTGTTCGCGCTCTGCCATGGGCTCGGCTTCGCCTTCGTGCCGCGCATCCCCGATCTCGA
CGGCCGCTGCCTCTACGGCTTTGCACCAGCCCGGCACTACGGCGTGCTGCAATCGGTCATGGGCGAGCGCCTCGACGCCGGCCTGATCCGCCGCCATTGG
GATGACATCCTGCGCCTTCTGACCTCGCTCAGGACCCGCACCGTCAGCGCCTCGCTGGTGCTGCGACAGCTGTCGGCGACGCCGCGCCAGAGTGGCCTCG
TCCAGGCGCTGCGGCAGATGGGGCGCGTCGAGCGCACCCTCTTCACCCTCGACTGGATCGGTGACGAACAGCTCCGCAAAGGTACCACGGCCGAACTCAA
CAAGGGCGAGCGCCGCAACGGCCTCGTGCGCGCCGTCAACCTGCATCGGCTCGGCCGCTTCCGCGACCGCAGCCAGGACAGCCTGGCGATCCGGGCCTCC
GCCCTCAACCTGGTGGTCACCGCCATCATCTACTGGAACACGATCTACACGGGCCGCGTCGTCGACGCCTTGCGAGCCAGGGGTGCACTCCTTCCCGACC
ACCTCCTCACCGGCCTGTCGCCCCTCGGCTGGGAGCATATCGGCCTCACCGGCGACTATCTCTGGGAGGAAACGCCCGGCATCGATCAGACCGGGTTCCG
GGCTATCCCGATCACACCCTGAGCGCATTACCGTATATCCGTGCCCCTTTAATGTTCAGGGGCC
TTCGCCGTCAGGGCGGAGTTCCCCGCGCGGCCCGACGAACCGGTAGAGGGTTTGTCGGGTGATGCCGAGTTCGACGCAGAGGTCGGCGACACAGGTTTCA
GGGTTTCCCATCGCGGCCTGAGCCAAGCGCAGCTTGGCGGGCGTCATCTTGAAGGGGCGGCCGCCCTTGCGGCCTCGTGCCCGTGCGGCAGCCAGCCCGG
CCTTTGTGCGCTCGGAGATCAGCTCGCGCTCGAACTCGGCCAAGCCGGCGAAGATCGCGAACACCAAGCGGCCGTTGGCCGTGGTGGTGTCGATCGAGGC
GCCTTCGCCCGACAGCACCTTCAGGCCGACGTGGCGCCGAGTGAGGTCTCCAACCAGGTTGACCAAGTGGCGCAGGTCGCGGCCGAGCCGGTCCAGCTTC
CACACCGCGAGCGTATCGCCGGCGCGCAGCGCCTTGAGGCAGGCCTCGAGCCCGGGACGGTCGTCGCTCCGTCCGGACGCGGCATCCTCGTAGATGTGCT
CGGGATCGACCCCGGCCTTCCTCAGCGCATCGCGCTGCAGGTCGTGCACCTGGCTGCCGTCGGCCTTCGACACCCGCGCGTAGCCGATCAGCACCGTCAT
TGATACGTCCGTTGGCGTGACACTGCCGACCGTGCCCTGTTGAGGCCGCTCGTTTGTCACATAATCCGTCTTGCAACGTAAGCTCTGCCAACGGCGTCTG
CGCGGGATTTCGTGACACCCTGGAAGAGCCGATGCCGCGCCGCCTGATCCTGACCGATGCCGAGCGGCGGACGATCCTCGCCCTGCCGACCGATGAGGCG
ACTTTGATCCGTCACTGGAGCCTCGATGACCAGGATCTCGCGCTCCTCGACACGCGGCGGCGCGACGACACCCGGCTCGGCCTGGCGCTCCAGCTCTGCG
CCCTGCGCTACCCCGGCCGCCTGATCCGCCCCGGTGAGACGATCCCCGAGGCCGCGGCGGTGTTCCTGGCCGACCAGCTCGGGGGCGATCCGGACGCGCT
CGCCAGCTTCGCGCGCCGCGCCCCCACCCGCTACGAGCAACTGACGATCCTGCGCCGACGCTTCGGCTTCACCGACCTGTGCCGCCCGTTGCGCGGCGAT
CTCGTCGCCTTCGCACGGGGCATCGCTCTGGCGGTTGCCAAGGATCGCCTCGTCGTCACGGCCCTGGCCGAGGAGATGCGGCGGCGGCGCATCGTCATCC
CCGGCATCACGGTGCTGGAGCGCCTCGCGGCTCAGGCCTGCACCGAGGCCGAGGACGCTCTTCTGGCCGACGTCGCGGGGCGGCTGACGCCCGACCTCGT
CATCCGCATGGAGGCGCTGCTCACTGTGGGACCGCTCGCCATGGGACCACGACACGCCCGGCAGAGCGGGATCTCCTGGCTGCGCGAGCCGCCGGGATCC
GCCGGCACGGCGGCCATGCGCGGCCTCGTCGACCGCCTCGAAGCCGTGCGTCACGTCGGCGTTCCCGCAACCGTGCTCGGGGGCGTTCCGGCCCACCGCA
TCCGCCGCATGGCGCAGGAAGGCCGTCGCCTCACGGCCCAGAACTTCGCGCAAATGCGCCCCAGCCGCCGGCACGCGACCTTGGCCGCCTTCCTGCACGA
CACGCAGACGGCGCTGACCGATGCGGCGATCGGCATGTTCGAGATCCTGGTCGGCCGCGCGTTCCGGCAGGCCGAGGCCGATCGTGAGGCACATCTCACC
GCCAGCGTCGTCGCGGCGGCCGAGGCGCTCGACTTCTTCGCAGGGTTCGGCGACGCCCTTGTGGCCCACAAAGGCGTCGGCCTGTCGCTCGATGCGGCGA
TCACGACCGTCGCGACTTGGGAGGCGCTCGCCCGAGCCACCGCGGCGGCCCAGGCCAACAGGCAGGCCCGGCACGGTGACGACACGATCGCCTTCCTGCG
TCGGCATCATGGCCGCATCCGCGCCTTCGCGGCCCCCTTCCTGACGCGCTTCACGTTCGAGGCCGCCCGGCCCGGCATGGCCCTCGTCACCGCCGTCTCC
CAACTCGGGGAGGCCTGGAAGGCCGGGCGCCGCTCACCGGGCCAGGCCTGGATCGACGCCGCCTTGTCGTTGCTCGACCGGCGCTGGTCCAGGCACGTCC
GTGCCCCGGACGGTACCATCGACCGCAAGATGCTGGAGATCTTCCTCGTCGTCGAGCTGAAGAACCGGATCACCGCCGGCGAGGTCTGGGTGGCGGGGTC
ACGGACCTACCGGGCGCTCGAGGAGAAGCTGATCCCGCCGCAGACCTTCGCGATCATCAAGGCGGAGGCCCGCGTACCCGTCGCTATCCCGGTCGATGTG
GAGATCTACCTGGCCGAGAAGGCCGCCGCGCTCGAAGGGAAGCTGCAGGCGGCGGCGCGCCGCCTGAAGACGGGACGCGGCGAGACGCGCATCGGCGCCA
AGGGTCTACGGGTGCCGGCCGTCAGGACAGCGGAGACCGAGGCGGCCGTCGCCCTGGCCCGGCAGGTGGCCGCGACCATGCCGCCGATCCGGCTCACCGA
CCTCATGGCCGACGTCGACCGGATGACCGGCTTCAGTGCCCTGTTCGAACATCTGCAGACCGGACGGCCGCCGGCCGATCGGCGCGTCTTCCTCGCCGCC
CTGATCGCCGAGGCGACCAACCTCGGCTTCGGCAAGATGGCCTTGGCCTGCCCCGGCCTCACGCGGCGCCAGCTGCAGCAGGTGGCGATCTGGCACTTCC
GGGAAGACACCTTCGCCCTGGCTCTGGCCCGGCTGGTCGAGGCCCAACACGCCGCCCCGTTCTCCGCCACCTTCGGATCGCACGCCATCGCGTCGTCCGA
CGGCCAGCACATCTACCTGGGCGACGGCGGCGAGATCGCCGGCGGCGTCAACGGCCACTACGGCTCCGACCCGATCACCAAGCTCTACACCACGATCTCG
GGCCGCTATGCGCCCTTCCACGTCAAGATCATCGCCGCCACGGCGAGCGAGGCCGTGCACGTGCTCGACGCGTTGCTTGAGACTGAGGCCGGCGCGGCCG
TCACCCGGCACCATGTCGATGGCGGCGGCGTCAGCGACCTGGTGTTCGCGCTCTGCCATGGGCTCGGCTTCGCCTTCGTGCCGCGCATCCCCGATCTCGA
CGGCCGCTGCCTCTACGGCTTTGCACCAGCCCGGCACTACGGCGTGCTGCAATCGGTCATGGGCGAGCGCCTCGACGCCGGCCTGATCCGCCGCCATTGG
GATGACATCCTGCGCCTTCTGACCTCGCTCAGGACCCGCACCGTCAGCGCCTCGCTGGTGCTGCGACAGCTGTCGGCGACGCCGCGCCAGAGTGGCCTCG
TCCAGGCGCTGCGGCAGATGGGGCGCGTCGAGCGCACCCTCTTCACCCTCGACTGGATCGGTGACGAACAGCTCCGCAAAGGTACCACGGCCGAACTCAA
CAAGGGCGAGCGCCGCAACGGCCTCGTGCGCGCCGTCAACCTGCATCGGCTCGGCCGCTTCCGCGACCGCAGCCAGGACAGCCTGGCGATCCGGGCCTCC
GCCCTCAACCTGGTGGTCACCGCCATCATCTACTGGAACACGATCTACACGGGCCGCGTCGTCGACGCCTTGCGAGCCAGGGGTGCACTCCTTCCCGACC
ACCTCCTCACCGGCCTGTCGCCCCTCGGCTGGGAGCATATCGGCCTCACCGGCGACTATCTCTGGGAGGAAACGCCCGGCATCGATCAGACCGGGTTCCG
GGCTATCCCGATCACACCCTGAGCGCATTACCGTATATCCGTGCCCCTTTAATGTTCAGGGGCC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
633 bp | 210 aa | 68 | 700 | - | No |
AG : Tn3 resolvase
ORF sequence :
MTVLIGYARVSKADGSQVHDLQRDALRKAGVDPEHIYEDAASGRSDDRPGLEACLKALRAGDTLAVWKLDRLGRDLRHLVNLVGDLTRRHVGLKVLSGEG
ASIDTTTANGRLVFAIFAGLAEFERELISERTKAGLAAARARGRKGGRPFKMTPAKLRLAQAAMGNPETCVADLCVELGITRQTLYRFVGPRGELRPDGE
KLLAQRRMRA
ASIDTTTANGRLVFAIFAGLAEFERELISERTKAGLAAARARGRKGGRPFKMTPAKLRLAQAAMGNPETCVADLCVELGITRQTLYRFVGPRGELRPDGE
KLLAQRRMRA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
2991 bp | 996 aa | 832 | 3822 | + | No |
Chemistry : DDE
ORF sequence :
MPRRLILTDAERRTILALPTDEATLIRHWSLDDQDLALLDTRRRDDTRLGLALQLCALRYPGRLIRPGETIPEAAAVFLADQLGGDPDALASFARRAPTR
YEQLTILRRRFGFTDLCRPLRGDLVAFARGIALAVAKDRLVVTALAEEMRRRRIVIPGITVLERLAAQACTEAEDALLADVAGRLTPDLVIRMEALLTVG
PLAMGPRHARQSGISWLREPPGSAGTAAMRGLVDRLEAVRHVGVPATVLGGVPAHRIRRMAQEGRRLTAQNFAQMRPSRRHATLAAFLHDTQTALTDAAI
GMFEILVGRAFRQAEADREAHLTASVVAAAEALDFFAGFGDALVAHKGVGLSLDAAITTVATWEALARATAAAQANRQARHGDDTIAFLRRHHGRIRAFA
APFLTRFTFEAARPGMALVTAVSQLGEAWKAGRRSPGQAWIDAALSLLDRRWSRHVRAPDGTIDRKMLEIFLVVELKNRITAGEVWVAGSRTYRALEEKL
IPPQTFAIIKAEARVPVAIPVDVEIYLAEKAAALEGKLQAAARRLKTGRGETRIGAKGLRVPAVRTAETEAAVALARQVAATMPPIRLTDLMADVDRMTG
FSALFEHLQTGRPPADRRVFLAALIAEATNLGFGKMALACPGLTRRQLQQVAIWHFREDTFALALARLVEAQHAAPFSATFGSHAIASSDGQHIYLGDGG
EIAGGVNGHYGSDPITKLYTTISGRYAPFHVKIIAATASEAVHVLDALLETEAGAAVTRHHVDGGGVSDLVFALCHGLGFAFVPRIPDLDGRCLYGFAPA
RHYGVLQSVMGERLDAGLIRRHWDDILRLLTSLRTRTVSASLVLRQLSATPRQSGLVQALRQMGRVERTLFTLDWIGDEQLRKGTTAELNKGERRNGLVR
AVNLHRLGRFRDRSQDSLAIRASALNLVVTAIIYWNTIYTGRVVDALRARGALLPDHLLTGLSPLGWEHIGLTGDYLWEETPGIDQTGFRAIPITP*
YEQLTILRRRFGFTDLCRPLRGDLVAFARGIALAVAKDRLVVTALAEEMRRRRIVIPGITVLERLAAQACTEAEDALLADVAGRLTPDLVIRMEALLTVG
PLAMGPRHARQSGISWLREPPGSAGTAAMRGLVDRLEAVRHVGVPATVLGGVPAHRIRRMAQEGRRLTAQNFAQMRPSRRHATLAAFLHDTQTALTDAAI
GMFEILVGRAFRQAEADREAHLTASVVAAAEALDFFAGFGDALVAHKGVGLSLDAAITTVATWEALARATAAAQANRQARHGDDTIAFLRRHHGRIRAFA
APFLTRFTFEAARPGMALVTAVSQLGEAWKAGRRSPGQAWIDAALSLLDRRWSRHVRAPDGTIDRKMLEIFLVVELKNRITAGEVWVAGSRTYRALEEKL
IPPQTFAIIKAEARVPVAIPVDVEIYLAEKAAALEGKLQAAARRLKTGRGETRIGAKGLRVPAVRTAETEAAVALARQVAATMPPIRLTDLMADVDRMTG
FSALFEHLQTGRPPADRRVFLAALIAEATNLGFGKMALACPGLTRRQLQQVAIWHFREDTFALALARLVEAQHAAPFSATFGSHAIASSDGQHIYLGDGG
EIAGGVNGHYGSDPITKLYTTISGRYAPFHVKIIAATASEAVHVLDALLETEAGAAVTRHHVDGGGVSDLVFALCHGLGFAFVPRIPDLDGRCLYGFAPA
RHYGVLQSVMGERLDAGLIRRHWDDILRLLTSLRTRTVSASLVLRQLSATPRQSGLVQALRQMGRVERTLFTLDWIGDEQLRKGTTAELNKGERRNGLVR
AVNLHRLGRFRDRSQDSLAIRASALNLVVTAIIYWNTIYTGRVVDALRARGALLPDHLLTGLSPLGWEHIGLTGDYLWEETPGIDQTGFRAIPITP*
Blast result :
Comments
ISMex22 was found with one copy in each of the five replicons in AM1.
ISMex22 is 64% aa similar to ISThsp9 and 54%(ORFB) to ISSod9.
ISMex22 is 64% aa similar to ISThsp9 and 54%(ORFB) to ISSod9.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS ONE Submitted.
2] Ming-Chun Lee (2009) Direct submission.
2] Ming-Chun Lee (2009) Direct submission.