ISMex16
- Family IS3
- Group IS150
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 Methylobacterium dichloromethanicum DM4 |
DNA section
IS Length : 1511 bp
Ends
IR Length : 20/26
IRL : TGTCCGACGTCAGCACCTGTGGGACAGATTGAGGGTCTTCACGAACGGAG
IRR : TGTCCGTCGTCAGATGATTTGGGACATTCAGCGGTGTTCAGGAGCGGAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGGCGCGATG | CT | CCTACCTCTA | 2 |
CGTCGAGACG | GGTGCAGGGC | 0 | |
TCGAACTGCG | CTCGATCAGC | 0 | |
TGCGGCCCAC | GCGAACCGGT | 0 |
DNA sequence
TGTCCGACGTCAGCACCTGTGGGACAGATTGAGGGTCTTCACGAACGGAGGATGGCTGGTTCATCGTAGCCCCTGAGGAGCGAAGATGAAGCAGACATCC
GGACCGGCCAAGAAACCGGCAGAAGCGGTGATCAAGGACATCCGTCGGGCCACACGTCGGCAGTTCTCCTCCGAGGAGAAGATCCGCGTGGTGCTGGAAG
GCTTGCGTGGCGAGGACAGCATCGCCGAGTTGTGCCGACGCGAGGGGATCGCCGCCTCGATGTACTACGGCTGGTCCAAGGAGTTCCTGGAGGCTGGCAA
GAAGCGCCTGTCGGGGGACACGGCTCGTGCGGCTACCACGGACGAGGTGCGGGATCTGCGGCGCGAGGCGGGGGCGCTGAAGGAGGTCGTAGCTGATCTC
GTCCTAGAGAACCGTCTGCTCAAAAAAAGTATGAGCGGGGCTGGGGGCGACGAGGCATGAGGTACCCGGCCTCCGAGAAGCTGGAGATCATCCGGCTGGT
CGAGCAATCCCACCTGCCGGTGCGCGCCACCCTGGACAAGCTCGGCATCACGCGCTCGACGTTTTACCGCTGGTACGACGCCTACCGGCGCGGCGGCCCC
GAGGCATTGCATGACCAACCTTCGCAGCCGAGCCGGGTCTGGAACCGTCTACCTGCGGAGATCCGCGAGCAGATCGTTGCCTTGGCTCTGGAGCAGCCTG
AACTCAGTCCGCGTGAGCTGGCGGTGCGCTTCACCGACGAGCGCCGCTACTTCGTCTCGGAAGCGACCGTCTACCGGTTGCTCAAGTCTCAGGACCTGAT
CACCAGCCCGGCCTATATCGTCATCAAGGCTGCCGACGAGTTCCGCGATAAGACCACCGCGCCCAACCAGCTTTGGCAGACGGACTTCACCTACCTGAAG
GTCGTTGGCTGGGGCTGGTACTACCTGTCGACGGTCCTGGACGACTTCTCACGCTACATCGTCGCCTGGAAGCTGTGCGCCACGATGCAGGCCAGCGACG
TCACCGCCACGCTCGACCTGGCGCTGGGTGCGGCTGGGCTCGATCAGGCGCGGGTGATGCAGCGGCCACGCCTGCTCTCCGACAACGGGCCGAGCTACGT
CGCCAGCGACCTGGCTGACTGGCTCGGCATTCGGGGCATGACCCACATCCGCGGTGCGCCATGCCATCCTCAAACGCAGGGCAAGATCGAGCGTTGGCAT
CAGACACTCAAAAACCGCATCCTGCTCGAACACGCCTACTTGCCCGGCGAGCTGGAGACGCAGGTTGCCGACTTCGTCGAACACTACAATCATGCCCGAG
CCCATGAGAGCCTGAGCAACCTGACGCCCGCTGACGTCTACTTCGGACGCGGCGAAGGGATCCTGGCCGAGCGGGAACGCATCAAACGCCAGACCCTGAT
GGATCGCCGCTTGCGCCATCACGCGCAGGCTGCCTAACCTCTCACCCCAGATGGACCAGAGCCTCCGCTCCTGAACACCGCTGAATGTCCCAAATCATCT
GACGACGGACA
GGACCGGCCAAGAAACCGGCAGAAGCGGTGATCAAGGACATCCGTCGGGCCACACGTCGGCAGTTCTCCTCCGAGGAGAAGATCCGCGTGGTGCTGGAAG
GCTTGCGTGGCGAGGACAGCATCGCCGAGTTGTGCCGACGCGAGGGGATCGCCGCCTCGATGTACTACGGCTGGTCCAAGGAGTTCCTGGAGGCTGGCAA
GAAGCGCCTGTCGGGGGACACGGCTCGTGCGGCTACCACGGACGAGGTGCGGGATCTGCGGCGCGAGGCGGGGGCGCTGAAGGAGGTCGTAGCTGATCTC
GTCCTAGAGAACCGTCTGCTCAAAAAAAGTATGAGCGGGGCTGGGGGCGACGAGGCATGAGGTACCCGGCCTCCGAGAAGCTGGAGATCATCCGGCTGGT
CGAGCAATCCCACCTGCCGGTGCGCGCCACCCTGGACAAGCTCGGCATCACGCGCTCGACGTTTTACCGCTGGTACGACGCCTACCGGCGCGGCGGCCCC
GAGGCATTGCATGACCAACCTTCGCAGCCGAGCCGGGTCTGGAACCGTCTACCTGCGGAGATCCGCGAGCAGATCGTTGCCTTGGCTCTGGAGCAGCCTG
AACTCAGTCCGCGTGAGCTGGCGGTGCGCTTCACCGACGAGCGCCGCTACTTCGTCTCGGAAGCGACCGTCTACCGGTTGCTCAAGTCTCAGGACCTGAT
CACCAGCCCGGCCTATATCGTCATCAAGGCTGCCGACGAGTTCCGCGATAAGACCACCGCGCCCAACCAGCTTTGGCAGACGGACTTCACCTACCTGAAG
GTCGTTGGCTGGGGCTGGTACTACCTGTCGACGGTCCTGGACGACTTCTCACGCTACATCGTCGCCTGGAAGCTGTGCGCCACGATGCAGGCCAGCGACG
TCACCGCCACGCTCGACCTGGCGCTGGGTGCGGCTGGGCTCGATCAGGCGCGGGTGATGCAGCGGCCACGCCTGCTCTCCGACAACGGGCCGAGCTACGT
CGCCAGCGACCTGGCTGACTGGCTCGGCATTCGGGGCATGACCCACATCCGCGGTGCGCCATGCCATCCTCAAACGCAGGGCAAGATCGAGCGTTGGCAT
CAGACACTCAAAAACCGCATCCTGCTCGAACACGCCTACTTGCCCGGCGAGCTGGAGACGCAGGTTGCCGACTTCGTCGAACACTACAATCATGCCCGAG
CCCATGAGAGCCTGAGCAACCTGACGCCCGCTGACGTCTACTTCGGACGCGGCGAAGGGATCCTGGCCGAGCGGGAACGCATCAAACGCCAGACCCTGAT
GGATCGCCGCTTGCGCCATCACGCGCAGGCTGCCTAACCTCTCACCCCAGATGGACCAGAGCCTCCGCTCCTGAACACCGCTGAATGTCCCAAATCATCT
GACGACGGACA
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
375 bp | 124 aa | 86 | 460 | + | No |
Description : First part of the transposase
ORF sequence :
MKQTSGPAKKPAEAVIKDIRRATRRQFSSEEKIRVVLEGLRGEDSIAELCRREGIAASMYYGWSKEFLEAGKKRLSGDTARAATTDEVRDLRREAGALKE
VVADLVLENRLLKKSMSGAGGDEA
VVADLVLENRLLKKSMSGAGGDEA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1041 bp | 346 aa | 397 | 1437 | + | No |
Description : Second part of the transposase
ORF sequence :
SRPREPSAQKKYERGWGRRGMRYPASEKLEIIRLVEQSHLPVRATLDKLGITRSTFYRWYDAYRRGGPEALHDQPSQPSRVWNRLPAEIREQIVALALEQ
PELSPRELAVRFTDERRYFVSEATVYRLLKSQDLITSPAYIVIKAADEFRDKTTAPNQLWQTDFTYLKVVGWGWYYLSTVLDDFSRYIVAWKLCATMQAS
DVTATLDLALGAAGLDQARVMQRPRLLSDNGPSYVASDLADWLGIRGMTHIRGAPCHPQTQGKIERWHQTLKNRILLEHAYLPGELETQVADFVEHYNHA
RAHESLSNLTPADVYFGRGEGILAERERIKRQTLMDRRLRHHAQAA
PELSPRELAVRFTDERRYFVSEATVYRLLKSQDLITSPAYIVIKAADEFRDKTTAPNQLWQTDFTYLKVVGWGWYYLSTVLDDFSRYIVAWKLCATMQAS
DVTATLDLALGAAGLDQARVMQRPRLLSDNGPSYVASDLADWLGIRGMTHIRGAPCHPQTQGKIERWHQTLKNRILLEHAYLPGELETQVADFVEHYNHA
RAHESLSNLTPADVYFGRGEGILAERERIKRQTLMDRRLRHHAQAA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1352 bp | 450 aa | 86 | 1437 | + | Yes |
Chemistry : DDE
ORF sequence :
MKQTSGPAKKPAEAVIKDIRRATRRQFSSEEKIRVVLEGLRGEDSIAELCRREGIAASMYYGWSKEFLEAGKKRLSGDTARAATTDEVRDLRREAGALKE
VVADLVLENRLLKKKYERGWGRRGMRYPASEKLEIIRLVEQSHLPVRATLDKLGITRSTFYRWYDAYRRGGPEALHDQPSQPSRVWNRLPAEIREQIVAL
ALEQPELSPRELAVRFTDERRYFVSEATVYRLLKSQDLITSPAYIVIKAADEFRDKTTAPNQLWQTDFTYLKVVGWGWYYLSTVLDDFSRYIVAWKLCAT
MQASDVTATLDLALGAAGLDQARVMQRPRLLSDNGPSYVASDLADWLGIRGMTHIRGAPCHPQTQGKIERWHQTLKNRILLEHAYLPGELETQVADFVEH
YNHARAHESLSNLTPADVYFGRGEGILAERERIKRQTLMDRRLRHHAQAA
VVADLVLENRLLKKKYERGWGRRGMRYPASEKLEIIRLVEQSHLPVRATLDKLGITRSTFYRWYDAYRRGGPEALHDQPSQPSRVWNRLPAEIREQIVAL
ALEQPELSPRELAVRFTDERRYFVSEATVYRLLKSQDLITSPAYIVIKAADEFRDKTTAPNQLWQTDFTYLKVVGWGWYYLSTVLDDFSRYIVAWKLCAT
MQASDVTATLDLALGAAGLDQARVMQRPRLLSDNGPSYVASDLADWLGIRGMTHIRGAPCHPQTQGKIERWHQTLKNRILLEHAYLPGELETQVADFVEH
YNHARAHESLSNLTPADVYFGRGEGILAERERIKRQTLMDRRLRHHAQAA
Blast result :
Comments
ISMex16 is 86% aa similar to ISSsp2.
The third ORF is a putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
The third ORF is a putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
References
1] Stéphane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS ONE Submitted. (2009)