ISMex37
- Family IS3
- Group IS150
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 |
DNA section
IS Length : 1510 bp
Ends
IR Length : 22/30
IRL : TGTCTGCCGTCAGCGCTCTTGAGACAGTTTAGGGGTTTTCGGGAAGGGAG
IRR : TGTCCGTCGTCAGATGATTTGAGACTGTTTCGGCGTCTCAGGAACGGAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGTTCTCAGG | CCATCACGTT | 0 |
DNA sequence
TGTCTGCCGTCAGCGCTCTTGAGACAGTTTAGGGGTTTTCGGGAAGGGAGGATTTCTGGATCATCGCAGCCATCATGGAGCGCAGATGAGACAGAAATCG
GGTCCGGAGAAAGCACCGGCAGAGCAGGTCGTGAAGGACATCCGCCGAGCGACGCGCCGGCAGTTCTCGGCCGAAGAGAAGATCCGCATCGTGCTGGAAG
GCGTGCGCGGCGAGGAGAGCATTGCCGAGCTGTGCCGGCGCGAGGGGATCGCCTCGTCGATGTATTACGGCTGGTCGAAGGAGTTCCTCGACGCCGGCAA
GCGCCGTCTCGCCGGTGACACGGCCCGCGCCGCGACGTCGGACGAGGTGAAGGAACTGCGTCGCGAGGCGCAGGCCCTGAAGGAGGCCGTGGCCGATCTG
ACCCTGGAAAACCGCCTGCTCAAAAAAAGCATGCTGGCGGATGGGGAGGACGACACATGAGGTATCCCGCCTCCGAGAAGGCCGAGATCATCCGGCTGGT
CGAGGGATCGCATCTACCGGCGCGGCGCACCCTGGACAAACTGGGCATCCCGCGCGCCACGTTCTATCGCTGGTACGATCGCTATCTCACCGGCGGAGTT
GAGGCGCTCGCCGATCATCGCTCGCGGCCGGATCGAGTCTGGAACCGCATCCCCGACCCGATCCGGGCCGAGATCATCGACCTGGCGCTGCGCGAAACGG
AGCTGAGTCCACGCGAGCTTGCGGTGCGCTTCACCGACGAGAAACGCTACTTCGTCTCGGAAGCGTCGGTGTATCGGTTGCTGAAGGCGCATGACCTGAT
CACCAGCCCGGCCTACATCGTCATCAAGGCGGCGTCCGAGTTCAAAGACAAGACGACGGCGCCCAACCAGCTCTGGCAGACCGACTTCACCTACCTGAAG
ATCACCGGATGGGGCTGGTATTATCTCTCGACCGTGCTCGACGACTTCTCACGCTTCATCGTCGCCTGGAAGCTCTGCGCCACGATGCGCACTGACGACG
TCACCGCCACGCTCGATCTGGCGCTGGCGGCATCGGGGCTCGACCAGATCACGGTCGCACATCGGCCGAGACTGTTGAGCGATAACGGCGCCTCATACAT
CTCGGCCGACCTCGCTACCTGGCTCGACGGCAAAGGCATGAAACACGTTCGCGGCGCGCCCTATCATCCCCAGACGCAGGGCAAGATCGAGCGCTGGCAC
CAGACCCTGAAGAACCGCATCCTGCTGGAAAACTACTATCTGCCCGGTGACCTTGAACGGCAGGTCGCGGCCTTCGTTGAGCATTACAACCATGCCCGGT
CTCACGAGAGCCTCGGCAACCTGACGCCCGCCGATGTCTACTTCAACCGCGGGCAGGCCATCCTCACCGAACGCGAAAGGATCAAGCGCCAGACAATCCA
GCACAGACGCTTGCAGCACCAGCTGCAGGCCGCATAACCTCAAACCCAGATGAGCCAGAACCTCCGTTCCTGAGACGCCGAAACAGTCTCAAATCATCTG
ACGACGGACA
GGTCCGGAGAAAGCACCGGCAGAGCAGGTCGTGAAGGACATCCGCCGAGCGACGCGCCGGCAGTTCTCGGCCGAAGAGAAGATCCGCATCGTGCTGGAAG
GCGTGCGCGGCGAGGAGAGCATTGCCGAGCTGTGCCGGCGCGAGGGGATCGCCTCGTCGATGTATTACGGCTGGTCGAAGGAGTTCCTCGACGCCGGCAA
GCGCCGTCTCGCCGGTGACACGGCCCGCGCCGCGACGTCGGACGAGGTGAAGGAACTGCGTCGCGAGGCGCAGGCCCTGAAGGAGGCCGTGGCCGATCTG
ACCCTGGAAAACCGCCTGCTCAAAAAAAGCATGCTGGCGGATGGGGAGGACGACACATGAGGTATCCCGCCTCCGAGAAGGCCGAGATCATCCGGCTGGT
CGAGGGATCGCATCTACCGGCGCGGCGCACCCTGGACAAACTGGGCATCCCGCGCGCCACGTTCTATCGCTGGTACGATCGCTATCTCACCGGCGGAGTT
GAGGCGCTCGCCGATCATCGCTCGCGGCCGGATCGAGTCTGGAACCGCATCCCCGACCCGATCCGGGCCGAGATCATCGACCTGGCGCTGCGCGAAACGG
AGCTGAGTCCACGCGAGCTTGCGGTGCGCTTCACCGACGAGAAACGCTACTTCGTCTCGGAAGCGTCGGTGTATCGGTTGCTGAAGGCGCATGACCTGAT
CACCAGCCCGGCCTACATCGTCATCAAGGCGGCGTCCGAGTTCAAAGACAAGACGACGGCGCCCAACCAGCTCTGGCAGACCGACTTCACCTACCTGAAG
ATCACCGGATGGGGCTGGTATTATCTCTCGACCGTGCTCGACGACTTCTCACGCTTCATCGTCGCCTGGAAGCTCTGCGCCACGATGCGCACTGACGACG
TCACCGCCACGCTCGATCTGGCGCTGGCGGCATCGGGGCTCGACCAGATCACGGTCGCACATCGGCCGAGACTGTTGAGCGATAACGGCGCCTCATACAT
CTCGGCCGACCTCGCTACCTGGCTCGACGGCAAAGGCATGAAACACGTTCGCGGCGCGCCCTATCATCCCCAGACGCAGGGCAAGATCGAGCGCTGGCAC
CAGACCCTGAAGAACCGCATCCTGCTGGAAAACTACTATCTGCCCGGTGACCTTGAACGGCAGGTCGCGGCCTTCGTTGAGCATTACAACCATGCCCGGT
CTCACGAGAGCCTCGGCAACCTGACGCCCGCCGATGTCTACTTCAACCGCGGGCAGGCCATCCTCACCGAACGCGAAAGGATCAAGCGCCAGACAATCCA
GCACAGACGCTTGCAGCACCAGCTGCAGGCCGCATAACCTCAAACCCAGATGAGCCAGAACCTCCGTTCCTGAGACGCCGAAACAGTCTCAAATCATCTG
ACGACGGACA
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
375 bp | 124 aa | 86 | 460 | + | No |
Description : First part of the transposase
ORF sequence :
MRQKSGPEKAPAEQVVKDIRRATRRQFSAEEKIRIVLEGVRGEESIAELCRREGIASSMYYGWSKEFLDAGKRRLAGDTARAATSDEVKELRREAQALKE
AVADLTLENRLLKKSMLADGEDDT
AVADLTLENRLLKKSMLADGEDDT
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1119 bp | 372 aa | 319 | 1437 | + | No |
Description : Second part of the transposase
ORF sequence :
HGPRRDVGRGEGTASRGAGPEGGRGRSDPGKPPAQKKHAGGWGGRHMRYPASEKAEIIRLVEGSHLPARRTLDKLGIPRATFYRWYDRYLTGGVEALADH
RSRPDRVWNRIPDPIRAEIIDLALRETELSPRELAVRFTDEKRYFVSEASVYRLLKAHDLITSPAYIVIKAASEFKDKTTAPNQLWQTDFTYLKITGWGW
YYLSTVLDDFSRFIVAWKLCATMRTDDVTATLDLALAASGLDQITVAHRPRLLSDNGASYISADLATWLDGKGMKHVRGAPYHPQTQGKIERWHQTLKNR
ILLENYYLPGDLERQVAAFVEHYNHARSHESLGNLTPADVYFNRGQAILTERERIKRQTIQHRRLQHQLQAA
RSRPDRVWNRIPDPIRAEIIDLALRETELSPRELAVRFTDEKRYFVSEASVYRLLKAHDLITSPAYIVIKAASEFKDKTTAPNQLWQTDFTYLKITGWGW
YYLSTVLDDFSRFIVAWKLCATMRTDDVTATLDLALAASGLDQITVAHRPRLLSDNGASYISADLATWLDGKGMKHVRGAPYHPQTQGKIERWHQTLKNR
ILLENYYLPGDLERQVAAFVEHYNHARSHESLGNLTPADVYFNRGQAILTERERIKRQTIQHRRLQHQLQAA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1352 bp | 450 aa | 86 | 1437 | + | Yes |
Chemistry : DDE
ORF sequence :
MRQKSGPEKAPAEQVVKDIRRATRRQFSAEEKIRIVLEGVRGEESIAELCRREGIASSMYYGWSKEFLDAGKRRLAGDTARAATSDEVKELRREAQALKE
AVADLTLENRLLKKKHAGGWGGRHMRYPASEKAEIIRLVEGSHLPARRTLDKLGIPRATFYRWYDRYLTGGVEALADHRSRPDRVWNRIPDPIRAEIIDL
ALRETELSPRELAVRFTDEKRYFVSEASVYRLLKAHDLITSPAYIVIKAASEFKDKTTAPNQLWQTDFTYLKITGWGWYYLSTVLDDFSRFIVAWKLCAT
MRTDDVTATLDLALAASGLDQITVAHRPRLLSDNGASYISADLATWLDGKGMKHVRGAPYHPQTQGKIERWHQTLKNRILLENYYLPGDLERQVAAFVEH
YNHARSHESLGNLTPADVYFNRGQAILTERERIKRQTIQHRRLQHQLQAA
AVADLTLENRLLKKKHAGGWGGRHMRYPASEKAEIIRLVEGSHLPARRTLDKLGIPRATFYRWYDRYLTGGVEALADHRSRPDRVWNRIPDPIRAEIIDL
ALRETELSPRELAVRFTDEKRYFVSEASVYRLLKAHDLITSPAYIVIKAASEFKDKTTAPNQLWQTDFTYLKITGWGWYYLSTVLDDFSRFIVAWKLCAT
MRTDDVTATLDLALAASGLDQITVAHRPRLLSDNGASYISADLATWLDGKGMKHVRGAPYHPQTQGKIERWHQTLKNRILLENYYLPGDLERQVAAFVEH
YNHARSHESLGNLTPADVYFNRGQAILTERERIKRQTIQHRRLQHQLQAA
Blast result :
Comments
ISMex37 is 87% aa similar to ISMex16.
The third ORF is a putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
The third ORF is a putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
References
1] Stéphane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS ONE Submitted. (2009)