ISMex1
- Family IS3
- Group IS407
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 |
DNA section
IS Length : 1200 bp
Ends
IR Length : 15
IRL : TGACCTTGCCCCCTGGCCTGCCCTCATTCTGAAGCTAGGGTCCGTCGAGC
IRR : TGACCTTGCCCCCTGTTTGCCCCCGTTCATAACCAGAGTCCGTTCTGGTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGAGTTCTGC | CTCCAGATCG | 0 | |
CGGGCTCTCC | ACTCATTAGA | 0 | |
AGCGGTCTGC | CTGG | TTCTGGATGC | 4 |
ACCCTGATCC | AGGC | CCTCTGCTAA | 4 |
CGAAGCCTCC | AACAGGGGGC | 0 | |
GCGAAGCCCC | CTGTGTCTGC | 0 | |
CCGTACCGTG | GGTCGGATCT | 0 | |
CGGCCTTCAC | CTAC | GACCTGCGCC | 4 |
AGGGGCAGGC | TGGC | CGGCGCGGAT | 4 |
ACATCGATGA | GGCT | CCAGCCGGGG | 4 |
CTCAGGCGCG | CGGT | AGATCCGTGA | 4 |
GGAGGACGAC | CCAGCAGGGC | 0 | |
GAAATCGCGT | CAGCGCTTGC | 0 | |
CGCTGACCAC | CGCCAACCAT | 0 | |
GCCCCCTGTT | GGACGACCGC | 0 | |
CGAGGCATCC | CGTCCCCCGC | 0 | |
CCATCCGATG | GGTGGCCCTG | 0 | |
TAACGGGTGC | GTCGGGTCAA | 0 | |
CTCGGCTTAC | CGGC | TAGTTCAGCA | 4 |
TCTCCTGGAG | CGTC | GACATGATGG | 4 |
TCTCGGGCTC | GCCC | TCACTGCCAC | 4 |
TCTCGACAAC | GCGG | CCACACACAA | 4 |
CAAGCGCTCT | GCGGCGTTCG | 0 | |
CCCAAGTAAC | GAACGCTCAT | 0 | |
CCCGCTGACA | AACAGGGGGC | 0 | |
GGTTCGAAAC | CCGC | GTCTGTCATG | 4 |
CCCACGATAC | GCTT | GTATGCCCAA | 4 |
GGGCAGCTGA | AACGGCTTCA | 0 | |
CGAGCCATAC | GAGA | TAATCTGCGA | 4 |
GCCCCCTGTT | CTGGACGACC | 0 | |
TGCTGGGGCA | TCAG | GTTTTGAGCC | 4 |
GCAGCCCCAG | GCGT | CACTGAGCCA | 4 |
CGCTAGACCC | AACAGGGGGC | 0 | |
TATGGTATGC | TCTC | GCATCTGCGG | 4 |
CGATATCCCT | ACAGCTCGGC | 0 | |
CGCCCCTGGC | CCATGGACGA | 0 | |
TGGTGCTTGG | CGAAGCCTCC | 0 |
DNA sequence
TGACCTTGCCCCCTGGCCTGCCCTCATTCTGAAGCTAGGGTCCGTCGAGCAGGAGACGGACGATGAAGAAGAGCCGGTTTAGCGAAGAGCAGATCATCGG
CATCCTGAAGGAGCAGCAGGCTGGGCTGCCGGTGGCTGAGATCTGTCGCCGCCACGGCATCAGCGACGCGACGTTCTACACGTGGCGCTCGCGCTTCGGC
GGCATGGAGGTCTCGGACGCGCGGCGTCTGAAGGCGCTCGATGAGGAGAACCGCAAGCTCAAGAAGCTCCTGGCCGAGGCGATGCTCGACGTGGCCACGC
TGCGTGAGGCGCTGGGAAAAAACTTCTGACGCCCGGCGCACGGAGAACGGCCGTGAGCTGGGCCATCGAGGAGAAAGGTTATTCGCAGCGTCGTGCCTGC
GGGCTGATCGGCCTTGAGCCGAAGACGTACCGCTACGCCTCGACCCGTGGTGACGACGCGGCTGTGCGGGTGCGCCTGCGCGGCCTGGCCGGCGAGCGCC
GCCGGTTCGGCTACCGGCGCCTGCTCATCCTACTGCGGCGGGAGGGCCTCGCTCTCAACCACAAGAAGCTCTTCCGGCTCTACCGAGAGGAGCGGCTGTC
GGTGCGCAAGCGCGGAGGTCGCAAGCGAGCACTTGGCACGCGAGCGCCCGCCGCGGTGCCGCAGGAGCCGAACCAGCGCTGGAGCCTCGACTTCGTCTCC
GACACGCTCGACGACGGGCGGCGCTTCCGCATCCTCGTCGTGGTCGATGACTGCACGCGGGAGTGCCTGGCGCTGGTGGTCGACACGTCGCTGTCCGGGC
GGCGGGTCACGCGTGAACTCGACCGGATCATCAAGGGCCGGGGCAAGCCGCTGATGATCGTCTCGGACAACGGCACCGAGCTGACCTCGCACGCCATCCT
GCGCTGGCAGGAGGAGCGTGCGGTCGAGTGGCATTACATCGCGCCCGGCAAGCCGCAGCAGAACGGTTTTGTCGAGAGCTTGAACGGGCGCTTGCGCGAC
GAGTGCCTGAACGAGCATCTGTTCCGGAGCCTGCCGGCGGCCCGGACCATCATCGAGGCGTGGCGGGTCGACTATAACACCTGCCGCCCCCACACGAGCC
TCGGCGGGCTCACCCCGAACGCGTTTGCAACCCGGTCCCGACAGGACCAGAACCAGAACGGACTCTGGTTATGAACGGGGGCAAACAGGGGGCAAGGTCA
CATCCTGAAGGAGCAGCAGGCTGGGCTGCCGGTGGCTGAGATCTGTCGCCGCCACGGCATCAGCGACGCGACGTTCTACACGTGGCGCTCGCGCTTCGGC
GGCATGGAGGTCTCGGACGCGCGGCGTCTGAAGGCGCTCGATGAGGAGAACCGCAAGCTCAAGAAGCTCCTGGCCGAGGCGATGCTCGACGTGGCCACGC
TGCGTGAGGCGCTGGGAAAAAACTTCTGACGCCCGGCGCACGGAGAACGGCCGTGAGCTGGGCCATCGAGGAGAAAGGTTATTCGCAGCGTCGTGCCTGC
GGGCTGATCGGCCTTGAGCCGAAGACGTACCGCTACGCCTCGACCCGTGGTGACGACGCGGCTGTGCGGGTGCGCCTGCGCGGCCTGGCCGGCGAGCGCC
GCCGGTTCGGCTACCGGCGCCTGCTCATCCTACTGCGGCGGGAGGGCCTCGCTCTCAACCACAAGAAGCTCTTCCGGCTCTACCGAGAGGAGCGGCTGTC
GGTGCGCAAGCGCGGAGGTCGCAAGCGAGCACTTGGCACGCGAGCGCCCGCCGCGGTGCCGCAGGAGCCGAACCAGCGCTGGAGCCTCGACTTCGTCTCC
GACACGCTCGACGACGGGCGGCGCTTCCGCATCCTCGTCGTGGTCGATGACTGCACGCGGGAGTGCCTGGCGCTGGTGGTCGACACGTCGCTGTCCGGGC
GGCGGGTCACGCGTGAACTCGACCGGATCATCAAGGGCCGGGGCAAGCCGCTGATGATCGTCTCGGACAACGGCACCGAGCTGACCTCGCACGCCATCCT
GCGCTGGCAGGAGGAGCGTGCGGTCGAGTGGCATTACATCGCGCCCGGCAAGCCGCAGCAGAACGGTTTTGTCGAGAGCTTGAACGGGCGCTTGCGCGAC
GAGTGCCTGAACGAGCATCTGTTCCGGAGCCTGCCGGCGGCCCGGACCATCATCGAGGCGTGGCGGGTCGACTATAACACCTGCCGCCCCCACACGAGCC
TCGGCGGGCTCACCCCGAACGCGTTTGCAACCCGGTCCCGACAGGACCAGAACCAGAACGGACTCTGGTTATGAACGGGGGCAAACAGGGGGCAAGGTCA
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
267 bp | 88 aa | 63 | 329 | + | No |
Description : First part of the transposase
ORF sequence :
MKKSRFSEEQIIGILKEQQAGLPVAEICRRHGISDATFYTWRSRFGGMEVSDARRLKALDEENRKLKKLLAEAMLDVATLREALGKNF
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
867 bp | 288 aa | 308 | 1174 | + | No |
Description : Second part of the transposase
ORF sequence :
GAGKKLLTPGARRTAVSWAIEEKGYSQRRACGLIGLEPKTYRYASTRGDDAAVRVRLRGLAGERRRFGYRRLLILLRREGLALNHKKLFRLYREERLSVR
KRGGRKRALGTRAPAAVPQEPNQRWSLDFVSDTLDDGRRFRILVVVDDCTRECLALVVDTSLSGRRVTRELDRIIKGRGKPLMIVSDNGTELTSHAILRW
QEERAVEWHYIAPGKPQQNGFVESLNGRLRDECLNEHLFRSLPAARTIIEAWRVDYNTCRPHTSLGGLTPNAFATRSRQDQNQNGLWL
KRGGRKRALGTRAPAAVPQEPNQRWSLDFVSDTLDDGRRFRILVVVDDCTRECLALVVDTSLSGRRVTRELDRIIKGRGKPLMIVSDNGTELTSHAILRW
QEERAVEWHYIAPGKPQQNGFVESLNGRLRDECLNEHLFRSLPAARTIIEAWRVDYNTCRPHTSLGGLTPNAFATRSRQDQNQNGLWL
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1112 bp | 370 aa | 63 | 1174 | + | Yes |
Chemistry : DDE
ORF sequence :
MKKSRFSEEQIIGILKEQQAGLPVAEICRRHGISDATFYTWRSRFGGMEVSDARRLKALDEENRKLKKLLAEAMLDVATLREALGKKLLTPGARRTAVSW
AIEEKGYSQRRACGLIGLEPKTYRYASTRGDDAAVRVRLRGLAGERRRFGYRRLLILLRREGLALNHKKLFRLYREERLSVRKRGGRKRALGTRAPAAVP
QEPNQRWSLDFVSDTLDDGRRFRILVVVDDCTRECLALVVDTSLSGRRVTRELDRIIKGRGKPLMIVSDNGTELTSHAILRWQEERAVEWHYIAPGKPQQ
NGFVESLNGRLRDECLNEHLFRSLPAARTIIEAWRVDYNTCRPHTSLGGLTPNAFATRSRQDQNQNGLWL
AIEEKGYSQRRACGLIGLEPKTYRYASTRGDDAAVRVRLRGLAGERRRFGYRRLLILLRREGLALNHKKLFRLYREERLSVRKRGGRKRALGTRAPAAVP
QEPNQRWSLDFVSDTLDDGRRFRILVVVDDCTRECLALVVDTSLSGRRVTRELDRIIKGRGKPLMIVSDNGTELTSHAILRWQEERAVEWHYIAPGKPQQ
NGFVESLNGRLRDECLNEHLFRSLPAARTIIEAWRVDYNTCRPHTSLGGLTPNAFATRSRQDQNQNGLWL
Blast result :
Comments
ISMex1 is 85% aa similar to ISGdi11. The third ORF is a putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
References
1] Stéphane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS ONE Submitted.
2] Ming-Chun Lee (2009) Direct submission.
2] Ming-Chun Lee (2009) Direct submission.