ISMex27
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 |
DNA section
IS Length : 2493 bp
Ends
IR Length : 30
IRL : TGTTGATTTCCGCTGAGAGTTGACCCGGCGGAGCCCGAAGTTTCCATCGA
IRR : TGTTGATTTCCGCTGAGAGTTGACCCGGCGTTTTCGTTGAGAAGTGACCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGCGCCAGCT | CCGTGGCGTT | 0 |
DNA sequence
TGTTGATTTCCGCTGAGAGTTGACCCGGCGGAGCCCGAAGTTTCCATCGAGAAGTGACCCGTGTCTGAACTCACCCCTGGCTTGCTGGTCGGGGGTCATG
GAGTGATCGACATGGCATTATTGAGCGTCATCCGGCGCTGGGCATTTCGAGAAGGGATGCCGATCCGGGAGATTTCGCGACGGACAGGATTGTCGCGCAA
CACCATTCGCAAATATCTGCGATCGGACGCGCAGGAGCCGGTGTATCGAGCGCCGGACCGGCCCAGCAGGTTGGACCCGTTCGCCGCAAAACTGTCAGGC
TGGTTGAAGGCCGAGGCGAACCGGTCCCGCAAGCAACAGCGTACGGCGCGGCAGATGCACGCCGACTTGGTGCTCCTCGGCTATGACGGCTCCTACGGCA
GGGTCGCGGCCTTTATTCGACGCTGGAAGGAGAACCGGCAGACCGAACAGCAGACCACGGGACGCGGGACCTTCGTACCTCTGATCTTCGCGCCCGGTGA
AGCGTTCCAGTTCGACTGGAGCGAGGACTGGGCGGATCTCGCAGGCGAGCGCGTCAAGCTGCAGGTGGCGCACACCAAGCTCTCGCACAGTCGAGCCTTT
GTCGTGCGCGCCTACCCGCTTCAGAGCCACGAGATGCTGTTCGACGCGCACTACCATGCCTTCCGCGTGCTCGGCGGTATCCCGGGTCGCGGCATCTACG
ACAACATGAAGACCGCCGTGGATCGGGTCGGCTCCGGCAAGGCCCGGCAGGTGAACGCGCGCTTCGCCGCCATGGCGAGCCACTATCTATTCGACACGAC
CTTCTGCAATCCCGCCTCAGGCTGGGAGAAGGGGCAGGTCGAGAAGAACGTGCAGGACGCGCGTCGCCGGCTGTGGCAGCCGCTCCCCAGCTTCGCCGAT
CTCGATGGGTTGAACGCCTGGCTTGAGATGCGCTGCATCGCGGCCTGGAAGGAGATCCCGCATGGAGATCTCCCCGGCACGCTCGCCGATGCGTGGAGCG
CCGAGGTCGCGAGCCTGATGCCGCTCGGGCGACCGTTCGACGGCTTCGTCGAGCACACAAAGCGGGTCTCGCCGACCTGCCTCGTGCACTTCGAGCGCAA
CCGCTATTCGGTTCCGGCATCGCTTGCCAATCGCCCCGTAAGCGTGAGGATCTACCCCGAGCGCATCGTCGTGGCCGCAGAAGGCCGGATCCTGTGCGAG
CATCTGCGAGTCATATCGCGCTCGCACAGCGTGGCCGGGCGCACGATCTACGACTGGCGCCACTATCTGGCGGTCATCCAACGCAAGCCAGGCGCCCTGC
GCAATGGGGCTCCCTTCGCCGAGATGCCGGATGCATTCCGGCGTTTGCAGGGGCACCTCCTCAAGCACCCTGGCGGCGATCGGGAGATGGTAGAGATTCT
GGCACTCGTCCTGCAGCACGACGAGCAGGCTGTTCTATGCGCGGTCGAACTGGCGCTCGAGAGCGGCGTCCCCACCAAGATCCATGTTCTCAACATCCTG
CACCGGCTGCTCGACGGTAAGCCATCGAGCATGCCAACCATCGATGCCCCGCAGGCGCTCGTCTTGCGTCAGGAGCCGAAGGCCGATGTCGAGCGCTACG
ACGCGCTCCGTAGCAAGGCGGTCCGCCATGCGTCATGACCCCGCCAGCGCCGCCATCATCGTCATGCTGCGCGGGCTCAAGATGTACGGCATGTCTCAAG
CCGTCGGCGACCTTGTGGAGCAAGGCGCGCCTGCCTTCGAGGCAGCCGTCCCGCTCTTGGCCCAGTTGCTCAAGGCCGAGATGGCCGAGCGGGAGGTCCG
ATCCATCGCCTACCAGATCAAGGCCGCCCGCTTTCCGGCCTACAAGGACCTGACCGGGTTCGACTTCGCTGCCAGCGAGGTCAACGAGGCCGTGGTGCGC
ACGCTGCACGGGGGCGACTTCATCGACGGGGCACACAACGTTGTGCTGATCGGGGGGCCAGGAACCGGCAAAACGCACGTCGCAACGGCCCTAGGCGTGC
AGGCCATCGAGCACCATCGCAAGAAGGTCCGATTCTTCGCCACCGTCGACCTGGTCAATGCGCTCGAACAGGAGAAGGCCCTGAACAAGGCCGGCCAATT
GGCGGATCGATTGCTGCGCCTGGACCTGATCATCCTGGACGAGCTGGGCTATCTCCCCTTCAGCACCTCAGGAGGAGCCTTGCTGTTCCACCTGCTGTCC
AAGCTCTACGAGCGCACCAGCGTCGTCATCACGACCAACCTCAGCTTCAGCGAGTGGGCTGACATCTTCGGCGATGCCAAGATGACCACTGCGTTGCTCG
ATCGCCTCACCCACCACTGCCACATTCTGGAAACCGGAAACGACAGCTTCCGCTTCCGGGCCAGTGCCGCAGCCGGAAAACCAAAAAGGAGCCGCGCTGA
GTTGACCAAGCCCGTCCAATGACAAACATCAGCCCCTGAACCCGGGTCACTTCTCAACGAAAACGCCGGGTCAACTCTCAGCGGAAATCAACA
GAGTGATCGACATGGCATTATTGAGCGTCATCCGGCGCTGGGCATTTCGAGAAGGGATGCCGATCCGGGAGATTTCGCGACGGACAGGATTGTCGCGCAA
CACCATTCGCAAATATCTGCGATCGGACGCGCAGGAGCCGGTGTATCGAGCGCCGGACCGGCCCAGCAGGTTGGACCCGTTCGCCGCAAAACTGTCAGGC
TGGTTGAAGGCCGAGGCGAACCGGTCCCGCAAGCAACAGCGTACGGCGCGGCAGATGCACGCCGACTTGGTGCTCCTCGGCTATGACGGCTCCTACGGCA
GGGTCGCGGCCTTTATTCGACGCTGGAAGGAGAACCGGCAGACCGAACAGCAGACCACGGGACGCGGGACCTTCGTACCTCTGATCTTCGCGCCCGGTGA
AGCGTTCCAGTTCGACTGGAGCGAGGACTGGGCGGATCTCGCAGGCGAGCGCGTCAAGCTGCAGGTGGCGCACACCAAGCTCTCGCACAGTCGAGCCTTT
GTCGTGCGCGCCTACCCGCTTCAGAGCCACGAGATGCTGTTCGACGCGCACTACCATGCCTTCCGCGTGCTCGGCGGTATCCCGGGTCGCGGCATCTACG
ACAACATGAAGACCGCCGTGGATCGGGTCGGCTCCGGCAAGGCCCGGCAGGTGAACGCGCGCTTCGCCGCCATGGCGAGCCACTATCTATTCGACACGAC
CTTCTGCAATCCCGCCTCAGGCTGGGAGAAGGGGCAGGTCGAGAAGAACGTGCAGGACGCGCGTCGCCGGCTGTGGCAGCCGCTCCCCAGCTTCGCCGAT
CTCGATGGGTTGAACGCCTGGCTTGAGATGCGCTGCATCGCGGCCTGGAAGGAGATCCCGCATGGAGATCTCCCCGGCACGCTCGCCGATGCGTGGAGCG
CCGAGGTCGCGAGCCTGATGCCGCTCGGGCGACCGTTCGACGGCTTCGTCGAGCACACAAAGCGGGTCTCGCCGACCTGCCTCGTGCACTTCGAGCGCAA
CCGCTATTCGGTTCCGGCATCGCTTGCCAATCGCCCCGTAAGCGTGAGGATCTACCCCGAGCGCATCGTCGTGGCCGCAGAAGGCCGGATCCTGTGCGAG
CATCTGCGAGTCATATCGCGCTCGCACAGCGTGGCCGGGCGCACGATCTACGACTGGCGCCACTATCTGGCGGTCATCCAACGCAAGCCAGGCGCCCTGC
GCAATGGGGCTCCCTTCGCCGAGATGCCGGATGCATTCCGGCGTTTGCAGGGGCACCTCCTCAAGCACCCTGGCGGCGATCGGGAGATGGTAGAGATTCT
GGCACTCGTCCTGCAGCACGACGAGCAGGCTGTTCTATGCGCGGTCGAACTGGCGCTCGAGAGCGGCGTCCCCACCAAGATCCATGTTCTCAACATCCTG
CACCGGCTGCTCGACGGTAAGCCATCGAGCATGCCAACCATCGATGCCCCGCAGGCGCTCGTCTTGCGTCAGGAGCCGAAGGCCGATGTCGAGCGCTACG
ACGCGCTCCGTAGCAAGGCGGTCCGCCATGCGTCATGACCCCGCCAGCGCCGCCATCATCGTCATGCTGCGCGGGCTCAAGATGTACGGCATGTCTCAAG
CCGTCGGCGACCTTGTGGAGCAAGGCGCGCCTGCCTTCGAGGCAGCCGTCCCGCTCTTGGCCCAGTTGCTCAAGGCCGAGATGGCCGAGCGGGAGGTCCG
ATCCATCGCCTACCAGATCAAGGCCGCCCGCTTTCCGGCCTACAAGGACCTGACCGGGTTCGACTTCGCTGCCAGCGAGGTCAACGAGGCCGTGGTGCGC
ACGCTGCACGGGGGCGACTTCATCGACGGGGCACACAACGTTGTGCTGATCGGGGGGCCAGGAACCGGCAAAACGCACGTCGCAACGGCCCTAGGCGTGC
AGGCCATCGAGCACCATCGCAAGAAGGTCCGATTCTTCGCCACCGTCGACCTGGTCAATGCGCTCGAACAGGAGAAGGCCCTGAACAAGGCCGGCCAATT
GGCGGATCGATTGCTGCGCCTGGACCTGATCATCCTGGACGAGCTGGGCTATCTCCCCTTCAGCACCTCAGGAGGAGCCTTGCTGTTCCACCTGCTGTCC
AAGCTCTACGAGCGCACCAGCGTCGTCATCACGACCAACCTCAGCTTCAGCGAGTGGGCTGACATCTTCGGCGATGCCAAGATGACCACTGCGTTGCTCG
ATCGCCTCACCCACCACTGCCACATTCTGGAAACCGGAAACGACAGCTTCCGCTTCCGGGCCAGTGCCGCAGCCGGAAAACCAAAAAGGAGCCGCGCTGA
GTTGACCAAGCCCGTCCAATGACAAACATCAGCCCCTGAACCCGGGTCACTTCTCAACGAAAACGCCGGGTCAACTCTCAGCGGAAATCAACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1449 bp | 482 aa | 190 | 1638 | + | No |
Chemistry : DDE
ORF sequence :
LSRNTIRKYLRSDAQEPVYRAPDRPSRLDPFAAKLSGWLKAEANRSRKQQRTARQMHADLVLLGYDGSYGRVAAFIRRWKENRQTEQQTTGRGTFVPLIF
APGEAFQFDWSEDWADLAGERVKLQVAHTKLSHSRAFVVRAYPLQSHEMLFDAHYHAFRVLGGIPGRGIYDNMKTAVDRVGSGKARQVNARFAAMASHYL
FDTTFCNPASGWEKGQVEKNVQDARRRLWQPLPSFADLDGLNAWLEMRCIAAWKEIPHGDLPGTLADAWSAEVASLMPLGRPFDGFVEHTKRVSPTCLVH
FERNRYSVPASLANRPVSVRIYPERIVVAAEGRILCEHLRVISRSHSVAGRTIYDWRHYLAVIQRKPGALRNGAPFAEMPDAFRRLQGHLLKHPGGDREM
VEILALVLQHDEQAVLCAVELALESGVPTKIHVLNILHRLLDGKPSSMPTIDAPQALVLRQEPKADVERYDALRSKAVRHAS
APGEAFQFDWSEDWADLAGERVKLQVAHTKLSHSRAFVVRAYPLQSHEMLFDAHYHAFRVLGGIPGRGIYDNMKTAVDRVGSGKARQVNARFAAMASHYL
FDTTFCNPASGWEKGQVEKNVQDARRRLWQPLPSFADLDGLNAWLEMRCIAAWKEIPHGDLPGTLADAWSAEVASLMPLGRPFDGFVEHTKRVSPTCLVH
FERNRYSVPASLANRPVSVRIYPERIVVAAEGRILCEHLRVISRSHSVAGRTIYDWRHYLAVIQRKPGALRNGAPFAEMPDAFRRLQGHLLKHPGGDREM
VEILALVLQHDEQAVLCAVELALESGVPTKIHVLNILHRLLDGKPSSMPTIDAPQALVLRQEPKADVERYDALRSKAVRHAS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
795 bp | 264 aa | 1628 | 2422 | + | No |
AG : IS21 helper
ORF sequence :
MRHDPASAAIIVMLRGLKMYGMSQAVGDLVEQGAPAFEAAVPLLAQLLKAEMAEREVRSIAYQIKAARFPAYKDLTGFDFAASEVNEAVVRTLHGGDFID
GAHNVVLIGGPGTGKTHVATALGVQAIEHHRKKVRFFATVDLVNALEQEKALNKAGQLADRLLRLDLIILDELGYLPFSTSGGALLFHLLSKLYERTSVV
ITTNLSFSEWADIFGDAKMTTALLDRLTHHCHILETGNDSFRFRASAAAGKPKRSRAELTKPVQ
GAHNVVLIGGPGTGKTHVATALGVQAIEHHRKKVRFFATVDLVNALEQEKALNKAGQLADRLLRLDLIILDELGYLPFSTSGGALLFHLLSKLYERTSVV
ITTNLSFSEWADIFGDAKMTTALLDRLTHHCHILETGNDSFRFRASAAAGKPKRSRAELTKPVQ
Blast result :
Comments
ISMex27 is 85% (istA, the transposase) and 94% (istB, the helper of transposition) aa similar to ISRsp3.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. (2009) PLoS ONE Submitted.