ISMex39
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 |
DNA section
IS Length : 2456 bp
Ends
IR Length : 27/35
IRL : TGACCCTGGATCGGCGTCCAATTCTGACCCCCTAACCTCGTTCTCGCAGA
IRR : TGAGGCGGGGGATCGGCGTCGAACTTTGACCCCCTGGATGGGTATGGCGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CAT | A | 0 |
DNA sequence
TGACCCTGGATCGGCGTCCAATTCTGACCCCCTAACCTCGTTCTCGCAGACTGCCCTCTGGTGCGGCCGGAAGGGTGGTGCAGGGGGATGAAGGGCGTGG
ACACGATTGCGCGCATCCGGCGCGAGTTCTTCACCCGCGGCCGCGCGATCAAGGACATCGTCCGCGACCTGCACGTCTCCCGCAACACGGTCCGGAAGGT
CATCCGGTCGGGGGCCACCGCCTTCACCTACGAGCGCGACGTGCAGCCTCTGCCCAAGCTCGGGCCGTGGCGCGACGATCTCGATCGGATGCTGACCACC
AACGCCGGCAGGTCCGCCCGCGAGCGACTGACGCTGATCCGGATCTACGAGGCGCTGCGCGGGCTCGGCTACGAGGGCGGCTACGATGCCGTTCGGCGCT
ATGCCAAAACCTGGAAGCGCGAGCGCGCCTCGGTCACGGCGCAGGCCTTCGTGCCGCTCATCTTCGCCCCAGGCGAGGCCTACCAGTTCGACTGGAGCCA
CGAGATCGTGCTGATCGGCGGCACGACCGTCACGGTCAAGGTCGCTCACGTCCGGCTCTGCCACTCGCGCATGCTGTTCGTGCGGGCTTATCCGCGCGAG
AGCCAGGAGATGGTGTTCGACGCCCACGACCGGGCGTTCGCGTTCTTCCGCGGCACCTGCCAGCGCGGCATCTACGACAACATGAAGACCGCGGTGGAGA
CGATCTTCGTCGGTCGCGAGCGCGCCTACAACCGTCGCTTCCTGCAGATGTGCTCGCACTACCTCGTCGATCCGGTCGCCTGCACGCCGGCCTCGGGCTG
GGAGAAGGGGCAGGTCGAGAACCAGGTCGGGCTGGTGCGCGAGCGCTTCTTCACCCCGCGCGTCCGGGTGAAGAGCTACGACGAGTTGAACGCGCTCCTA
CTCGATGGAGCTATCGCCTACGCCAAGGCCCATCCGCATCCCGAGGAGCGCGAGCGCACCGTCTGGGAGCGGTTCGAGGCCGAGCGGGCGGCGCTGGTGC
CCTATGCTGGCCGCTTCGACGGCTTCCACGCCGTCACCGCTTCGGTGTCCGCGACCTGCCTCGTACGCTTCGACAACAACCGATACTCGGTCATGGCCTC
CGCCATTGGCCGCCCGGTCGAAGTGCGAGCCTACGCCGAGCGCATCGAGATCCGGCAGGACGGGCGCGTCGTCGCCGAGCACGCGCGTGCCTTCGGCCGG
GATCAGACCGTGTTCGATCCCTGGCACTACGTCCCTGTGCTCGCCCGCAAACCTGGCGCGCTCCGGAACGGCGCACCGTTCAAGGACTGGGTGCTGCCCG
CCGCCCTCGACCGGGTTCGACGCAAGCTCGCCGGCAGTGCCGGGGGTGACCGGCAGATGGTCGAGATCCTCACCGCTGTGCTCGGTGACGGGTTGCCGGC
GGTCGAGGCGGCCTGTGCCGAGGCCCTGCGCGAGGGCGTCCACTCGGCCGATGTCGTCCTCAACATCCTCGCCCGGCAGCGCGAGCCGACCACGCCCGTC
ACCATCCTGACGCCCGAGAGCCTGCGGCTGCGCCACGAGCCGGTCGCCGACTGTGCCCGCTACGACAGCCTAAGGAGAGCACCATGATGGAACGCCAGCA
GATCCTCGCCACCATGGGCGATCTGAAGCTCTTCGGCATGAAGGCCGCCTACGACGAGATCATCAAGGTCGCAGTGAAGCGCACCCACGAGCCGCAGCAG
ATCGTCGGCGACCTGCTCCAGGCAGAGATCTCCGAGAAGCAGGCACGCTCGATCCGCTACCAGATGACCATCGCCAAACTGCCCCTGGCCAAGGACATCG
CCGAGTTCGCCTTCGACGGCACTCCGATCAACGAGGTGCTGGTGCGTGATCTCGCCGGCGGCGAGTTCCTGGCGCATCAGCGCAACGTCGTGCTCGTCGG
CGGCACCGGGACCGGCAAGACCCACCTCGCCATCGCCCTTGCACGAGCCTGCATCCGGGATGGCATGCGGGGCCGGTTCTACAACGTCGTCGATCTCGTC
AATCGGCTCGAAGCTGAGGCACGCGCCGGCCGGCACGGCCGCATCGCCGACCATCTGGCGCGGCTCGACTTCGTGGTGCTCGACGAACTCGGCTACCTGC
CGTTCGCGCAGTCGGGCGGCCAACTCCTGTTCCACCTGATCAGCAAGCTCTACGAGACGACCTCGATCGTGGTCACCACCAACCTCGCCTTCGGGGAATG
GCCGAGCGTGTTCGCCGGCGATGCCAAGATGACCACCGCGCTCCTCGACCGGCTCACCCACCACTGCGAGATCGTCGAGACCGGCAACGAGAGCTGGCGC
TTCAAGAACCGCGCCTGAGGCCAGCACACACGTCGCGCCTGCTGGACCCCTGATCAGCCCGGCTGCGCCACCCCGACCCGCCTTCGCCGGGCTGATCAGG
GGCGTCCCGCCATACCCATCCAGGGGGTCAAAGTTCGACGCCGATCCCCCGCCTCA
ACACGATTGCGCGCATCCGGCGCGAGTTCTTCACCCGCGGCCGCGCGATCAAGGACATCGTCCGCGACCTGCACGTCTCCCGCAACACGGTCCGGAAGGT
CATCCGGTCGGGGGCCACCGCCTTCACCTACGAGCGCGACGTGCAGCCTCTGCCCAAGCTCGGGCCGTGGCGCGACGATCTCGATCGGATGCTGACCACC
AACGCCGGCAGGTCCGCCCGCGAGCGACTGACGCTGATCCGGATCTACGAGGCGCTGCGCGGGCTCGGCTACGAGGGCGGCTACGATGCCGTTCGGCGCT
ATGCCAAAACCTGGAAGCGCGAGCGCGCCTCGGTCACGGCGCAGGCCTTCGTGCCGCTCATCTTCGCCCCAGGCGAGGCCTACCAGTTCGACTGGAGCCA
CGAGATCGTGCTGATCGGCGGCACGACCGTCACGGTCAAGGTCGCTCACGTCCGGCTCTGCCACTCGCGCATGCTGTTCGTGCGGGCTTATCCGCGCGAG
AGCCAGGAGATGGTGTTCGACGCCCACGACCGGGCGTTCGCGTTCTTCCGCGGCACCTGCCAGCGCGGCATCTACGACAACATGAAGACCGCGGTGGAGA
CGATCTTCGTCGGTCGCGAGCGCGCCTACAACCGTCGCTTCCTGCAGATGTGCTCGCACTACCTCGTCGATCCGGTCGCCTGCACGCCGGCCTCGGGCTG
GGAGAAGGGGCAGGTCGAGAACCAGGTCGGGCTGGTGCGCGAGCGCTTCTTCACCCCGCGCGTCCGGGTGAAGAGCTACGACGAGTTGAACGCGCTCCTA
CTCGATGGAGCTATCGCCTACGCCAAGGCCCATCCGCATCCCGAGGAGCGCGAGCGCACCGTCTGGGAGCGGTTCGAGGCCGAGCGGGCGGCGCTGGTGC
CCTATGCTGGCCGCTTCGACGGCTTCCACGCCGTCACCGCTTCGGTGTCCGCGACCTGCCTCGTACGCTTCGACAACAACCGATACTCGGTCATGGCCTC
CGCCATTGGCCGCCCGGTCGAAGTGCGAGCCTACGCCGAGCGCATCGAGATCCGGCAGGACGGGCGCGTCGTCGCCGAGCACGCGCGTGCCTTCGGCCGG
GATCAGACCGTGTTCGATCCCTGGCACTACGTCCCTGTGCTCGCCCGCAAACCTGGCGCGCTCCGGAACGGCGCACCGTTCAAGGACTGGGTGCTGCCCG
CCGCCCTCGACCGGGTTCGACGCAAGCTCGCCGGCAGTGCCGGGGGTGACCGGCAGATGGTCGAGATCCTCACCGCTGTGCTCGGTGACGGGTTGCCGGC
GGTCGAGGCGGCCTGTGCCGAGGCCCTGCGCGAGGGCGTCCACTCGGCCGATGTCGTCCTCAACATCCTCGCCCGGCAGCGCGAGCCGACCACGCCCGTC
ACCATCCTGACGCCCGAGAGCCTGCGGCTGCGCCACGAGCCGGTCGCCGACTGTGCCCGCTACGACAGCCTAAGGAGAGCACCATGATGGAACGCCAGCA
GATCCTCGCCACCATGGGCGATCTGAAGCTCTTCGGCATGAAGGCCGCCTACGACGAGATCATCAAGGTCGCAGTGAAGCGCACCCACGAGCCGCAGCAG
ATCGTCGGCGACCTGCTCCAGGCAGAGATCTCCGAGAAGCAGGCACGCTCGATCCGCTACCAGATGACCATCGCCAAACTGCCCCTGGCCAAGGACATCG
CCGAGTTCGCCTTCGACGGCACTCCGATCAACGAGGTGCTGGTGCGTGATCTCGCCGGCGGCGAGTTCCTGGCGCATCAGCGCAACGTCGTGCTCGTCGG
CGGCACCGGGACCGGCAAGACCCACCTCGCCATCGCCCTTGCACGAGCCTGCATCCGGGATGGCATGCGGGGCCGGTTCTACAACGTCGTCGATCTCGTC
AATCGGCTCGAAGCTGAGGCACGCGCCGGCCGGCACGGCCGCATCGCCGACCATCTGGCGCGGCTCGACTTCGTGGTGCTCGACGAACTCGGCTACCTGC
CGTTCGCGCAGTCGGGCGGCCAACTCCTGTTCCACCTGATCAGCAAGCTCTACGAGACGACCTCGATCGTGGTCACCACCAACCTCGCCTTCGGGGAATG
GCCGAGCGTGTTCGCCGGCGATGCCAAGATGACCACCGCGCTCCTCGACCGGCTCACCCACCACTGCGAGATCGTCGAGACCGGCAACGAGAGCTGGCGC
TTCAAGAACCGCGCCTGAGGCCAGCACACACGTCGCGCCTGCTGGACCCCTGATCAGCCCGGCTGCGCCACCCCGACCCGCCTTCGCCGGGCTGATCAGG
GGCGTCCCGCCATACCCATCCAGGGGGTCAAAGTTCGACGCCGATCCCCCGCCTCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1527 bp | 508 aa | 61 | 1587 | + | No |
Chemistry : DDE
ORF sequence :
VRPEGWCRGMKGVDTIARIRREFFTRGRAIKDIVRDLHVSRNTVRKVIRSGATAFTYERDVQPLPKLGPWRDDLDRMLTTNAGRSARERLTLIRIYEALR
GLGYEGGYDAVRRYAKTWKRERASVTAQAFVPLIFAPGEAYQFDWSHEIVLIGGTTVTVKVAHVRLCHSRMLFVRAYPRESQEMVFDAHDRAFAFFRGTC
QRGIYDNMKTAVETIFVGRERAYNRRFLQMCSHYLVDPVACTPASGWEKGQVENQVGLVRERFFTPRVRVKSYDELNALLLDGAIAYAKAHPHPEERERT
VWERFEAERAALVPYAGRFDGFHAVTASVSATCLVRFDNNRYSVMASAIGRPVEVRAYAERIEIRQDGRVVAEHARAFGRDQTVFDPWHYVPVLARKPGA
LRNGAPFKDWVLPAALDRVRRKLAGSAGGDRQMVEILTAVLGDGLPAVEAACAEALREGVHSADVVLNILARQREPTTPVTILTPESLRLRHEPVADCAR
YDSLRRAP
GLGYEGGYDAVRRYAKTWKRERASVTAQAFVPLIFAPGEAYQFDWSHEIVLIGGTTVTVKVAHVRLCHSRMLFVRAYPRESQEMVFDAHDRAFAFFRGTC
QRGIYDNMKTAVETIFVGRERAYNRRFLQMCSHYLVDPVACTPASGWEKGQVENQVGLVRERFFTPRVRVKSYDELNALLLDGAIAYAKAHPHPEERERT
VWERFEAERAALVPYAGRFDGFHAVTASVSATCLVRFDNNRYSVMASAIGRPVEVRAYAERIEIRQDGRVVAEHARAFGRDQTVFDPWHYVPVLARKPGA
LRNGAPFKDWVLPAALDRVRRKLAGSAGGDRQMVEILTAVLGDGLPAVEAACAEALREGVHSADVVLNILARQREPTTPVTILTPESLRLRHEPVADCAR
YDSLRRAP
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
735 bp | 244 aa | 1584 | 2318 | + | No |
AG : IS21 helper
ORF sequence :
MMERQQILATMGDLKLFGMKAAYDEIIKVAVKRTHEPQQIVGDLLQAEISEKQARSIRYQMTIAKLPLAKDIAEFAFDGTPINEVLVRDLAGGEFLAHQR
NVVLVGGTGTGKTHLAIALARACIRDGMRGRFYNVVDLVNRLEAEARAGRHGRIADHLARLDFVVLDELGYLPFAQSGGQLLFHLISKLYETTSIVVTTN
LAFGEWPSVFAGDAKMTTALLDRLTHHCEIVETGNESWRFKNRA
NVVLVGGTGTGKTHLAIALARACIRDGMRGRFYNVVDLVNRLEAEARAGRHGRIADHLARLDFVVLDELGYLPFAQSGGQLLFHLISKLYETTSIVVTTN
LAFGEWPSVFAGDAKMTTALLDRLTHHCEIVETGNESWRFKNRA
Blast result :
Comments
ISMex39 is 69% (istA, the transposase) and 79% (istB, the helper of transposition) aa similar to ISSsp4.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. (2009) PLoS ONE Submitted.