ISMdi7
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium dichloromethanicum | Methylobacterium extorquens AM1 Methylobacterium dichloromethanicum DM4 |
DNA section
IS Length : 2568 bp
Ends
IR Length : 10/12
IRL : TGTGTAGGGCGGTGACAAAACCATCCACTGGAGCGTCGGCTTTGTGCTGA
IRR : TGTAAAGGGCGGACCAATTTTAGGCCGCCGTGGCGGAGTAAAACCAGGCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GTTGTCGGAG | CACCTGCGCG | 0 | |
CTAATCGCGT | CCCTTATCCT | 0 |
DNA sequence
TGTGTAGGGCGGTGACAAAACCATCCACTGGAGCGTCGGCTTTGTGCTGATGCGGGCGGTGTAAAAGCCGGCCAGTGGAGAGGCTTCTCTTTCGAGAGGA
GCCGAGGATGTTCGTCGTGGAAGTCTACGCAGCCGTTCGGCAGTTCGTGTTCATCGAGGGCCAGTCCCGGCGTGAGGCGGCTCGGGTCTTCGGGTTGAGC
CGAGAGACGATCGCCAAGATGTGTCGGTTCTCGTTGCCGCCGGGCTACACACGCTCGAAGCCGGTCGAGAAGCCGAAGCTCGGGCCTCTTCTGCCAGTGA
TCGCGGCGATCCTTGAGGCGGACCGGACTGCACCGCTCAAGCAACGCCATACGGCCAAGCGGATCTTCGAGCGCCTACGCGACGAGCACGGCTATGCCGG
TGGCTATACCGTGGTGAAGGACCACGTACGGATCTGCCGAGCACGGGGGCAGGAGACTTTCGTGCCGCTCGCCCACCCGCCCGGCCATGCCCAGGTCGAT
TTTGGCGAGGCGGTCGCCACCATCGCTGGCGTGCGCCGCAAGATCCATTTCTTCTGCATGGATCTGCCGCACTCCGACGCCTGCTTCGTGAAGGCGTATC
CGCGGGAGACCACCGAAGCTTTTCTCGACGGGCATGTCGCCGCCTTCGCCTTCTTCGGCGGCGTGCCTCTGTCGATCCTGTACGACAACACCAAGATCGC
AGTGGCCAAGATCTGCGGCGACGGACAGCGTGAGCGTACGCGCGCCTTCACCGAGTTGGTGAGCCACTGCCTGTTCCGGGATCGCTTCGGCCGTCCGGGC
AGGGGCAACGACAAGGGCAAGGTCGAGGGGCTGGTCAAGTTCGCCCGGTCCCACTTCATGACCCCGGCTCCGGAGGCGGCCTCGTTCGAGGCGCTGAACG
CTGACCTGGAGCGCCGCTGCCGAGCTCGGCAGAACGAGTGCGCCGGGCGGCATCCCGAGAGCATCGGAACGCGGCTCATGGCCGATCGAGTGGTTCTGCG
AGCCCTGCCGGCGGTGCCGCTGGAGCCGTGCGAGAAGAGGGCCGGGCGCGTCTCGTCGACCGCGCTGGTGCGCTATCGCGGCAACGACTACTCGGTGCCG
ACCACCTACGGCTTCCGGGACGTGCTGGTGAAGGGCTTCGTCGAGGAAGTCGTGATCCTGTGTGCGGGGGTCGAGATCGCCCGGCACCCGCGCAGCTACG
GCAGCGGCGTTTTCGTCGCCGAACCTCTGCACTACCTCGCGCTGATCGAGACCAAGCCGAACGCCCTCGACCAAGCCGCGGCACTCCAGGGCTGGGATCT
GCCCGAGGCGTTCCAGCACCTGCGCCACCTTCTGGAGGCGCGCATGGGCAACCGCGGCAAGCGCGAGTTCATCCAGGTGCTGCGCCTGATGGAGGCGATG
CCGAAGGACCTCGTGGCCTGGGCCGTCACCGAGGCGATCCGGCTCGGGGCGATTGGCTTCGATGCGGTCAAGTTGATCGCGCTGGCCCGTCTCGAACGGC
GGCCGCCTCGGCTCGACTTGTCGGCCTACCCGCATCTGCCCCGGCCTGCGGTGCGCGCGACGATGGCCGCCGACTACACGGTGCTGGTGCCGGAGGTGGC
GGCATGAGCGTGAGCGGCGATGAGACGACACCGGGCGTCCTGCTCGCCCATCACCTCAAGCAGTTGAAGTTGCCGACGGTCCTGCGCGAGTACGACAAGG
TCGCCCGGGACTGCGCCCGGAGCGGCCTCGACCACCCCCGCTATCTGCTGCGGCTGGTTGAGCTGGAACTGATCGACCGCGAACGGCGCATGGTCGAGCG
CCGGATCCGGGCGGCACGCTTCCCGGCGGTGAAGAGCCTCGACACCTTCGACTTCGCCGCCATCCCGAGCCTGAACAAGATGCTCGTGCTGGAGCTGGCG
CGCTGCGGCTACGTCCTCGGTCGGGAGAACGTCATCGCGCTCGGCAACTCCGGCACCGGCAAGACGCACATCGCCCTGGCTCTCGGACTGGCAGCCTGCC
AGAAGGGCTTCTCGGTCACGTTCACCACGGCGGCCTCGCTGGTCAACCAGCTCATGGAGGCGCGTGACGAGCGCCGCCTGCTCCGGCTTCAGAGGGAGCT
GGCCGCGGTGAAGTTGCTGATTGTCGATGAACTCGGCTACGTGCCGCTGTCGCCGACGGGGGCCGAGCTTTTGTTCGAGGTCCTGTCCCAGCGCTACGAG
CGCGGCTCGACGGTGGTGACCTCGAACCTTCCGTTCGAGGACTGGACCTCGGTCCTGGGCTCGGAGCGGCTGACCGGTGCGCTGCTGGATCGGCTGACCC
ATCACGTCAGCATCCTGAGCCTGAACGGAGACAGCTACCGCCTCAGATCCTCCCGCAGCCGGCGGGGCCGCACAGACAGGGCGGAGCAAAACCAGGCCAC
CCTTGATGACCCTGATCCCGCGACGGGCGAGATCCGGCCAGCCTGACCTCACGATCCGCGACAAACGATGAAGAGGCCCGATCGGGCCCCTTCATCGTTC
AGGCTGTCCGCAGTCCGTGGCCTGGTTTTACTCCGCCACGGCGGCCTAAAATTGGTCCGCCCTTTACA
GCCGAGGATGTTCGTCGTGGAAGTCTACGCAGCCGTTCGGCAGTTCGTGTTCATCGAGGGCCAGTCCCGGCGTGAGGCGGCTCGGGTCTTCGGGTTGAGC
CGAGAGACGATCGCCAAGATGTGTCGGTTCTCGTTGCCGCCGGGCTACACACGCTCGAAGCCGGTCGAGAAGCCGAAGCTCGGGCCTCTTCTGCCAGTGA
TCGCGGCGATCCTTGAGGCGGACCGGACTGCACCGCTCAAGCAACGCCATACGGCCAAGCGGATCTTCGAGCGCCTACGCGACGAGCACGGCTATGCCGG
TGGCTATACCGTGGTGAAGGACCACGTACGGATCTGCCGAGCACGGGGGCAGGAGACTTTCGTGCCGCTCGCCCACCCGCCCGGCCATGCCCAGGTCGAT
TTTGGCGAGGCGGTCGCCACCATCGCTGGCGTGCGCCGCAAGATCCATTTCTTCTGCATGGATCTGCCGCACTCCGACGCCTGCTTCGTGAAGGCGTATC
CGCGGGAGACCACCGAAGCTTTTCTCGACGGGCATGTCGCCGCCTTCGCCTTCTTCGGCGGCGTGCCTCTGTCGATCCTGTACGACAACACCAAGATCGC
AGTGGCCAAGATCTGCGGCGACGGACAGCGTGAGCGTACGCGCGCCTTCACCGAGTTGGTGAGCCACTGCCTGTTCCGGGATCGCTTCGGCCGTCCGGGC
AGGGGCAACGACAAGGGCAAGGTCGAGGGGCTGGTCAAGTTCGCCCGGTCCCACTTCATGACCCCGGCTCCGGAGGCGGCCTCGTTCGAGGCGCTGAACG
CTGACCTGGAGCGCCGCTGCCGAGCTCGGCAGAACGAGTGCGCCGGGCGGCATCCCGAGAGCATCGGAACGCGGCTCATGGCCGATCGAGTGGTTCTGCG
AGCCCTGCCGGCGGTGCCGCTGGAGCCGTGCGAGAAGAGGGCCGGGCGCGTCTCGTCGACCGCGCTGGTGCGCTATCGCGGCAACGACTACTCGGTGCCG
ACCACCTACGGCTTCCGGGACGTGCTGGTGAAGGGCTTCGTCGAGGAAGTCGTGATCCTGTGTGCGGGGGTCGAGATCGCCCGGCACCCGCGCAGCTACG
GCAGCGGCGTTTTCGTCGCCGAACCTCTGCACTACCTCGCGCTGATCGAGACCAAGCCGAACGCCCTCGACCAAGCCGCGGCACTCCAGGGCTGGGATCT
GCCCGAGGCGTTCCAGCACCTGCGCCACCTTCTGGAGGCGCGCATGGGCAACCGCGGCAAGCGCGAGTTCATCCAGGTGCTGCGCCTGATGGAGGCGATG
CCGAAGGACCTCGTGGCCTGGGCCGTCACCGAGGCGATCCGGCTCGGGGCGATTGGCTTCGATGCGGTCAAGTTGATCGCGCTGGCCCGTCTCGAACGGC
GGCCGCCTCGGCTCGACTTGTCGGCCTACCCGCATCTGCCCCGGCCTGCGGTGCGCGCGACGATGGCCGCCGACTACACGGTGCTGGTGCCGGAGGTGGC
GGCATGAGCGTGAGCGGCGATGAGACGACACCGGGCGTCCTGCTCGCCCATCACCTCAAGCAGTTGAAGTTGCCGACGGTCCTGCGCGAGTACGACAAGG
TCGCCCGGGACTGCGCCCGGAGCGGCCTCGACCACCCCCGCTATCTGCTGCGGCTGGTTGAGCTGGAACTGATCGACCGCGAACGGCGCATGGTCGAGCG
CCGGATCCGGGCGGCACGCTTCCCGGCGGTGAAGAGCCTCGACACCTTCGACTTCGCCGCCATCCCGAGCCTGAACAAGATGCTCGTGCTGGAGCTGGCG
CGCTGCGGCTACGTCCTCGGTCGGGAGAACGTCATCGCGCTCGGCAACTCCGGCACCGGCAAGACGCACATCGCCCTGGCTCTCGGACTGGCAGCCTGCC
AGAAGGGCTTCTCGGTCACGTTCACCACGGCGGCCTCGCTGGTCAACCAGCTCATGGAGGCGCGTGACGAGCGCCGCCTGCTCCGGCTTCAGAGGGAGCT
GGCCGCGGTGAAGTTGCTGATTGTCGATGAACTCGGCTACGTGCCGCTGTCGCCGACGGGGGCCGAGCTTTTGTTCGAGGTCCTGTCCCAGCGCTACGAG
CGCGGCTCGACGGTGGTGACCTCGAACCTTCCGTTCGAGGACTGGACCTCGGTCCTGGGCTCGGAGCGGCTGACCGGTGCGCTGCTGGATCGGCTGACCC
ATCACGTCAGCATCCTGAGCCTGAACGGAGACAGCTACCGCCTCAGATCCTCCCGCAGCCGGCGGGGCCGCACAGACAGGGCGGAGCAAAACCAGGCCAC
CCTTGATGACCCTGATCCCGCGACGGGCGAGATCCGGCCAGCCTGACCTCACGATCCGCGACAAACGATGAAGAGGCCCGATCGGGCCCCTTCATCGTTC
AGGCTGTCCGCAGTCCGTGGCCTGGTTTTACTCCGCCACGGCGGCCTAAAATTGGTCCGCCCTTTACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1500 bp | 499 aa | 108 | 1607 | + | No |
Chemistry : DDE
ORF sequence :
MFVVEVYAAVRQFVFIEGQSRREAARVFGLSRETIAKMCRFSLPPGYTRSKPVEKPKLGPLLPVIAAILEADRTAPLKQRHTAKRIFERLRDEHGYAGGY
TVVKDHVRICRARGQETFVPLAHPPGHAQVDFGEAVATIAGVRRKIHFFCMDLPHSDACFVKAYPRETTEAFLDGHVAAFAFFGGVPLSILYDNTKIAVA
KICGDGQRERTRAFTELVSHCLFRDRFGRPGRGNDKGKVEGLVKFARSHFMTPAPEAASFEALNADLERRCRARQNECAGRHPESIGTRLMADRVVLRAL
PAVPLEPCEKRAGRVSSTALVRYRGNDYSVPTTYGFRDVLVKGFVEEVVILCAGVEIARHPRSYGSGVFVAEPLHYLALIETKPNALDQAAALQGWDLPE
AFQHLRHLLEARMGNRGKREFIQVLRLMEAMPKDLVAWAVTEAIRLGAIGFDAVKLIALARLERRPPRLDLSAYPHLPRPAVRATMAADYTVLVPEVAA
TVVKDHVRICRARGQETFVPLAHPPGHAQVDFGEAVATIAGVRRKIHFFCMDLPHSDACFVKAYPRETTEAFLDGHVAAFAFFGGVPLSILYDNTKIAVA
KICGDGQRERTRAFTELVSHCLFRDRFGRPGRGNDKGKVEGLVKFARSHFMTPAPEAASFEALNADLERRCRARQNECAGRHPESIGTRLMADRVVLRAL
PAVPLEPCEKRAGRVSSTALVRYRGNDYSVPTTYGFRDVLVKGFVEEVVILCAGVEIARHPRSYGSGVFVAEPLHYLALIETKPNALDQAAALQGWDLPE
AFQHLRHLLEARMGNRGKREFIQVLRLMEAMPKDLVAWAVTEAIRLGAIGFDAVKLIALARLERRPPRLDLSAYPHLPRPAVRATMAADYTVLVPEVAA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
843 bp | 280 aa | 1604 | 2446 | + | No |
AG : IS21 helper
ORF sequence :
MSVSGDETTPGVLLAHHLKQLKLPTVLREYDKVARDCARSGLDHPRYLLRLVELELIDRERRMVERRIRAARFPAVKSLDTFDFAAIPSLNKMLVLELAR
CGYVLGRENVIALGNSGTGKTHIALALGLAACQKGFSVTFTTAASLVNQLMEARDERRLLRLQRELAAVKLLIVDELGYVPLSPTGAELLFEVLSQRYER
GSTVVTSNLPFEDWTSVLGSERLTGALLDRLTHHVSILSLNGDSYRLRSSRSRRGRTDRAEQNQATLDDPDPATGEIRPA
CGYVLGRENVIALGNSGTGKTHIALALGLAACQKGFSVTFTTAASLVNQLMEARDERRLLRLQRELAAVKLLIVDELGYVPLSPTGAELLFEVLSQRYER
GSTVVTSNLPFEDWTSVLGSERLTGALLDRLTHHVSILSLNGDSYRLRSSRSRRGRTDRAEQNQATLDDPDPATGEIRPA
Blast result :
Comments
ISMdi7 has 2 intact copies in DM4 and 1 partial copy in AM1.
ISMdi7 is 94% and 92% aa similar to orfA and orfB of ISMex8.
ISMdi7 is 94% and 92% aa similar to orfA and orfB of ISMex8.
References
1] Stéphane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS ONE Submitted. (2009)