ISMex8
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 |
DNA section
IS Length : 2597 bp
Ends
IR Length : 10/12
IRL : TGTGTAGGGCGGTGACAAAACCATCCACTGGAGCGTCGGCGGCGTGCTGA
IRR : TGTAAAGGGCGGACCAATTCCAGGCCGCTGTGGCGGAGTAAAACCAGGCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTCCGGCGTC | CCCTCGGCTT | 0 | |
TGGTCGAAAG | GG | CGTCATCCTT | 2 |
CTGAGCGAGC | GGCGGTTTCG | 0 |
DNA sequence
TGTGTAGGGCGGTGACAAAACCATCCACTGGAGCGTCGGCGGCGTGCTGATGCGGGCGGCGTAAAAGTCGGCCAGTTTGAGAGGCTTCTCTTTCGGGAGG
GGCCCAGGATGTTTGCCGTGGAAGTCTACGCGGCCGTTCGGCAGTTTGTTTTCATCGACGGCCACTCTCGTCGTGAGGCAGCCCGGGTATTTGGGCTCAA
TCGCGAGACGATTGCCAAGATGTGCCGGTTCTCGCTGCCGCCGGGCTACACGCGCTCGAAGCCGATCGAGAAGCCGAAGCTGGGGCCGCTGCTGCCGGTG
ATCGATGCGATCCTGGAGGCGGACCGGATCGCACCCGTCAAGCAGCGTCACACGGCCAAGCGGATTTTCGAGCGTCTGCGCGACGAGCACGGCTATGGCG
GCGGCTACACGGTGGTGAAGGACCACGTCCGGATCGCACGCGCGAGCGGGAGGGAGACCTTCGTGCCGTTGGCGCACCCACCCGGCCACGCGCAGGTCGA
CTTCGGCGAAGCGATCGCCACCATCGGCGGCGTGCGCCGGAAGATCCACTTCTTCTGCATGGACCTGCCGCATTCCGACGCCTGCTTCGTGAAGGCGTAT
CCGCGGGAGACCACCGAGGCGTTCCTCGACGGGCACGTCGCCGCCTTCGATTTCTTCGCCGGCGTGCCGCTGTCGATTCTCTACGACAACACCAGGATCG
CGGTGGCCAAGATCTGCGGCGGAGGCCAGCGCGAACGTACCCGTGCCTTCACCGAGCTGGTTAGCCACTACCTGTTCCGAGATCGCTTCGGACGTCCCGG
CAGGGGCAACGACAAGGGCAAGGTCGAGGGGCTGGTCAAGTACGCCCGATCCAACTTCATGACCCCGGCCCCGGAGGCGGCCTCCTTCGAGGCGCTGAAC
GCGGATCTGGAGCGCCGCTGCCGGACCCGGCAGGATGAGCGGGCCGGACAGCATGCCGAGAGCATCGGCACGCGCTTGGTCGCCGATCGTGCCGTCCTGC
GTGCCCTGCCGGCGGTCCCGCTGGAGCCGTGCGAGAAGCGGGGTGGGCGGGTCTCCTCGACCGCGCTGGTTCGCTACCGCTGCAACGACTACTCGGTGCC
GACCGCCTACGGCTTCCGGGACGTGCTGGTGAAGGGCTTCGTCGAGGAGGTCGTGATCCTATGCGGCGGCATCGAGATCGCGCGGCATGAGCGCAGCTAC
GGCACCGGCGTGTTCGTGTCCGACCCGCTGCATTACCTCGCGCTGATCGAGATCAAACCGAACGCTCTCGACCAGGCGGCCGCGCTCCAGGGCTGGGATC
TGCCCGAGGCGTTCCAGCACCTACGCCACCTCCTGGAGGCGCGCATGGGTAACCGCGGCAAGCGGGAGTTCATCCAGGTGCTGCGCCTGATGGAGGCGAT
GCCCAAGGACGTCGTGGCCTCGTCCGTCACCGACGCGATCCGCCTCGGTGCGATCGGCTTCGACGCGGTCAAGCTGATCGCGCTGGCCCGGCTCGAACGA
CGACCGCCCCGGCTCGACCTGGCGGCCTACCCCCATCTGCCGAGGATGACGGTGAGGACGACCGCGGCCGCCGACTACACGGTGTTGGTGCCGGAGGTGG
CAGCATGAGCGCGGCACGTGACGAGACGATGCCGGGCGGCACCACCGGCGGCACGCCGGGGATCCTGCTCGCCCACCACCTCAAGCAGCTGAAGCTGCCC
ACGGTGCTGCGCGAGTACGACAAGGTCGCCCGGGAGTGCGCCCAGGGCGGCATCGACCACCCGCGCTACCTGCTGCGGCTCGTCGAGCTGGAACTGATCG
ATCGCGAACGGCGCATGGTCGAACGCCGGATCCGGTCGGCACGCTTCCCGGCGGTGAAGAGCCTCGACACCTTCGACTTCACCGCGATCCCGAGCCTGAA
CAAGATGCTCGTGCTGGAGCTGGCGCGCTGCGGCTACATCCTTGGCCGAGAGAACGTCATCGCACTCGGCAACTCCGGCACCGGCAAGACCCACATCGCC
TTGGCTCTCGGCCTGGCAGCTTGCCAGAAGGGCTTCTCGGTCGCGTTCACCACCGCGGCCTCTCTGGTCAACCAGCTCCTGGAGGCACGCGACGAGCGCC
GCCTGCTCCGGCTGCAGCGCGAACTGGCTTCGGTGAAGCTGCTGATCGTCGACGAACTCGGCTACGTGCCGCTGTCGGCCACAGGAGCCGAGTTGCTGTT
CGAGACCTTCTCGCAACGCTACGAGCGCGGCTCGACCGTGGTGACCTCGAACCTGCCGTTCGAGGATTGGACCTCGGTCCTGGGCTCGGAGCGGCTCACC
GGCGCGCTGCTCGATCGGCTGACCCACCACGTCAGCATCCTCAGCCTGAACGGCGACAGCTACCGCCTCAAGGCCTCCCGCAGCCGACGCGGCTGCGGCG
ATGGCAGGGCGGAGCAAAACCAGGCCACCATCGATCCCCACGATCCCGAGACGGGCGAGATCCTGCCGGCTTGAGCCTCAGGATCGGCGACGAACGATGA
AGGGGCCCGATCGGGCCCCTTCATCGTTCAAGCTGCCGGCCGCCACTGGCCTGGTTTTACTCCGCCACAGCGGCCTGGAATTGGTCCGCCCTTTACA
GGCCCAGGATGTTTGCCGTGGAAGTCTACGCGGCCGTTCGGCAGTTTGTTTTCATCGACGGCCACTCTCGTCGTGAGGCAGCCCGGGTATTTGGGCTCAA
TCGCGAGACGATTGCCAAGATGTGCCGGTTCTCGCTGCCGCCGGGCTACACGCGCTCGAAGCCGATCGAGAAGCCGAAGCTGGGGCCGCTGCTGCCGGTG
ATCGATGCGATCCTGGAGGCGGACCGGATCGCACCCGTCAAGCAGCGTCACACGGCCAAGCGGATTTTCGAGCGTCTGCGCGACGAGCACGGCTATGGCG
GCGGCTACACGGTGGTGAAGGACCACGTCCGGATCGCACGCGCGAGCGGGAGGGAGACCTTCGTGCCGTTGGCGCACCCACCCGGCCACGCGCAGGTCGA
CTTCGGCGAAGCGATCGCCACCATCGGCGGCGTGCGCCGGAAGATCCACTTCTTCTGCATGGACCTGCCGCATTCCGACGCCTGCTTCGTGAAGGCGTAT
CCGCGGGAGACCACCGAGGCGTTCCTCGACGGGCACGTCGCCGCCTTCGATTTCTTCGCCGGCGTGCCGCTGTCGATTCTCTACGACAACACCAGGATCG
CGGTGGCCAAGATCTGCGGCGGAGGCCAGCGCGAACGTACCCGTGCCTTCACCGAGCTGGTTAGCCACTACCTGTTCCGAGATCGCTTCGGACGTCCCGG
CAGGGGCAACGACAAGGGCAAGGTCGAGGGGCTGGTCAAGTACGCCCGATCCAACTTCATGACCCCGGCCCCGGAGGCGGCCTCCTTCGAGGCGCTGAAC
GCGGATCTGGAGCGCCGCTGCCGGACCCGGCAGGATGAGCGGGCCGGACAGCATGCCGAGAGCATCGGCACGCGCTTGGTCGCCGATCGTGCCGTCCTGC
GTGCCCTGCCGGCGGTCCCGCTGGAGCCGTGCGAGAAGCGGGGTGGGCGGGTCTCCTCGACCGCGCTGGTTCGCTACCGCTGCAACGACTACTCGGTGCC
GACCGCCTACGGCTTCCGGGACGTGCTGGTGAAGGGCTTCGTCGAGGAGGTCGTGATCCTATGCGGCGGCATCGAGATCGCGCGGCATGAGCGCAGCTAC
GGCACCGGCGTGTTCGTGTCCGACCCGCTGCATTACCTCGCGCTGATCGAGATCAAACCGAACGCTCTCGACCAGGCGGCCGCGCTCCAGGGCTGGGATC
TGCCCGAGGCGTTCCAGCACCTACGCCACCTCCTGGAGGCGCGCATGGGTAACCGCGGCAAGCGGGAGTTCATCCAGGTGCTGCGCCTGATGGAGGCGAT
GCCCAAGGACGTCGTGGCCTCGTCCGTCACCGACGCGATCCGCCTCGGTGCGATCGGCTTCGACGCGGTCAAGCTGATCGCGCTGGCCCGGCTCGAACGA
CGACCGCCCCGGCTCGACCTGGCGGCCTACCCCCATCTGCCGAGGATGACGGTGAGGACGACCGCGGCCGCCGACTACACGGTGTTGGTGCCGGAGGTGG
CAGCATGAGCGCGGCACGTGACGAGACGATGCCGGGCGGCACCACCGGCGGCACGCCGGGGATCCTGCTCGCCCACCACCTCAAGCAGCTGAAGCTGCCC
ACGGTGCTGCGCGAGTACGACAAGGTCGCCCGGGAGTGCGCCCAGGGCGGCATCGACCACCCGCGCTACCTGCTGCGGCTCGTCGAGCTGGAACTGATCG
ATCGCGAACGGCGCATGGTCGAACGCCGGATCCGGTCGGCACGCTTCCCGGCGGTGAAGAGCCTCGACACCTTCGACTTCACCGCGATCCCGAGCCTGAA
CAAGATGCTCGTGCTGGAGCTGGCGCGCTGCGGCTACATCCTTGGCCGAGAGAACGTCATCGCACTCGGCAACTCCGGCACCGGCAAGACCCACATCGCC
TTGGCTCTCGGCCTGGCAGCTTGCCAGAAGGGCTTCTCGGTCGCGTTCACCACCGCGGCCTCTCTGGTCAACCAGCTCCTGGAGGCACGCGACGAGCGCC
GCCTGCTCCGGCTGCAGCGCGAACTGGCTTCGGTGAAGCTGCTGATCGTCGACGAACTCGGCTACGTGCCGCTGTCGGCCACAGGAGCCGAGTTGCTGTT
CGAGACCTTCTCGCAACGCTACGAGCGCGGCTCGACCGTGGTGACCTCGAACCTGCCGTTCGAGGATTGGACCTCGGTCCTGGGCTCGGAGCGGCTCACC
GGCGCGCTGCTCGATCGGCTGACCCACCACGTCAGCATCCTCAGCCTGAACGGCGACAGCTACCGCCTCAAGGCCTCCCGCAGCCGACGCGGCTGCGGCG
ATGGCAGGGCGGAGCAAAACCAGGCCACCATCGATCCCCACGATCCCGAGACGGGCGAGATCCTGCCGGCTTGAGCCTCAGGATCGGCGACGAACGATGA
AGGGGCCCGATCGGGCCCCTTCATCGTTCAAGCTGCCGGCCGCCACTGGCCTGGTTTTACTCCGCCACAGCGGCCTGGAATTGGTCCGCCCTTTACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1500 bp | 499 aa | 109 | 1608 | + | No |
Chemistry : DDE
ORF sequence :
MFAVEVYAAVRQFVFIDGHSRREAARVFGLNRETIAKMCRFSLPPGYTRSKPIEKPKLGPLLPVIDAILEADRIAPVKQRHTAKRIFERLRDEHGYGGGY
TVVKDHVRIARASGRETFVPLAHPPGHAQVDFGEAIATIGGVRRKIHFFCMDLPHSDACFVKAYPRETTEAFLDGHVAAFDFFAGVPLSILYDNTRIAVA
KICGGGQRERTRAFTELVSHYLFRDRFGRPGRGNDKGKVEGLVKYARSNFMTPAPEAASFEALNADLERRCRTRQDERAGQHAESIGTRLVADRAVLRAL
PAVPLEPCEKRGGRVSSTALVRYRCNDYSVPTAYGFRDVLVKGFVEEVVILCGGIEIARHERSYGTGVFVSDPLHYLALIEIKPNALDQAAALQGWDLPE
AFQHLRHLLEARMGNRGKREFIQVLRLMEAMPKDVVASSVTDAIRLGAIGFDAVKLIALARLERRPPRLDLAAYPHLPRMTVRTTAAADYTVLVPEVAA
TVVKDHVRIARASGRETFVPLAHPPGHAQVDFGEAIATIGGVRRKIHFFCMDLPHSDACFVKAYPRETTEAFLDGHVAAFDFFAGVPLSILYDNTRIAVA
KICGGGQRERTRAFTELVSHYLFRDRFGRPGRGNDKGKVEGLVKYARSNFMTPAPEAASFEALNADLERRCRTRQDERAGQHAESIGTRLVADRAVLRAL
PAVPLEPCEKRGGRVSSTALVRYRCNDYSVPTAYGFRDVLVKGFVEEVVILCGGIEIARHERSYGTGVFVSDPLHYLALIEIKPNALDQAAALQGWDLPE
AFQHLRHLLEARMGNRGKREFIQVLRLMEAMPKDVVASSVTDAIRLGAIGFDAVKLIALARLERRPPRLDLAAYPHLPRMTVRTTAAADYTVLVPEVAA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
870 bp | 289 aa | 1605 | 2474 | + | No |
AG : IS21 helper
ORF sequence :
MSAARDETMPGGTTGGTPGILLAHHLKQLKLPTVLREYDKVARECAQGGIDHPRYLLRLVELELIDRERRMVERRIRSARFPAVKSLDTFDFTAIPSLNK
MLVLELARCGYILGRENVIALGNSGTGKTHIALALGLAACQKGFSVAFTTAASLVNQLLEARDERRLLRLQRELASVKLLIVDELGYVPLSATGAELLFE
TFSQRYERGSTVVTSNLPFEDWTSVLGSERLTGALLDRLTHHVSILSLNGDSYRLKASRSRRGCGDGRAEQNQATIDPHDPETGEILPA
MLVLELARCGYILGRENVIALGNSGTGKTHIALALGLAACQKGFSVAFTTAASLVNQLLEARDERRLLRLQRELASVKLLIVDELGYVPLSATGAELLFE
TFSQRYERGSTVVTSNLPFEDWTSVLGSERLTGALLDRLTHHVSILSLNGDSYRLKASRSRRGCGDGRAEQNQATIDPHDPETGEILPA
Blast result :
Comments
ISMex8 is 69% (ORFA) aa similar to ISFlsp1 and 86% (ORFB) to ISSsp5.
References
1] Stéphane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS ONE Submitted.
2] Ming-Chun Lee (2009) Direct submission.
2] Ming-Chun Lee (2009) Direct submission.