ISMex10
- Family ISL3
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 Methylobacterium dichloromethanicum DM4 Methylobacterium chloromethanicum CM4 |
DNA section
IS Length : 1754 bp
Ends
IR Length : 26/33
IRL : GGTTCATCGCCGAGTTTGGTGGATCGAGGTGCGTGACGGTTGGGGTCTTG
IRR : GGCTCTTCGTCGGGTTTGGTGGATCACGCTGCGAGGAGGAAGCGGCGACG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGCCCCC | ACTGAAGT | GGTCC | 8 |
AGGGCGGACC | ACTTCAGT | GGGGGCGGAT | 8 |
CGCCCCCACTGAAGT | ACTTCAGTGGTCCGC | 0 | |
CGCCCCCACTGAAGT | ACTTCGAAGTGGTCC | 0 | |
ACCACTTCAAGAAGT | ACTTCAGTGGGGGCG | 0 | |
CGCCCCCACTGAAGT | ACTTCGAAGTGGTCC | 0 | |
ACCACTCCGCGAAGT | ACTTCCCGCTGGGGG | 0 |
DNA sequence
GGTTCATCGCCGAGTTTGGTGGATCGAGGTGCGTGACGGTTGGGGTCTTGGCGTGGTCGGTTCAGCCTCGCAGATCGGGGCATGACCCAGCTCATCCCTT
TGCCTGGCTGCCGCCTCGTCCGCGTCGCGCGGGACGGCCCCACCGCCCTCACCCTCGTCGCCGAAGCGAAGCCCGACCACGCCCGCTGCCCGACGTGCCG
AGCCATCAGCACGTCGGTCCATAGCCGGTATCGACGACGGCCTGCCGATTTGCCGGCCAGCGGCAAGGCCATTCGGCTGCAGTTGGAGGTTCGGCGCTTC
TACTGCTGCGACCCAGCCTGCCCCCGCCGAACCTTCGCCGAACGGTTCCCGAAGCTACTCGCCCGCCATGCCCAACGCACCCGCCGGCTGGCCGGGGCGC
AGGCCCGGACCGGTCTCGCACTCGGCGGACAGCCAGCTGCCCGGTTGCTCGCACACCTGGCCATGCCGTCCAGCGCCACGACTCTGCTGCGGACGATCCG
GGGAGTGCCGCTGCCGAAGGCGCCTCGACCCTGTGTCGTCGGCGTCGATGACTGGGCGCTGCGCAAAGGGCGGACCTACGGCACGATCGTCGTCGACCTC
GAACGCCATCGCCCTCTCGATCTGCTCCCTGACCGCTCGGCCGAGACCTGGGCCGCATGGCTCCGCCGCCAGCCACAGATCCGGCTCGTGGCGCGCGATC
GTTCGACCGAGTATGCCCGCGGCACCACGCTTGGCGCGCCGGCGGCCGTGCAGGTCGCTGACCGGTGGCATCTCCTGCTCAACACCCGCCAGATGATCGA
GCGTTGGCTCGCCCGTGTCCACCCGCGCCTGAAGCTCTTGCCGCCGATCACGGCGCCAGCGCCTTCGACCCGGCGCACCAGAGCCTATCCGCGTGCACCC
GCCGAGACGCTCGCCCGCGCTGCAGCGGTCGGTCGGTGGGAGGAGCTCTACGACGATGTCCGTCGCCGCCGTGCGGCCGGGCAGTCGCTCCGGCTCATCA
ATCGCGAGACCGGCTTGGCTCGCGCCACGGTGCGCAAGTACGCCTTCGCCGAGGGCTTCCCACGCCATGACGGGCGCGGGCCGGGGCGCAGCATCCTTGA
TCCTCATCTCGATCACCTGCACGCTCGCCAGGCTGCGGGGTGCGAGAACGCCATGCAGCTCTGGCGAGAGCTGCGGGATCGCGGCTTTCCCGGCACCGTC
AAGCAGGTCAGACGCTGGCTGTCCGAGCGGCGCACGCGCCCCGCCAGGACCACGATCTGGCGCCTCAAGACGCCGTCGCCCATGGCGCAGGTCGCGCCCC
CGTCGCCGCCGCTGCCCTCGCCGAAGCAGTTGTCGTGGCACCTCCTGCGCGAGCCGGATGACCTCGACGCCGACGCCGCAGCAGTGGTCGCGCGCGTGCT
GCAGGATGACGAGGCTGCCAAGGTCGTCGGGCTCGGTCGGCGGTTCTGTCGAATCGTTCGCAGCCGTTGCGGCGCCGCGCCGGCCAAGCCCGGCATCACC
ACCGCCTTCGACGCTTGGTTGTCCGACGCCCGTGCCTGCGGTGTGCGGGTGGTCGAGAGCTTCGCCGTCAGCCTCGCTCAGGATGGAGCCGCTGTTCGTG
CAGGCCTTCGCCTGCCCTGGAGCAGCGGGCAGGACGAGGGGCAGGTCAACCGCCTCAAGCTGTTGAAGCGTTCGATGTATGGCCGCGCCAAGCTCGACCT
CCTGCGTCGCCGCTTCCTCCTCGCAGCGTGATCCACCAAACCCGACGAAGAGCC
TGCCTGGCTGCCGCCTCGTCCGCGTCGCGCGGGACGGCCCCACCGCCCTCACCCTCGTCGCCGAAGCGAAGCCCGACCACGCCCGCTGCCCGACGTGCCG
AGCCATCAGCACGTCGGTCCATAGCCGGTATCGACGACGGCCTGCCGATTTGCCGGCCAGCGGCAAGGCCATTCGGCTGCAGTTGGAGGTTCGGCGCTTC
TACTGCTGCGACCCAGCCTGCCCCCGCCGAACCTTCGCCGAACGGTTCCCGAAGCTACTCGCCCGCCATGCCCAACGCACCCGCCGGCTGGCCGGGGCGC
AGGCCCGGACCGGTCTCGCACTCGGCGGACAGCCAGCTGCCCGGTTGCTCGCACACCTGGCCATGCCGTCCAGCGCCACGACTCTGCTGCGGACGATCCG
GGGAGTGCCGCTGCCGAAGGCGCCTCGACCCTGTGTCGTCGGCGTCGATGACTGGGCGCTGCGCAAAGGGCGGACCTACGGCACGATCGTCGTCGACCTC
GAACGCCATCGCCCTCTCGATCTGCTCCCTGACCGCTCGGCCGAGACCTGGGCCGCATGGCTCCGCCGCCAGCCACAGATCCGGCTCGTGGCGCGCGATC
GTTCGACCGAGTATGCCCGCGGCACCACGCTTGGCGCGCCGGCGGCCGTGCAGGTCGCTGACCGGTGGCATCTCCTGCTCAACACCCGCCAGATGATCGA
GCGTTGGCTCGCCCGTGTCCACCCGCGCCTGAAGCTCTTGCCGCCGATCACGGCGCCAGCGCCTTCGACCCGGCGCACCAGAGCCTATCCGCGTGCACCC
GCCGAGACGCTCGCCCGCGCTGCAGCGGTCGGTCGGTGGGAGGAGCTCTACGACGATGTCCGTCGCCGCCGTGCGGCCGGGCAGTCGCTCCGGCTCATCA
ATCGCGAGACCGGCTTGGCTCGCGCCACGGTGCGCAAGTACGCCTTCGCCGAGGGCTTCCCACGCCATGACGGGCGCGGGCCGGGGCGCAGCATCCTTGA
TCCTCATCTCGATCACCTGCACGCTCGCCAGGCTGCGGGGTGCGAGAACGCCATGCAGCTCTGGCGAGAGCTGCGGGATCGCGGCTTTCCCGGCACCGTC
AAGCAGGTCAGACGCTGGCTGTCCGAGCGGCGCACGCGCCCCGCCAGGACCACGATCTGGCGCCTCAAGACGCCGTCGCCCATGGCGCAGGTCGCGCCCC
CGTCGCCGCCGCTGCCCTCGCCGAAGCAGTTGTCGTGGCACCTCCTGCGCGAGCCGGATGACCTCGACGCCGACGCCGCAGCAGTGGTCGCGCGCGTGCT
GCAGGATGACGAGGCTGCCAAGGTCGTCGGGCTCGGTCGGCGGTTCTGTCGAATCGTTCGCAGCCGTTGCGGCGCCGCGCCGGCCAAGCCCGGCATCACC
ACCGCCTTCGACGCTTGGTTGTCCGACGCCCGTGCCTGCGGTGTGCGGGTGGTCGAGAGCTTCGCCGTCAGCCTCGCTCAGGATGGAGCCGCTGTTCGTG
CAGGCCTTCGCCTGCCCTGGAGCAGCGGGCAGGACGAGGGGCAGGTCAACCGCCTCAAGCTGTTGAAGCGTTCGATGTATGGCCGCGCCAAGCTCGACCT
CCTGCGTCGCCGCTTCCTCCTCGCAGCGTGATCCACCAAACCCGACGAAGAGCC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1650 bp | 549 aa | 82 | 1731 | + | No |
Chemistry : Unknow
ORF sequence :
MTQLIPLPGCRLVRVARDGPTALTLVAEAKPDHARCPTCRAISTSVHSRYRRRPADLPASGKAIRLQLEVRRFYCCDPACPRRTFAERFPKLLARHAQRT
RRLAGAQARTGLALGGQPAARLLAHLAMPSSATTLLRTIRGVPLPKAPRPCVVGVDDWALRKGRTYGTIVVDLERHRPLDLLPDRSAETWAAWLRRQPQI
RLVARDRSTEYARGTTLGAPAAVQVADRWHLLLNTRQMIERWLARVHPRLKLLPPITAPAPSTRRTRAYPRAPAETLARAAAVGRWEELYDDVRRRRAAG
QSLRLINRETGLARATVRKYAFAEGFPRHDGRGPGRSILDPHLDHLHARQAAGCENAMQLWRELRDRGFPGTVKQVRRWLSERRTRPARTTIWRLKTPSP
MAQVAPPSPPLPSPKQLSWHLLREPDDLDADAAAVVARVLQDDEAAKVVGLGRRFCRIVRSRCGAAPAKPGITTAFDAWLSDARACGVRVVESFAVSLAQ
DGAAVRAGLRLPWSSGQDEGQVNRLKLLKRSMYGRAKLDLLRRRFLLAA
RRLAGAQARTGLALGGQPAARLLAHLAMPSSATTLLRTIRGVPLPKAPRPCVVGVDDWALRKGRTYGTIVVDLERHRPLDLLPDRSAETWAAWLRRQPQI
RLVARDRSTEYARGTTLGAPAAVQVADRWHLLLNTRQMIERWLARVHPRLKLLPPITAPAPSTRRTRAYPRAPAETLARAAAVGRWEELYDDVRRRRAAG
QSLRLINRETGLARATVRKYAFAEGFPRHDGRGPGRSILDPHLDHLHARQAAGCENAMQLWRELRDRGFPGTVKQVRRWLSERRTRPARTTIWRLKTPSP
MAQVAPPSPPLPSPKQLSWHLLREPDDLDADAAAVVARVLQDDEAAKVVGLGRRFCRIVRSRCGAAPAKPGITTAFDAWLSDARACGVRVVESFAVSLAQ
DGAAVRAGLRLPWSSGQDEGQVNRLKLLKRSMYGRAKLDLLRRRFLLAA
Blast result :
Comments
ISMex10 is 52% aa similar to ISAli18.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. (2009) PLoS ONE Submitted.