ISMdi18
- Family IS1595
- Group ISNwi1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium dichloromethanicum | Methylobacterium dichloromethanicum DM4 |
DNA section
IS Length : 3479 bp
Ends
IR Length : 23/24
IRL : GGCGATTATATCGTTGACACATACGGCAACCGCAGGTAGGTTTTAGCTCT
IRR : GGCGATTATATCGTTCACACATACACCTTGACCGACCACGACCGTTGGGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GAGGGCCGCT | TAATTTTT | AGAGGCGGAT | 8 |
DNA sequence
GGCGATTATATCGTTGACACATACGGCAACCGCAGGTAGGTTTTAGCTCTAGGAGCTAAACCGATGTCCGCCCTTTCCGCTGCATTCTTCCACGACGAGG
CCGCCGCTTTCGCCAAGCTGGAAAGCCTTCTGTGGCCGGAAGGCCCGGTCTGCCCTCACTGCGGCGGCATGGGCCGGATCACGAACGTAAAGGGCGGTCG
CATGGGCCTGCGCCGCTGCGGCGACTGCAAGAAGCAGTTCACGGTGACGGTCGGCACGGTTTTCGAGTCCAGCCATGTGAAGCTGCACCTGTGGCTCCAG
GCTGCGCACCTTCTTGCGTCCAGCAAGAAGGGCTTCTCCGCGCACCAACTGCACCGGACCCTCGGCGTCACCTACAAGACCGCGTGGTTCATGTTCCATC
GCCTGCGTGAGGCGATGCGCGACGGCGCGCTTGCCCCGATGGGCGGTCTGGACGGCCCGGCCGGTGGCGCTGGCATCGTGGAAGCCGACGAGACGTTCAT
CGGCCGCGAGCCCGGCAAGCCGAAGAAGCGCGCCTATCACCACAAGATGAAGGTGCTGAGCCTTGTGGATCGCGACACGAAGCAGGTCCGCTCCGTCGTC
GTGGATGATCTCAAGCCCGACACGGTGAAGCCGATCCTCGCCGCGAACATCGCCAAGGAAGCCAGCCTCTTCACCGACGAGGCGGGCCACTACGTGAAGC
TCGGCAAGGACTTCATTGAGCATCAGATCACGTCTCACGGCAAAGGCGAGTACGTGCGCGGCATCGTCCATACGAACACCGTCGAAGGCTACTTCTCCGT
CTTCAAGCGCGGCATGAAGGGTGTCTATCAGCACTGCGGCAAGCAGCACTTACACCGCTACCTCGCGGAGTTCGATTTCCGGTACAACAACCGGGTGAAG
CTCGGTGTGGGCGATGTCGAGCGGGCCGAGCGGGCGCTTCGGGGCGTCGTTGGGAAGCGCCTTACGTACCAAACAACTCGTTGACCGGAGACGGCGTTAA
CCCCATATTCCCCCCTGTCCCCGAGCGCGAAAACCTGGGAGATATCCTGGACTGCCGCAGTAGCGGCTAGGCTCGGGGGCCGCGCTTTAAGTTGAACTTC
GGGCGCACATTCCAAACAGATAATCGGTGTTGCGAAATTGAAGTGCTGGCCTCTAAGCTATGCAGCTTCGTTTTTTCGAACGCATAGATCGCTAACTTGC
GTTTCATATCGCTCGTTTCTGGCAAAAGGTGCCTACCGTTTCGATATGCGGTCAATGCACCTAGCGTTACATCTAAGAACTGAAGAAATGGCGTTTGCTT
AGAGTCTCTGGGTTCAATTGTTCGGACACAATTAGGCGGATGATTGAACTCGCTTACAGCGCCTTCGTTCAGCTTTTGCTTATAGCCCGGCAATAGAGAC
GTGCAGTTGCCATTATCGGGGCGAATGTGGATGTCATACTCCGCGCCGTAGTACCTAACTGCGCGGTGCAGCATTAATTGATAATGGGCCTTACTAACAG
TGTCTACAAGCTTGCGCTCTCCACTGATACTATGATCATACTTGCCCATCTCTTGAAATCGAATGTGGAAGTTGACGATATTTGTCTCTATTAGCTTAAA
AAGCAAATCAATATAAGCAATATGGGCGCTATCTCGGCGTTTTTTGGCTGTAGCCCATTTGATCTCTGAAGTAATTTCATAATTGTAATTAATTTTCTGA
ATATCTGATAATATATAATCAGCCATTCCCAAATCGATCGCAAGACCGCCGATGCCCATAAACGCATCCTTCTGGCTTGTCTCGTCACAATAATATAAGA
TGCGTGGACGCGGCTCTTGCAAAGTGGGGTCTAGCAATGCTGGAAAAGACGCGTACGGAGGCGTACCGCCATGGGCTCAGGAAAATAATCGAGAAAAGCC
CCAAGGTGCAATCAGGAAATGCGCGGGCACCCGCAGGGGCGTGTGGCGACAAGCAAGCGGCGGAAACCCCTAGCACACAAATCGCGCGGAGATCTGACGC
TTCTCGCGAACGCTGACGCGAAGGAACATGCTGCGCGGCATGTATCAACTCCGAAGCGGAGGGCCTGGAGCCTCTTGGTTGATGGCGCCGCTTAGGCATC
TTCCGTGGACGGCGGCGGAAGCGCCTCGGCTGGCTTCACTCCACGTGCCAGCCGGTCAAGGAACAGGGGCGCGCTCGCACCTAGATAAAATGATGTGAGT
GCGTTGGGCGCCTCGGCTATGGCAGGGATAGTGCCGCTAGCCAGTGCTAAGAGCGCCCGCACACAAAGGTAGAATGGCTGTTTGTAGCGCTCAGGGCAGC
GCCCATTCAGATCGCTGCACTCCTTGAACGCCGCGCCTAGTTCGACAGCGACACTTCCAATGACGCCGAGCCAGTAGGGAGATAGCTCGCCTACGAACGG
AAGGAGAGCCACTTAGGCGTGCTTAGAACGCTCGCGCTGACTCCGAGTGCTCCCGCCCAGCCGCAGATAGGAAGCGAGAGGGCTGAGACGATGCACGACG
ATGCGCGCCTGGGTGGGCCGGTGCGACAGCCGCAAGGCGGCAACCACGACGTTGGCGGCAAAAAGCAGTACGCCCACGCCGACGAGGGCTGTGAACACAA
TTTGCGTCACGAGTTCCATCCTGTCCCTCCAGCTTGGCTGTAAGCGGAGGATGATGCTTTTTGTCTTTCTAGGATGTGAATAACTGGCACTGCCCCCTTG
CGGCGTCAATGCCGCGTGCCGCCGGATTTTTGCAGGCCGATGTGATGAGGCGTACGGTTGTCCGTTTTTGAAAGGCCGAACTGATGCCGACAGACACGCC
GACCAAGCCCAAACCTTCCAAGCTCGGCGCCGAGAAGCTGACGCCAGAGGAACAGAAGCGTCGCTTCATTGAAGCCGCGCGGGAAGCGGGTGTGAGCGAG
AACGAAGCGGACTTCGATGCGGCGTTGAAGCAGATCGCGAAGCCGAAGCAGCCCAACTCCAGCCAAGGGTAGAGTGTGGCACTCTCACGCTAGCCCACGT
CGCGCAGTCGCGTTAGCAGCGCCGTAGAAAGGGGCTGCGAAATGTTAACGAACTCGATCATCGGCATGATACGCCGCCGCTGGCGGACCGAGCGTCCTTT
CCAAGGGCACCCGAAGGAAAGCTTGCTGCGGAAGATCCAGAACGATGCGGTTCGCGAGGACGTCAAGGAGCACGCGCGTCGCGAGCTACAGCTTCGCAAA
GATCAGCGCCTAATCGGCTGAGCTGAGCACTGCCGCCATTTGTCGATGCAGGTCGTCGGCAACGGACTGGGGAAGCAGCACGACCGCCTCGGTTCCGTCC
GCCTTCTCCAAGCGAACGCCCAACGTCCGGCCGCCGTCGATCAACTCGACGCTGATAACCCGACCAACCGCAAACGCGTCGGGGGCGTCATCACTCAACA
TGCACTGAACTCCACGAAGGGCGAGACTCGCCCAACGGTCGTGGTCGGTCAAGGTGTATGTGTGAACGATATAATCGCC
CCGCCGCTTTCGCCAAGCTGGAAAGCCTTCTGTGGCCGGAAGGCCCGGTCTGCCCTCACTGCGGCGGCATGGGCCGGATCACGAACGTAAAGGGCGGTCG
CATGGGCCTGCGCCGCTGCGGCGACTGCAAGAAGCAGTTCACGGTGACGGTCGGCACGGTTTTCGAGTCCAGCCATGTGAAGCTGCACCTGTGGCTCCAG
GCTGCGCACCTTCTTGCGTCCAGCAAGAAGGGCTTCTCCGCGCACCAACTGCACCGGACCCTCGGCGTCACCTACAAGACCGCGTGGTTCATGTTCCATC
GCCTGCGTGAGGCGATGCGCGACGGCGCGCTTGCCCCGATGGGCGGTCTGGACGGCCCGGCCGGTGGCGCTGGCATCGTGGAAGCCGACGAGACGTTCAT
CGGCCGCGAGCCCGGCAAGCCGAAGAAGCGCGCCTATCACCACAAGATGAAGGTGCTGAGCCTTGTGGATCGCGACACGAAGCAGGTCCGCTCCGTCGTC
GTGGATGATCTCAAGCCCGACACGGTGAAGCCGATCCTCGCCGCGAACATCGCCAAGGAAGCCAGCCTCTTCACCGACGAGGCGGGCCACTACGTGAAGC
TCGGCAAGGACTTCATTGAGCATCAGATCACGTCTCACGGCAAAGGCGAGTACGTGCGCGGCATCGTCCATACGAACACCGTCGAAGGCTACTTCTCCGT
CTTCAAGCGCGGCATGAAGGGTGTCTATCAGCACTGCGGCAAGCAGCACTTACACCGCTACCTCGCGGAGTTCGATTTCCGGTACAACAACCGGGTGAAG
CTCGGTGTGGGCGATGTCGAGCGGGCCGAGCGGGCGCTTCGGGGCGTCGTTGGGAAGCGCCTTACGTACCAAACAACTCGTTGACCGGAGACGGCGTTAA
CCCCATATTCCCCCCTGTCCCCGAGCGCGAAAACCTGGGAGATATCCTGGACTGCCGCAGTAGCGGCTAGGCTCGGGGGCCGCGCTTTAAGTTGAACTTC
GGGCGCACATTCCAAACAGATAATCGGTGTTGCGAAATTGAAGTGCTGGCCTCTAAGCTATGCAGCTTCGTTTTTTCGAACGCATAGATCGCTAACTTGC
GTTTCATATCGCTCGTTTCTGGCAAAAGGTGCCTACCGTTTCGATATGCGGTCAATGCACCTAGCGTTACATCTAAGAACTGAAGAAATGGCGTTTGCTT
AGAGTCTCTGGGTTCAATTGTTCGGACACAATTAGGCGGATGATTGAACTCGCTTACAGCGCCTTCGTTCAGCTTTTGCTTATAGCCCGGCAATAGAGAC
GTGCAGTTGCCATTATCGGGGCGAATGTGGATGTCATACTCCGCGCCGTAGTACCTAACTGCGCGGTGCAGCATTAATTGATAATGGGCCTTACTAACAG
TGTCTACAAGCTTGCGCTCTCCACTGATACTATGATCATACTTGCCCATCTCTTGAAATCGAATGTGGAAGTTGACGATATTTGTCTCTATTAGCTTAAA
AAGCAAATCAATATAAGCAATATGGGCGCTATCTCGGCGTTTTTTGGCTGTAGCCCATTTGATCTCTGAAGTAATTTCATAATTGTAATTAATTTTCTGA
ATATCTGATAATATATAATCAGCCATTCCCAAATCGATCGCAAGACCGCCGATGCCCATAAACGCATCCTTCTGGCTTGTCTCGTCACAATAATATAAGA
TGCGTGGACGCGGCTCTTGCAAAGTGGGGTCTAGCAATGCTGGAAAAGACGCGTACGGAGGCGTACCGCCATGGGCTCAGGAAAATAATCGAGAAAAGCC
CCAAGGTGCAATCAGGAAATGCGCGGGCACCCGCAGGGGCGTGTGGCGACAAGCAAGCGGCGGAAACCCCTAGCACACAAATCGCGCGGAGATCTGACGC
TTCTCGCGAACGCTGACGCGAAGGAACATGCTGCGCGGCATGTATCAACTCCGAAGCGGAGGGCCTGGAGCCTCTTGGTTGATGGCGCCGCTTAGGCATC
TTCCGTGGACGGCGGCGGAAGCGCCTCGGCTGGCTTCACTCCACGTGCCAGCCGGTCAAGGAACAGGGGCGCGCTCGCACCTAGATAAAATGATGTGAGT
GCGTTGGGCGCCTCGGCTATGGCAGGGATAGTGCCGCTAGCCAGTGCTAAGAGCGCCCGCACACAAAGGTAGAATGGCTGTTTGTAGCGCTCAGGGCAGC
GCCCATTCAGATCGCTGCACTCCTTGAACGCCGCGCCTAGTTCGACAGCGACACTTCCAATGACGCCGAGCCAGTAGGGAGATAGCTCGCCTACGAACGG
AAGGAGAGCCACTTAGGCGTGCTTAGAACGCTCGCGCTGACTCCGAGTGCTCCCGCCCAGCCGCAGATAGGAAGCGAGAGGGCTGAGACGATGCACGACG
ATGCGCGCCTGGGTGGGCCGGTGCGACAGCCGCAAGGCGGCAACCACGACGTTGGCGGCAAAAAGCAGTACGCCCACGCCGACGAGGGCTGTGAACACAA
TTTGCGTCACGAGTTCCATCCTGTCCCTCCAGCTTGGCTGTAAGCGGAGGATGATGCTTTTTGTCTTTCTAGGATGTGAATAACTGGCACTGCCCCCTTG
CGGCGTCAATGCCGCGTGCCGCCGGATTTTTGCAGGCCGATGTGATGAGGCGTACGGTTGTCCGTTTTTGAAAGGCCGAACTGATGCCGACAGACACGCC
GACCAAGCCCAAACCTTCCAAGCTCGGCGCCGAGAAGCTGACGCCAGAGGAACAGAAGCGTCGCTTCATTGAAGCCGCGCGGGAAGCGGGTGTGAGCGAG
AACGAAGCGGACTTCGATGCGGCGTTGAAGCAGATCGCGAAGCCGAAGCAGCCCAACTCCAGCCAAGGGTAGAGTGTGGCACTCTCACGCTAGCCCACGT
CGCGCAGTCGCGTTAGCAGCGCCGTAGAAAGGGGCTGCGAAATGTTAACGAACTCGATCATCGGCATGATACGCCGCCGCTGGCGGACCGAGCGTCCTTT
CCAAGGGCACCCGAAGGAAAGCTTGCTGCGGAAGATCCAGAACGATGCGGTTCGCGAGGACGTCAAGGAGCACGCGCGTCGCGAGCTACAGCTTCGCAAA
GATCAGCGCCTAATCGGCTGAGCTGAGCACTGCCGCCATTTGTCGATGCAGGTCGTCGGCAACGGACTGGGGAAGCAGCACGACCGCCTCGGTTCCGTCC
GCCTTCTCCAAGCGAACGCCCAACGTCCGGCCGCCGTCGATCAACTCGACGCTGATAACCCGACCAACCGCAAACGCGTCGGGGGCGTCATCACTCAACA
TGCACTGAACTCCACGAAGGGCGAGACTCGCCCAACGGTCGTGGTCGGTCAAGGTGTATGTGTGAACGATATAATCGCC
Protein section
ORF number : 5
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
783 bp | 260 aa | 202 | 984 | + | No |
Chemistry : DDE
ORF sequence :
MGLRRCGDCKKQFTVTVGTVFESSHVKLHLWLQAAHLLASSKKGFSAHQLHRTLGVTYKTAWFMFHRLREAMRDGALAPMGGLDGPAGGAGIVEADETFI
GREPGKPKKRAYHHKMKVLSLVDRDTKQVRSVVVDDLKPDTVKPILAANIAKEASLFTDEAGHYVKLGKDFIEHQITSHGKGEYVRGIVHTNTVEGYFSV
FKRGMKGVYQHCGKQHLHRYLAEFDFRYNNRVKLGVGDVERAERALRGVVGKRLTYQTTR
GREPGKPKKRAYHHKMKVLSLVDRDTKQVRSVVVDDLKPDTVKPILAANIAKEASLFTDEAGHYVKLGKDFIEHQITSHGKGEYVRGIVHTNTVEGYFSV
FKRGMKGVYQHCGKQHLHRYLAEFDFRYNNRVKLGVGDVERAERALRGVVGKRLTYQTTR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
540 bp | 179 aa | 1606 | 1067 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
LLFKLIETNIVNFHIRFQEMGKYDHSISGERKLVDTVSKAHYQLMLHRAVRYYGAEYDIHIRPDNGNCTSLLPGYKQKLNEGAVSEFNHPPNCVRTIEPR
DSKQTPFLQFLDVTLGALTAYRNGRHLLPETSDMKRKLAIYAFEKTKLHSLEASTSISQHRLSVWNVRPKFNLKRGPRA
DSKQTPFLQFLDVTLGALTAYRNGRHLLPETSDMKRKLAIYAFEKTKLHSLEASTSISQHRLSVWNVRPKFNLKRGPRA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
186 bp | 61 aa | 2598 | 2413 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
VFTALVGVGVLLFAANVVVAALRLSHRPTQARIVVHRLSPLASYLRLGGSTRSQRERSKHA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
189 bp | 62 aa | 2784 | 2972 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MPTDTPTKPKPSKLGAEKLTPEEQKRRFIEAAREAGVSENEADFDAALKQIAKPKQPNSSQG
Blast result :ORF 5
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
240 bp | 80 aa | 3240 | 3479 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
LSMQVVGNGLGKQHDRLGSVRLLQANAQRPAAVDQLDADNPTNRKRVGGVITQHALNSTKGETRPTVVVGQGVCVNDIIA
Blast result :
Comments
ISMdi18 is 98% aa similar to ISMpo2.
The first ORF is the transposase, the others are passengers genes annotated as hypothetical protein.
The first ORF is the transposase, the others are passengers genes annotated as hypothetical protein.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. (2009) PLoS ONE Submitted.