ISMdi27
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium dichloromethanicum | Methylobacterium dichloromethanicum DM4 |
DNA section
IS Length : 2476 bp
Ends
IR Length : 16/20
IRL : TGCAACTTTGACCCCGTGATGCGGGGGATCGGCGTCCAATTCTGACCCCC
IRR : TGGAATGTTGACCCCGGATCGGCGTCCAAAATTAACCCCCTGGATGGATG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTGTCAAACAG | CG | CCGAAACACAA | 2 |
DNA sequence
TGCAACTTTGACCCCGTGATGCGGGGGATCGGCGTCCAATTCTGACCCCCTGAAGCTTGCTCTCGCAGACTGCCATCCGGGACGACCGGACGGGTGGTGG
GGGATGAAGGGCGTGGACACGATTGCGCGCATCCGGCGCGAGTTCTTCATCCGCGGCCGGTCGATCAAGGACATCGTCCGCGACCTGCACGTCTCCCGTA
ACACCGTGCGCAAGGTCATCCGCTCGGGAGCCACCAGCTTTGCCTACGAGCGCGAGGTGCAGCCGCTGCCCAAGCTGGGACCATGGCGGGACGACCTCGA
CCGGATGCTGGCCACGAACGCCAGCCGGTCGCCGCGCGAGCGGCTGACCCTGATCCGGGTGTTCGAGGCGCTGCGCGGGCTCGGCTACGAGGGCGGCTAC
GACGCGGTGCGCCGCTACGCCAAGGGCTGGAAGCGGGAGCGCACCGCCATCACGGCGCCCGCCTTCGTACCGCTCTCCTTCGCGCCGGGCGAGGCCTACC
AGTTCGACTGGAGCCACGAGATCGTGCTGATCGGCGGCGTGACCACGACGGTCAAGGTCGCCCACGTCCGGCTCTGCCACTCGCGCATGTTGTTCGTGCG
CGCCTACCCGCGTGAGAGCCAGGAGATGGTCTTTGACGCCCATGACCGGGCGTTCGCCTTCTTCCGCGGCGCCTGCCAGCGCGGGATCTACGACAACATG
AAGACCGCGGTGGAGACGATCTTCGTCGGGCGCGAGCGCGCCTACAATCGCCGCTTCCTGCAGATGTGCTCGCACTATCTCGTCGAGCCGGTGGCCTGCA
CGCCGGCCTCGGGCTGGGAGAAGGGTCAGGTCGAGAACCAGGTCGGGCTGGTGCGCGAGCGCCTGTTCACCCCGCGCATCCGGGTGAAGAGCTACGACGA
ACTGAACGCCCTGCTGCTCGACGGGGTGATCGCCTACGCCAAGGCCCATCCTCATCCCGAGGAGCGCGAGCGCACCGTCTGGGAGCGGTTCGAGGCCGAG
CGGGCGGCGCTGGTGCCCTATGCAGGCCGCTTCGACGGCTTCCACGCCGTGCCGGCCGCCGTCTCCTCGACCTGCCTCGTGCGCTTCGACAACAACAAGT
ATTCGGTCGCCGCCTCGGCCATTGGCCGTCCGGTCGAGGTGCGCGCCTACGCCGAGCGTGTCGAGATCCGCCAGGACGGCCGCATCGTTGCCGAGCATGC
GCGCGCCTTCGGCCGTGGCCAGACGGTGTTCGATCCCTGGCACTACGTTCCCGTGCTCGCCCGCAAACCCGGCGCGCTGCGCAACGGGGCGCCGTTCAAG
GACTGGGTGCTGCCCGCCGCCCTTGACCGGATCCGGCGCAAGCTTACCGGCAGCGCCGACGGCGACCGCCAGATGGTCGAGATCCTCACCGCCGTGCTCG
GCGATGGCCTGCCCGCGGTCGAAGCCGCCTGTGCCGAGGCGCTGCGCGAGGGCGTCCACTCGGCCGACGTCGTCCTCAACATCCTGGCTCGCCAGCGCGA
GCCCGCTGCCGCCGTCACCCTCGCGACGCCCGAGGGTCTGCGGCTGCGCCACGAACCTGTTGCTGACTGCGCCCGCTACGACAGCCTGAGGAGAGCCCGA
TGATGGAACGCCAGCAGATCCTCGCCACGATGGGCGAGCTGAAGTTGTTCGGGATGAAGGCGGCCTACGACGAGATCATCAAGGTCGCGGTCAAGCGCAC
CCACGAGCCGCAGCAGATCGTCGGCGACCTGCTCCAGGCCGAGATCAGCGAGAAGCAGGCCCGCTCGATCCGCTACCAGATGACGATCGCCAAGCTGCCC
CTGGCCAAGGACCTCGCCGAGTTCGCCTTCGCCGGCACGCCGATCAACGAGGGGCTGGTGCGCGATCTCGCCGGCGGCGAGTTCCTGGCCCACCAGCGCA
ACGTCGTGCTGGTCGGCGGCACCGGTACCGGCAAGACGCATCTGGCCATCGCGATGGCCCGGGCCTGCATCCGAGACGGGGCACGAGCTCGGTTCTACAA
CGTGGTCGACCTCGTCAATCGGCTCGAAGCCGAGGCGCGGGCCGGCCGGCAGGGCCGCATCGCCGACCACCTCGCCCGCCTCGACCTCGTGGTGCTGGAC
GAGCTCGGCTACCTGCCGTTCGCACAGTCGGGCGGGCAACTCCTGTTCCACCTAATCAGCAAGCTCTACGAGACCACCTCGATCGTGGTGACCACCAACC
TCGCCTTCGGGGAGTGGCCGAGCGTGTTCGCCGGTGACGCCAAGATGACCACCGCGCTGCTCGATCGGCTCACACACCACTGCGAGATCGTCGAGACCGG
CAACGAGAGCTGGCGCTTCAAGAACCGCGCCTGAGGTCGGCTACCCGTCGCGCCTTCTGGATCCCTGATCAGCCCGGCTGCGCCACCTCGATCCGCTCCG
CCGGGCTGACCAGGGGCGTCACGACACATCCATCCAGGGGGTTAATTTTGGACGCCGATCCGGGGTCAACATTCCA
GGGATGAAGGGCGTGGACACGATTGCGCGCATCCGGCGCGAGTTCTTCATCCGCGGCCGGTCGATCAAGGACATCGTCCGCGACCTGCACGTCTCCCGTA
ACACCGTGCGCAAGGTCATCCGCTCGGGAGCCACCAGCTTTGCCTACGAGCGCGAGGTGCAGCCGCTGCCCAAGCTGGGACCATGGCGGGACGACCTCGA
CCGGATGCTGGCCACGAACGCCAGCCGGTCGCCGCGCGAGCGGCTGACCCTGATCCGGGTGTTCGAGGCGCTGCGCGGGCTCGGCTACGAGGGCGGCTAC
GACGCGGTGCGCCGCTACGCCAAGGGCTGGAAGCGGGAGCGCACCGCCATCACGGCGCCCGCCTTCGTACCGCTCTCCTTCGCGCCGGGCGAGGCCTACC
AGTTCGACTGGAGCCACGAGATCGTGCTGATCGGCGGCGTGACCACGACGGTCAAGGTCGCCCACGTCCGGCTCTGCCACTCGCGCATGTTGTTCGTGCG
CGCCTACCCGCGTGAGAGCCAGGAGATGGTCTTTGACGCCCATGACCGGGCGTTCGCCTTCTTCCGCGGCGCCTGCCAGCGCGGGATCTACGACAACATG
AAGACCGCGGTGGAGACGATCTTCGTCGGGCGCGAGCGCGCCTACAATCGCCGCTTCCTGCAGATGTGCTCGCACTATCTCGTCGAGCCGGTGGCCTGCA
CGCCGGCCTCGGGCTGGGAGAAGGGTCAGGTCGAGAACCAGGTCGGGCTGGTGCGCGAGCGCCTGTTCACCCCGCGCATCCGGGTGAAGAGCTACGACGA
ACTGAACGCCCTGCTGCTCGACGGGGTGATCGCCTACGCCAAGGCCCATCCTCATCCCGAGGAGCGCGAGCGCACCGTCTGGGAGCGGTTCGAGGCCGAG
CGGGCGGCGCTGGTGCCCTATGCAGGCCGCTTCGACGGCTTCCACGCCGTGCCGGCCGCCGTCTCCTCGACCTGCCTCGTGCGCTTCGACAACAACAAGT
ATTCGGTCGCCGCCTCGGCCATTGGCCGTCCGGTCGAGGTGCGCGCCTACGCCGAGCGTGTCGAGATCCGCCAGGACGGCCGCATCGTTGCCGAGCATGC
GCGCGCCTTCGGCCGTGGCCAGACGGTGTTCGATCCCTGGCACTACGTTCCCGTGCTCGCCCGCAAACCCGGCGCGCTGCGCAACGGGGCGCCGTTCAAG
GACTGGGTGCTGCCCGCCGCCCTTGACCGGATCCGGCGCAAGCTTACCGGCAGCGCCGACGGCGACCGCCAGATGGTCGAGATCCTCACCGCCGTGCTCG
GCGATGGCCTGCCCGCGGTCGAAGCCGCCTGTGCCGAGGCGCTGCGCGAGGGCGTCCACTCGGCCGACGTCGTCCTCAACATCCTGGCTCGCCAGCGCGA
GCCCGCTGCCGCCGTCACCCTCGCGACGCCCGAGGGTCTGCGGCTGCGCCACGAACCTGTTGCTGACTGCGCCCGCTACGACAGCCTGAGGAGAGCCCGA
TGATGGAACGCCAGCAGATCCTCGCCACGATGGGCGAGCTGAAGTTGTTCGGGATGAAGGCGGCCTACGACGAGATCATCAAGGTCGCGGTCAAGCGCAC
CCACGAGCCGCAGCAGATCGTCGGCGACCTGCTCCAGGCCGAGATCAGCGAGAAGCAGGCCCGCTCGATCCGCTACCAGATGACGATCGCCAAGCTGCCC
CTGGCCAAGGACCTCGCCGAGTTCGCCTTCGCCGGCACGCCGATCAACGAGGGGCTGGTGCGCGATCTCGCCGGCGGCGAGTTCCTGGCCCACCAGCGCA
ACGTCGTGCTGGTCGGCGGCACCGGTACCGGCAAGACGCATCTGGCCATCGCGATGGCCCGGGCCTGCATCCGAGACGGGGCACGAGCTCGGTTCTACAA
CGTGGTCGACCTCGTCAATCGGCTCGAAGCCGAGGCGCGGGCCGGCCGGCAGGGCCGCATCGCCGACCACCTCGCCCGCCTCGACCTCGTGGTGCTGGAC
GAGCTCGGCTACCTGCCGTTCGCACAGTCGGGCGGGCAACTCCTGTTCCACCTAATCAGCAAGCTCTACGAGACCACCTCGATCGTGGTGACCACCAACC
TCGCCTTCGGGGAGTGGCCGAGCGTGTTCGCCGGTGACGCCAAGATGACCACCGCGCTGCTCGATCGGCTCACACACCACTGCGAGATCGTCGAGACCGG
CAACGAGAGCTGGCGCTTCAAGAACCGCGCCTGAGGTCGGCTACCCGTCGCGCCTTCTGGATCCCTGATCAGCCCGGCTGCGCCACCTCGATCCGCTCCG
CCGGGCTGACCAGGGGCGTCACGACACATCCATCCAGGGGGTTAATTTTGGACGCCGATCCGGGGTCAACATTCCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1500 bp | 499 aa | 104 | 1603 | + | No |
Chemistry : DDE
ORF sequence :
MKGVDTIARIRREFFIRGRSIKDIVRDLHVSRNTVRKVIRSGATSFAYEREVQPLPKLGPWRDDLDRMLATNASRSPRERLTLIRVFEALRGLGYEGGYD
AVRRYAKGWKRERTAITAPAFVPLSFAPGEAYQFDWSHEIVLIGGVTTTVKVAHVRLCHSRMLFVRAYPRESQEMVFDAHDRAFAFFRGACQRGIYDNMK
TAVETIFVGRERAYNRRFLQMCSHYLVEPVACTPASGWEKGQVENQVGLVRERLFTPRIRVKSYDELNALLLDGVIAYAKAHPHPEERERTVWERFEAER
AALVPYAGRFDGFHAVPAAVSSTCLVRFDNNKYSVAASAIGRPVEVRAYAERVEIRQDGRIVAEHARAFGRGQTVFDPWHYVPVLARKPGALRNGAPFKD
WVLPAALDRIRRKLTGSADGDRQMVEILTAVLGDGLPAVEAACAEALREGVHSADVVLNILARQREPAAAVTLATPEGLRLRHEPVADCARYDSLRRAR
AVRRYAKGWKRERTAITAPAFVPLSFAPGEAYQFDWSHEIVLIGGVTTTVKVAHVRLCHSRMLFVRAYPRESQEMVFDAHDRAFAFFRGACQRGIYDNMK
TAVETIFVGRERAYNRRFLQMCSHYLVEPVACTPASGWEKGQVENQVGLVRERLFTPRIRVKSYDELNALLLDGVIAYAKAHPHPEERERTVWERFEAER
AALVPYAGRFDGFHAVPAAVSSTCLVRFDNNKYSVAASAIGRPVEVRAYAERVEIRQDGRIVAEHARAFGRGQTVFDPWHYVPVLARKPGALRNGAPFKD
WVLPAALDRIRRKLTGSADGDRQMVEILTAVLGDGLPAVEAACAEALREGVHSADVVLNILARQREPAAAVTLATPEGLRLRHEPVADCARYDSLRRAR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
735 bp | 244 aa | 1600 | 2334 | + | No |
AG : IS21 helper
ORF sequence :
MMERQQILATMGELKLFGMKAAYDEIIKVAVKRTHEPQQIVGDLLQAEISEKQARSIRYQMTIAKLPLAKDLAEFAFAGTPINEGLVRDLAGGEFLAHQR
NVVLVGGTGTGKTHLAIAMARACIRDGARARFYNVVDLVNRLEAEARAGRQGRIADHLARLDLVVLDELGYLPFAQSGGQLLFHLISKLYETTSIVVTTN
LAFGEWPSVFAGDAKMTTALLDRLTHHCEIVETGNESWRFKNRA
NVVLVGGTGTGKTHLAIAMARACIRDGARARFYNVVDLVNRLEAEARAGRQGRIADHLARLDLVVLDELGYLPFAQSGGQLLFHLISKLYETTSIVVTTN
LAFGEWPSVFAGDAKMTTALLDRLTHHCEIVETGNESWRFKNRA
Blast result :
Comments
ISMdi27 is 95% (istA, the transposase) and 97% (istB, the helper of transposition) aa similar to ISMex39.
References
1] Stéphane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. PLoS ONE Submitted. (2009)