ISMex13
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Methylobacterium extorquens | Methylobacterium extorquens AM1 |
DNA section
IS Length : 2359 bp
Ends
IR Length : 38/50
IRL : TGTCAGCGGGCGTTGAATACTCCCTGATTGTGGGCATCGAAAATTCCCTG
IRR : TGTCAACGGGCACCGAAGTCTCCGCAATTGTGGGCTTTCAAAATTCCCTG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCAGAACTTC | GCAGG | CACGCTTGCG | 5 |
TCCGCCAATA | CCTGC | TGTGCACCCG | 5 |
TAGTGGGTCA | GGATCC | GATGCAGCAC | 6 |
GCGGCGAGTG | ACCTT | GCCCCCTGGC | 5 |
DNA sequence
TGTCAGCGGGCGTTGAATACTCCCTGATTGTGGGCATCGAAAATTCCCTGGTCGGTGCCCTGGCCCGTAGGGTCGCGACGCCCCGCGCCCTCTCGGGCTT
TTCCATCCTCAGCCACCATGCCTGCCGATACCGATCGGCTGGGAGCGACGAGGATGGTTCTGCTGGGAGAACTCGTCATGATCTTGGACCTGCACCGACA
GGGCCTGTCCGTCTCCGCCATCGCCCGCCGGACCGGCCGCGATCCGAAGACGATCCGCAAGTACATCGAGCGCGGCCTCGAGCCGCCGGCCTACGGCCCG
CGTCAGCCCGGCCGCCCGAGCAAGCTCGCGCCCTATCTCGATTATCTGCGCGAGCGGATCACCGCCTTCCCCGACCTGAGTGCCGTGCGCCTGACCCGCG
AGTTGCGCGAGCGCGGCTACACCGGTGCCTACACCGCGGTGAAGCGGTTCGCCGCCGCGATCCGGCCGCCCGAGGCCAAGCCCTACGAGGTCCGCTTCGA
GACCCCGGCCGGCCAGCAGGCGCAGGTCGACTTCGCCCGCTTCCTCGTCACCTTCACGGATGCGCCGGACACGACCTGCATCGTCTGGCTGTTCTCGCTG
GTGCTCGGCCATTCCCGGCACATCGAGGCGCGCTTCGTCCTGCATCAGGACCTGCAAACGCTGCTGCGCTGTCACATGCAGGCCTTCACCGCGATCGGCG
GCGTGCCGATCGAGATCCTCTACGATCGCATGAAGACGGCGGTCACCGGCGAGGATGCGGACGGCCACATCGTCTACAACCGATCCCTGCTGGCACTCGC
CCAGCACTACGGATTCCTGCCGCGCGCCTGCCGCCCGTACCGGGCCAAGACCAAGGGGAAGGTCGAGAGACCGTTCTCCTACATCCGCCAGGACTTCTTC
CTCGCACGTTCCTTCCGCGACCTCGACGACCTCAACCGCCAGCTTCGGAGCTGGCTCGATACCGTCGCCAACGTCCGCTTGCACGGCACCACGCAGCGGA
TCGTCTCGGAAGCCTTCGCCGCCGAGCGGCCCGAGTTGCAGCCCTTGCCGGCTCTGCCCTTCGACGCTCTGCTCACGCTGGAGCGGCGCGTCAGCCACGA
TGGCCTCGTCTCGATCGGTGGCAACTATTACAGCGTACCGGATCGGACCCGGCGCGTCGTCGAGGTGCATCAGTTGCCCGACACGATCCGCATCCTCGAT
GGTGGCCGGCTCGTCGCGAGCCATCCGATCCTGGAGGGACGACGGCAGTACCGCATCGACCCCGACCATCGGCAAGGCACGGCCGCTCGGGCCATGCGCC
GCGGCCATCCCGACGGTCTGCCGATCGGCCGCCATGGCGATCACGTCGCCCGGCGCTCGCTGGCTGTCTACCAGGCAGTCGGCGAACGGCTCGCCGGCGG
GATCGGAGGCCAGCGATGAGCCGCGCCGCCCCCTGTGTCGCCACGACCCTCGACAGCATCAAGCGCAGCTTGGTCGGCCTGCGCATGCCGCGCGCCCTGG
AGGTGCTCGACGCGACGGTCCGGCGCATCGAGCAGGGCGAGATCGACGGCTTGGCCGCCCTCGACGTGATCCTGACCGAGGAACTGACGCTGCGCGAGAA
CCGCCGCGTGAAGACCGCCCTGCTGGTCGCGCGCCTGACCACGATCAAGACGCTGTCCGGGTTCGACTTTGCCTTCCAGCCCTCGCTCGACCGCGAGCGC
GTCCTGGCGCTGGCGGAACTGACCTTCATCGACCGGGCCGAGGTCGTCCATCTGCTCGGACCACCCGGCACCGGCAAGAGCCATCTGGCGATCGCGCTCG
CCGTCGAGGCGGTCAAGGCCGGGCGCAGCGTCGTGTTCTCGACGCTGGCCGACCTCGTGACCTCGCTGGCCAAGGCCGAGCGCGACGGCTCCCTGCGCGA
GCGCATCCGCTATCTCTGCCGGGCCTCGCTGCTGGTCGTGGACGAGATCGGCTACCTCCCCGTCGTCCCCGGTGGCGGCAACCTGTTCTTCCAACTCGTC
AACGCGCGCTACGAGCGCGGGGCGATGATCCTGACTTCGAACCGCGGCTTCGCCGAGTGGGGCGAAGTGTTCGGTGATCCGGTCGTGGCGACAGCCCTGC
TCGACCGACTCCTCCACCATGCCGTGGTGATCCAGATCGAGGGGGCGAGCTACCGCCTGCGCCAGCACGCCGACCTCGTCCCCGAGCACGTCCGCTCCAA
GGCCCTGATCGCTCCGCCGCCCGCACCCAGGCGTCGCGGTCGTCCACCCGGAAAGGCTGCCTCCGATCACACGGCCGGCTGATCACCGATCCGCACCGAA
CCCGCCGGCCAGGGAATTTTGAAAGCCCACAATTGCGGAGACTTCGGTGCCCGTTGACA
TTCCATCCTCAGCCACCATGCCTGCCGATACCGATCGGCTGGGAGCGACGAGGATGGTTCTGCTGGGAGAACTCGTCATGATCTTGGACCTGCACCGACA
GGGCCTGTCCGTCTCCGCCATCGCCCGCCGGACCGGCCGCGATCCGAAGACGATCCGCAAGTACATCGAGCGCGGCCTCGAGCCGCCGGCCTACGGCCCG
CGTCAGCCCGGCCGCCCGAGCAAGCTCGCGCCCTATCTCGATTATCTGCGCGAGCGGATCACCGCCTTCCCCGACCTGAGTGCCGTGCGCCTGACCCGCG
AGTTGCGCGAGCGCGGCTACACCGGTGCCTACACCGCGGTGAAGCGGTTCGCCGCCGCGATCCGGCCGCCCGAGGCCAAGCCCTACGAGGTCCGCTTCGA
GACCCCGGCCGGCCAGCAGGCGCAGGTCGACTTCGCCCGCTTCCTCGTCACCTTCACGGATGCGCCGGACACGACCTGCATCGTCTGGCTGTTCTCGCTG
GTGCTCGGCCATTCCCGGCACATCGAGGCGCGCTTCGTCCTGCATCAGGACCTGCAAACGCTGCTGCGCTGTCACATGCAGGCCTTCACCGCGATCGGCG
GCGTGCCGATCGAGATCCTCTACGATCGCATGAAGACGGCGGTCACCGGCGAGGATGCGGACGGCCACATCGTCTACAACCGATCCCTGCTGGCACTCGC
CCAGCACTACGGATTCCTGCCGCGCGCCTGCCGCCCGTACCGGGCCAAGACCAAGGGGAAGGTCGAGAGACCGTTCTCCTACATCCGCCAGGACTTCTTC
CTCGCACGTTCCTTCCGCGACCTCGACGACCTCAACCGCCAGCTTCGGAGCTGGCTCGATACCGTCGCCAACGTCCGCTTGCACGGCACCACGCAGCGGA
TCGTCTCGGAAGCCTTCGCCGCCGAGCGGCCCGAGTTGCAGCCCTTGCCGGCTCTGCCCTTCGACGCTCTGCTCACGCTGGAGCGGCGCGTCAGCCACGA
TGGCCTCGTCTCGATCGGTGGCAACTATTACAGCGTACCGGATCGGACCCGGCGCGTCGTCGAGGTGCATCAGTTGCCCGACACGATCCGCATCCTCGAT
GGTGGCCGGCTCGTCGCGAGCCATCCGATCCTGGAGGGACGACGGCAGTACCGCATCGACCCCGACCATCGGCAAGGCACGGCCGCTCGGGCCATGCGCC
GCGGCCATCCCGACGGTCTGCCGATCGGCCGCCATGGCGATCACGTCGCCCGGCGCTCGCTGGCTGTCTACCAGGCAGTCGGCGAACGGCTCGCCGGCGG
GATCGGAGGCCAGCGATGAGCCGCGCCGCCCCCTGTGTCGCCACGACCCTCGACAGCATCAAGCGCAGCTTGGTCGGCCTGCGCATGCCGCGCGCCCTGG
AGGTGCTCGACGCGACGGTCCGGCGCATCGAGCAGGGCGAGATCGACGGCTTGGCCGCCCTCGACGTGATCCTGACCGAGGAACTGACGCTGCGCGAGAA
CCGCCGCGTGAAGACCGCCCTGCTGGTCGCGCGCCTGACCACGATCAAGACGCTGTCCGGGTTCGACTTTGCCTTCCAGCCCTCGCTCGACCGCGAGCGC
GTCCTGGCGCTGGCGGAACTGACCTTCATCGACCGGGCCGAGGTCGTCCATCTGCTCGGACCACCCGGCACCGGCAAGAGCCATCTGGCGATCGCGCTCG
CCGTCGAGGCGGTCAAGGCCGGGCGCAGCGTCGTGTTCTCGACGCTGGCCGACCTCGTGACCTCGCTGGCCAAGGCCGAGCGCGACGGCTCCCTGCGCGA
GCGCATCCGCTATCTCTGCCGGGCCTCGCTGCTGGTCGTGGACGAGATCGGCTACCTCCCCGTCGTCCCCGGTGGCGGCAACCTGTTCTTCCAACTCGTC
AACGCGCGCTACGAGCGCGGGGCGATGATCCTGACTTCGAACCGCGGCTTCGCCGAGTGGGGCGAAGTGTTCGGTGATCCGGTCGTGGCGACAGCCCTGC
TCGACCGACTCCTCCACCATGCCGTGGTGATCCAGATCGAGGGGGCGAGCTACCGCCTGCGCCAGCACGCCGACCTCGTCCCCGAGCACGTCCGCTCCAA
GGCCCTGATCGCTCCGCCGCCCGCACCCAGGCGTCGCGGTCGTCCACCCGGAAAGGCTGCCTCCGATCACACGGCCGGCTGATCACCGATCCGCACCGAA
CCCGCCGGCCAGGGAATTTTGAAAGCCCACAATTGCGGAGACTTCGGTGCCCGTTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1392 bp | 463 aa | 28 | 1419 | + | No |
Chemistry : DDE
ORF sequence :
LWASKIPWSVPWPVGSRRPAPSRAFPSSATMPADTDRLGATRMVLLGELVMILDLHRQGLSVSAIARRTGRDPKTIRKYIERGLEPPAYGPRQPGRPSKL
APYLDYLRERITAFPDLSAVRLTRELRERGYTGAYTAVKRFAAAIRPPEAKPYEVRFETPAGQQAQVDFARFLVTFTDAPDTTCIVWLFSLVLGHSRHIE
ARFVLHQDLQTLLRCHMQAFTAIGGVPIEILYDRMKTAVTGEDADGHIVYNRSLLALAQHYGFLPRACRPYRAKTKGKVERPFSYIRQDFFLARSFRDLD
DLNRQLRSWLDTVANVRLHGTTQRIVSEAFAAERPELQPLPALPFDALLTLERRVSHDGLVSIGGNYYSVPDRTRRVVEVHQLPDTIRILDGGRLVASHP
ILEGRRQYRIDPDHRQGTAARAMRRGHPDGLPIGRHGDHVARRSLAVYQAVGERLAGGIGGQR
APYLDYLRERITAFPDLSAVRLTRELRERGYTGAYTAVKRFAAAIRPPEAKPYEVRFETPAGQQAQVDFARFLVTFTDAPDTTCIVWLFSLVLGHSRHIE
ARFVLHQDLQTLLRCHMQAFTAIGGVPIEILYDRMKTAVTGEDADGHIVYNRSLLALAQHYGFLPRACRPYRAKTKGKVERPFSYIRQDFFLARSFRDLD
DLNRQLRSWLDTVANVRLHGTTQRIVSEAFAAERPELQPLPALPFDALLTLERRVSHDGLVSIGGNYYSVPDRTRRVVEVHQLPDTIRILDGGRLVASHP
ILEGRRQYRIDPDHRQGTAARAMRRGHPDGLPIGRHGDHVARRSLAVYQAVGERLAGGIGGQR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
867 bp | 288 aa | 1416 | 2282 | + | No |
AG : IS21 helper
ORF sequence :
MSRAAPCVATTLDSIKRSLVGLRMPRALEVLDATVRRIEQGEIDGLAALDVILTEELTLRENRRVKTALLVARLTTIKTLSGFDFAFQPSLDRERVLALA
ELTFIDRAEVVHLLGPPGTGKSHLAIALAVEAVKAGRSVVFSTLADLVTSLAKAERDGSLRERIRYLCRASLLVVDEIGYLPVVPGGGNLFFQLVNARYE
RGAMILTSNRGFAEWGEVFGDPVVATALLDRLLHHAVVIQIEGASYRLRQHADLVPEHVRSKALIAPPPAPRRRGRPPGKAASDHTAG
ELTFIDRAEVVHLLGPPGTGKSHLAIALAVEAVKAGRSVVFSTLADLVTSLAKAERDGSLRERIRYLCRASLLVVDEIGYLPVVPGGGNLFFQLVNARYE
RGAMILTSNRGFAEWGEVFGDPVVATALLDRLLHHAVVIQIEGASYRLRQHADLVPEHVRSKALIAPPPAPRRRGRPPGKAASDHTAG
Blast result :
Comments
ISMex13 is 72% (istA, the transposase) aa similar to ISBcen28 and 84% (istB, the helper of transposition) to ISRe13.
References
1] Stephane Vuilleumier et.al. Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources. (2009) PLoS ONE Submitted.