ISMsm1
- Family IS3
- Group IS51
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_008596 | ND | Mycobacterium smegmatis | Mycobacterium smegmatis MC2 155 |
DNA section
IS Length : 1345 bp
Ends
IR Length : 26/28
IRL : TGAACCGTCCTGGATCTGGTGGAGACCGTGATCGCACCAGGAGGATGTGG
IRR : TGAACCGTCCTGGTCTGGATGGAGACCTCGGTTAGGCCACCTCCGGTGTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AAATGCAGTG | TCA | CGCGGGGATT | 3 |
CGCTCATCAT | CTGATGGGCG | 0 |
DNA sequence
TGAACCGTCCTGGATCTGGTGGAGACCGTGATCGCACCAGGAGGATGTGGTTGTGGCTGCGCCGAGAAAGTTCGACCAAGAGACCCGTGAGCGGGCAGTT
CGGATGTATGAGGACCGGATCGCGGAGTTCGGTGGTTCCAAGCGGGAAGCTCGTCGCCATGTGGGCGAGCTGCTGGGCATCAATGAGGCGACGTTGCGCA
ATTGGGTCGAGAACCGCCACGGTAGCGGGAGGCCTGCAGCGGGCGCGGTGATGTCTTCGGCAGGCGATAAGGACGCGGAGCTGGCCGCGCTGCGCAAGGA
GAATGCTGAATTGCGCAGAGCCAACGAGATTCTCAAGACAGCATCGGCGTTTTTCGCCGCGGCGGAGGTCGACCGCCGACTGCGGTGATCGTCGACTACA
TCGACGCTTACCGCCACCGGTTCGGGGTCGACCCGATCTGCGCCGTGCTCTCCGAGCACGACATGCCGATCGCCCCGTCCACCTACTACGCGGCCAAGGC
TCGCGGTCCGGTCAGCGACGCGGCCTGGGCCGAGGCGCACGCCGCCAACGCCGTCCATACGGTGTTCTGGGCCAACCGGGGTCTGTATGGGGTCCGCAAG
ATGTGGCATGCGATGCGACACGCTGGTCACGACATGGGCCGCGATCAGGTGGCCCGGCTGATGCGCATCTGTGGAATCTCGGGCGCGGTGCGCGGCAAAC
GCCGCACCATCACCACCGCCTCCGAGCAGGGTGCACCACGGCACCCGGATTTGATCGAGCGCAAGTGGGGCCTGCCGGTGCGCCCGGATCAGTGGTGGGT
GGCTGATTTCACCTACACCTGGACGCTGGCGGGGTTCGTCTACACGGCGTTCTGTGTCGACGTGTTCTCCCGACGCATTCTGGGCTGGCGGGTGATGTCG
ACCAAGGCCACTCCGCTGGTGACCAGTGTGCTCGAGCAGGCGGTGTTCACTCGTCGTAGGACTGATTTCCGTTTCACTACAACGGGTTTGGTGCACCACT
CGGATGCGGGAAGCCAATACACATCGCTGGCTTTCACCGACGCACTGCGCGACTCAGGGATCGCCGGCTCGATCGGATCGGTCGGCGACGCGCTCGACAA
CGCGTTGATGGAATCGGCGATCGGCCTCTACAAGACCGAGCTGATCGACCGCCACCAATCCTGGACCGGGCGTGCCGAAGTCGAACGCGAAACCGCGGCC
TGGGTGCACTGGTACAACACCGACCGCCTGCACTCCTCCCTGAGCTACCAGTCGCCGATCGACTACGAGGCCCAGTACCGTAATGACGCCGCCTCAACAC
CGGAGGTGGCCTAACCGAGGTCTCCATCCAGACCAGGACGGTTCA
CGGATGTATGAGGACCGGATCGCGGAGTTCGGTGGTTCCAAGCGGGAAGCTCGTCGCCATGTGGGCGAGCTGCTGGGCATCAATGAGGCGACGTTGCGCA
ATTGGGTCGAGAACCGCCACGGTAGCGGGAGGCCTGCAGCGGGCGCGGTGATGTCTTCGGCAGGCGATAAGGACGCGGAGCTGGCCGCGCTGCGCAAGGA
GAATGCTGAATTGCGCAGAGCCAACGAGATTCTCAAGACAGCATCGGCGTTTTTCGCCGCGGCGGAGGTCGACCGCCGACTGCGGTGATCGTCGACTACA
TCGACGCTTACCGCCACCGGTTCGGGGTCGACCCGATCTGCGCCGTGCTCTCCGAGCACGACATGCCGATCGCCCCGTCCACCTACTACGCGGCCAAGGC
TCGCGGTCCGGTCAGCGACGCGGCCTGGGCCGAGGCGCACGCCGCCAACGCCGTCCATACGGTGTTCTGGGCCAACCGGGGTCTGTATGGGGTCCGCAAG
ATGTGGCATGCGATGCGACACGCTGGTCACGACATGGGCCGCGATCAGGTGGCCCGGCTGATGCGCATCTGTGGAATCTCGGGCGCGGTGCGCGGCAAAC
GCCGCACCATCACCACCGCCTCCGAGCAGGGTGCACCACGGCACCCGGATTTGATCGAGCGCAAGTGGGGCCTGCCGGTGCGCCCGGATCAGTGGTGGGT
GGCTGATTTCACCTACACCTGGACGCTGGCGGGGTTCGTCTACACGGCGTTCTGTGTCGACGTGTTCTCCCGACGCATTCTGGGCTGGCGGGTGATGTCG
ACCAAGGCCACTCCGCTGGTGACCAGTGTGCTCGAGCAGGCGGTGTTCACTCGTCGTAGGACTGATTTCCGTTTCACTACAACGGGTTTGGTGCACCACT
CGGATGCGGGAAGCCAATACACATCGCTGGCTTTCACCGACGCACTGCGCGACTCAGGGATCGCCGGCTCGATCGGATCGGTCGGCGACGCGCTCGACAA
CGCGTTGATGGAATCGGCGATCGGCCTCTACAAGACCGAGCTGATCGACCGCCACCAATCCTGGACCGGGCGTGCCGAAGTCGAACGCGAAACCGCGGCC
TGGGTGCACTGGTACAACACCGACCGCCTGCACTCCTCCCTGAGCTACCAGTCGCCGATCGACTACGAGGCCCAGTACCGTAATGACGCCGCCTCAACAC
CGGAGGTGGCCTAACCGAGGTCTCCATCCAGACCAGGACGGTTCA
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
336 bp | 111 aa | 53 | 388 | + | No |
Description : First part of the transposase
ORF sequence :
VAAPRKFDQETRERAVRMYEDRIAEFGGSKREARRHVGELLGINEATLRNWVENRHGSGRPAAGAVMSSAGDKDAELAALRKENAELRRANEILKTASAF
FAAAEVDRRLR
FAAAEVDRRLR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1005 bp | 334 aa | 310 | 1314 | + | No |
Description : Second part of the transposase
ORF sequence :
IAQSQRDSQDSIGVFRRGGGRPPTAVIVDYIDAYRHRFGVDPICAVLSEHDMPIAPSTYYAAKARGPVSDAAWAEAHAANAVHTVFWANRGLYGVRKMWH
AMRHAGHDMGRDQVARLMRICGISGAVRGKRRTITTASEQGAPRHPDLIERKWGLPVRPDQWWVADFTYTWTLAGFVYTAFCVDVFSRRILGWRVMSTKA
TPLVTSVLEQAVFTRRRTDFRFTTTGLVHHSDAGSQYTSLAFTDALRDSGIAGSIGSVGDALDNALMESAIGLYKTELIDRHQSWTGRAEVERETAAWVH
WYNTDRLHSSLSYQSPIDYEAQYRNDAASTPEVA
AMRHAGHDMGRDQVARLMRICGISGAVRGKRRTITTASEQGAPRHPDLIERKWGLPVRPDQWWVADFTYTWTLAGFVYTAFCVDVFSRRILGWRVMSTKA
TPLVTSVLEQAVFTRRRTDFRFTTTGLVHHSDAGSQYTSLAFTDALRDSGIAGSIGSVGDALDNALMESAIGLYKTELIDRHQSWTGRAEVERETAAWVH
WYNTDRLHSSLSYQSPIDYEAQYRNDAASTPEVA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1262 bp | 420 aa | 53 | 1314 | + | Yes |
Chemistry : DDE
ORF sequence :
VAAPRKFDQETRERAVRMYEDRIAEFGGSKREARRHVGELLGINEATLRNWVENRHGSGRPAAGAVMSSAGDKDAELAALRKENAELRRANEILKTASAF
FRRGGGRPPTAVIVDYIDAYRHRFGVDPICAVLSEHDMPIAPSTYYAAKARGPVSDAAWAEAHAANAVHTVFWANRGLYGVRKMWHAMRHAGHDMGRDQV
ARLMRICGISGAVRGKRRTITTASEQGAPRHPDLIERKWGLPVRPDQWWVADFTYTWTLAGFVYTAFCVDVFSRRILGWRVMSTKATPLVTSVLEQAVFT
RRRTDFRFTTTGLVHHSDAGSQYTSLAFTDALRDSGIAGSIGSVGDALDNALMESAIGLYKTELIDRHQSWTGRAEVERETAAWVHWYNTDRLHSSLSYQ
SPIDYEAQYRNDAASTPEVA
FRRGGGRPPTAVIVDYIDAYRHRFGVDPICAVLSEHDMPIAPSTYYAAKARGPVSDAAWAEAHAANAVHTVFWANRGLYGVRKMWHAMRHAGHDMGRDQV
ARLMRICGISGAVRGKRRTITTASEQGAPRHPDLIERKWGLPVRPDQWWVADFTYTWTLAGFVYTAFCVDVFSRRILGWRVMSTKATPLVTSVLEQAVFT
RRRTDFRFTTTGLVHHSDAGSQYTSLAFTDALRDSGIAGSIGSVGDALDNALMESAIGLYKTELIDRHQSWTGRAEVERETAAWVHWYNTDRLHSSLSYQ
SPIDYEAQYRNDAASTPEVA
Blast result :
Comments
ISMsm1 is 82% aa similar to ISRru1. The third ORF is the putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
References
1] Fleischmann,R.D., Dodson,R.J., Haft,D.H., Merkel,J.S., Nelson,W.C. and Fraser,C.M. (2006) Direct Submission GenBank.