ISMt3
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
Z83858 | ND | Mycobacterium tuberculosis | Mycobacterium tuberculosis H37Rv |
DNA section
IS Length : 2213 bp
Ends
IR Length : 40/50
IRL : TGTCGACGGCACGTGAAAACTGACCCCGGCGCGGCACCCGAATTTTGACC
IRR : TGTCAACGGCACCCGAAAACTGACCCCCTGACGGCATCTGAAAATTGACC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGCCCTTGTG | AACGG | TAAGCAGCAG | 5 |
DNA sequence
TGTCGACGGCACGTGAAAACTGACCCCGGCGCGGCACCCGAATTTTGACCCCCTGGTCGGGTGGACTGGCTCTACCCGAGCCAGGAGGACCGAAGGGAAT
GTTGACTGTGGAAGATTGGGCTGAGATTCGCCGATTGCATCGCGCGGAGGGTTTGCCGATCAAGATGATCGCCCGGGTGCTGGGGATTTCCAAGAACACG
GTGAAGTCAGCGTTGGAATCAAACCAGCAGCCGAAATATGAACGGGCACCGCAGGGTTCGATCGTTGATGCGGTTGAGCCGCGGATCCGGGAGTTGTTGC
AGGCCTATCCGACGATGCCGGCGACGGTGATCGCCGAGCGGATCGGCTGGGAGCGCTCGATTCGGGTGCTCTCGGCGCGGGTGGCCGAGCTGCGCCCGGT
GTATCTGCCGCCGGACCCGGCGTCGCGCACCACGTATGTGGCAGGCGAAATTGCCCAGTGCGACTTCTGGTTTCCGCCGATCGAGTTGCCGGTAGGGTTC
GGGCAGACCCGCACGGCCAAACAGTTGCCGGTGCTGACCATGGTGTGCGCCTATTCGCGCTGGCTGTTGGCGATGCTGCTGCCCAGCAGGTGTGCCGAGG
ACCTGTTCGCCGGCTGGTGGCGGCTGATCGAGGCGTTGGGGGCGGTGCCGCGGGTGTTGGTGTGGGATGGCGAGGGCGCGATCGGGCGCTGGCGCGGCGG
GCGGTCGGAGTTGACCACTGAGTGTCAGGCGTTCCGCGGCACGCTGGCGGCCAAGGTGCTCATCTGCCGGCCGGCCGACCCGGAGGCCAAGGGCCTCATT
GAACGGGCCCACGACTACCTGGAGCGCTCGTTTTTGCCCGGGCGGGTGTTTGCCTCGCCGGCCGATTTCAACGCCCAACTGGGCGCCTGGCTGGCGCTGG
TGAACACCCGCACCCGCCGGGCGCTGGGTTGTGCGCCCACCGATCGCATCGGCGCGGATCGGGCCGCGATGCTGAGCTTGCCGCCGGTGGCGCCGGCCAC
CGGGTGGTGCACCTCGCTGCGGCTGCCCCGGGATCACTATGTGCGCTGCGATTCCAACGACTACTCGGTGCACCCGGGTGTGATCGGGCATCGGGTGCTG
GTGCGCGCCGACCTGGAGCGGGTGCATGTGTTCTGCGACGGTGAGCTGGTCGCCGACCACGAGCGGATCTGGGCGGTCCATCAGACGGTCTCCGATCCCG
CACATGTGGAGGCGGCGAAGGTGTTGCGCCGCCGGCACTTCAGTGCAGCATCACCGGTAGTTGAGCCGCAGGTGCAGGTCCGCTCACTGAGCGACTACGA
TGACGCGCTGGGAGTCGACATCGATGGCGGGGTGGCCTGATGCCCACCACCAAAGCCACCCAGCGCCGTGATGTTTCCACCGAGATCGCTTACCTGACAA
GAGCATTGAAAGCTCCCACCCTGCGTGAGTCAGTGTCCCGGCTGGCCGATCGCGCCCGCGCCGAGAACTGGAGCCACGAAGAATACCTGGCCGCCTGCCT
GCAGCGGGAAGTGTCAGCCCGGGAGTCCCATGGTGGTGAGGGCCGCATCCGCGCCGCCCGCTTCCCGGCTCGGAAGTCGTTGGAAGAGTTCGACTTTGAG
CATGCTCGTGGCCTCAAACGCGACACCATCGCACATCTGGGCACCCTGGATTTCATCACCGCCCGCGATAACGTCGTGTTTTTGGGCCCCGCCTGGCACC
GGGAAGACTCATCTTGCGGTCGGCCTGGCGATACGCGCGTGTCAGGCCGGTCATCGGGTGCTGTTCGCCACCGCCGCCGAATGGGTAGCACGGCTCGCCG
AGGCTCACCACGCCGGGCGCATCTACGCCGAACTCACCCGGCTTTGCCGCTATCCGCTCCTGGTGGTTGACGAAGTCGGCTACATTCCGTTTGAGCCCGA
GGCCGCCAACCTCTTCTTCCAGCTGGTGTCCTCCCGGTATGAGCGGGCCAGCTTGATCGTCACGTCCAATAAGGCCTTCGGCCGGTGGGGCGAGGTTTTC
GGCGGCGACGACGTCGTTGCTGCCGCCATGATCGACCGCCTCGTCCACCATGCTGAAGTCGTCGCCCTCAAAGGCGACAGCTACCGGCTCAAAGACCGCG
ACCTCGGCCGCGTCCCACCAGCCGGAACCACCGAAGAATAACCACCAACCGCCCGGTCTAGGGGGTCAATTTTCAGATGCCGTCAGGGGGTCAGTTTTCG
GGTGCCGTTGACA
GTTGACTGTGGAAGATTGGGCTGAGATTCGCCGATTGCATCGCGCGGAGGGTTTGCCGATCAAGATGATCGCCCGGGTGCTGGGGATTTCCAAGAACACG
GTGAAGTCAGCGTTGGAATCAAACCAGCAGCCGAAATATGAACGGGCACCGCAGGGTTCGATCGTTGATGCGGTTGAGCCGCGGATCCGGGAGTTGTTGC
AGGCCTATCCGACGATGCCGGCGACGGTGATCGCCGAGCGGATCGGCTGGGAGCGCTCGATTCGGGTGCTCTCGGCGCGGGTGGCCGAGCTGCGCCCGGT
GTATCTGCCGCCGGACCCGGCGTCGCGCACCACGTATGTGGCAGGCGAAATTGCCCAGTGCGACTTCTGGTTTCCGCCGATCGAGTTGCCGGTAGGGTTC
GGGCAGACCCGCACGGCCAAACAGTTGCCGGTGCTGACCATGGTGTGCGCCTATTCGCGCTGGCTGTTGGCGATGCTGCTGCCCAGCAGGTGTGCCGAGG
ACCTGTTCGCCGGCTGGTGGCGGCTGATCGAGGCGTTGGGGGCGGTGCCGCGGGTGTTGGTGTGGGATGGCGAGGGCGCGATCGGGCGCTGGCGCGGCGG
GCGGTCGGAGTTGACCACTGAGTGTCAGGCGTTCCGCGGCACGCTGGCGGCCAAGGTGCTCATCTGCCGGCCGGCCGACCCGGAGGCCAAGGGCCTCATT
GAACGGGCCCACGACTACCTGGAGCGCTCGTTTTTGCCCGGGCGGGTGTTTGCCTCGCCGGCCGATTTCAACGCCCAACTGGGCGCCTGGCTGGCGCTGG
TGAACACCCGCACCCGCCGGGCGCTGGGTTGTGCGCCCACCGATCGCATCGGCGCGGATCGGGCCGCGATGCTGAGCTTGCCGCCGGTGGCGCCGGCCAC
CGGGTGGTGCACCTCGCTGCGGCTGCCCCGGGATCACTATGTGCGCTGCGATTCCAACGACTACTCGGTGCACCCGGGTGTGATCGGGCATCGGGTGCTG
GTGCGCGCCGACCTGGAGCGGGTGCATGTGTTCTGCGACGGTGAGCTGGTCGCCGACCACGAGCGGATCTGGGCGGTCCATCAGACGGTCTCCGATCCCG
CACATGTGGAGGCGGCGAAGGTGTTGCGCCGCCGGCACTTCAGTGCAGCATCACCGGTAGTTGAGCCGCAGGTGCAGGTCCGCTCACTGAGCGACTACGA
TGACGCGCTGGGAGTCGACATCGATGGCGGGGTGGCCTGATGCCCACCACCAAAGCCACCCAGCGCCGTGATGTTTCCACCGAGATCGCTTACCTGACAA
GAGCATTGAAAGCTCCCACCCTGCGTGAGTCAGTGTCCCGGCTGGCCGATCGCGCCCGCGCCGAGAACTGGAGCCACGAAGAATACCTGGCCGCCTGCCT
GCAGCGGGAAGTGTCAGCCCGGGAGTCCCATGGTGGTGAGGGCCGCATCCGCGCCGCCCGCTTCCCGGCTCGGAAGTCGTTGGAAGAGTTCGACTTTGAG
CATGCTCGTGGCCTCAAACGCGACACCATCGCACATCTGGGCACCCTGGATTTCATCACCGCCCGCGATAACGTCGTGTTTTTGGGCCCCGCCTGGCACC
GGGAAGACTCATCTTGCGGTCGGCCTGGCGATACGCGCGTGTCAGGCCGGTCATCGGGTGCTGTTCGCCACCGCCGCCGAATGGGTAGCACGGCTCGCCG
AGGCTCACCACGCCGGGCGCATCTACGCCGAACTCACCCGGCTTTGCCGCTATCCGCTCCTGGTGGTTGACGAAGTCGGCTACATTCCGTTTGAGCCCGA
GGCCGCCAACCTCTTCTTCCAGCTGGTGTCCTCCCGGTATGAGCGGGCCAGCTTGATCGTCACGTCCAATAAGGCCTTCGGCCGGTGGGGCGAGGTTTTC
GGCGGCGACGACGTCGTTGCTGCCGCCATGATCGACCGCCTCGTCCACCATGCTGAAGTCGTCGCCCTCAAAGGCGACAGCTACCGGCTCAAAGACCGCG
ACCTCGGCCGCGTCCCACCAGCCGGAACCACCGAAGAATAACCACCAACCGCCCGGTCTAGGGGGTCAATTTTCAGATGCCGTCAGGGGGTCAGTTTTCG
GGTGCCGTTGACA
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1242 bp | 413 aa | 99 | 1340 | + | No |
Chemistry : DDE
ORF sequence :
MLTVEDWAEIRRLHRAEGLPIKMIARVLGISKNTVKSALESNQQPKYERAPQGSIVDAVEPRIRELLQAYPTMPATVIAERIGWERSIRVLSARVAELRP
VYLPPDPASRTTYVAGEIAQCDFWFPPIELPVGFGQTRTAKQLPVLTMVCAYSRWLLAMLLPSRCAEDLFAGWWRLIEALGAVPRVLVWDGEGAIGRWRG
GRSELTTECQAFRGTLAAKVLICRPADPEAKGLIERAHDYLERSFLPGRVFASPADFNAQLGAWLALVNTRTRRALGCAPTDRIGADRAAMLSLPPVAPA
TGWCTSLRLPRDHYVRCDSNDYSVHPGVIGHRVLVRADLERVHVFCDGELVADHERIWAVHQTVSDPAHVEAAKVLRRRHFSAASPVVEPQVQVRSLSDY
DDALGVDIDGGVA
VYLPPDPASRTTYVAGEIAQCDFWFPPIELPVGFGQTRTAKQLPVLTMVCAYSRWLLAMLLPSRCAEDLFAGWWRLIEALGAVPRVLVWDGEGAIGRWRG
GRSELTTECQAFRGTLAAKVLICRPADPEAKGLIERAHDYLERSFLPGRVFASPADFNAQLGAWLALVNTRTRRALGCAPTDRIGADRAAMLSLPPVAPA
TGWCTSLRLPRDHYVRCDSNDYSVHPGVIGHRVLVRADLERVHVFCDGELVADHERIWAVHQTVSDPAHVEAAKVLRRRHFSAASPVVEPQVQVRSLSDY
DDALGVDIDGGVA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
802 bp | 266 aa | 1340 | 2141 | + | Yes |
AG : IS21 helper
ORF sequence :
MPTTKATQRRDVSTEIAYLTRALKAPTLRESVSRLADRARAENWSHEEYLAACLQREVSARESHGGEGRIRAARFPARKSLEEFDFEHARGLKRDTIAHL
GTLDFITARDNVVFLGPPGTGKTHLAVGLAIRACQAGHRVLFATAAEWVARLAEAHHAGRIYAELTRLCRYPLLVVDEVGYIPFEPEAANLFFQLVSSRY
ERASLIVTSNKAFGRWGEVFGGDDVVAAAMIDRLVHHAEVVALKGDSYRLKDRDLGRVPPAGTTEE
GTLDFITARDNVVFLGPPGTGKTHLAVGLAIRACQAGHRVLFATAAEWVARLAEAHHAGRIYAELTRLCRYPLLVVDEVGYIPFEPEAANLFFQLVSSRY
ERASLIVTSNKAFGRWGEVFGGDDVVAAAMIDRLVHHAEVVALKGDSYRLKDRDLGRVPPAGTTEE
Blast result :
Comments
ISMt3 was found by its similarity to other IS21 family members (Nagy et al., 1997). It is located on the Mycobacterium tuberculosis H37Rv chromosome (Oliver and Harris, 1997). ISMt3 ORF2 (IstB) requires a frameshift around position 1750 (sequencing error ?). The "corrected" putative protein corresponds to the fusion between 1340-1690 and 1692-2141. ISMt3 IstA and IstB are 29 % and 40 % identical to those of IS100.
References
1] Nagy, I., Schoofs, G., Vanderleyden, J., and De Mot, R. (1997) J. Bacteriol. 179, 4635-4638.
2] Oliver, K., and Harris, D. (1997) EMBL Data Library, Rel. 52.
2] Oliver, K., and Harris, D. (1997) EMBL Data Library, Rel. 52.