ISMyca1
- Family IS1634
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AJ619854 | ND | Mycobacterium canettii | Mycobacterium canettii CIP 14000059 |
DNA section
IS Length : 1968 bp
Ends
IR Length : 19/26
IRL : CCGAAGTTCCCCCTTGTAGGGGCGGGCTGAGTTTCGATCTGTTTCGTGAG
IRR : CCGAAGTTCCCCCTGATCCGGCGCGGTTTGTGTCGTTGACCTGGGCATTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
NNNNNNNNNN | GCTAGC | NNNNNNNNNN | 6 |
DNA sequence
CCGAAGTTCCCCCTTGTAGGGGCGGGCTGAGTTTCGATCTGTTTCGTGAGCAGGTGTTTCTGTGTTCAACTTCCCTCAACATGTACTCATGTATTATTGA
GAATAGCTCGGCGTGTCATCCTCTGATGACGCTATTATCGCGCTGACCGCGTGTTATAAAGTAATCATGTACATTACCCGGGTACCCAACCGGGGATCCC
CGCCGGCGGTGCTGTTGCGGGAAAGCTTCCGCGAAAACGGCAAGGTCAAGACGCGTACCCTGGCCAACCTCTCACGCTGGCCCGAGCACAAGCTGGACAG
ACTGGACCGGGCGCTTAAGGGCTTGCCGCCCGCGGACTGGGATCTAGCCGAGGCCTTCGATATCACCCGCAGCCTGCCGCACGGGCATGTGGCCGCGGTG
GCCGGCACCGCCGAGAAGCTGGGCATACCCGAGCTGATCGACCCCACCCCGTCGCGGCGGCGCAACCTGGTGCTGGCCATGCTGATCGGGCAGATCATCG
AGCCCGGATCGAAACTGGCGATCGCGCGCGGGCTGCGCGCCCAGACCGCCACCAGCACGCTGGGTGCGGTGCTGGGTGTCTCGGGCGCCGATGAGGACGA
CCTGTATGACGCGATGGACTGGGCGCTGGAGCGCAAAGACGGCATCGAAAACGCCTTGGCCGCACGGCATCTGACCAACGGCACCCTGGTGCTCTATGAC
GTATCCTCGGCGGCGTTCGAGGGCCACACCTGCCCGCTGGGAGCGATCGGGCACGCCCGCGACGGGGTCAAAGGCCGGCTGCAGATCGTCTACGGGCTGC
TGTGCTCACCCAAGGGAGCGCCGGTGGCCATCGAGGTGTTCAAGGGCAACACCGCCGACCCGAAAACTCTGAAAGCTCAAATCGACAAGCTCAAAACCCG
GTTCGGGTTGACCCGCATCGCCCTGGTGGGCGATCGGGGCATGCTCACTTCCGCGCGCATCCGTGACGAGCTGCGTCCGGCGCACCTGGATTGGATCAGC
GCGCTGCGCGCCCCGCAGATCAAGATCCTGCTCGAGGACGGGGCGCTGCAGCTGTCGCTGTTCGATGAGCAGAACCTGTTCGAGATCACTCACCCCGACT
ATCCCGGTGAGCGGCTGGTGTGCTGCCACAACCCCGCCCTGGCCGACGAGCGCGCCCGCAAACGCGCCGAGCTGCTGGCGGCCACCGAAAAGGAGCTGCA
GGCCATCGCCGAAGCCACCCGCCGCCAACGCCGGCCGTTACGCGGTACAGACAAGATCGGCCTGCGGGTGGGCAAGGTGCGCAACAAGTTCAAGATGGCC
AAGCACTTTGACCTGCACATCACCGATGAGGCCTTCAGCTTCACCCGCAACCAGAACAGTATCGCCGCCGAGGCCGCCCTCGACGGCATCTACGTGCTAC
GCACCAGCCTGCCCGACAACGCCCTGGGCCGCGACGACGTGGTGGGCCGCTACAAAGACCTCGCCGACGTCGAACGCTTCTTCCGCACCCTCAACAGCGA
ACTGGACGTACGCCCCATCCGGCATCGGCTGGCCGACCGGGTCCGCGCCCACATGTTCTTGCACATGCTCTCCTACTACATCAGCTGGCACATGAAACAA
GCCCTGGCCCCAATCCTGTTCACCGACAACGACAAACCCGCCGCCGCCGCCAAACGCGCCGACCCCGTCGCGCCAGCCCAACGCTCCGACGAAGCGCTGA
ACAAGGCAGCACGCAAACGCACCGAAGACAACCAACCGGTGCACAGCTTCACCAGCCTGCTCACCGACCTGGCCACCATCTGCGCCAACTACATCCAACC
CACAGACGACCTGCCAGCATTCACCAAAACCACCACCCCCACCCCCACACAACGGCGCGCCTTCGACCTACTGGCCGTTTCCCACCGCCACGGCCTGGCG
TAGTCAGTACCGAACCACAAATGCCCAGGTCAACGACACAAACCGCGCCGGATCAGGGGGAACTTCGG
GAATAGCTCGGCGTGTCATCCTCTGATGACGCTATTATCGCGCTGACCGCGTGTTATAAAGTAATCATGTACATTACCCGGGTACCCAACCGGGGATCCC
CGCCGGCGGTGCTGTTGCGGGAAAGCTTCCGCGAAAACGGCAAGGTCAAGACGCGTACCCTGGCCAACCTCTCACGCTGGCCCGAGCACAAGCTGGACAG
ACTGGACCGGGCGCTTAAGGGCTTGCCGCCCGCGGACTGGGATCTAGCCGAGGCCTTCGATATCACCCGCAGCCTGCCGCACGGGCATGTGGCCGCGGTG
GCCGGCACCGCCGAGAAGCTGGGCATACCCGAGCTGATCGACCCCACCCCGTCGCGGCGGCGCAACCTGGTGCTGGCCATGCTGATCGGGCAGATCATCG
AGCCCGGATCGAAACTGGCGATCGCGCGCGGGCTGCGCGCCCAGACCGCCACCAGCACGCTGGGTGCGGTGCTGGGTGTCTCGGGCGCCGATGAGGACGA
CCTGTATGACGCGATGGACTGGGCGCTGGAGCGCAAAGACGGCATCGAAAACGCCTTGGCCGCACGGCATCTGACCAACGGCACCCTGGTGCTCTATGAC
GTATCCTCGGCGGCGTTCGAGGGCCACACCTGCCCGCTGGGAGCGATCGGGCACGCCCGCGACGGGGTCAAAGGCCGGCTGCAGATCGTCTACGGGCTGC
TGTGCTCACCCAAGGGAGCGCCGGTGGCCATCGAGGTGTTCAAGGGCAACACCGCCGACCCGAAAACTCTGAAAGCTCAAATCGACAAGCTCAAAACCCG
GTTCGGGTTGACCCGCATCGCCCTGGTGGGCGATCGGGGCATGCTCACTTCCGCGCGCATCCGTGACGAGCTGCGTCCGGCGCACCTGGATTGGATCAGC
GCGCTGCGCGCCCCGCAGATCAAGATCCTGCTCGAGGACGGGGCGCTGCAGCTGTCGCTGTTCGATGAGCAGAACCTGTTCGAGATCACTCACCCCGACT
ATCCCGGTGAGCGGCTGGTGTGCTGCCACAACCCCGCCCTGGCCGACGAGCGCGCCCGCAAACGCGCCGAGCTGCTGGCGGCCACCGAAAAGGAGCTGCA
GGCCATCGCCGAAGCCACCCGCCGCCAACGCCGGCCGTTACGCGGTACAGACAAGATCGGCCTGCGGGTGGGCAAGGTGCGCAACAAGTTCAAGATGGCC
AAGCACTTTGACCTGCACATCACCGATGAGGCCTTCAGCTTCACCCGCAACCAGAACAGTATCGCCGCCGAGGCCGCCCTCGACGGCATCTACGTGCTAC
GCACCAGCCTGCCCGACAACGCCCTGGGCCGCGACGACGTGGTGGGCCGCTACAAAGACCTCGCCGACGTCGAACGCTTCTTCCGCACCCTCAACAGCGA
ACTGGACGTACGCCCCATCCGGCATCGGCTGGCCGACCGGGTCCGCGCCCACATGTTCTTGCACATGCTCTCCTACTACATCAGCTGGCACATGAAACAA
GCCCTGGCCCCAATCCTGTTCACCGACAACGACAAACCCGCCGCCGCCGCCAAACGCGCCGACCCCGTCGCGCCAGCCCAACGCTCCGACGAAGCGCTGA
ACAAGGCAGCACGCAAACGCACCGAAGACAACCAACCGGTGCACAGCTTCACCAGCCTGCTCACCGACCTGGCCACCATCTGCGCCAACTACATCCAACC
CACAGACGACCTGCCAGCATTCACCAAAACCACCACCCCCACCCCCACACAACGGCGCGCCTTCGACCTACTGGCCGTTTCCCACCGCCACGGCCTGGCG
TAGTCAGTACCGAACCACAAATGCCCAGGTCAACGACACAAACCGCGCCGGATCAGGGGGAACTTCGG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1756 bp | 596 aa | 113 | 1868 | + | No |
Chemistry : DDE
ORF sequence :
MSSSDDAIIALTACYKVIMYITRVPNRGSPPAVLLRESFRENGKVKTRTLANLSRWPEHKLDRLDRALKGLPPADWDLAEAFDITRSLPHGHVAAVAGTA
EKLGIPELIDPTPSRRRNLVLAMLIGQIIEPGSKLAIARGLRAQTATSTLGAVLGVSGADEDDLYDAMDWALERKDGIENALAARHLTNGTLVLYDVSSA
AFEGHTCPLGAIGHARDGVKGRLQIVYGLLCSPKGAPVAIEVFKGNTADPKTLKAQIDKLKTRFGLTRIALVGDRGMLTSARIRDELRPAHLDWISALRA
PQIKILLEDGALQLSLFDEQNLFEITHPDYPGERLVCCHNPALADERARKRAELLAATEKELQAIAEATRRQRRPLRGTDKIGLRVGKVRNKFKMAKHFD
LHITDEAFSFTRNQNSIAAEAALDGIYVLRTSLPDNALGRDDVVGRYKDLADVERFFRTLNSELDVRPIRHRLADRVRAHMFLHMLSYYISWHMKQALAP
ILFTDNDKPAAAAKRADPVAPAQRSDEALNKAARKRTEDNQPVHSFTSLLTDLATICANYIQPTDDLPAFTKTTTPTPTQRRAFDLLAVSHRHGLA
EKLGIPELIDPTPSRRRNLVLAMLIGQIIEPGSKLAIARGLRAQTATSTLGAVLGVSGADEDDLYDAMDWALERKDGIENALAARHLTNGTLVLYDVSSA
AFEGHTCPLGAIGHARDGVKGRLQIVYGLLCSPKGAPVAIEVFKGNTADPKTLKAQIDKLKTRFGLTRIALVGDRGMLTSARIRDELRPAHLDWISALRA
PQIKILLEDGALQLSLFDEQNLFEITHPDYPGERLVCCHNPALADERARKRAELLAATEKELQAIAEATRRQRRPLRGTDKIGLRVGKVRNKFKMAKHFD
LHITDEAFSFTRNQNSIAAEAALDGIYVLRTSLPDNALGRDDVVGRYKDLADVERFFRTLNSELDVRPIRHRLADRVRAHMFLHMLSYYISWHMKQALAP
ILFTDNDKPAAAAKRADPVAPAQRSDEALNKAARKRTEDNQPVHSFTSLLTDLATICANYIQPTDDLPAFTKTTTPTPTQRRAFDLLAVSHRHGLA
Blast result :
Comments
ISMyca1 is 47% aa similar to ISMhp1.
There are at least 2 copies in the genome.
This is an IS element which is present in most Mycobacterium canettii strains (Members of the Mycobacterium tuberculosis complex). From inverted repeat to inverted repeat the sequence is specific for M. canettii and has not been found in any other member of the M. tuberculosis complex, nor in any other known organism.
There are at least 2 copies in the genome.
This is an IS element which is present in most Mycobacterium canettii strains (Members of the Mycobacterium tuberculosis complex). From inverted repeat to inverted repeat the sequence is specific for M. canettii and has not been found in any other member of the M. tuberculosis complex, nor in any other known organism.
References
1] Brosch,R. (2004) Direct submission GenBank.