ISMyma9
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Mycobacterium marinum | Mycobacterium marinum 324-958 |
DNA section
IS Length : 2162 bp
Ends
IR Length : 44/51
IRL : TGTCGGTTCCGGTCGAAAAGGGAGCAGGTGATTCCGGTCGAAAAGTGAGC
IRR : TGTCGGTTCCGGTCGAATCCTGAGCAGTTTCTTCCGGTCGAAAAGTGAGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ACGCAGCGGC | GCAAG | GGCTGTGCGG | 5 |
TGGCGGCGTG | CCGGG | CTGATCCGAG | 5 |
CGGAGCGTTT | CGCTG | TGCGGTTGAC | 5 |
GACCGTCGAA | GGTGC | TATAAATCCA | 5 |
CAGAATCGAG | CCGATAT | GTCGTCAGCC | 7 |
DNA sequence
TGTCGGTTCCGGTCGAAAAGGGAGCAGGTGATTCCGGTCGAAAAGTGAGCACCCTTCGATTGGAGAGTGATCACTGTGGAGGATTGGGCCGAGATTCGTC
GGTTGTATCGGTCGGAGAAGTTGTCGCAGGCTGCGATCGCCCGACGGCTGGGTTTGTCGCGGAACACCGTGGCCAAGGCGCTGGGTTCGGATGGCCCGCC
GCGTTATGAGCGGGCGCCGGTGACGACCTCGGCGTGGGCGCAGGTCGAGCCAGCGGTGCGGGCGGTGCTTAGCCAGTATCCGAGGATGCCGGCGACGGTG
ATCGCCGAGCGGGTGGGCTGGATGGGTGGGCATTCGTGGTTTGGGGAAAACGTCGCACGAATCCGCCCGGAGTATGCCCCTGCTGATCCGTGCGACCGGC
TGGTGCATTTGCCCGGTGAGCAAGTGCAGTGCGATTTGTGGTTGCCCGGCCGGGTAGTGCCCGACCATGCCGGAGTGCTGCGGTCGTTTCCGGTGTTGGT
GATGGTGGCTGCCTACTCGCGGTTTGTCGCCGCGATGATGATCCCGTCGCGGGTCACCGGGGATCTGCTGGCCGGCATGTGGCAGCTGCTCTCTGGGTAC
ATCGGCGGGGTGCCTCGAACCCTGTTGTGGGACAACGAGGCCGGTATCGGTCAACGTGGCCGGCTGGCCGAAGGGGTGGCCGGGTTCTGCGGAGTACTGG
GCACCCGGCTTGTTCAGGCCCGACCCTATGATCCTGAAACCAAGGGGCTAGTCGAGCGAGCCAACGGGTATTTGCAGACGTCGTTTCTGCCGGGCCGGCG
CTTTTGCTCGGCTGCAGATTTCAACACCCAGCTCGCGCAGTGGCTGGCGCAGGTGGCCAACCCCCGCCGGCACGCCACGACCAAGACAATTCCCGCTCAG
GCCCTGGGCGCTGATTTGCAGGTGATGGCTGCGCTGCCGCCGGTGGCCCCGACCACGGGCACCACACTGACCACCCGGTTGGGTCGGGACTACTACATCA
GCATGGGCGGCAACGCCTATTCGGTGCATCCTGAGGTGATTGGTCGGATGATCACCGCGACGGTATCGCTGGATCGGGTGATGGCCCGCTGTGGTGAGCG
TGTCGTCGCTGACCACGAACGGCTTTGGGGTAGTGCGGGACTGGCTTGCGATCCTGAGCATCTGGCCGCGGCGGCGATCATGCGCGAGAACTTCCGGGAA
CGCTCGGGCGTCGGCGCTCACCTGGACGTGGAGGTCCAAGTCGCTGACCTGGGCGCCTATGACGCGGTGTTTGGGACCGGGGAGGTCGCCTGATGGCCAC
CAAACGCGCCCCCGCCCCCGGTGAGGCCGACAAGCTGATCGCCCACCAATCCCGGCTGCTCAAAGCCCCGCGGATCGCCGCCCACTACCATCGACTGGCC
GAACAAGGCCGCGCCGCGGGCTGGTCGCTGGAGGACTACCTGGCCGCCGTCTTGGCCGTGGAATCCAACGCCCGCGCCGAATCGGGGGCCCGCCAACGCA
TCCGCTATGCCGGGTTCCCGGCGATCAAGACGATCACCGACTTCGACTTCACCGCCCAACCCCACCTTGACCGAGCCCAGATCGCCCGCCTGGAAGCCGG
CGGCTGGCTGGCCGAAGCCCGCAACATCGTGCTGCTGGGCCCCCCGGGCACCGGCAAAACACATTTGGCGACCGCGCTGGCGATCGCAGCCGCCCAAGCC
GGACACCGGGTCGCGTTCGCCCCAGCCACCGGGTGGATCACCCGCCTGGCCGAAGCCCACCGCATCGGCAGCCTGGATGGCGAACTGCGCAAAATCTCGC
GCATCGGGCTCATCGTCATCGACGAGGTTGGATACATCCCCTTTGACACCGAAGCGGCCAACCTGTTCTTCCAACTGGTGTCGACCCGATACGAGAAATC
GTCGATCATCTTGACCTCCAACCTGCCGTTTTCCCGGTGGGGCCAGGTCTTCGGCGAAGCCACCATCGCCTCGGCGATGATCGACCGCATCGTGCACCAC
GCCGACGTCATCGCCCTCAAAGGCGCCAGCTACCGCATCAAACACACCGCAATCGAGTCCCTGCCCTCCGTCGAAGCCGACCGTCAGGCAGACTCAACCC
CGTAACACAACTGCTCACTTTTCGACCGGAAGAAACTGCTCAGGATTCGACCGGAACCGACA
GGTTGTATCGGTCGGAGAAGTTGTCGCAGGCTGCGATCGCCCGACGGCTGGGTTTGTCGCGGAACACCGTGGCCAAGGCGCTGGGTTCGGATGGCCCGCC
GCGTTATGAGCGGGCGCCGGTGACGACCTCGGCGTGGGCGCAGGTCGAGCCAGCGGTGCGGGCGGTGCTTAGCCAGTATCCGAGGATGCCGGCGACGGTG
ATCGCCGAGCGGGTGGGCTGGATGGGTGGGCATTCGTGGTTTGGGGAAAACGTCGCACGAATCCGCCCGGAGTATGCCCCTGCTGATCCGTGCGACCGGC
TGGTGCATTTGCCCGGTGAGCAAGTGCAGTGCGATTTGTGGTTGCCCGGCCGGGTAGTGCCCGACCATGCCGGAGTGCTGCGGTCGTTTCCGGTGTTGGT
GATGGTGGCTGCCTACTCGCGGTTTGTCGCCGCGATGATGATCCCGTCGCGGGTCACCGGGGATCTGCTGGCCGGCATGTGGCAGCTGCTCTCTGGGTAC
ATCGGCGGGGTGCCTCGAACCCTGTTGTGGGACAACGAGGCCGGTATCGGTCAACGTGGCCGGCTGGCCGAAGGGGTGGCCGGGTTCTGCGGAGTACTGG
GCACCCGGCTTGTTCAGGCCCGACCCTATGATCCTGAAACCAAGGGGCTAGTCGAGCGAGCCAACGGGTATTTGCAGACGTCGTTTCTGCCGGGCCGGCG
CTTTTGCTCGGCTGCAGATTTCAACACCCAGCTCGCGCAGTGGCTGGCGCAGGTGGCCAACCCCCGCCGGCACGCCACGACCAAGACAATTCCCGCTCAG
GCCCTGGGCGCTGATTTGCAGGTGATGGCTGCGCTGCCGCCGGTGGCCCCGACCACGGGCACCACACTGACCACCCGGTTGGGTCGGGACTACTACATCA
GCATGGGCGGCAACGCCTATTCGGTGCATCCTGAGGTGATTGGTCGGATGATCACCGCGACGGTATCGCTGGATCGGGTGATGGCCCGCTGTGGTGAGCG
TGTCGTCGCTGACCACGAACGGCTTTGGGGTAGTGCGGGACTGGCTTGCGATCCTGAGCATCTGGCCGCGGCGGCGATCATGCGCGAGAACTTCCGGGAA
CGCTCGGGCGTCGGCGCTCACCTGGACGTGGAGGTCCAAGTCGCTGACCTGGGCGCCTATGACGCGGTGTTTGGGACCGGGGAGGTCGCCTGATGGCCAC
CAAACGCGCCCCCGCCCCCGGTGAGGCCGACAAGCTGATCGCCCACCAATCCCGGCTGCTCAAAGCCCCGCGGATCGCCGCCCACTACCATCGACTGGCC
GAACAAGGCCGCGCCGCGGGCTGGTCGCTGGAGGACTACCTGGCCGCCGTCTTGGCCGTGGAATCCAACGCCCGCGCCGAATCGGGGGCCCGCCAACGCA
TCCGCTATGCCGGGTTCCCGGCGATCAAGACGATCACCGACTTCGACTTCACCGCCCAACCCCACCTTGACCGAGCCCAGATCGCCCGCCTGGAAGCCGG
CGGCTGGCTGGCCGAAGCCCGCAACATCGTGCTGCTGGGCCCCCCGGGCACCGGCAAAACACATTTGGCGACCGCGCTGGCGATCGCAGCCGCCCAAGCC
GGACACCGGGTCGCGTTCGCCCCAGCCACCGGGTGGATCACCCGCCTGGCCGAAGCCCACCGCATCGGCAGCCTGGATGGCGAACTGCGCAAAATCTCGC
GCATCGGGCTCATCGTCATCGACGAGGTTGGATACATCCCCTTTGACACCGAAGCGGCCAACCTGTTCTTCCAACTGGTGTCGACCCGATACGAGAAATC
GTCGATCATCTTGACCTCCAACCTGCCGTTTTCCCGGTGGGGCCAGGTCTTCGGCGAAGCCACCATCGCCTCGGCGATGATCGACCGCATCGTGCACCAC
GCCGACGTCATCGCCCTCAAAGGCGCCAGCTACCGCATCAAACACACCGCAATCGAGTCCCTGCCCTCCGTCGAAGCCGACCGTCAGGCAGACTCAACCC
CGTAACACAACTGCTCACTTTTCGACCGGAAGAAACTGCTCAGGATTCGACCGGAACCGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1218 bp | 405 aa | 76 | 1293 | + | No |
Chemistry : DDE
ORF sequence :
VEDWAEIRRLYRSEKLSQAAIARRLGLSRNTVAKALGSDGPPRYERAPVTTSAWAQVEPAVRAVLSQYPRMPATVIAERVGWMGGHSWFGENVARIRPEY
APADPCDRLVHLPGEQVQCDLWLPGRVVPDHAGVLRSFPVLVMVAAYSRFVAAMMIPSRVTGDLLAGMWQLLSGYIGGVPRTLLWDNEAGIGQRGRLAEG
VAGFCGVLGTRLVQARPYDPETKGLVERANGYLQTSFLPGRRFCSAADFNTQLAQWLAQVANPRRHATTKTIPAQALGADLQVMAALPPVAPTTGTTLTT
RLGRDYYISMGGNAYSVHPEVIGRMITATVSLDRVMARCGERVVADHERLWGSAGLACDPEHLAAAAIMRENFRERSGVGAHLDVEVQVADLGAYDAVFG
TGEVA
APADPCDRLVHLPGEQVQCDLWLPGRVVPDHAGVLRSFPVLVMVAAYSRFVAAMMIPSRVTGDLLAGMWQLLSGYIGGVPRTLLWDNEAGIGQRGRLAEG
VAGFCGVLGTRLVQARPYDPETKGLVERANGYLQTSFLPGRRFCSAADFNTQLAQWLAQVANPRRHATTKTIPAQALGADLQVMAALPPVAPTTGTTLTT
RLGRDYYISMGGNAYSVHPEVIGRMITATVSLDRVMARCGERVVADHERLWGSAGLACDPEHLAAAAIMRENFRERSGVGAHLDVEVQVADLGAYDAVFG
TGEVA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
813 bp | 270 aa | 1293 | 2105 | + | No |
AG : IS21 helper
ORF sequence :
MATKRAPAPGEADKLIAHQSRLLKAPRIAAHYHRLAEQGRAAGWSLEDYLAAVLAVESNARAESGARQRIRYAGFPAIKTITDFDFTAQPHLDRAQIARL
EAGGWLAEARNIVLLGPPGTGKTHLATALAIAAAQAGHRVAFAPATGWITRLAEAHRIGSLDGELRKISRIGLIVIDEVGYIPFDTEAANLFFQLVSTRY
EKSSIILTSNLPFSRWGQVFGEATIASAMIDRIVHHADVIALKGASYRIKHTAIESLPSVEADRQADSTP
EAGGWLAEARNIVLLGPPGTGKTHLATALAIAAAQAGHRVAFAPATGWITRLAEAHRIGSLDGELRKISRIGLIVIDEVGYIPFDTEAANLFFQLVSTRY
EKSSIILTSNLPFSRWGQVFGEATIASAMIDRIVHHADVIALKGASYRIKHTAIESLPSVEADRQADSTP
Blast result :
Comments
ISMyma9 has 81% pairwise identity to an integrase in Mycobacterium chubuense (Genbank CP003054) and an 81.5% pairwise identity to a transposase in Mycobacterium abcessus subspecies bolletii (CP004375).
The ORF1 of ISMyma9 has a 78.6% pairwise identity to a Mycobacterium abscessus subspecies bolletii integrase (CP004375).
ORF2 of ISMyma9 has an 85.4% pairwise identity to an IstB domain protein in Mycobacterium abscessus (CP004375).
ISMyma9 displays characteristics of insertion sequences in the IS21 family due to its 50bp inverted repeats; 5-8bp direct repeats; two consecutive open reading frames; and terminating with 5'-CA-3' (Mahillon & Chandler, 1998; Xu et. al. 1993).
The ORF1 of ISMyma9 has a 78.6% pairwise identity to a Mycobacterium abscessus subspecies bolletii integrase (CP004375).
ORF2 of ISMyma9 has an 85.4% pairwise identity to an IstB domain protein in Mycobacterium abscessus (CP004375).
ISMyma9 displays characteristics of insertion sequences in the IS21 family due to its 50bp inverted repeats; 5-8bp direct repeats; two consecutive open reading frames; and terminating with 5'-CA-3' (Mahillon & Chandler, 1998; Xu et. al. 1993).
References
1] Gauthier, D.T., Helenthal, A.M., Rhodes, M.W., Vogelbein, W.K., Kator, H.I. (2011) Dis. Aquat. Org. 95: 113-124