ISMlu14
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP001628 | ND | Micrococcus luteus | Micrococcus luteus NCTC 2665 |
DNA section
IS Length : 2646 bp
Ends
IR Length : 29/38
IRL : TGTAAGCGGCCATGAAAAGCTGCCCATAGGTGGCCAGGGAATGACCCGCT
IRR : TGTCAAGGGCCATTGAGTTCTGCCCATAGGCGGCCAGTGTTTCTTCCCGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGTCCACTCCCTGG | CCGTGC | AGGGGTGAGGGCGTC | 6 |
DNA sequence
TGTAAGCGGCCATGAAAAGCTGCCCATAGGTGGCCAGGGAATGACCCGCTGACGGCCATGAGGACTGCCCAGTGATGGCCACGGGAACTGCCCGGTGCTG
GCCATCTGGTTTGCCCGCGTTGCTCGTTGCCCGCCGCGTCGTGAGCCGGCGGGGGCCTCCTCCCCGGGTTGGATATCGGTGTCGAATCGAATCCAGCGAG
GAAGAGGCCCTGAATGAAGGATGCCGCAGAAACCATGAAAATACTCAGTGCCTATGACCTGACCAAGTCATTGCGGGGCGCCGCCGAGCTGGCCGGTTGC
TCCCATCACACCGTGGCCCGGCTGGTGCGGGCCCGCGACGCCGGCCAGACCCCTGGATCCGGAGCCAGCCGCCCGAAGGTCACCGATGTCTGGCTGCCCA
AGATCGAGGAATGGGTCGAGGCCTCCACGGGCAGGATCCGCGCCGATGTCGTGCACGCCAAGCTCACCGCAATGGGCTACACCGGCTCTGAGCGCTCCAC
CCGGCGGGCGGTGGCCGAGGTGAAGGCCGCCTGGCGGGCCGGGAAGCGCCGGGTGCACCGGCCCTGGATCACCGAGCCGGGCGCATGGCTGCAGTACGAC
TTCGGCGATGGGCCGGCGATCGACGGGGCCAAGACGGTGCTGTTCGTGGCCTGGCTGGCCTGGTCCCGCTACCGGATCGTCATCGCCCTGCGTGATAGGA
CGGCTCCGAGTGTGTTCGCCGCCCTGGACCGGTGCTTCCGGCTGATCGGGGGCGCGCCGACCTATGTGCTGACGGATAACGAGAAGACAGTGACGGTCTC
TCATGTGGCCGGGGTGCCGGTGCGTAACCAGGCCACCGTGGCGTTCGCTCGGCATTACGGCGTGGAGGTGCTGACCTGCCAGCCAGCGGATCCGGCCGCC
AAGGGCGGGGTGGAGAACGCGGTGAAGCTGGCCAAGGCCGACCTGGTCCCGAAGGACACCAACCTGCGGGACCAGTACGCCTCGTTCGCTGAGCTCGAGG
CCGCCTGCGCGGGGTTCATGGCCATGGTGAACTCCCGGGAGCACCGGGTGACCCGGCGCCGCCCGGACCACATGCTCGCCGAGGAGACCAGCGCGTTGCA
TCGGATCCCCGCCACGGCGCACACCGTGGCCTACGGGGTGGGACGGAAGGTGCCGGAGAACACGCCGATGGTCTCCTTCGAAAACGCCCAGTACTCGGTG
CCCGCGCACCTGCTCGGGGCCGAGGTGTTCGTGCGCCATCACGGCACCGGACCGGACTCGATGGTGGTGATCATGCACGCCGGTGCGCAGGGGCCGGTGG
AGGTCGCCCGGCACCGGGTGGCCCGGCCCGGGTCCCCGGCCATCGACGACGCGCACTTCCCCGGCCACGAGACGGACAAGGTGCCCAGGGACTACACCCC
GGTGCCCCGTTCGGCGGCGGAGGCCGAGTTCCTGGCCATCGGGGCCGGAGCGAGGACCTGGCTGGTGGAGGCCGCCGCGGCCGGCACCAGCCGGATCGGT
CAGAAGATGGCCGAGGCCGTCACCCTGGCCAAGCTCGCCGGCACCGACCAGGTCGACCGCGCCCTCGGGGTGGCCGCGGTGCACCAGCGCTTCGCCCACG
GCGACCTGGCCTCCCTGCTCACCGCCGCCGGCCACCGCACCGGGATGCACACCGCCACCGAGGAACGGTCCCTGACCCAGGGCACCGCCGGCTGGGCCGG
CCTCGGTACCTCCAACACCGAGGGGGCAGCCCGGTGAGCATCCACACCCCGAACACCGCCGCACCGGCGCTGCCGGCCGATTTGGAGGCGCTGTTGCGGC
AGCTGAAGATGCCCCACGCCCGCGGTATCGCCGCCGAGGTACTGGCCACCGCCCGGGCCCAACGCTGGGAGCCGGCCGAGGTCCTCAAGGCCCTGCTGGT
CGAGGAGACCACCGGTCGGGCCCGATCCATGCTCGCCGCCCGGCGCAAGGCCGCCGGGTTCCCCACCGGCAAGACCTTCACGACCTGGGATCCGGGCGCC
TCCTCGATCCCGGCCCCGACCCAGCAGTCCCTGCGCACCCTGGAATGGCTCTCGCGCAGAGAGAACCTCGTGGTCTGCGGGCCCTCCGGCACCGGCAAGA
CCTTCTTCCTCGAGGCCCTGGGCCAGCAGGCCGTGGAGGAGGGCAAGCGGGTGGCCTGGTTCACCCTGGAGGACCTCGGGGCGATGATCCGGGCCCACCG
GCCCGATGACACCATCACCAAAGCCGTCGCCCGAATCCTGCGCGCCGACCTGATCGTCATCGACGACATCGGCCTGCTGCCGGTCGCCGACGATGCCGCC
GAGGGCCTCTACCGCATCGTGGACGCCGCCTACGAGAAACGCTCCGTAGCGATCAGCTCGAACCTGCATCCGGCCGGCTTCGACGAGCTGATGCCCAAGA
CCCTGGCCACCGCCACCGTGGACCGGCTCCTGCACCACGCCCACGTCTGCCAGACCACCGGAGACTCCGTACGGATGACCCAGGCAATGGCCGGGAAGGG
AGTCATGCCACTGAACTGATCCCATCACGGTGGCCAGCACCACCACCCAAACGGGCAAACCAGCTGGCCACCAGTGGGCAGTTCTGATGGCCACCAGCGG
GAAGAAACACTGGCCGCCTATGGGCAGAACTCAATGGCCCTTGACA
GCCATCTGGTTTGCCCGCGTTGCTCGTTGCCCGCCGCGTCGTGAGCCGGCGGGGGCCTCCTCCCCGGGTTGGATATCGGTGTCGAATCGAATCCAGCGAG
GAAGAGGCCCTGAATGAAGGATGCCGCAGAAACCATGAAAATACTCAGTGCCTATGACCTGACCAAGTCATTGCGGGGCGCCGCCGAGCTGGCCGGTTGC
TCCCATCACACCGTGGCCCGGCTGGTGCGGGCCCGCGACGCCGGCCAGACCCCTGGATCCGGAGCCAGCCGCCCGAAGGTCACCGATGTCTGGCTGCCCA
AGATCGAGGAATGGGTCGAGGCCTCCACGGGCAGGATCCGCGCCGATGTCGTGCACGCCAAGCTCACCGCAATGGGCTACACCGGCTCTGAGCGCTCCAC
CCGGCGGGCGGTGGCCGAGGTGAAGGCCGCCTGGCGGGCCGGGAAGCGCCGGGTGCACCGGCCCTGGATCACCGAGCCGGGCGCATGGCTGCAGTACGAC
TTCGGCGATGGGCCGGCGATCGACGGGGCCAAGACGGTGCTGTTCGTGGCCTGGCTGGCCTGGTCCCGCTACCGGATCGTCATCGCCCTGCGTGATAGGA
CGGCTCCGAGTGTGTTCGCCGCCCTGGACCGGTGCTTCCGGCTGATCGGGGGCGCGCCGACCTATGTGCTGACGGATAACGAGAAGACAGTGACGGTCTC
TCATGTGGCCGGGGTGCCGGTGCGTAACCAGGCCACCGTGGCGTTCGCTCGGCATTACGGCGTGGAGGTGCTGACCTGCCAGCCAGCGGATCCGGCCGCC
AAGGGCGGGGTGGAGAACGCGGTGAAGCTGGCCAAGGCCGACCTGGTCCCGAAGGACACCAACCTGCGGGACCAGTACGCCTCGTTCGCTGAGCTCGAGG
CCGCCTGCGCGGGGTTCATGGCCATGGTGAACTCCCGGGAGCACCGGGTGACCCGGCGCCGCCCGGACCACATGCTCGCCGAGGAGACCAGCGCGTTGCA
TCGGATCCCCGCCACGGCGCACACCGTGGCCTACGGGGTGGGACGGAAGGTGCCGGAGAACACGCCGATGGTCTCCTTCGAAAACGCCCAGTACTCGGTG
CCCGCGCACCTGCTCGGGGCCGAGGTGTTCGTGCGCCATCACGGCACCGGACCGGACTCGATGGTGGTGATCATGCACGCCGGTGCGCAGGGGCCGGTGG
AGGTCGCCCGGCACCGGGTGGCCCGGCCCGGGTCCCCGGCCATCGACGACGCGCACTTCCCCGGCCACGAGACGGACAAGGTGCCCAGGGACTACACCCC
GGTGCCCCGTTCGGCGGCGGAGGCCGAGTTCCTGGCCATCGGGGCCGGAGCGAGGACCTGGCTGGTGGAGGCCGCCGCGGCCGGCACCAGCCGGATCGGT
CAGAAGATGGCCGAGGCCGTCACCCTGGCCAAGCTCGCCGGCACCGACCAGGTCGACCGCGCCCTCGGGGTGGCCGCGGTGCACCAGCGCTTCGCCCACG
GCGACCTGGCCTCCCTGCTCACCGCCGCCGGCCACCGCACCGGGATGCACACCGCCACCGAGGAACGGTCCCTGACCCAGGGCACCGCCGGCTGGGCCGG
CCTCGGTACCTCCAACACCGAGGGGGCAGCCCGGTGAGCATCCACACCCCGAACACCGCCGCACCGGCGCTGCCGGCCGATTTGGAGGCGCTGTTGCGGC
AGCTGAAGATGCCCCACGCCCGCGGTATCGCCGCCGAGGTACTGGCCACCGCCCGGGCCCAACGCTGGGAGCCGGCCGAGGTCCTCAAGGCCCTGCTGGT
CGAGGAGACCACCGGTCGGGCCCGATCCATGCTCGCCGCCCGGCGCAAGGCCGCCGGGTTCCCCACCGGCAAGACCTTCACGACCTGGGATCCGGGCGCC
TCCTCGATCCCGGCCCCGACCCAGCAGTCCCTGCGCACCCTGGAATGGCTCTCGCGCAGAGAGAACCTCGTGGTCTGCGGGCCCTCCGGCACCGGCAAGA
CCTTCTTCCTCGAGGCCCTGGGCCAGCAGGCCGTGGAGGAGGGCAAGCGGGTGGCCTGGTTCACCCTGGAGGACCTCGGGGCGATGATCCGGGCCCACCG
GCCCGATGACACCATCACCAAAGCCGTCGCCCGAATCCTGCGCGCCGACCTGATCGTCATCGACGACATCGGCCTGCTGCCGGTCGCCGACGATGCCGCC
GAGGGCCTCTACCGCATCGTGGACGCCGCCTACGAGAAACGCTCCGTAGCGATCAGCTCGAACCTGCATCCGGCCGGCTTCGACGAGCTGATGCCCAAGA
CCCTGGCCACCGCCACCGTGGACCGGCTCCTGCACCACGCCCACGTCTGCCAGACCACCGGAGACTCCGTACGGATGACCCAGGCAATGGCCGGGAAGGG
AGTCATGCCACTGAACTGATCCCATCACGGTGGCCAGCACCACCACCCAAACGGGCAAACCAGCTGGCCACCAGTGGGCAGTTCTGATGGCCACCAGCGG
GAAGAAACACTGGCCGCCTATGGGCAGAACTCAATGGCCCTTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1467 bp | 488 aa | 271 | 1737 | + | No |
Chemistry : DDE
ORF sequence :
MRGAAELAGCSHHTVARLVRARDAGQTPGSGASRPKVTDVWLPKIEEWVEASTGRIRADVVHAKLTAMGYTGSERSTRRAVAEVKAAWRAGKRRVHRPWI
TEPGAWLQYDFGDGPAIDGAKTVLFVAWLAWSRYRIVIALRDRTAPSVFAALDRCFRLIGGAPTYVLTDNEKTVTVSHVAGVPVRNQATVAFARHYGVEV
LTCQPADPAAKGGVENAVKLAKADLVPKDTNLRDQYASFAELEAACAGFMAMVNSREHRVTRRRPDHMLAEETSALHRIPATAHTVAYGVGRKVPENTPM
VSFENAQYSVPAHLLGAEVFVRHHGTGPDSMVVIMHAGAQGPVEVARHRVARPGSPAIDDAHFPGHETDKVPRDYTPVPRSAAEAEFLAIGAGARTWLVE
AAAAGTSRIGQKMAEAVTLAKLAGTDQVDRALGVAAVHQRFAHGDLASLLTAAGHRTGMHTATEERSLTQGTAGWAGLGTSNTEGAAR
TEPGAWLQYDFGDGPAIDGAKTVLFVAWLAWSRYRIVIALRDRTAPSVFAALDRCFRLIGGAPTYVLTDNEKTVTVSHVAGVPVRNQATVAFARHYGVEV
LTCQPADPAAKGGVENAVKLAKADLVPKDTNLRDQYASFAELEAACAGFMAMVNSREHRVTRRRPDHMLAEETSALHRIPATAHTVAYGVGRKVPENTPM
VSFENAQYSVPAHLLGAEVFVRHHGTGPDSMVVIMHAGAQGPVEVARHRVARPGSPAIDDAHFPGHETDKVPRDYTPVPRSAAEAEFLAIGAGARTWLVE
AAAAGTSRIGQKMAEAVTLAKLAGTDQVDRALGVAAVHQRFAHGDLASLLTAAGHRTGMHTATEERSLTQGTAGWAGLGTSNTEGAAR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
786 bp | 261 aa | 1734 | 2519 | + | No |
AG : IS21 helper
ORF sequence :
MSIHTPNTAAPALPADLEALLRQLKMPHARGIAAEVLATARAQRWEPAEVLKALLVEETTGRARSMLAARRKAAGFPTGKTFTTWDPGASSIPAPTQQSL
RTLEWLSRRENLVVCGPSGTGKTFFLEALGQQAVEEGKRVAWFTLEDLGAMIRAHRPDDTITKAVARILRADLIVIDDIGLLPVADDAAEGLYRIVDAAY
EKRSVAISSNLHPAGFDELMPKTLATATVDRLLHHAHVCQTTGDSVRMTQAMAGKGVMPLN
RTLEWLSRRENLVVCGPSGTGKTFFLEALGQQAVEEGKRVAWFTLEDLGAMIRAHRPDDTITKAVARILRADLIVIDDIGLLPVADDAAEGLYRIVDAAY
EKRSVAISSNLHPAGFDELMPKTLATATVDRLLHHAHVCQTTGDSVRMTQAMAGKGVMPLN
Blast result :Comments : 81% aa similar to IS1415.
Comments
ISMlu14 is 66% (transposase) aa similar to IS1415.
References
1] ISfinder annotation (2018)
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Lowry,S., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Lykidis,A., Young,M. and Greenblatt,C. (2009) Direct GenBank submission.
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Lowry,S., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Lykidis,A., Young,M. and Greenblatt,C. (2009) Direct GenBank submission.