ISMasa1
- Family IS1182
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP017715 | ND | Marinobacter salinus | Marinobacter salinus Hb8 |
DNA section
IS Length : 1830 bp
Ends
IR Length : 18
IRL : GAGGCTGTAGAAAAACCCCGAAATCTCCACCCCGGTTTGGTAGGATCAGG
IRR : GAGGCTGTAGAAAAACCCGATTGCGAGTCGCTCCCAGCGCCTGGTTCCTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCATGCTGGCTAGC | GTT | AGTTGCCATCCT | 3 |
CGCATGCTGGCGGC | gtt | ATAAATCGGTGG | 3 |
GCATGCTGGCTAGC | GTT | AGTTGCCATCCT | 3 |
GCATGCTGGCTGGC | GTT | ATACGTAAAGGA | 3 |
DNA sequence
GAGGCTGTAGAAAAACCCCGAAATCTCCACCCCGGTTTGGTAGGATCAGGCAAACGGACAAGGAGTTGTTGGCATGCCCCGCTTCAAGCATTACAACTAC
GACCAGGACGCCATGGTCGTGATCAACTACCAGGAACAGCTTCAACCCGGCACCTTTGAATACGCGGTGCACTACCTGATCGAACATAAGCTAGACCTGT
CAGTTTTCCATCCCAAATACCGCAACGAAGACACTGGCCGTTTGGCCTACGATCCGGCCATCCTGCTGAAGATCATTCTGTTTGCCTATTCCAAGGGCAT
CACCTCCAGCCGTGAGATGCAGTGGTGCTGCGAGACCAACATCATCTTCAAAGCCTTGTCCTGCGATACGGTTCCCCACTTCACCACCCTGGCGAAGTTC
GTCAGCAGCCATGCCGAGGAAATCGAAGAGCTGTTCGAACAGGTACTGCTGGTGTGCCACGAACAGGGATTGCTGGGCAACGAACTCTTCGCTATCGACG
GCTGCAAGATGTCCTCCAACGCCGCCAAAGAATGGTCGGGCACCTTTAAGGAACTGGGCGAAAAGCGAGAGAAACTGCGACGACTGATCCGCCACCACTT
GAAGGAGCATTACGAACGGGACGAGGCGGAGACCGAAGCCGAACTGGATCGGGATATCCGTCGGGCAAACACCATCCTCTCGCTGGATGAAGCCATGAAC
AAAGTGGACCGCTTCCTAAAGACGAACCGCCCAAGGATGGGCCGTGGGAAGCGGAGCAAGGAAGTGAAGAGCAACCTGACTGACAACGAAAGTGCCAAGA
TGACCACCAGCAAAGGCACGATCCAGGGCTATAACGGCGTAGCCACGGTGGATAAGAAACACCAGATCGTCATCGACGCCCAGGCCTTCGGCGAAGGCCA
GGAACACCACACCCTGCAACCGGTACTGGAAACCGTCGAATCCCGATTCCGGAAACTGGGCATCGCTGACAGTATTTACCAGCAAGGCACGGTCGTCACC
GCCGACACCGGCTTCGCTAACGAAGCCAACATGAAGTACCTGCACGAACGACAGATCAACGGCTACGTGCCAGACAATCGGTTCCGCAGCCGGGATCCGA
AGTTCCAGAACCAGAAAGACAAATACGGCAAGCGTCATCAGAATCTGCCGGATACAGGCTGGAAGCAGACCATCCCGGCCAGTGAGTTCCAGTTCGATCC
GGTAACCATGACGTGCATCTGCCCGGCAGGTAATTCCATCAGCTACCAAGCCACACGCGAAGCCGAGAATGGCAAAATGCGGGTGCATTTCGAAGGGCGT
TTGCTGCAATGCCGGCACTGCCCGAAGAAGCATCAGTGTATGCAGAACCCCGCGTCCGCCAACCATCGCAAAGGCTCCGGAAGACAGGTCTCGTTTACCA
TCGAAAACAAGCGCCTGCCAAACTACACCGACTGGATGAAACACCGGGTGGACAGTCCGCAGGGCAAGGAAATATACAGTCACCGGATGTCGGTGGTAGA
GCCGGTATTTGGCAACATTGGTACCACGAAGCGACTGAACCGTTTCAGCTTGCGAGGCAAGAAGAAGGTGCAGGGCCAGTGGCAGCTGTACTGCCTGGTG
CACAATATTGAGAAGTTAGCGAATTACGGGCGGTTAGCGGCGTAATGCCGAGGGTGGAAGGCCAGAAATGTGGCCGCTGAATGCTAGTAATACCGGCACT
AAGACCGTCAACGGGCCGAAACTGATGGCTGAACATCGCAATGGCAATAATCGGCTAGCTGATGGCTGCGACAAAAATTTTAGGAACCAGGCGCTGGGAG
CGACTCGCAATCGGGTTTTTCTACAGCCTC
GACCAGGACGCCATGGTCGTGATCAACTACCAGGAACAGCTTCAACCCGGCACCTTTGAATACGCGGTGCACTACCTGATCGAACATAAGCTAGACCTGT
CAGTTTTCCATCCCAAATACCGCAACGAAGACACTGGCCGTTTGGCCTACGATCCGGCCATCCTGCTGAAGATCATTCTGTTTGCCTATTCCAAGGGCAT
CACCTCCAGCCGTGAGATGCAGTGGTGCTGCGAGACCAACATCATCTTCAAAGCCTTGTCCTGCGATACGGTTCCCCACTTCACCACCCTGGCGAAGTTC
GTCAGCAGCCATGCCGAGGAAATCGAAGAGCTGTTCGAACAGGTACTGCTGGTGTGCCACGAACAGGGATTGCTGGGCAACGAACTCTTCGCTATCGACG
GCTGCAAGATGTCCTCCAACGCCGCCAAAGAATGGTCGGGCACCTTTAAGGAACTGGGCGAAAAGCGAGAGAAACTGCGACGACTGATCCGCCACCACTT
GAAGGAGCATTACGAACGGGACGAGGCGGAGACCGAAGCCGAACTGGATCGGGATATCCGTCGGGCAAACACCATCCTCTCGCTGGATGAAGCCATGAAC
AAAGTGGACCGCTTCCTAAAGACGAACCGCCCAAGGATGGGCCGTGGGAAGCGGAGCAAGGAAGTGAAGAGCAACCTGACTGACAACGAAAGTGCCAAGA
TGACCACCAGCAAAGGCACGATCCAGGGCTATAACGGCGTAGCCACGGTGGATAAGAAACACCAGATCGTCATCGACGCCCAGGCCTTCGGCGAAGGCCA
GGAACACCACACCCTGCAACCGGTACTGGAAACCGTCGAATCCCGATTCCGGAAACTGGGCATCGCTGACAGTATTTACCAGCAAGGCACGGTCGTCACC
GCCGACACCGGCTTCGCTAACGAAGCCAACATGAAGTACCTGCACGAACGACAGATCAACGGCTACGTGCCAGACAATCGGTTCCGCAGCCGGGATCCGA
AGTTCCAGAACCAGAAAGACAAATACGGCAAGCGTCATCAGAATCTGCCGGATACAGGCTGGAAGCAGACCATCCCGGCCAGTGAGTTCCAGTTCGATCC
GGTAACCATGACGTGCATCTGCCCGGCAGGTAATTCCATCAGCTACCAAGCCACACGCGAAGCCGAGAATGGCAAAATGCGGGTGCATTTCGAAGGGCGT
TTGCTGCAATGCCGGCACTGCCCGAAGAAGCATCAGTGTATGCAGAACCCCGCGTCCGCCAACCATCGCAAAGGCTCCGGAAGACAGGTCTCGTTTACCA
TCGAAAACAAGCGCCTGCCAAACTACACCGACTGGATGAAACACCGGGTGGACAGTCCGCAGGGCAAGGAAATATACAGTCACCGGATGTCGGTGGTAGA
GCCGGTATTTGGCAACATTGGTACCACGAAGCGACTGAACCGTTTCAGCTTGCGAGGCAAGAAGAAGGTGCAGGGCCAGTGGCAGCTGTACTGCCTGGTG
CACAATATTGAGAAGTTAGCGAATTACGGGCGGTTAGCGGCGTAATGCCGAGGGTGGAAGGCCAGAAATGTGGCCGCTGAATGCTAGTAATACCGGCACT
AAGACCGTCAACGGGCCGAAACTGATGGCTGAACATCGCAATGGCAATAATCGGCTAGCTGATGGCTGCGACAAAAATTTTAGGAACCAGGCGCTGGGAG
CGACTCGCAATCGGGTTTTTCTACAGCCTC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1572 bp | 523 aa | 74 | 1645 | + | No |
Chemistry : DDE
ORF sequence :
MPRFKHYNYDQDAMVVINYQEQLQPGTFEYAVHYLIEHKLDLSVFHPKYRNEDTGRLAYDPAILLKIILFAYSKGITSSREMQWCCETNIIFKALSCDTV
PHFTTLAKFVSSHAEEIEELFEQVLLVCHEQGLLGNELFAIDGCKMSSNAAKEWSGTFKELGEKREKLRRLIRHHLKEHYERDEAETEAELDRDIRRANT
ILSLDEAMNKVDRFLKTNRPRMGRGKRSKEVKSNLTDNESAKMTTSKGTIQGYNGVATVDKKHQIVIDAQAFGEGQEHHTLQPVLETVESRFRKLGIADS
IYQQGTVVTADTGFANEANMKYLHERQINGYVPDNRFRSRDPKFQNQKDKYGKRHQNLPDTGWKQTIPASEFQFDPVTMTCICPAGNSISYQATREAENG
KMRVHFEGRLLQCRHCPKKHQCMQNPASANHRKGSGRQVSFTIENKRLPNYTDWMKHRVDSPQGKEIYSHRMSVVEPVFGNIGTTKRLNRFSLRGKKKVQ
GQWQLYCLVHNIEKLANYGRLAA*
PHFTTLAKFVSSHAEEIEELFEQVLLVCHEQGLLGNELFAIDGCKMSSNAAKEWSGTFKELGEKREKLRRLIRHHLKEHYERDEAETEAELDRDIRRANT
ILSLDEAMNKVDRFLKTNRPRMGRGKRSKEVKSNLTDNESAKMTTSKGTIQGYNGVATVDKKHQIVIDAQAFGEGQEHHTLQPVLETVESRFRKLGIADS
IYQQGTVVTADTGFANEANMKYLHERQINGYVPDNRFRSRDPKFQNQKDKYGKRHQNLPDTGWKQTIPASEFQFDPVTMTCICPAGNSISYQATREAENG
KMRVHFEGRLLQCRHCPKKHQCMQNPASANHRKGSGRQVSFTIENKRLPNYTDWMKHRVDSPQGKEIYSHRMSVVEPVFGNIGTTKRLNRFSLRGKKKVQ
GQWQLYCLVHNIEKLANYGRLAA*
Blast result :
Comments
ISMasa1 is 48% aa similar to ISMac1.
Identified within an integron. 2 other isoforms downstream the same integron showed greater than 99% identity
Identified within an integron. 2 other isoforms downstream the same integron showed greater than 99% identity
References
1] Sarah Sonbol (2021) Direct submission.
2] Park,S.-J. (2016) Direct GenBank submission.
2] Park,S.-J. (2016) Direct GenBank submission.