ISMahy2
- Family IS91
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_008740.1 | ND | Marinobacter hydrocarbonoclasticus | Marinobacter hydrocarbonoclasticus VT8 |
DNA section
IS Length : 2340 bp
Ends
oriIS : TAAACCGCTCCGCGGTAGCGGTGATACTGAGAATCTCCCCGGAATGGAGGTGATTGGAGGATACTTTTGAATGGCCCAAGTGGGCTTTTCAAAGAGGAGA II struct. : No
terIS : CCGCAGAGTACCTTTGCGGGCCTGAACCGCCATCCCGGTATCCGTTCAAAATTTACTTTACCCTTATACATTGTGTGCTGGCATTGGTGGCAGCGGCTTA II struct. : No
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GCTGTT GAAC | GTCC AACCCA |
DNA sequence
TAAACCGCTCCGCGGTAGCGGTGATACTGAGAATCTCCCCGGAATGGAGGTGATTGGAGGATACTTTTGAATGGCCCAAGTGGGCTTTTCAAAGAGGAGA
TGCCGCCATGACCCCCTTACGTCAGGCTATGATCCAGACCATGCGTCAGCATGGTTTTTCCCCGCAGACCCAAAAAAGTTATCTCTATGTAGTCGCGGAT
CTGGCTCGATATTATCGTCGATCCCCGGACCAGTTGGAGGTCGGGGATCTGCAGACCTACTTTAATTACCTCGTCCAGGAACGTTCCCTGGCGCCGGCCA
GTTGTCGGCTTTACCTGCATGGCATTCGGTTTTTGTATCTGCAGGTGCTGCATTGGGCAGACTTCGACGTGAACCTGGTGCTGCCCAAGCGACCACAGCG
CATTCCTGAGTTACTGACTCGTCAGGACGTTGCAAGGATAGTGGGAGCCACCAAGAACCCCAAGCATCGGGCCTTGCTGTTGATCACGTATGGCTGCGGC
TTGCGAGTGAGTGAAGCCGTGTCATTGCAAGTGAAGAACATCGACGGGGAGCGCCACCTGCTTCACATCCATCAGGGCAAGGGCGGTAAGGATCGCATGG
TGCCCCTGACTGACGTGCTACTCGATGTATTGCGAGCGTATTGGTGCCTTGGCCGCCCCGTACTCTGGCTGTTTCCGAGTGAAGTACTGGTTGGTCAGCA
CCTGACCGCCACCACGGCCCAGAAGGTTTATCGTCACGCCAAGCGACTCAGCGGTGTGCAAAGACGCGGAGGGATACACGCACTGCGCCACGCCTATGCG
ACCCATCAGCTGGAACATGGTGTACCGCTCAACGAGTTACAGAAATATCTTGGGCACAGTGATCTGCGCACCACGGAGCGCTACCTGCACTGGTTGCCGG
GAATATCCTCACAAAACCGTTCGCCTGCGGACTTGATCGCGGATCTGGGAGGTGCGTGATGGCCGGTTCAGTTACGGTCCAGTCGGCACTGACCCAGTTC
CTCTCGACCGACTCATTGGACCGCCACCGTCGCAAGGTGTGCAGCCGACTGATGGATTGCCGCACAGCCCGGATGGGCGGTATGGAAATGCGTTGTGATC
ACTGCAAAGCACGAACCGTGCATTACTACGGTTGCCGGGACCGGCATTGTCCGCAATGCCAGAGTCGGGCTAGCCAACAGTGGTGTGAGCGCCAGCGTCG
ATCGCTACTGCCGGTACCCTACTTCCACCTGGTGTTCACCGTGCCCCACGCCCTGAATGGCTGGGTTCAGGTGCACCCGGAAGTGGTTTACCGTCGCCTG
TTTGAGTCAGTTTGGAATACGCTCAGTCAGTTTGGCCATACAACCAAACACTTGCAGGGTGAGTTGGGCATGACAGCCGTGCTCCATACCTGGGGGCAGA
ATCTGAGTCGCCACGTTCACGTGCATTGCCTGATTCCGGGCGGGGTGCTCACAGAGGCCGGAGAATGGCATGAGGCTAAACATCAATATCTGTTCCCCGT
CAGGGCCTTGTCCCGCCGCTTTCGCGGCCGCATGGTATCGTCCCTCAGACGATCGGTCAGAACCGGCGAATTGCACAGACTCACGGAGCCTCAAGCCATC
GACGACGTACTCAAAAAAGTGATGCAGCAGGATTGGGTGGTTTACGCCCGCCCTTGTCTGAATCAGGCGGAGACGGTGGTGGATTACCTGGCTCGCTACA
CCCACCGCATTGCTATCAGTAACGGTCGTTTGTTGTCGCAGGACGGCGGCCGAATCATGATCCGCTATGTGGACTACCGGGAAGGAGGGCGCCAGAAGAG
CCTGCAACTGGAAGGGGCGGAATTTGTTCGACGGTTCCTCATGCATATCCTGCCCAAGGGCTTTATGCGGATTCGTCACTTTGGTTATCTGAGTAACCGA
ACCCGACGTCGCAAGCTGACGGCCATTCGTCAGGCCCTTCAAAAGCCACCCGAGGTCGATGTTGAAGTCAAGAATGGCTCGGAGTCCCAACGCAGCTGGC
CGTGCCCTCGATGCGAGGATGGTGTGGTTTATATGGTTCGGCAGATTCCCCGGTTTAAAACCGTGGGTAGAGTGACGGGTTAGCCGCTTTCCCTGGCGGC
TTCCCGTGAGCGCTACAGTTCCTGAGGAGCTGGGCTTCGACTCGGGCTGAGGCTGGCAATGAAAGAAACAGAGGGGTAGGCTGATCCAAATCACCCAGGG
ATGAGGCATTTTATGGGGACCATCAGGACGATACGCACTTCCGCAGAGTACCTTTGCGGGCCTGAACCGCCATCCCGGTATCCGTTCAAAATTTACTTTA
CCCTTATACATTGTGTGCTGGCATTGGTGGCAGCGGCTTA
TGCCGCCATGACCCCCTTACGTCAGGCTATGATCCAGACCATGCGTCAGCATGGTTTTTCCCCGCAGACCCAAAAAAGTTATCTCTATGTAGTCGCGGAT
CTGGCTCGATATTATCGTCGATCCCCGGACCAGTTGGAGGTCGGGGATCTGCAGACCTACTTTAATTACCTCGTCCAGGAACGTTCCCTGGCGCCGGCCA
GTTGTCGGCTTTACCTGCATGGCATTCGGTTTTTGTATCTGCAGGTGCTGCATTGGGCAGACTTCGACGTGAACCTGGTGCTGCCCAAGCGACCACAGCG
CATTCCTGAGTTACTGACTCGTCAGGACGTTGCAAGGATAGTGGGAGCCACCAAGAACCCCAAGCATCGGGCCTTGCTGTTGATCACGTATGGCTGCGGC
TTGCGAGTGAGTGAAGCCGTGTCATTGCAAGTGAAGAACATCGACGGGGAGCGCCACCTGCTTCACATCCATCAGGGCAAGGGCGGTAAGGATCGCATGG
TGCCCCTGACTGACGTGCTACTCGATGTATTGCGAGCGTATTGGTGCCTTGGCCGCCCCGTACTCTGGCTGTTTCCGAGTGAAGTACTGGTTGGTCAGCA
CCTGACCGCCACCACGGCCCAGAAGGTTTATCGTCACGCCAAGCGACTCAGCGGTGTGCAAAGACGCGGAGGGATACACGCACTGCGCCACGCCTATGCG
ACCCATCAGCTGGAACATGGTGTACCGCTCAACGAGTTACAGAAATATCTTGGGCACAGTGATCTGCGCACCACGGAGCGCTACCTGCACTGGTTGCCGG
GAATATCCTCACAAAACCGTTCGCCTGCGGACTTGATCGCGGATCTGGGAGGTGCGTGATGGCCGGTTCAGTTACGGTCCAGTCGGCACTGACCCAGTTC
CTCTCGACCGACTCATTGGACCGCCACCGTCGCAAGGTGTGCAGCCGACTGATGGATTGCCGCACAGCCCGGATGGGCGGTATGGAAATGCGTTGTGATC
ACTGCAAAGCACGAACCGTGCATTACTACGGTTGCCGGGACCGGCATTGTCCGCAATGCCAGAGTCGGGCTAGCCAACAGTGGTGTGAGCGCCAGCGTCG
ATCGCTACTGCCGGTACCCTACTTCCACCTGGTGTTCACCGTGCCCCACGCCCTGAATGGCTGGGTTCAGGTGCACCCGGAAGTGGTTTACCGTCGCCTG
TTTGAGTCAGTTTGGAATACGCTCAGTCAGTTTGGCCATACAACCAAACACTTGCAGGGTGAGTTGGGCATGACAGCCGTGCTCCATACCTGGGGGCAGA
ATCTGAGTCGCCACGTTCACGTGCATTGCCTGATTCCGGGCGGGGTGCTCACAGAGGCCGGAGAATGGCATGAGGCTAAACATCAATATCTGTTCCCCGT
CAGGGCCTTGTCCCGCCGCTTTCGCGGCCGCATGGTATCGTCCCTCAGACGATCGGTCAGAACCGGCGAATTGCACAGACTCACGGAGCCTCAAGCCATC
GACGACGTACTCAAAAAAGTGATGCAGCAGGATTGGGTGGTTTACGCCCGCCCTTGTCTGAATCAGGCGGAGACGGTGGTGGATTACCTGGCTCGCTACA
CCCACCGCATTGCTATCAGTAACGGTCGTTTGTTGTCGCAGGACGGCGGCCGAATCATGATCCGCTATGTGGACTACCGGGAAGGAGGGCGCCAGAAGAG
CCTGCAACTGGAAGGGGCGGAATTTGTTCGACGGTTCCTCATGCATATCCTGCCCAAGGGCTTTATGCGGATTCGTCACTTTGGTTATCTGAGTAACCGA
ACCCGACGTCGCAAGCTGACGGCCATTCGTCAGGCCCTTCAAAAGCCACCCGAGGTCGATGTTGAAGTCAAGAATGGCTCGGAGTCCCAACGCAGCTGGC
CGTGCCCTCGATGCGAGGATGGTGTGGTTTATATGGTTCGGCAGATTCCCCGGTTTAAAACCGTGGGTAGAGTGACGGGTTAGCCGCTTTCCCTGGCGGC
TTCCCGTGAGCGCTACAGTTCCTGAGGAGCTGGGCTTCGACTCGGGCTGAGGCTGGCAATGAAAGAAACAGAGGGGTAGGCTGATCCAAATCACCCAGGG
ATGAGGCATTTTATGGGGACCATCAGGACGATACGCACTTCCGCAGAGTACCTTTGCGGGCCTGAACCGCCATCCCGGTATCCGTTCAAAATTTACTTTA
CCCTTATACATTGTGTGCTGGCATTGGTGGCAGCGGCTTA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
852 bp | 283 aa | 108 | 959 | + | No |
AG : IS91 integrase/resolvase
ORF sequence :
MTPLRQAMIQTMRQHGFSPQTQKSYLYVVADLARYYRRSPDQLEVGDLQTYFNYLVQERSLAPASCRLYLHGIRFLYLQVLHWADFDVNLVLPKRPQRIP
ELLTRQDVARIVGATKNPKHRALLLITYGCGLRVSEAVSLQVKNIDGERHLLHIHQGKGGKDRMVPLTDVLLDVLRAYWCLGRPVLWLFPSEVLVGQHLT
ATTAQKVYRHAKRLSGVQRRGGIHALRHAYATHQLEHGVPLNELQKYLGHSDLRTTERYLHWLPGISSQNRSPADLIADLGGA
ELLTRQDVARIVGATKNPKHRALLLITYGCGLRVSEAVSLQVKNIDGERHLLHIHQGKGGKDRMVPLTDVLLDVLRAYWCLGRPVLWLFPSEVLVGQHLT
ATTAQKVYRHAKRLSGVQRRGGIHALRHAYATHQLEHGVPLNELQKYLGHSDLRTTERYLHWLPGISSQNRSPADLIADLGGA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1125 bp | 374 aa | 959 | 2083 | + | No |
Chemistry : Y2
ORF sequence :
MAGSVTVQSALTQFLSTDSLDRHRRKVCSRLMDCRTARMGGMEMRCDHCKARTVHYYGCRDRHCPQCQSRASQQWCERQRRSLLPVPYFHLVFTVPHALN
GWVQVHPEVVYRRLFESVWNTLSQFGHTTKHLQGELGMTAVLHTWGQNLSRHVHVHCLIPGGVLTEAGEWHEAKHQYLFPVRALSRRFRGRMVSSLRRSV
RTGELHRLTEPQAIDDVLKKVMQQDWVVYARPCLNQAETVVDYLARYTHRIAISNGRLLSQDGGRIMIRYVDYREGGRQKSLQLEGAEFVRRFLMHILPK
GFMRIRHFGYLSNRTRRRKLTAIRQALQKPPEVDVEVKNGSESQRSWPCPRCEDGVVYMVRQIPRFKTVGRVTG
GWVQVHPEVVYRRLFESVWNTLSQFGHTTKHLQGELGMTAVLHTWGQNLSRHVHVHCLIPGGVLTEAGEWHEAKHQYLFPVRALSRRFRGRMVSSLRRSV
RTGELHRLTEPQAIDDVLKKVMQQDWVVYARPCLNQAETVVDYLARYTHRIAISNGRLLSQDGGRIMIRYVDYREGGRQKSLQLEGAEFVRRFLMHILPK
GFMRIRHFGYLSNRTRRRKLTAIRQALQKPPEVDVEVKNGSESQRSWPCPRCEDGVVYMVRQIPRFKTVGRVTG
Blast result :
Comments
ISMahy2 is 72% aa (transposase) similar o ISHati3.
References
1] Sarah Sonbol (2020) Direct submission.
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Kiss,H., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Kim,E., Edwards,K. and Richardson,P. (200-) Direct submission GenBank.
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Kiss,H., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Kim,E., Edwards,K. and Richardson,P. (200-) Direct submission GenBank.