ISMpr2
- Family ISKra4
- Group ISKra4
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_GL890840 | ND | Moorea producens | Moorea producens 3L |
DNA section
IS Length : 1735 bp
Ends
IR Length : 19/24
IRL : GGGTGCGGCTCTTTTAAAAGTACGGTAGACTAGATCTCAGTAAAACTGAC
IRR : GGGTGTGGCTCTTTTGGAAAGACGTTTGGTCAGGTTTAGAGAGGTATCGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCTTATTCAC | AAATCTTTT | CTGGAAGTTC | 9 |
CCTTATTCAC | AAATCTTTT | CTGGAAGTTC | 9 |
CAGCTATTCT | AAATCATTT | GTAAACTATG | 9 |
GAGCAATCCT | AAATCATTT | GTCAAATCAC | 9 |
ACTGCTCTAC | ATATTATTT | AGCGGTTTTT | 9 |
DNA sequence
GGGTGCGGCTCTTTTAAAAGTACGGTAGACTAGATCTCAGTAAAACTGACAAATTAGTCAGAGTAATAGTTTGAGTCAATACCTCCTGGTGGGTGACCAG
CTGAAAAAGTCAGCAGTAGTTCAGGAAAAGTTTGGACCTATGCAACCATCGATCTCAAATGAAGACATGTTTTTCTCACAAGCGCGAGAGCTATTTGAAG
AAATAGTGGACTGGCTGGGTTCAGATAGTGTTTGCGGTTTACAACATGGGGAATTAGAGAGTAAACTGCTTGAAAACGGATACGAACTATTAAGACGACT
GCTGCAAGGTTACTTCGACAGACGAAGTGATGACGAGATCGAAGGAGAATGCTTAGGTAACGACGAAGCCAAGAGAACCCATAAGAAGAGATTTACAAGG
AAGCTGACAACAATATTTGGCACGATCATCGCCAATCGAATCGGTTATGGTGGACGAAAAATAACTTCCCTCAACCCACTCGATGCCGAGTTAAACCTAC
CAGTAGAAAAATACTCCCACGGACTGAGAGAGCGAGTGGCTGTCGAAGTAGCTCGTTCAGGGTTTGGCGAAACAGTCGAAATCATACAGAAAACAACAGC
CGCAAAAGTTGGTAAGCGACAAGTTGAACAATTGGCTGATCAAAGTGCTTGTGACTTTGATCAATTCTATGCTCATCAACAAGCACAATCGGTTGAATTG
AAGGAGACGGGGGAAATAGTAGTTATTAGTGTGGATGGCAAGGGTGTTGTCATGCGTACAGAAGACTTACGTGCTCCCACGCAAAAAAGAGCGCTGGCAA
ATAGCAAGAAGTTAAATAAACGATTAACTAAGGGGGAAAAACGCAATTCAAAGCGAATGGCAACAGTGGCTTCAGTTTATACCATCAACCCTTTTGTTCG
TACACCTCAACAAATTGTCAGCACAGAAGAAGAACACAAAAAAATCAAACGACCCAAACCAATCGGTAAGCGTGTTTGGGCAAGTTTAGTTAAAGAACCA
TCACTTGTTATAAAAGAAGCTTTTGACGAGGCATTACACCGCGACCCCAATCAACACAAGCGTTTTTGCGCTCTGGTCGATGGTAACAAGACACAATTAT
CGCGCTTTGAAAAAATTTGCTCGCGAACACCATCTCAAGTTAACGATTGTCCTGGATATTATTCATGTGATTGAGTATCTGTGGAAGGCTGCATTTGCCT
TCTATTCGGACACTAGTCAACAAGCCGAAGTTTGGGTTAGTAAACGTTTACTGCTGATCCTTGAAGGTAAATCGAGTACAGTTGCTGGGGGTATGCGTGG
TAGCGCTACAAAGCGCCAACTTTCTGCATCACAACGTCAGCCAGTGGATAAGTGTGCCAGATATTTACTCAACAACGCTGCTTACCTCAAATACCACGAT
TACTTGAAAGCTGGTCTTCCCATTGCCACTGGGGTTATTGAGGGCGCTTGTCGTCACTTAATCAAAGACAGGATGGACATTACCGGGGCTAGATGGAGTC
TTATGGGTGCTGAGGCGGTTTTACGTCTGCGCTCGCTTTATGTTAGCGGTGATTGGCAAGAGTATTGGCCCTTTCACTTAAAGCAAGAGCATAAACGTAA
TCACTTGTCTTTGTACAAAGATGGCCTTCCTTTGATGAAACGACTAATTCAAGCTCGTTGCTCTATTACTGCTCCTCCTACTCTTCCGATACCTCTCTAA
ACCTGACCAAACGTCTTTCCAAAAGAGCCACACCC
CTGAAAAAGTCAGCAGTAGTTCAGGAAAAGTTTGGACCTATGCAACCATCGATCTCAAATGAAGACATGTTTTTCTCACAAGCGCGAGAGCTATTTGAAG
AAATAGTGGACTGGCTGGGTTCAGATAGTGTTTGCGGTTTACAACATGGGGAATTAGAGAGTAAACTGCTTGAAAACGGATACGAACTATTAAGACGACT
GCTGCAAGGTTACTTCGACAGACGAAGTGATGACGAGATCGAAGGAGAATGCTTAGGTAACGACGAAGCCAAGAGAACCCATAAGAAGAGATTTACAAGG
AAGCTGACAACAATATTTGGCACGATCATCGCCAATCGAATCGGTTATGGTGGACGAAAAATAACTTCCCTCAACCCACTCGATGCCGAGTTAAACCTAC
CAGTAGAAAAATACTCCCACGGACTGAGAGAGCGAGTGGCTGTCGAAGTAGCTCGTTCAGGGTTTGGCGAAACAGTCGAAATCATACAGAAAACAACAGC
CGCAAAAGTTGGTAAGCGACAAGTTGAACAATTGGCTGATCAAAGTGCTTGTGACTTTGATCAATTCTATGCTCATCAACAAGCACAATCGGTTGAATTG
AAGGAGACGGGGGAAATAGTAGTTATTAGTGTGGATGGCAAGGGTGTTGTCATGCGTACAGAAGACTTACGTGCTCCCACGCAAAAAAGAGCGCTGGCAA
ATAGCAAGAAGTTAAATAAACGATTAACTAAGGGGGAAAAACGCAATTCAAAGCGAATGGCAACAGTGGCTTCAGTTTATACCATCAACCCTTTTGTTCG
TACACCTCAACAAATTGTCAGCACAGAAGAAGAACACAAAAAAATCAAACGACCCAAACCAATCGGTAAGCGTGTTTGGGCAAGTTTAGTTAAAGAACCA
TCACTTGTTATAAAAGAAGCTTTTGACGAGGCATTACACCGCGACCCCAATCAACACAAGCGTTTTTGCGCTCTGGTCGATGGTAACAAGACACAATTAT
CGCGCTTTGAAAAAATTTGCTCGCGAACACCATCTCAAGTTAACGATTGTCCTGGATATTATTCATGTGATTGAGTATCTGTGGAAGGCTGCATTTGCCT
TCTATTCGGACACTAGTCAACAAGCCGAAGTTTGGGTTAGTAAACGTTTACTGCTGATCCTTGAAGGTAAATCGAGTACAGTTGCTGGGGGTATGCGTGG
TAGCGCTACAAAGCGCCAACTTTCTGCATCACAACGTCAGCCAGTGGATAAGTGTGCCAGATATTTACTCAACAACGCTGCTTACCTCAAATACCACGAT
TACTTGAAAGCTGGTCTTCCCATTGCCACTGGGGTTATTGAGGGCGCTTGTCGTCACTTAATCAAAGACAGGATGGACATTACCGGGGCTAGATGGAGTC
TTATGGGTGCTGAGGCGGTTTTACGTCTGCGCTCGCTTTATGTTAGCGGTGATTGGCAAGAGTATTGGCCCTTTCACTTAAAGCAAGAGCATAAACGTAA
TCACTTGTCTTTGTACAAAGATGGCCTTCCTTTGATGAAACGACTAATTCAAGCTCGTTGCTCTATTACTGCTCCTCCTACTCTTCCGATACCTCTCTAA
ACCTGACCAAACGTCTTTCCAAAAGAGCCACACCC
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1104 bp | 367 aa | 71 | 1174 | + | No |
Description : First part of the transposase
ORF sequence :
MSQYLLVGDQLKKSAVVQEKFGPMQPSISNEDMFFSQARELFEEIVDWLGSDSVCGLQHGELESKLLENGYELLRRLLQGYFDRRSDDEIEGECLGNDEA
KRTHKKRFTRKLTTIFGTIIANRIGYGGRKITSLNPLDAELNLPVEKYSHGLRERVAVEVARSGFGETVEIIQKTTAAKVGKRQVEQLADQSACDFDQFY
AHQQAQSVELKETGEIVVISVDGKGVVMRTEDLRAPTQKRALANSKKLNKRLTKGEKRNSKRMATVASVYTINPFVRTPQQIVSTEEEHKKIKRPKPIGK
RVWASLVKEPSLVIKEAFDEALHRDPNQHKRFCALVDGNKTQLSRFEKICSRTPSQVNDCPGYYSCD
KRTHKKRFTRKLTTIFGTIIANRIGYGGRKITSLNPLDAELNLPVEKYSHGLRERVAVEVARSGFGETVEIIQKTTAAKVGKRQVEQLADQSACDFDQFY
AHQQAQSVELKETGEIVVISVDGKGVVMRTEDLRAPTQKRALANSKKLNKRLTKGEKRNSKRMATVASVYTINPFVRTPQQIVSTEEEHKKIKRPKPIGK
RVWASLVKEPSLVIKEAFDEALHRDPNQHKRFCALVDGNKTQLSRFEKICSRTPSQVNDCPGYYSCD
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
687 bp | 228 aa | 1014 | 1700 | + | No |
Description : Second part of the transposase
ORF sequence :
KKLLTRHYTATPINTSVFALWSMVTRHNYRALKKFAREHHLKLTIVLDIIHVIEYLWKAAFAFYSDTSQQAEVWVSKRLLLILEGKSSTVAGGMRGSATK
RQLSASQRQPVDKCARYLLNNAAYLKYHDYLKAGLPIATGVIEGACRHLIKDRMDITGARWSLMGAEAVLRLRSLYVSGDWQEYWPFHLKQEHKRNHLSL
YKDGLPLMKRLIQARCSITAPPTLPIPL
RQLSASQRQPVDKCARYLLNNAAYLKYHDYLKAGLPIATGVIEGACRHLIKDRMDITGARWSLMGAEAVLRLRSLYVSGDWQEYWPFHLKQEHKRNHLSL
YKDGLPLMKRLIQARCSITAPPTLPIPL
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1630 bp | 543 aa | 71 | 1700 | + | Yes |
Chemistry : DDE
ORF sequence :
MSQYLLVGDQLKKSAVVQEKFGPMQPSISNEDMFFSQARELFEEIVDWLGSDSVCGLQHGELESKLLENGYELLRRLLQGYFDRRSDDEIEGECLGNDEA
KRTHKKRFTRKLTTIFGTIIANRIGYGGRKITSLNPLDAELNLPVEKYSHGLRERVAVEVARSGFGETVEIIQKTTAAKVGKRQVEQLADQSACDFDQFY
AHQQAQSVELKETGEIVVISVDGKGVVMRTEDLRAPTQKRALANSKKLNKRLTKGEKRNSKRMATVASVYTINPFVRTPQQIVSTEEEHKKIKRPKPIGK
RVWASLVKEPSLVIKEAFDEALHRDPNQHKRFCALVDGNKTQLSRFEKKFAREHHLKLTIVLDIIHVIEYLWKAAFAFYSDTSQQAEVWVSKRLLLILEG
KSSTVAGGMRGSATKRQLSASQRQPVDKCARYLLNNAAYLKYHDYLKAGLPIATGVIEGACRHLIKDRMDITGARWSLMGAEAVLRLRSLYVSGDWQEYW
PFHLKQEHKRNHLSLYKDGLPLMKRLIQARCSITAPPTLPIPL
KRTHKKRFTRKLTTIFGTIIANRIGYGGRKITSLNPLDAELNLPVEKYSHGLRERVAVEVARSGFGETVEIIQKTTAAKVGKRQVEQLADQSACDFDQFY
AHQQAQSVELKETGEIVVISVDGKGVVMRTEDLRAPTQKRALANSKKLNKRLTKGEKRNSKRMATVASVYTINPFVRTPQQIVSTEEEHKKIKRPKPIGK
RVWASLVKEPSLVIKEAFDEALHRDPNQHKRFCALVDGNKTQLSRFEKKFAREHHLKLTIVLDIIHVIEYLWKAAFAFYSDTSQQAEVWVSKRLLLILEG
KSSTVAGGMRGSATKRQLSASQRQPVDKCARYLLNNAAYLKYHDYLKAGLPIATGVIEGACRHLIKDRMDITGARWSLMGAEAVLRLRSLYVSGDWQEYW
PFHLKQEHKRNHLSLYKDGLPLMKRLIQARCSITAPPTLPIPL
Blast result :
Comments
ISMpr2 is 88% aa similar to ISCasp3.
The third ORF is the putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
The third ORF is the putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
References
1] ISfinder annotation (2013)
2] Jones,A.C., Monroe,E.A., Podell,S., Hess,W.R., Klages,S.,
Esquenazi,E., Niessen,S., Hoover,H., Rothmann,M., Lasken,R.S., Yates,J.R. III, Reinhardt,R., Kube,M., Burkart,M.D., Allen,E.E., Dorrestein,P.C., Gerwick,W.H. and Gerwick,L. (2011) Proc. Natl. Acad. Sci. U.S.A. 108 (21), 8815-8820.
2] Jones,A.C., Monroe,E.A., Podell,S., Hess,W.R., Klages,S.,
Esquenazi,E., Niessen,S., Hoover,H., Rothmann,M., Lasken,R.S., Yates,J.R. III, Reinhardt,R., Kube,M., Burkart,M.D., Allen,E.E., Dorrestein,P.C., Gerwick,W.H. and Gerwick,L. (2011) Proc. Natl. Acad. Sci. U.S.A. 108 (21), 8815-8820.