ISMesp4
- Family ISNCY
- Group ISDol1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
LMHO01000023 | ND | Mesorhizobium sp. | Mesorhizobium sp. Root695 |
DNA section
IS Length : 1663 bp
Ends
IR Length : 15/16
IRL : GTTTCCGTCCACAAACAGCAACTTATTGATTTTATTGTCTGAATCGTTCC
IRR : GTTTCCGTCCATAAACGTGCGTTTTCGCCAGCATTTACAGGCATTTGGCT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTGAACTGCTC | TAGTCTA | GACAGTTCCAAT | 7 |
DNA sequence
GTTTCCGTCCACAAACAGCAACTTATTGATTTTATTGTCTGAATCGTTCCTTTGCGCGAGCGAGGCCCGGCGGTTTCAGGTATCTTGGATCATGCCACTG
ATTCCAAGTCCCTGCCACCGCCGGAGCCGACAATGCGCCAAGAACGCACCGTCCAAGCCAGCATATTCGATGTTTTCGCCACACACGAGATCGGCCACGA
GTTGAAGGCGATGTCGCAATGGCTGGACGGGCATCGCGATCTGCTTGCTTTGGCGACGCGAGACCTATGCCGAGACGGCGCCAAGGCCACCGGGCGGCAA
GGGCTGCCGGCCGAAGCAGTGTTGCGTTGTGCGTTGCTCAAACAATATCGCCAGCTGAGCTACCAGGAGCTGGCGTTCCATCTCGAGGACTCCGCCTCGT
TTCGCGCCTTTGCCCGGCTGCCGTGGCCATGGAGCCCGCAGAAGTCGGTATTGCAAAGGACGATCAGCGCGATCCGGCCCGAGACCTGGGAACAGATCAA
TCGGACACTGCTGTCAGGCGCGCGGCAGGCGAAGCTCGAAGACGGCGCGGTCGTGCGGCTGGACAGCACCGTGACCTCGGCACTCATACACGAGCCGAGC
GACAGCAGCCTGCTTTGGGATGCAGTGCGGGTCATGGTGCGCCTGCTGAAGCGGGCCGATACCTTGGTCGGCGGCGCAGGCTCATCATGGCGCGATCGTT
GCCGCGCAGCCAAGAAGCGCGCCCGCAAGATCCAGTTCACGCGCGGTCGACCCAATCGGGTCCAACTCTACCGCGAACTGATCGCAATCACCCGTGCGAC
CTTGGCCTATCTGAAGCAGGCGACCGATCGACTAGCCGTGGCGAGCACTCCACTCATCGCACTGTGGCAGGTCCAGGTTCGGCACTATCGGCCGTTGGCG
GAACGCATCATCAGACAGAGCGAGCGGCGGGTGTTGGCTGGCGAGCCGGTGCCAGCCGGCGAGAAGTTGGTCAGCCTGTTCGAACAGCATGCCGATATCA
TCGTCAAAGGCAGCCGCGATACCGAATACGGCCACAAGCTCAACCTCACCACCGGCAGGAGCGGCATGATCCTAGATCTGGTGATAGAAGCAGGCAATCC
GGCCGACAGCGACCGCTTGCTGCCGATGCTGGAGCGCCACATCAACCTCTATGGTCAAGCACCGCGGCAAGCGGCCGCCGATGGCGGCTTCGCCACCCGC
AACAACTTGGCCACAGCGAAGGCGTGGGGCGTCTGCGACATGGCCTTCCACAAGAAGGCCGGTCTCAGGATCGAGGACATGGTCAGAAGCAAGTGGGTCT
ACCGCAAGCTGCGCAACTTCCGCGCCGGTATCGAAGCCGGCATCTCGTGCCTCAAGCGCGCTTACGGTTTGGCGCGCTGCACTTGGCGCGGCCTTGATCA
CTTCAAGGCCTACGTCTGGTCCTCGGTCGTGGCCTACAACCTGGTTCTCTTCACGCGCCTCAAAGCGACCTGACAGCGACACCCATCACCGTTGCGCAAC
CACCGGCCGACGCCGATCTTCTGGCCTGCGCGGGATCATGCCCACAACGTCTTTCTCGACCGACCAAACGCTCTGCTACCACGCCATCCCCTCTACAGGT
CGCTGACGACAACAGCCAAATGCCTGTAAATGCTGGCGAAAACGCACGTTTATGGACGGAAAC
ATTCCAAGTCCCTGCCACCGCCGGAGCCGACAATGCGCCAAGAACGCACCGTCCAAGCCAGCATATTCGATGTTTTCGCCACACACGAGATCGGCCACGA
GTTGAAGGCGATGTCGCAATGGCTGGACGGGCATCGCGATCTGCTTGCTTTGGCGACGCGAGACCTATGCCGAGACGGCGCCAAGGCCACCGGGCGGCAA
GGGCTGCCGGCCGAAGCAGTGTTGCGTTGTGCGTTGCTCAAACAATATCGCCAGCTGAGCTACCAGGAGCTGGCGTTCCATCTCGAGGACTCCGCCTCGT
TTCGCGCCTTTGCCCGGCTGCCGTGGCCATGGAGCCCGCAGAAGTCGGTATTGCAAAGGACGATCAGCGCGATCCGGCCCGAGACCTGGGAACAGATCAA
TCGGACACTGCTGTCAGGCGCGCGGCAGGCGAAGCTCGAAGACGGCGCGGTCGTGCGGCTGGACAGCACCGTGACCTCGGCACTCATACACGAGCCGAGC
GACAGCAGCCTGCTTTGGGATGCAGTGCGGGTCATGGTGCGCCTGCTGAAGCGGGCCGATACCTTGGTCGGCGGCGCAGGCTCATCATGGCGCGATCGTT
GCCGCGCAGCCAAGAAGCGCGCCCGCAAGATCCAGTTCACGCGCGGTCGACCCAATCGGGTCCAACTCTACCGCGAACTGATCGCAATCACCCGTGCGAC
CTTGGCCTATCTGAAGCAGGCGACCGATCGACTAGCCGTGGCGAGCACTCCACTCATCGCACTGTGGCAGGTCCAGGTTCGGCACTATCGGCCGTTGGCG
GAACGCATCATCAGACAGAGCGAGCGGCGGGTGTTGGCTGGCGAGCCGGTGCCAGCCGGCGAGAAGTTGGTCAGCCTGTTCGAACAGCATGCCGATATCA
TCGTCAAAGGCAGCCGCGATACCGAATACGGCCACAAGCTCAACCTCACCACCGGCAGGAGCGGCATGATCCTAGATCTGGTGATAGAAGCAGGCAATCC
GGCCGACAGCGACCGCTTGCTGCCGATGCTGGAGCGCCACATCAACCTCTATGGTCAAGCACCGCGGCAAGCGGCCGCCGATGGCGGCTTCGCCACCCGC
AACAACTTGGCCACAGCGAAGGCGTGGGGCGTCTGCGACATGGCCTTCCACAAGAAGGCCGGTCTCAGGATCGAGGACATGGTCAGAAGCAAGTGGGTCT
ACCGCAAGCTGCGCAACTTCCGCGCCGGTATCGAAGCCGGCATCTCGTGCCTCAAGCGCGCTTACGGTTTGGCGCGCTGCACTTGGCGCGGCCTTGATCA
CTTCAAGGCCTACGTCTGGTCCTCGGTCGTGGCCTACAACCTGGTTCTCTTCACGCGCCTCAAAGCGACCTGACAGCGACACCCATCACCGTTGCGCAAC
CACCGGCCGACGCCGATCTTCTGGCCTGCGCGGGATCATGCCCACAACGTCTTTCTCGACCGACCAAACGCTCTGCTACCACGCCATCCCCTCTACAGGT
CGCTGACGACAACAGCCAAATGCCTGTAAATGCTGGCGAAAACGCACGTTTATGGACGGAAAC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1344 bp | 446 aa | 133 | 1476 | + | No |
Chemistry : DDE
ORF sequence :
MRQERTVQASIFDVFATHEIGHELKAMSQWLDGHRDLLALATRDLCRDGAKATGRQGLPAEAVLRCALLKQYRQLSYQELAFHLEDSASFRAFARLPWPW
SPQKSVLQRTISAIRPETWEQINRTLLSGARQAKLEDGAVVRLDSTVTSALIHEPSDSSLLWDAVRVMVRLLKRADTLVGGAGSSWRDRCRAAKKRARKI
QFTRGRPNRVQLYRELIAITRATLAYLKQATDRLAVASTPLIALWQVQVRHYRPLAERIIRQSERRVLAGEPVPAGEKLVSLFEQHADIIVKGSRDTEYG
HKLNLTTGRSGMILDLVIEAGNPADSDRLLPMLERHINLYGQAPRQAAADGGFATRNNLATAKAWGVCDMAFHKKAGLRIEDMVRSKWVYRKLRNFRAGI
EAGISCLKRAYGLARCTWRGLDHFKAYVWSSVVAYNLVLFTRLKAT
SPQKSVLQRTISAIRPETWEQINRTLLSGARQAKLEDGAVVRLDSTVTSALIHEPSDSSLLWDAVRVMVRLLKRADTLVGGAGSSWRDRCRAAKKRARKI
QFTRGRPNRVQLYRELIAITRATLAYLKQATDRLAVASTPLIALWQVQVRHYRPLAERIIRQSERRVLAGEPVPAGEKLVSLFEQHADIIVKGSRDTEYG
HKLNLTTGRSGMILDLVIEAGNPADSDRLLPMLERHINLYGQAPRQAAADGGFATRNNLATAKAWGVCDMAFHKKAGLRIEDMVRSKWVYRKLRNFRAGI
EAGISCLKRAYGLARCTWRGLDHFKAYVWSSVVAYNLVLFTRLKAT
Blast result :
Comments
ISMesp4 is 94% aa similar to ISMesp3.
References
1] ISfinder annotation (2016)
2] Bai,Y., Muller,D.B., Srinivas,G., Garrido-Oter,R., Potthoff,E., Rott,M., Dombrowski,N., Munch,P.C., Spaepen,S., Remus-Emsermann,M., Huttel,B., McHardy,A.C., Vorholt,J.A. and Schulze-Lefert,P. (2015) Nature 528 (7582), 364-369.
2] Bai,Y., Muller,D.B., Srinivas,G., Garrido-Oter,R., Potthoff,E., Rott,M., Dombrowski,N., Munch,P.C., Spaepen,S., Remus-Emsermann,M., Huttel,B., McHardy,A.C., Vorholt,J.A. and Schulze-Lefert,P. (2015) Nature 528 (7582), 364-369.