ISMesp1
- Family IS1595
- Group ISNwi1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_008254 | ND | Mesorhizobium sp. | Mesorhizobium sp. BNC1 |
DNA section
IS Length : 2601 bp
Ends
IR Length : 30/33
IRL : CCTAATTATATACTTGACAAAGTACGGCCGATTTCATAGGGTTGGCCATA
IRR : CCTAATTATATACATGACAAAGTAAGGTCGATTCGAGTATTGTGCTTGGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GACCGCCGTCACGGCGTCAAGG | ACTTTTTT | CCTAACAACGAAAGTCCG | 8 |
DNA sequence
CCTAATTATATACTTGACAAAGTACGGCCGATTTCATAGGGTTGGCCATAGAGAGGGCAGACCCATGAAACTTGACCATCCGATCTTCACCAACGTTGAC
GCAGCCCGCGAGCACCTTAAGGCGCAGCGTTGGCCGCATGGTCCCGTCTGCCCGCATTGTGGCAACACCGACCCTGCCCGCATCACGAAGATGGAAGGCA
AGGCGCATCGCCCCGGCCTCTACAACTGCATGGAATGCCGCGAACAGTTCACCGTCACCGTGGGAACCGTATTCGAGCGTTCCAAGATCGCTCTTAACAA
ATGGCTGCTGGCAACGTTCCTCATGGCTTCCTCTAAGAAGGGCATGAGCGCGCACCAGCTTCACCGTATGCTCGGCGTCACCTACAAGACCGCATGGTTC
ATGGCGCATCGCATCCGTGAGGCCATGAAGGAAGACGTGAAGTCCTCCGGCCCGCTCGGCGGCGAAGGCAAGACCGTTGAGGCCGACGAGATGTACATCG
GCAAGCGTGAGACGCCTCGCAAGCTCGCTCGTGGCCGCATCGCCAACCCCACCAAATCAGGCAAGGCAGGCGGCGCACAGAAGCGCATCGTGGTTGGCCT
TGTGGAGCGCGGCGGCAAGTCCCGTATGTTCCACCTGAACGATGCCACGAAAGAGACCGTGCGCGACGTGCTGGTGCGTAATGCAGACCGCAAGTCCATC
CTCTACACTGACGAAAGCCGTCTCTACACTCGCACTGGTGAGGAATATGCCGGTCACAAGACCACCAAGCATTCCGCTGGCGAGTACGCCCGCCGCGAAG
GCGACGTGGTTGTTCACTCCAAATACGATTGAAAGCGTGTTCTCAGTGATCAAGCGTGGCATGGTTGGCGTCTACCAGCATTGCGGCGAAGCCCATCTGC
ACCGGTATCTCGCTGAGTTCGATTTCCGTTACAATCGCCGTACGGCGCTCAAGATCACCGATGCGGAGCGCACCGATCAGTTGCTTGCCATGATCGAAGG
CAAGCGCCTCACCTATCGGCGGATTGGTGAAGCCGATCACGCCTAAGCAGAAGGCGCGAAGGCTTCTCCGGCTAAGGAAGCGATCAAAGCTCTAGCCGGG
GCCAAATTCCTCAGCCGCTGTCGGATAAGGCTCAAGGCCCAGCCTTATTCGCATGGCAGGTAGAGAGACTTGAAGGGCATCCGCCATAGCTTTCGGCGTG
GTTTTCCCTTCCGCCCTCAGGCGACTGATCAGATCGTAGGGCATTAGCAGGTTGGCCGCGAAGCGGTTTGCTTCCGTCTCCTCGCGCGGGCCTATACTCG
TGTTGTGGTAGCGCCCCGCGCTCGTGCTGCGATAAGCCCGATCATCGTCAATCCCGGTGCCAATCAGGTGGCGGTGATGGAAGAAATGCCCCAACTCATG
TGCTATCGTAAACCTACGGCGGCGCTCACCATGGGTGGCGTTCACATTGATCTGGTAACGGCCCTCACTCACGGGGACCAATTCACCGGAAACGTCATCA
GCCAACCATGCAGAATGAACGCGGATGCCGAGTTCACGGCAAAGGCCCTCCAGATCTACCGGATAGCTACTCTGCGCCCTCTTTACGGCGCTTATCATCG
TCGTCACTGACAACCTCCTTCGCCGTCACTTCTGCGATCTCGTTGGCCTGCTCCGAGCTAACAGAGTTCTTCACAAGCTCCATGTATTCGCCAATTAATG
TAGGTAGCAGCGTCTCCATTTTGGAGATCGCCAAATCTTCGACTCTCCGAGCGGCGTCCTTAAGCGCTTCCTTGGCCTCTTTCTCTGCTATCGTTTTTGC
CCGATGCTGCACAAAGAAAAAGAGCGGGAACGCAGCTAACGCGAGAATCAGCGCCAACACAGCTAGCACGGACGATAGAAAATCGAGGCGTCCTATTTGT
GACGCATATTCGATCACGTCTCTTGCAGCTTCCTCTGCGCCCATCCCGATTACCCCCATCTATCGGTTATGACGCTGGAAAGGTTACTATTCAACTACGC
GAAGCGGACGGGTTGCTTTTGGAGTCCTGTTTGGCGCGGTTGCTGCGGGGTTTATCCTTCGCCGCCTTCTCGTGCGGCTTCGGTGGCGTCTTGAGCATCC
GCTTTAGCGTCTCGTTGAAACGCTTCTCATCGCTCTGGGGGCGCTCTTTATCTTCGGCCATTCCCCGATATCCACATTAGTGACACACAATATGCAGCGT
TTCTGGAAGCAATCGGGAACGACTGCTTGACGCCCGAAAGCGAGTCGCCTTTTAGTTGATAGGCTTGGCGCGACGGGACCGGGATTTTGCGGGTGCATCC
GCAGAAATCTCCCGGTCGCCATCTTGGCCCGCTAGCTGGCGCTACCCGCTACGGGTGCCTTCGGAGATGCCGTCGATCTCCGTCGTATCGGTGGGGCCGT
AAGGTTGGTACCTTGGTCGCGGCTTGTCATCCTGACTTTCGGGTTCGATTCCCGACGGCTCCACCATCTCTACAATAGGCGTTTTCGCCTTTTTGCCAAT
GGGGCAAATGCGCTCCATCTTAAAAGTTCCACACATTCCGGCCTGTCACGCACCAAGCACAATACTCGAATCGACCTTACTTTGTCATGTATATAATTAG
G
GCAGCCCGCGAGCACCTTAAGGCGCAGCGTTGGCCGCATGGTCCCGTCTGCCCGCATTGTGGCAACACCGACCCTGCCCGCATCACGAAGATGGAAGGCA
AGGCGCATCGCCCCGGCCTCTACAACTGCATGGAATGCCGCGAACAGTTCACCGTCACCGTGGGAACCGTATTCGAGCGTTCCAAGATCGCTCTTAACAA
ATGGCTGCTGGCAACGTTCCTCATGGCTTCCTCTAAGAAGGGCATGAGCGCGCACCAGCTTCACCGTATGCTCGGCGTCACCTACAAGACCGCATGGTTC
ATGGCGCATCGCATCCGTGAGGCCATGAAGGAAGACGTGAAGTCCTCCGGCCCGCTCGGCGGCGAAGGCAAGACCGTTGAGGCCGACGAGATGTACATCG
GCAAGCGTGAGACGCCTCGCAAGCTCGCTCGTGGCCGCATCGCCAACCCCACCAAATCAGGCAAGGCAGGCGGCGCACAGAAGCGCATCGTGGTTGGCCT
TGTGGAGCGCGGCGGCAAGTCCCGTATGTTCCACCTGAACGATGCCACGAAAGAGACCGTGCGCGACGTGCTGGTGCGTAATGCAGACCGCAAGTCCATC
CTCTACACTGACGAAAGCCGTCTCTACACTCGCACTGGTGAGGAATATGCCGGTCACAAGACCACCAAGCATTCCGCTGGCGAGTACGCCCGCCGCGAAG
GCGACGTGGTTGTTCACTCCAAATACGATTGAAAGCGTGTTCTCAGTGATCAAGCGTGGCATGGTTGGCGTCTACCAGCATTGCGGCGAAGCCCATCTGC
ACCGGTATCTCGCTGAGTTCGATTTCCGTTACAATCGCCGTACGGCGCTCAAGATCACCGATGCGGAGCGCACCGATCAGTTGCTTGCCATGATCGAAGG
CAAGCGCCTCACCTATCGGCGGATTGGTGAAGCCGATCACGCCTAAGCAGAAGGCGCGAAGGCTTCTCCGGCTAAGGAAGCGATCAAAGCTCTAGCCGGG
GCCAAATTCCTCAGCCGCTGTCGGATAAGGCTCAAGGCCCAGCCTTATTCGCATGGCAGGTAGAGAGACTTGAAGGGCATCCGCCATAGCTTTCGGCGTG
GTTTTCCCTTCCGCCCTCAGGCGACTGATCAGATCGTAGGGCATTAGCAGGTTGGCCGCGAAGCGGTTTGCTTCCGTCTCCTCGCGCGGGCCTATACTCG
TGTTGTGGTAGCGCCCCGCGCTCGTGCTGCGATAAGCCCGATCATCGTCAATCCCGGTGCCAATCAGGTGGCGGTGATGGAAGAAATGCCCCAACTCATG
TGCTATCGTAAACCTACGGCGGCGCTCACCATGGGTGGCGTTCACATTGATCTGGTAACGGCCCTCACTCACGGGGACCAATTCACCGGAAACGTCATCA
GCCAACCATGCAGAATGAACGCGGATGCCGAGTTCACGGCAAAGGCCCTCCAGATCTACCGGATAGCTACTCTGCGCCCTCTTTACGGCGCTTATCATCG
TCGTCACTGACAACCTCCTTCGCCGTCACTTCTGCGATCTCGTTGGCCTGCTCCGAGCTAACAGAGTTCTTCACAAGCTCCATGTATTCGCCAATTAATG
TAGGTAGCAGCGTCTCCATTTTGGAGATCGCCAAATCTTCGACTCTCCGAGCGGCGTCCTTAAGCGCTTCCTTGGCCTCTTTCTCTGCTATCGTTTTTGC
CCGATGCTGCACAAAGAAAAAGAGCGGGAACGCAGCTAACGCGAGAATCAGCGCCAACACAGCTAGCACGGACGATAGAAAATCGAGGCGTCCTATTTGT
GACGCATATTCGATCACGTCTCTTGCAGCTTCCTCTGCGCCCATCCCGATTACCCCCATCTATCGGTTATGACGCTGGAAAGGTTACTATTCAACTACGC
GAAGCGGACGGGTTGCTTTTGGAGTCCTGTTTGGCGCGGTTGCTGCGGGGTTTATCCTTCGCCGCCTTCTCGTGCGGCTTCGGTGGCGTCTTGAGCATCC
GCTTTAGCGTCTCGTTGAAACGCTTCTCATCGCTCTGGGGGCGCTCTTTATCTTCGGCCATTCCCCGATATCCACATTAGTGACACACAATATGCAGCGT
TTCTGGAAGCAATCGGGAACGACTGCTTGACGCCCGAAAGCGAGTCGCCTTTTAGTTGATAGGCTTGGCGCGACGGGACCGGGATTTTGCGGGTGCATCC
GCAGAAATCTCCCGGTCGCCATCTTGGCCCGCTAGCTGGCGCTACCCGCTACGGGTGCCTTCGGAGATGCCGTCGATCTCCGTCGTATCGGTGGGGCCGT
AAGGTTGGTACCTTGGTCGCGGCTTGTCATCCTGACTTTCGGGTTCGATTCCCGACGGCTCCACCATCTCTACAATAGGCGTTTTCGCCTTTTTGCCAAT
GGGGCAAATGCGCTCCATCTTAAAAGTTCCACACATTCCGGCCTGTCACGCACCAAGCACAATACTCGAATCGACCTTACTTTGTCATGTATATAATTAG
G
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
982 bp | 327 aa | 65 | 1046 | + | No |
Description :
ORF sequence :
MKLDHPIFTNVDAAREHLKAQRWPHGPVCPHCGNTDPARITKMEGKAHRPGLYNCMECREQFTVTVGTVFERSKIALNKWLLATFLMASSKKGMSAHQLH
RMLGVTYKTAWFMAHRIREAMKEDVKSSGPLGGEGKTVEADEMYIGKRETPRKLARGRIANPTKSGKAGGAQKRIVVGLVERGGKSRMFHLNDATKETVR
DVLVRNADRKSILYTDESRLYTRTGEEYAGHKTTKHSAGEYARREGDVVVHTPNTIESVFSVIKRGMVGVYQHCGEAHLHRYLAEFDFRYNRRTALKITD
AERTDQLLAMIEGKRLTYRRIGEADHA
RMLGVTYKTAWFMAHRIREAMKEDVKSSGPLGGEGKTVEADEMYIGKRETPRKLARGRIANPTKSGKAGGAQKRIVVGLVERGGKSRMFHLNDATKETVR
DVLVRNADRKSILYTDESRLYTRTGEEYAGHKTTKHSAGEYARREGDVVVHTPNTIESVFSVIKRGMVGVYQHCGEAHLHRYLAEFDFRYNRRTALKITD
AERTDQLLAMIEGKRLTYRRIGEADHA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
507 bp | 168 aa | 1092 | 1598 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MISAVKRAQSSYPVDLEGLCRELGIRVHSAWLADDVSGELVPVSEGRYQINVNATHGERRRRFTIAHELGHFFHHRHLIGTGIDDDRAYRSTSAGRYHNT
SIGPREETEANRFAANLLMPYDLISRLRAEGKTTPKAMADALQVSLPAMRIRLGLEPYPTAAEEFGPG
SIGPREETEANRFAANLLMPYDLISRLRAEGKTTPKAMADALQVSLPAMRIRLGLEPYPTAAEEFGPG
Blast result :
Comments
The transposase is in silico reconstructed by join (65-817; 818-1046).
The second ORF is a passenger ORF annotated as "Protein of unknown function DUF955".
ISMesp1 is 68% aa similar to ISNwi1.
The second ORF is a passenger ORF annotated as "Protein of unknown function DUF955".
ISMesp1 is 68% aa similar to ISNwi1.
References
1] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chertkov,O., Brettin,T., Bruce,D., Han,C., Tapia,R., Gilna,P., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Mikhailova,N. and Richardson,P. US DOE Joint Genome Institute (2006) Direct submission GenBank)