ISMlo1
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AP003008 | ND | Mesorhizobium loti | Mesorhizobium loti MAFF303099 |
DNA section
IS Length : 1405 bp
Ends
IR Length : 14
IRL : tgttggtGTGGACGGCCTCCTGCACGGCATCAATGTGCCAGAATGAGGTC
IRR : ----tatGTGGACGGCCTCCTCCCTGCAAGATGTTGGGCAGCGGTTTGAT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
TGTTGGTGTGGACGGCCTCCTGCACGGCATCAATGTGCCAGAATGAGGTCATCGGAGACCTCAAGCAAAGGAGACCGCCCCGTGGAGCAAATTATTCGAA
TTGGCATGGACACGTCAAAGCACGTCTTTCAACTGCATGGTGTGAATGCAGCCGAAGAGGTCGTGCTGCGCAAAAAAATGCGGCGTAAGGAGATGGTGGC
TTTCTTCGAGAGGGCGGCGCCAACGGTGGTCGCGCTCGAAGCGTGCGGCGGATCGCATCATTGGGCCCGCCTGCTTGAGGCGCTTGGCCATGACGTGAAA
CTGATCCCGCCGCAATATGTCAAACCGTACGTAAAGCGCGGCAAGAACGACGCGGCCGATGCGGAGGCTCTTTGCGAAGCGGTAAGCCGCCCGACGATGC
GCTTCGTTCGGGTAAAAACCACCGACAACCAGGCGGCTTTGATGCTGGTCGGCATGCGTGACCGACTGGTTCGCAATCGCACGCAACTTGCCAACGCCAT
CCGCGGATACGCTGCCGAGTTCGGATTGATCGCTGCCACAGGTACATCGAAGGTCAGATCATTGCTCGAGCGTATCGCGACTGATGAAACGCTGCCTGAA
CTGGCACGTGAACTGTTTGGGCTGCACGGCCGCGAATATGATCGGCTGGAGGACGAAATCGAAAAGGTCGAAGCACGGCTGATGGCGTGGCATCGCACAA
ACGAATGCAGTCGACGCCTGACACGGATTCCTGGCGTCGGACCGATCGGCGCATTGATGATGGTCATGAAGACCCCGGCGCCCGAGACGTTCCGATCAGG
GCGGCACTTCGCGGCTTGGCTTGGCCTGACGCCGAAGGATCACTCGACAGCTGGCAGGGTCAGGCTTGGCGTGATCACCCGAGCTGGCGATGAAAATCTG
CGAAGCGTGCTCGTCGCGGGCGCCACCGCTGTCCTCCGGCATGTTCGGGAGGACCGCGGGAAGAATGCGTCCCCATGGCTGCTGGACATGCTCAACCGCA
AACCGCCGAAGCTCGTCGCTGTGGCGCTAGCCAATAAAATCGCCCGCGTCGCCTGGAAGCTGATGGTCACCGGCGAACGATACAATCCCAACGCTGCATT
GTCGGCTTATGGCCAGGCAGCATAGAGATCAGTCATCCAGGGGAGCACAATGTGACGCGACCTTGGTAGGCTGAACCCGAACTTGCGGGAAACGAGCAGT
TGGTGTGATCGATCGATCCGAAACGTGAGACAATCCGTACGATGCAAGGCTCTCAAAAGGCCGCAAAACCTGTTTGGAACTCATGTTGCGGAAACCATCT
TGGCCCGCGGTCATGTACGACCGCAATCAGAGGCCGGACATATGATCGCAAGCGATGGGATCAAACCGCTGCCCAACATCTTGCAGGGAGGAGGCCGTCC
ACATA
TTGGCATGGACACGTCAAAGCACGTCTTTCAACTGCATGGTGTGAATGCAGCCGAAGAGGTCGTGCTGCGCAAAAAAATGCGGCGTAAGGAGATGGTGGC
TTTCTTCGAGAGGGCGGCGCCAACGGTGGTCGCGCTCGAAGCGTGCGGCGGATCGCATCATTGGGCCCGCCTGCTTGAGGCGCTTGGCCATGACGTGAAA
CTGATCCCGCCGCAATATGTCAAACCGTACGTAAAGCGCGGCAAGAACGACGCGGCCGATGCGGAGGCTCTTTGCGAAGCGGTAAGCCGCCCGACGATGC
GCTTCGTTCGGGTAAAAACCACCGACAACCAGGCGGCTTTGATGCTGGTCGGCATGCGTGACCGACTGGTTCGCAATCGCACGCAACTTGCCAACGCCAT
CCGCGGATACGCTGCCGAGTTCGGATTGATCGCTGCCACAGGTACATCGAAGGTCAGATCATTGCTCGAGCGTATCGCGACTGATGAAACGCTGCCTGAA
CTGGCACGTGAACTGTTTGGGCTGCACGGCCGCGAATATGATCGGCTGGAGGACGAAATCGAAAAGGTCGAAGCACGGCTGATGGCGTGGCATCGCACAA
ACGAATGCAGTCGACGCCTGACACGGATTCCTGGCGTCGGACCGATCGGCGCATTGATGATGGTCATGAAGACCCCGGCGCCCGAGACGTTCCGATCAGG
GCGGCACTTCGCGGCTTGGCTTGGCCTGACGCCGAAGGATCACTCGACAGCTGGCAGGGTCAGGCTTGGCGTGATCACCCGAGCTGGCGATGAAAATCTG
CGAAGCGTGCTCGTCGCGGGCGCCACCGCTGTCCTCCGGCATGTTCGGGAGGACCGCGGGAAGAATGCGTCCCCATGGCTGCTGGACATGCTCAACCGCA
AACCGCCGAAGCTCGTCGCTGTGGCGCTAGCCAATAAAATCGCCCGCGTCGCCTGGAAGCTGATGGTCACCGGCGAACGATACAATCCCAACGCTGCATT
GTCGGCTTATGGCCAGGCAGCATAGAGATCAGTCATCCAGGGGAGCACAATGTGACGCGACCTTGGTAGGCTGAACCCGAACTTGCGGGAAACGAGCAGT
TGGTGTGATCGATCGATCCGAAACGTGAGACAATCCGTACGATGCAAGGCTCTCAAAAGGCCGCAAAACCTGTTTGGAACTCATGTTGCGGAAACCATCT
TGGCCCGCGGTCATGTACGACCGCAATCAGAGGCCGGACATATGATCGCAAGCGATGGGATCAAACCGCTGCCCAACATCTTGCAGGGAGGAGGCCGTCC
ACATA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1044 bp | 347 aa | 82 | 1125 | + | No |
Chemistry : DEDD
ORF sequence :
MEQIIRIGMDTSKHVFQLHGVNAAEEVVLRKKMRRKEMVAFFERAAPTVVALEACGGSHHWARLLEALGHDVKLIPPQYVKPYVKRGKNDAADAEALCEA
VSRPTMRFVRVKTTDNQAALMLVGMRDRLVRNRTQLANAIRGYAAEFGLIAATGTSKVRSLLERIATDETLPELARELFGLHGREYDRLEDEIEKVEARL
MAWHRTNECSRRLTRIPGVGPIGALMMVMKTPAPETFRSGRHFAAWLGLTPKDHSTAGRVRLGVITRAGDENLRSVLVAGATAVLRHVREDRGKNASPWL
LDMLNRKPPKLVAVALANKIARVAWKLMVTGERYNPNAALSAYGQAA
VSRPTMRFVRVKTTDNQAALMLVGMRDRLVRNRTQLANAIRGYAAEFGLIAATGTSKVRSLLERIATDETLPELARELFGLHGREYDRLEDEIEKVEARL
MAWHRTNECSRRLTRIPGVGPIGALMMVMKTPAPETFRSGRHFAAWLGLTPKDHSTAGRVRLGVITRAGDENLRSVLVAGATAVLRHVREDRGKNASPWL
LDMLNRKPPKLVAVALANKIARVAWKLMVTGERYNPNAALSAYGQAA
Blast result :
Comments
The IR of this IS are not at its termini. In the IS sequence as given 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right.
The transposase protein corresponds to orf mll6112 and is 78, 71, 70 and 38% identical to those of ISMlo2, ISShsp1, ISRsp4 and IS1111, respectively.
By analogy with IS4321, ISMlo1 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
The transposase protein corresponds to orf mll6112 and is 78, 71, 70 and 38% identical to those of ISMlo2, ISShsp1, ISRsp4 and IS1111, respectively.
By analogy with IS4321, ISMlo1 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
References
1] Kaneko et al. (2000) DNA Res. 7, 331-338
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384