ISMlo2
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AP003008 | ND | Mesorhizobium loti | Mesorhizobium loti MAFF303099 |
DNA section
IS Length : 1402 bp
Ends
IR Length : 12/13
IRL : tgttgggGTGGACGGCCCCCGCACGGCATCGATGTGCCAGAATGAGGTCG
IRR : ----tatGTGGACGGCACCCTTCTCGCAAGATTGTTGAGCAATGATTTGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
TGTTGGGGTGGACGGCCCCCGCACGGCATCGATGTGCCAGAATGAGGTCGTTGCAGATCTCAAGCAAAGGAGACCGTCCGTGGAACAAATTATTCGAATT
GGCATCGACACGTCAAAGCACGTCTTCCAACTCCATGGTGTGAACGCGGCTGAGGAGCCGGTGCTGCGCAAGAAGCTGAGGCGCAAGGAGGTGGTAGCGT
TGTTCGAGAAGCTGTCGCCGACGGTGGTAGCGATCGAAGCGTGCGGGGGCTCGCACCACTGGGCGCGCCTGCTGCAGTCGTTCGGGCACGAGGTGAAGCT
GATCCCTCCGCAATATGTCAAACCGTACGTCAAGCGCGGCAAGAACGATACGGCCGATGCCGAAGCGCTGTGCGAGGCGATGAGCCGGCCGACGATGCGG
TTTGTGCCGGTCAAGACCGCTAAACAACAGGCGGCATTGATGCTGGTTGGCCTGCGCGACCGGCTGATCCGCAATCGCACGCAGCTCGCCAATGCGATCC
GCGGCTATGCGGCAGAATACGGGCTGATCGCTGCCAGGGGGATGTGCAAGATCGAACCGCTGCTCGAGCGTATCGCGGCAGATAAGATGCTACCGGATTT
GGCGCGGGAGCTGTTCGCGCTCCATGCCAAGGAATATGCGCAGCTGCAAACACAACTGAAAGACGTCGACGCCAAATTGATGGCTTGGCATCGGGCCGAG
GCGTGCAGCCAGCGCCTGGCGCAAATCCCCGGTGTCGGCCCGATCGGTGCGTCGCTACTGGTGATGAAGACGCCGGCACCTGAGACCTTTCGATCGGCCC
GACACTTTGCCGCCTGGCTTGGCCTAACGCCGAAGGATCATTCAACTGCCGGCAGGGTCAGGCTCGGTGTGATCACGCGCGCCGGCGACGAAGCCTTGCG
CAGCGTGCTGGTGGCCGGGGCTACTGCTGTCATCCGGCATGTCCGACGCGGCCGAGGCGCCGTCTCCCCCTGGCTCGTCGACCTACTCAAGCGCAAGCCG
CCGAAACTCGCTGCCGTGGCGCTAGCGAACAAGATCGCGCGCATCGCCTGGAAGCTAATGGTAAGCGGCGAGATCTACGGAGCGAAGCCTATGCCGCCAG
CCTCGGCGCGCGCCGCATAGAGATCGGCCAGACACGGGGAACATCTCTACTGTTACCGTGCTGAGCTGGTGCCGGAACTTGCAAGAGAATGAGCAGTTGG
TGTGATCGATCGATCCGAGACGCGAGACACTCCGTTGACCCCATTGGTCGTTGTAGACCGCCATGATGTTGGGAACTCGCGTCGCGGAAACCATCTTGGC
CAGCGGTTATGTGCGACCGTACACAAAGGCCGGACATATGAGCGCAAGCGATCCGATCAAATCATTGCTCAACAATCTTGCGAGAAGGGTGCCGTCCACA
TA
GGCATCGACACGTCAAAGCACGTCTTCCAACTCCATGGTGTGAACGCGGCTGAGGAGCCGGTGCTGCGCAAGAAGCTGAGGCGCAAGGAGGTGGTAGCGT
TGTTCGAGAAGCTGTCGCCGACGGTGGTAGCGATCGAAGCGTGCGGGGGCTCGCACCACTGGGCGCGCCTGCTGCAGTCGTTCGGGCACGAGGTGAAGCT
GATCCCTCCGCAATATGTCAAACCGTACGTCAAGCGCGGCAAGAACGATACGGCCGATGCCGAAGCGCTGTGCGAGGCGATGAGCCGGCCGACGATGCGG
TTTGTGCCGGTCAAGACCGCTAAACAACAGGCGGCATTGATGCTGGTTGGCCTGCGCGACCGGCTGATCCGCAATCGCACGCAGCTCGCCAATGCGATCC
GCGGCTATGCGGCAGAATACGGGCTGATCGCTGCCAGGGGGATGTGCAAGATCGAACCGCTGCTCGAGCGTATCGCGGCAGATAAGATGCTACCGGATTT
GGCGCGGGAGCTGTTCGCGCTCCATGCCAAGGAATATGCGCAGCTGCAAACACAACTGAAAGACGTCGACGCCAAATTGATGGCTTGGCATCGGGCCGAG
GCGTGCAGCCAGCGCCTGGCGCAAATCCCCGGTGTCGGCCCGATCGGTGCGTCGCTACTGGTGATGAAGACGCCGGCACCTGAGACCTTTCGATCGGCCC
GACACTTTGCCGCCTGGCTTGGCCTAACGCCGAAGGATCATTCAACTGCCGGCAGGGTCAGGCTCGGTGTGATCACGCGCGCCGGCGACGAAGCCTTGCG
CAGCGTGCTGGTGGCCGGGGCTACTGCTGTCATCCGGCATGTCCGACGCGGCCGAGGCGCCGTCTCCCCCTGGCTCGTCGACCTACTCAAGCGCAAGCCG
CCGAAACTCGCTGCCGTGGCGCTAGCGAACAAGATCGCGCGCATCGCCTGGAAGCTAATGGTAAGCGGCGAGATCTACGGAGCGAAGCCTATGCCGCCAG
CCTCGGCGCGCGCCGCATAGAGATCGGCCAGACACGGGGAACATCTCTACTGTTACCGTGCTGAGCTGGTGCCGGAACTTGCAAGAGAATGAGCAGTTGG
TGTGATCGATCGATCCGAGACGCGAGACACTCCGTTGACCCCATTGGTCGTTGTAGACCGCCATGATGTTGGGAACTCGCGTCGCGGAAACCATCTTGGC
CAGCGGTTATGTGCGACCGTACACAAAGGCCGGACATATGAGCGCAAGCGATCCGATCAAATCATTGCTCAACAATCTTGCGAGAAGGGTGCCGTCCACA
TA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1041 bp | 346 aa | 80 | 1120 | + | No |
Chemistry : DEDD
ORF sequence :
MEQIIRIGIDTSKHVFQLHGVNAAEEPVLRKKLRRKEVVALFEKLSPTVVAIEACGGSHHWARLLQSFGHEVKLIPPQYVKPYVKRGKNDTADAEALCEA
MSRPTMRFVPVKTAKQQAALMLVGLRDRLIRNRTQLANAIRGYAAEYGLIAARGMCKIEPLLERIAADKMLPDLARELFALHAKEYAQLQTQLKDVDAKL
MAWHRAEACSQRLAQIPGVGPIGASLLVMKTPAPETFRSARHFAAWLGLTPKDHSTAGRVRLGVITRAGDEALRSVLVAGATAVIRHVRRGRGAVSPWLV
DLLKRKPPKLAAVALANKIARIAWKLMVSGEIYGAKPMPPASARAA
MSRPTMRFVPVKTAKQQAALMLVGLRDRLIRNRTQLANAIRGYAAEYGLIAARGMCKIEPLLERIAADKMLPDLARELFALHAKEYAQLQTQLKDVDAKL
MAWHRAEACSQRLAQIPGVGPIGASLLVMKTPAPETFRSARHFAAWLGLTPKDHSTAGRVRLGVITRAGDEALRSVLVAGATAVIRHVRRGRGAVSPWLV
DLLKRKPPKLAAVALANKIARIAWKLMVSGEIYGAKPMPPASARAA
Blast result :
Comments
The IR of this IS are not at its termini. In the IS sequence as given 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right.
There are 2 copies in the Mesorhizobium loti MAFF303099 genome. The transposase gene corresponds to orfs mlr5965 and mlr6063 and the transposase protein is 78, 74, 73 and 38% identical to those of ISMlo1, ISRsp4, ISShsp1 and IS1111, respectively.
By analogy with IS4321, ISMlo2 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
There are 2 copies in the Mesorhizobium loti MAFF303099 genome. The transposase gene corresponds to orfs mlr5965 and mlr6063 and the transposase protein is 78, 74, 73 and 38% identical to those of ISMlo1, ISRsp4, ISShsp1 and IS1111, respectively.
By analogy with IS4321, ISMlo2 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
References
1] Kaneko et al. (2000) DNA Res. 7, 331-338
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384