ISMsi1
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_011666 | ND | Methylocella silvestris | Methylocella silvestris BL2 |
DNA section
IS Length : 2307 bp
Ends
IR Length : 20/27
IRL : TGTGAGATCGCGTGGAATGTTGTCCCCTTGAGGGGCTGATTTCGCGTTGA
IRR : TGTCAATTCGCTTTGAAAGCTGTCCCCGCTTTCGCTTTGAATGTTAGCCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TATGAGGCGCGGATACACGA | CAGACGCGGTAAAGGGCAGT | 0 | |
TACGTTTCGCCCGGCACCTC | ATCAAGGCGGTGCAAGGCAC | 0 |
DNA sequence
TGTGAGATCGCGTGGAATGTTGTCCCCTTGAGGGGCTGATTTCGCGTTGAAGATTGTCCCCCTAGTTCGGCTTTCAGCGATCCTTCCGGCTTTGCGGAGG
GCTCTGGGGATGATGGGCGTGGAGACGATCGCGCGGATCCGGTTCGAGCATTTCCAGAACGGCAAAGGGATCAAGCGGATCGCGCTGGAGCTGGGGATTG
CGCGCGACACGGTCCGCAAAATTCTGCGCTCCGGCGTGACGGAGTTCGCCTACAAGCGCACGGTTCAGCCGCAGCGCAAGCTGGGCGACTGGGTGGAGAC
GCTGACAGCGATCCTGGAGGCGGAGGCCAAGCTGCCGCGACGCGAGCGGCGCTCGACCATCCGTTTATTCGAGGAACTCCGCGGCCGGGGCTATGACGGC
GCGCATGACAGTGTGCATCGGTTCGTCAAGAGCTGGCGCGCCGAACGGGCCCGGGCGCCGGCGCAGGCTTTCAGTCCGCTGCGGTTCGATCCGGGCGAAG
CCCATCAGTTCGATTGGAGCCATGAGGCGATAGAGCTTCAGGGATTGCCGCTGACGGTGAAGCTGGCGCAGATGCGGCTCGCTTACAGCCGCATGCCTTT
CGTGCGCGCCTATTTCCGCGAGACGCAGGAGATGGTGTTCGACGCGCACGATCGCGCCTTCGCTTTTTATGGCGGCGTCTGCCGGCGCGGGATTTATGAC
AACATGCGCACGGCGGTGGAGGCCGTCTTCGTCGGCAGCGCCCGCACGTATAACCGGCGCTTTCTCCAACTTTGCTCGCATCATCTCGTCGAACCTGTAG
CCTGCACGCCTGCCGCCGGTTGGGAGAAGGGCCAGGTCGAGAACCAGGTCGGCACGATGCGCGACGTGTTGTTCCGGCCAAAGCCAAAGGTGAAGTCGCT
GGATGAGCTGAATGCTTGGCTGGCCGAGCAATGCGTCTCCTACGCCAAACGGGTCAAGCACCCCGAGTTCAGGGATCGGACGATCTTTGAAGTGTTCGAG
GAAGAACGGCCGCGCCTCATGCCCTTCCTCGGCCCGTTCGCGGGTTCATCGAGAAGCCGATGCGCGCGACGACAACCTGCCTGATCGCGCATGATCGGAA
CAAATACAGCGTCGACGCCAAGGCGGCGGGGCGCGCTGTGCTGGTGCGCGTCTACGCCGATCGCATTGTGGCTCTCCTCGGCGAGGAGATTGTCGCGGAT
CACCCACGCAGCTTCAAGCGCGATCAAGTTGTCTACGAGGCTTGCGCCGAAGCCTTGGAGGCCGGCGTCGCCAATGGCGACGTCGTTGTGGCGATCCTCG
CCCGCAACCGCCAACCTCCGGCGCCGCCAAGCATCACAACGCCCGAGGCGCTCAAACTCAAAGCCGAGCCCGTCGCCGACTGCGCCCGCTACGACAATCT
GCGCAAAAGAAGGGAGGATGCGCTATGGAGCGTCATCAGATCATCGACGCCATGACCGGATTGAAGCTTTACGGAATGCGCGCCAGTTTCGACGAGATCG
CCGGCAAAGGGCTGGCGCGGCGCGAAGAACTCTACCCGCTGCTCGGCAGCCTGATCCGCGCCGAACTGGCGCATCGCCAGTCCAGATCGATCAACTACCG
CATCAGCGGCGCCAAGTTCCCCGTGCTGAAAGATCTCGACGCCTTCGTCTTCAAAGACACGCCTGCCGACGAAGGCCTGATCCGCGAACTCGCCACCGGA
GACTTCATCGACGCCAAGTGTAATCTCATCCTGATCGGCGGCACAGGCACAGGCAAAACCCATCTGGCGATCGCCATCGCCGCCGCCGTCATTCGCGCCC
GAGCGCGCGGCCGTTTCTTCAACCTTGTCGACCTCGTCAACAAGCTCGAACAGAAAAAGGCCGCCGGAAAAAGCGGCAGGATCGTCGAAGCCATGCTGCG
TCAGGATTTGATCGTCATCGACGAACTCGGCTATTTGCCGTTCAGCCACGCCGGCGCGCAGCTGTTGTTCTATCTGATCAGCAAGCTCTATGAAAACACC
TTGATCATTATCACGACCAATCTTGCCTTCGCCGATTGGCCGCAAGTCTTTGGCGACGCCAAAATGACCACCGCCATGCTCGATCGGCTCACGCATCACT
GCGATATCGTCGAGACCGGCAACGAGAGCTGGCGCTTCAAGAACCGCGTCTGACCATCAACCAACAAAGCGCCCCATGGCCGGCGAAACAGCCGCGGCTC
TGCAACCCCGACCAGCGACGCCGCCGCTGTTCCCGCTCTTCCGGGCTCTAACCGAGGGGGCTAACATTCAAAGCGAAAGCGGGGACAGCTTTCAAAGCGA
ATTGACA
GCTCTGGGGATGATGGGCGTGGAGACGATCGCGCGGATCCGGTTCGAGCATTTCCAGAACGGCAAAGGGATCAAGCGGATCGCGCTGGAGCTGGGGATTG
CGCGCGACACGGTCCGCAAAATTCTGCGCTCCGGCGTGACGGAGTTCGCCTACAAGCGCACGGTTCAGCCGCAGCGCAAGCTGGGCGACTGGGTGGAGAC
GCTGACAGCGATCCTGGAGGCGGAGGCCAAGCTGCCGCGACGCGAGCGGCGCTCGACCATCCGTTTATTCGAGGAACTCCGCGGCCGGGGCTATGACGGC
GCGCATGACAGTGTGCATCGGTTCGTCAAGAGCTGGCGCGCCGAACGGGCCCGGGCGCCGGCGCAGGCTTTCAGTCCGCTGCGGTTCGATCCGGGCGAAG
CCCATCAGTTCGATTGGAGCCATGAGGCGATAGAGCTTCAGGGATTGCCGCTGACGGTGAAGCTGGCGCAGATGCGGCTCGCTTACAGCCGCATGCCTTT
CGTGCGCGCCTATTTCCGCGAGACGCAGGAGATGGTGTTCGACGCGCACGATCGCGCCTTCGCTTTTTATGGCGGCGTCTGCCGGCGCGGGATTTATGAC
AACATGCGCACGGCGGTGGAGGCCGTCTTCGTCGGCAGCGCCCGCACGTATAACCGGCGCTTTCTCCAACTTTGCTCGCATCATCTCGTCGAACCTGTAG
CCTGCACGCCTGCCGCCGGTTGGGAGAAGGGCCAGGTCGAGAACCAGGTCGGCACGATGCGCGACGTGTTGTTCCGGCCAAAGCCAAAGGTGAAGTCGCT
GGATGAGCTGAATGCTTGGCTGGCCGAGCAATGCGTCTCCTACGCCAAACGGGTCAAGCACCCCGAGTTCAGGGATCGGACGATCTTTGAAGTGTTCGAG
GAAGAACGGCCGCGCCTCATGCCCTTCCTCGGCCCGTTCGCGGGTTCATCGAGAAGCCGATGCGCGCGACGACAACCTGCCTGATCGCGCATGATCGGAA
CAAATACAGCGTCGACGCCAAGGCGGCGGGGCGCGCTGTGCTGGTGCGCGTCTACGCCGATCGCATTGTGGCTCTCCTCGGCGAGGAGATTGTCGCGGAT
CACCCACGCAGCTTCAAGCGCGATCAAGTTGTCTACGAGGCTTGCGCCGAAGCCTTGGAGGCCGGCGTCGCCAATGGCGACGTCGTTGTGGCGATCCTCG
CCCGCAACCGCCAACCTCCGGCGCCGCCAAGCATCACAACGCCCGAGGCGCTCAAACTCAAAGCCGAGCCCGTCGCCGACTGCGCCCGCTACGACAATCT
GCGCAAAAGAAGGGAGGATGCGCTATGGAGCGTCATCAGATCATCGACGCCATGACCGGATTGAAGCTTTACGGAATGCGCGCCAGTTTCGACGAGATCG
CCGGCAAAGGGCTGGCGCGGCGCGAAGAACTCTACCCGCTGCTCGGCAGCCTGATCCGCGCCGAACTGGCGCATCGCCAGTCCAGATCGATCAACTACCG
CATCAGCGGCGCCAAGTTCCCCGTGCTGAAAGATCTCGACGCCTTCGTCTTCAAAGACACGCCTGCCGACGAAGGCCTGATCCGCGAACTCGCCACCGGA
GACTTCATCGACGCCAAGTGTAATCTCATCCTGATCGGCGGCACAGGCACAGGCAAAACCCATCTGGCGATCGCCATCGCCGCCGCCGTCATTCGCGCCC
GAGCGCGCGGCCGTTTCTTCAACCTTGTCGACCTCGTCAACAAGCTCGAACAGAAAAAGGCCGCCGGAAAAAGCGGCAGGATCGTCGAAGCCATGCTGCG
TCAGGATTTGATCGTCATCGACGAACTCGGCTATTTGCCGTTCAGCCACGCCGGCGCGCAGCTGTTGTTCTATCTGATCAGCAAGCTCTATGAAAACACC
TTGATCATTATCACGACCAATCTTGCCTTCGCCGATTGGCCGCAAGTCTTTGGCGACGCCAAAATGACCACCGCCATGCTCGATCGGCTCACGCATCACT
GCGATATCGTCGAGACCGGCAACGAGAGCTGGCGCTTCAAGAACCGCGTCTGACCATCAACCAACAAAGCGCCCCATGGCCGGCGAAACAGCCGCGGCTC
TGCAACCCCGACCAGCGACGCCGCCGCTGTTCCCGCTCTTCCGGGCTCTAACCGAGGGGGCTAACATTCAAAGCGAAAGCGGGGACAGCTTTCAAAGCGA
ATTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1406 bp | 448 aa | 50 | 1455 | + | No |
Chemistry : DDE
ORF sequence :
MMGVETIARIRFEHFQNGKGIKRIALELGIARDTVRKILRSGVTEFAYKRTVQPQRKLGDWVETLTAILEAEAKLPRRERRSTIRLFEELRGRGYDGAHD
SVHRFVKSWRAERARAPAQAFSPLRFDPGEAHQFDWSHEAIELQGLPLTVKLAQMRLAYSRMPFVRAYFRETQEMVFDAHDRAFAFYGGVCRRGIYDNMR
TAVEAVFVGSARTYNRRFLQLCSHHLVEPVACTPAAGWEKGQVENQVGTMRDVLFRPKPKVKSLDELNAWLAEQCVSYAKRVKHPEFRDRTIFEVFEEER
PRLMPFLGPFAGFIEKPMRATTTCLIAHDRNKYSVDAKAAGRAVLVRVYADRIVALLGEEIVADHPRSFKRDQVVYEACAEALEAGVANGDVVVAILARN
RQPPAPPSITTPEALKLKAEPVADCARYDNLRKRREDALWSVIRSSTP
SVHRFVKSWRAERARAPAQAFSPLRFDPGEAHQFDWSHEAIELQGLPLTVKLAQMRLAYSRMPFVRAYFRETQEMVFDAHDRAFAFYGGVCRRGIYDNMR
TAVEAVFVGSARTYNRRFLQLCSHHLVEPVACTPAAGWEKGQVENQVGTMRDVLFRPKPKVKSLDELNAWLAEQCVSYAKRVKHPEFRDRTIFEVFEEER
PRLMPFLGPFAGFIEKPMRATTTCLIAHDRNKYSVDAKAAGRAVLVRVYADRIVALLGEEIVADHPRSFKRDQVVYEACAEALEAGVANGDVVVAILARN
RQPPAPPSITTPEALKLKAEPVADCARYDNLRKRREDALWSVIRSSTP
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
729 bp | 242 aa | 1425 | 2153 | + | No |
AG : IS21 helper
ORF sequence :
MERHQIIDAMTGLKLYGMRASFDEIAGKGLARREELYPLLGSLIRAELAHRQSRSINYRISGAKFPVLKDLDAFVFKDTPADEGLIRELATGDFIDAKCN
LILIGGTGTGKTHLAIAIAAAVIRARARGRFFNLVDLVNKLEQKKAAGKSGRIVEAMLRQDLIVIDELGYLPFSHAGAQLLFYLISKLYENTLIIITTNL
AFADWPQVFGDAKMTTAMLDRLTHHCDIVETGNESWRFKNRV
LILIGGTGTGKTHLAIAIAAAVIRARARGRFFNLVDLVNKLEQKKAAGKSGRIVEAMLRQDLIVIDELGYLPFSHAGAQLLFYLISKLYENTLIIITTNL
AFADWPQVFGDAKMTTAMLDRLTHHCDIVETGNESWRFKNRV
Blast result :
Comments
ISMsi1 is 63% (istA, the transposase) and + 78% (istB, the helper of transposition) aa similar to ISMex39.
istA was reconstructed in silico by joining (50-1045;1044-1455).
istA was reconstructed in silico by joining (50-1045;1044-1455).
References
1] Miriam Land (2009) Direct submission
2] Lucas,S., Lucas,S., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Forester,B., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ovchinnikova,G., Dunfield,P.F., Dedysh,S.N., Liesack,W., Stott,M.B., Smirnova,A.V., Alam,M., Chen,Y., Murrell,J.C. and Richardson,P.(2008) Direct submission GenBank.
2] Lucas,S., Lucas,S., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Forester,B., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ovchinnikova,G., Dunfield,P.F., Dedysh,S.N., Liesack,W., Stott,M.B., Smirnova,A.V., Alam,M., Chen,Y., Murrell,J.C. and Richardson,P.(2008) Direct submission GenBank.