ISMtsp8
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_010511 | ND | Methylobacterium sp. | Methylobacterium sp. 4-46 Methylobacterium sp. 4-46 plasmid pM44601 |
DNA section
IS Length : 2371 bp
Ends
IR Length : 36/50
IRL : TGTGAGCGGGCGTTGGATCCTCCTCACTTGTGGGCCTGCAATTTTCCCCA
IRR : TGTCAACGGGCTTCGAACCTTCCGCAATTTCGGGCGCGCAATTTTCCTCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ACGACGAGGA | CCTTCG | TCGGCGAGGT | 6 |
GCCGCGGGTG | GGCGGG | TTCCGGTCGT | 6 |
GGCCGATGCC | GACGCC | GGGCCGGGCC | 6 |
CTAGCCTGCA | CCGAT | CACAGCGTGA | 5 |
AGGCGTAGGT | CTGCG | TACTGGTCGT | 5 |
GCTGGACCAT | GGACG | CGTCGAACCC | 5 |
GCCCTTGTCC | ATCAG | CGCGTCCAGC | 5 |
CCTGAGCAGG | CGAATACGCT | 0 | |
GCCGAATACG | GCAGGTTCAA | 0 |
DNA sequence
TGTGAGCGGGCGTTGGATCCTCCTCACTTGTGGGCCTGCAATTTTCCCCACCCTCCCGGCCGCCCCTGAGCCTCCTGTTTGAGGCGGGATGCCCGCGCGG
CGGCCGTCTCACCGAAGCTGGTCCCCGTGGCGGGAGGGAGCGGCGCGGTGGTCAGCCTCGGGGAACTCATGACGATCCTGGACCTCCACCGGCAGGGCCT
CTCGGTCACCGCCATCGCCCGCCAGCTCGGCCTCGACCGCAAGACCGTCGCCAAGTACATCGCCCGCGGCCTCGAGCCGCCCGTCTACGGACCGCGATCC
CCCCGCGCGCGGGCCACCGACGCCTTCCTGCCCTACCTGCGCGAGCGCCTGGCCGCCTACCCGCAGCTTACGGCCGTCCGTCTCTGGCGCGAGTTGAAGG
AGCGCGGCTTCGCAGGGGCCTACACCGCCGTGAAGCGAGCCGTGGCCCTGCTTCGCCCCTCAGCTCCTCTGCCTATCGAGCGCCGCTTCGAGACCCCGCC
GGGCGAGCAGGCCCAGGTCGACCTCGCCCGCTTCGAGGTCGTCTTCGCCGACGAGCCGGGCGTGACCCGCATCGTCTGGCTGTTCGCGATGGTGCTGGGC
CACTCGCGCTATCTCTGGGCCCGCTTCGTCGTCCACCAGGATCTGCAGACGGTCCTGCGCTGCCACATCGCCGCCTTCCAGGCCCTTGGAGGCGCCCCGC
GCGAGATCCTCTACGACCGCATGAAGACCGCCGTGATCGGCGAGGATCCCGACGGCTTGGTCATCTATAACCGCAGCCTTCTCGATCTCGCGCGCCACTA
CGGGTTCCTGCCGCGCGCCTGCCGTCCCTACCGGGCCAAGACCAAGGGCAAGGTCGAGCGCCCGTTCCGCTACCTGCGCGAGGACTTCTTCCTCGCCCGC
TCGTTCCGCAACCTCGACGACCTGAACGACCAGCTGCGGCACTGGCTCGACACCGTGGCCAACGCCCGCCGGCACGCCACGACCAAGCGGATCGTCGCCG
ACGCCTTCGCGGAGGAGCGCAGCCAGCTGCGGGCGCTGCCGCCCGTGCCCTACGAAGCCGTGCTCAGCCTGGAGCGGCGCGTCACCCACGAGGGCTTCGT
CTCGGTGGCGGGCAATCTCTACAGCGTGCCCGACACCACCCGCCGCCGCGCCCTGGAGGTGCACGTGCTGGCCGATCAGATCCGCATCTACGAGGCGGGT
GAGCTCGTTGCCTGCCACCTGCCCCTGGAAGGGCGTGGGCTGACGCAGGTCGATCCAGCCCATCGGCGGCCGCGATCTCCCCCGCCCGAGCCACGAGACC
CTGCCGAGCCGGTGGTCGTCAGGCGCGCCGGCGACCAGGTCGCGCGTCGCCCGCTGGCCATCTACGACGCCGTAGCCCGCCAACTCGCGGGAGCCGGCGT
GCCCAGGACCGACCGGGGAGACGCGGCATGAGCGGAACTGGCGAGCTGATCCCCCCACTGGTCGAGCGGATCAAGGCCACGCTGGTAGGGCTGAAGATGC
CGCGCGCCCTGGAGATCGTCGACACCACCGTGCGGCGGCTGGAGCGCGGCGAACTCAGCGCGCTGGAGGCGGTCGATGCCCTGCTGAGCGAGGAGCTGAG
CCTACGCGAGAGCCGGCGGGTGAAGACCGCGCTGGTGATGGCGCGGCTCTCGACGGTCAAGACGCTGTCGGGCTTCGACTTCGCCTTCCAGCCCTCGCTC
GACCGCACCCGCATCCTGGCCCTGGCCGAGCTGGGCTTCGTGGACCGCTGCGAGGTGCTGCACTTCCTCGGCCCGCCCGGCACCGGCAAGAGCCACTTGG
CGGTGGCCCTCGGGGTCGAGGCGGTGAAGGCGGGCCGCAGCGTGTACTTCACCACCCTGGCCGACCTTGTGGGAACGCTGGCGCGGGCCGAGCGGGAAGG
AACGTTGCGCGAGAAGATCCGCTACTTCTGCCGGCCGGCGCTGCTGATCGTGGACGAGATCGGCTACCTGCCGGTGGTGCCGGGCGGGGGCAACCTGTTC
TTCCAGCTCGTCAACGCGCGCTACGAGCGGGGCGCGATGGTCCTGACCTCGAACCGCGGCTTTGCAGAGTGGGGGGAGGTGTTCGGCGATCCGGTGGTGG
CGACCGCGCTGCTGGACCGGTTGCTTCACCACGCCGTGGTGGTGCAGATCGAGGGCTCAAGCTACCGGCTGCGCCAGCACACCGCGCTCATGCCCGAGCA
CATCCGCTCGAAGGCAGCCCTGCAGGCTCCGCCGCTCGCCCCGCCTCCGCGTCGGCGCGGACGCCCGCCCAAGAATGGAGGTGCTCACCTCGGCATCGCC
TGACCGCCGAGCCCGCCGATCTGAGGAAAATTGCGCGCCCGAAATTGCGGAAGGTTCGAAGCCCGTTGACA
CGGCCGTCTCACCGAAGCTGGTCCCCGTGGCGGGAGGGAGCGGCGCGGTGGTCAGCCTCGGGGAACTCATGACGATCCTGGACCTCCACCGGCAGGGCCT
CTCGGTCACCGCCATCGCCCGCCAGCTCGGCCTCGACCGCAAGACCGTCGCCAAGTACATCGCCCGCGGCCTCGAGCCGCCCGTCTACGGACCGCGATCC
CCCCGCGCGCGGGCCACCGACGCCTTCCTGCCCTACCTGCGCGAGCGCCTGGCCGCCTACCCGCAGCTTACGGCCGTCCGTCTCTGGCGCGAGTTGAAGG
AGCGCGGCTTCGCAGGGGCCTACACCGCCGTGAAGCGAGCCGTGGCCCTGCTTCGCCCCTCAGCTCCTCTGCCTATCGAGCGCCGCTTCGAGACCCCGCC
GGGCGAGCAGGCCCAGGTCGACCTCGCCCGCTTCGAGGTCGTCTTCGCCGACGAGCCGGGCGTGACCCGCATCGTCTGGCTGTTCGCGATGGTGCTGGGC
CACTCGCGCTATCTCTGGGCCCGCTTCGTCGTCCACCAGGATCTGCAGACGGTCCTGCGCTGCCACATCGCCGCCTTCCAGGCCCTTGGAGGCGCCCCGC
GCGAGATCCTCTACGACCGCATGAAGACCGCCGTGATCGGCGAGGATCCCGACGGCTTGGTCATCTATAACCGCAGCCTTCTCGATCTCGCGCGCCACTA
CGGGTTCCTGCCGCGCGCCTGCCGTCCCTACCGGGCCAAGACCAAGGGCAAGGTCGAGCGCCCGTTCCGCTACCTGCGCGAGGACTTCTTCCTCGCCCGC
TCGTTCCGCAACCTCGACGACCTGAACGACCAGCTGCGGCACTGGCTCGACACCGTGGCCAACGCCCGCCGGCACGCCACGACCAAGCGGATCGTCGCCG
ACGCCTTCGCGGAGGAGCGCAGCCAGCTGCGGGCGCTGCCGCCCGTGCCCTACGAAGCCGTGCTCAGCCTGGAGCGGCGCGTCACCCACGAGGGCTTCGT
CTCGGTGGCGGGCAATCTCTACAGCGTGCCCGACACCACCCGCCGCCGCGCCCTGGAGGTGCACGTGCTGGCCGATCAGATCCGCATCTACGAGGCGGGT
GAGCTCGTTGCCTGCCACCTGCCCCTGGAAGGGCGTGGGCTGACGCAGGTCGATCCAGCCCATCGGCGGCCGCGATCTCCCCCGCCCGAGCCACGAGACC
CTGCCGAGCCGGTGGTCGTCAGGCGCGCCGGCGACCAGGTCGCGCGTCGCCCGCTGGCCATCTACGACGCCGTAGCCCGCCAACTCGCGGGAGCCGGCGT
GCCCAGGACCGACCGGGGAGACGCGGCATGAGCGGAACTGGCGAGCTGATCCCCCCACTGGTCGAGCGGATCAAGGCCACGCTGGTAGGGCTGAAGATGC
CGCGCGCCCTGGAGATCGTCGACACCACCGTGCGGCGGCTGGAGCGCGGCGAACTCAGCGCGCTGGAGGCGGTCGATGCCCTGCTGAGCGAGGAGCTGAG
CCTACGCGAGAGCCGGCGGGTGAAGACCGCGCTGGTGATGGCGCGGCTCTCGACGGTCAAGACGCTGTCGGGCTTCGACTTCGCCTTCCAGCCCTCGCTC
GACCGCACCCGCATCCTGGCCCTGGCCGAGCTGGGCTTCGTGGACCGCTGCGAGGTGCTGCACTTCCTCGGCCCGCCCGGCACCGGCAAGAGCCACTTGG
CGGTGGCCCTCGGGGTCGAGGCGGTGAAGGCGGGCCGCAGCGTGTACTTCACCACCCTGGCCGACCTTGTGGGAACGCTGGCGCGGGCCGAGCGGGAAGG
AACGTTGCGCGAGAAGATCCGCTACTTCTGCCGGCCGGCGCTGCTGATCGTGGACGAGATCGGCTACCTGCCGGTGGTGCCGGGCGGGGGCAACCTGTTC
TTCCAGCTCGTCAACGCGCGCTACGAGCGGGGCGCGATGGTCCTGACCTCGAACCGCGGCTTTGCAGAGTGGGGGGAGGTGTTCGGCGATCCGGTGGTGG
CGACCGCGCTGCTGGACCGGTTGCTTCACCACGCCGTGGTGGTGCAGATCGAGGGCTCAAGCTACCGGCTGCGCCAGCACACCGCGCTCATGCCCGAGCA
CATCCGCTCGAAGGCAGCCCTGCAGGCTCCGCCGCTCGCCCCGCCTCCGCGTCGGCGCGGACGCCCGCCCAAGAATGGAGGTGCTCACCTCGGCATCGCC
TGACCGCCGAGCCCGCCGATCTGAGGAAAATTGCGCGCCCGAAATTGCGGAAGGTTCGAAGCCCGTTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1284 bp | 427 aa | 148 | 1431 | + | No |
Chemistry : DDE
ORF sequence :
VVSLGELMTILDLHRQGLSVTAIARQLGLDRKTVAKYIARGLEPPVYGPRSPRARATDAFLPYLRERLAAYPQLTAVRLWRELKERGFAGAYTAVKRAVA
LLRPSAPLPIERRFETPPGEQAQVDLARFEVVFADEPGVTRIVWLFAMVLGHSRYLWARFVVHQDLQTVLRCHIAAFQALGGAPREILYDRMKTAVIGED
PDGLVIYNRSLLDLARHYGFLPRACRPYRAKTKGKVERPFRYLREDFFLARSFRNLDDLNDQLRHWLDTVANARRHATTKRIVADAFAEERSQLRALPPV
PYEAVLSLERRVTHEGFVSVAGNLYSVPDTTRRRALEVHVLADQIRIYEAGELVACHLPLEGRGLTQVDPAHRRPRSPPPEPRDPAEPVVVRRAGDQVAR
RPLAIYDAVARQLAGAGVPRTDRGDAA
LLRPSAPLPIERRFETPPGEQAQVDLARFEVVFADEPGVTRIVWLFAMVLGHSRYLWARFVVHQDLQTVLRCHIAAFQALGGAPREILYDRMKTAVIGED
PDGLVIYNRSLLDLARHYGFLPRACRPYRAKTKGKVERPFRYLREDFFLARSFRNLDDLNDQLRHWLDTVANARRHATTKRIVADAFAEERSQLRALPPV
PYEAVLSLERRVTHEGFVSVAGNLYSVPDTTRRRALEVHVLADQIRIYEAGELVACHLPLEGRGLTQVDPAHRRPRSPPPEPRDPAEPVVVRRAGDQVAR
RPLAIYDAVARQLAGAGVPRTDRGDAA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
876 bp | 291 aa | 1428 | 2303 | + | No |
AG : IS21 helper
ORF sequence :
MSGTGELIPPLVERIKATLVGLKMPRALEIVDTTVRRLERGELSALEAVDALLSEELSLRESRRVKTALVMARLSTVKTLSGFDFAFQPSLDRTRILALA
ELGFVDRCEVLHFLGPPGTGKSHLAVALGVEAVKAGRSVYFTTLADLVGTLARAEREGTLREKIRYFCRPALLIVDEIGYLPVVPGGGNLFFQLVNARYE
RGAMVLTSNRGFAEWGEVFGDPVVATALLDRLLHHAVVVQIEGSSYRLRQHTALMPEHIRSKAALQAPPLAPPPRRRGRPPKNGGAHLGIA
ELGFVDRCEVLHFLGPPGTGKSHLAVALGVEAVKAGRSVYFTTLADLVGTLARAEREGTLREKIRYFCRPALLIVDEIGYLPVVPGGGNLFFQLVNARYE
RGAMVLTSNRGFAEWGEVFGDPVVATALLDRLLHHAVVVQIEGSSYRLRQHTALMPEHIRSKAALQAPPLAPPPRRRGRPPKNGGAHLGIA
Blast result :
Comments
ISMtsp8 is 76%(orfA), and 88%(orfB) aa similar to ISMex13.
References
1] Ming-Chun Lee and David Robinson, direct submission.
2] Copeland,A., Lucas,S., Lapidus,A., Glavina del Rio,T., Dalin,E.,Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Chertkov,O., Brettin,T.,Detter,J.C., Han,C., Kuske,C.R., Schmutz,J., Larimer,F., Land,M.,Hauser,L., Kyrpides,N., Ivanova,N., Marx,C.J. and Richardson,P. (2008) Direct submission GenBank.
2] Copeland,A., Lucas,S., Lapidus,A., Glavina del Rio,T., Dalin,E.,Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Chertkov,O., Brettin,T.,Detter,J.C., Han,C., Kuske,C.R., Schmutz,J., Larimer,F., Land,M.,Hauser,L., Kyrpides,N., Ivanova,N., Marx,C.J. and Richardson,P. (2008) Direct submission GenBank.