ISMno23
- Family IS91
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_011894 | ND | Methylobacterium nodulans | Methylobacterium nodulans ORS 2060 |
DNA section
IS Length : 2271 bp
Ends
oriIS : TGAACCGCTTCCGCGGTAGGGGCGCTTGGCAATGCCGATGACGTGATCGGCCCGGAGAGGCTTGTCTGAAGATCGTGGGGCCCCCCTGGTCTCGGACAGG II struct. : No
terIS : CTCCGCTCCGCGCGGTCAGGATTGACACCTCATGAGCCCCCATCCGGACGCGAGCCTGATGCTGGACCGCCCTGCGGTGCGCCTCCCCAGCTTGGACGCA II struct. : No
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GCCAGC GAAC | CAAG CCCTGG |
DNA sequence
TGAACCGCTTCCGCGGTAGGGGCGCTTGGCAATGCCGATGACGTGATCGGCCCGGAGAGGCTTGTCTGAAGATCGTGGGGCCCCCCTGGTCTCGGACAGG
AGCCCCTCATGAGCAACCAAGCCGTCAGCCCGCTGCGCCGCCGCATGATCGAGGACATGACCATCCGGCACTTCAGTGCCAAGACGCAGGCCGACCCCAT
CCGCGCCGTGAGAACCTTCGCAGCCTTTCTCGGTCAGTCACCCGATCAGGCCGAGCCAGAAGACCTGCGTCGCTTCCAGCTGCACAGGGCCGCCCCGGGC
GCATCCCCGGCCACGATGAACGCCGCCGTCTCAGCCCTGCGCTTCTTCTTTGGCATCACCCTCGGCTGGGTCGGCTTCGGCGACCGGCTCGCCACCGTCC
GCTCGCCCGAGCGGCTGCCGGTGGTGCTCAGCCCCGAAGAGGTGGCGCTCCTGCTCGGCTGCGCACCGGGGCTGAAGGACCGCGCCGCCCTCAGTATCGC
CTATGGCTGCGGCCTACGCGTCTCGGAGATCGCCAACCTGAAGGTCACGGACATCGACAGCGCCCGCATGCTGATCCGGGTCGAGCAGGGTAAGGGCCGC
AAGGACCGCTTCGTCATGCTCGCCCCCGACCTGCTCGATCTGCTGCGGCACTGGTGGAAGCTGACGCGCTCCCGCGGCTGGCTGTTCCCGGGGCGGGAGC
CCGGTCAGCCGATCACGGTCCGCCAGCTCAACCGCGCCTGCCATGCGGCCGCCGCCAGCGCCCGGCTCGACAAGCGCGTCTCGATCGCGTCTCGATGCAC
ACCTTGCGGCACAGCTTCGCCACGCATCTCCTGGAGCAGAAGATCGACATCCGCCTCATCCAGGTGCTGCTCGGCCACAGGAAGCTCGACACCACGGCGC
TCTACACCCGCGTGGCACGAGGACGCTGCGCGCCGTCGCGAGCCCGCTGAGCCAGCTGCGGCCCAGGGACGTGCCGCCGGTCTGAGGAGGGGGGCGCGTG
CCGCGCCCCGCCCTGGAACTCGCCGACGTCCTCAACCAGCATGGCGAGGCCTCCCTGCGGGCCCATGCCGGACACCTCAGTCGCGGCCAGCGCAAGGTCA
TTGCGGCGATCCGGGCCTGCCGTACCGCGGCGCTCGGCGGCCATGTCGCACGCTGCCAGGCCTGCGATCAGATCCAGGTCTCCTCCAACTCCTGCCGCAA
CCGGCACTGCCCGAAGTGTCAGGGGCAGACCGCGCGGGCGTGGCTGGAACGGCAGCAGGCCGACCTGCTGCCGGTGGCCTACTTCCACGTCGTCTTCAGC
CGGCCGGCGCCGATCGCCAGGGTGGCGTTCCAGAACAAGGCCGTGGTCTACCACCTGCTGCTGCGGACCGCCGCCGACACGCTGCTCACCCTCGCGGCCG
ACCCCAAGCACCTCGGGGCGAGGATTGGGTTGACCGCCGTGCTCCACACCTGGGGCTCGGCGATGACCCATCATCCGCACGTGCCCGTCATCGTCCCCGG
CGGCGGGCTGTCCCCGGACGGGGCGCGCTGGATCGCCTGCCGGCCCGGCTTCTTCCTGCCCGTCCGTGTGCTCTCGCGCCTGTTCCGCGGGCTGTTCCTC
GACCGGCTGAGCCAAGCGCATCGCGCCGGCCGGCTCACCTTCGGGGCTGAGCTGTCTCGTCTGGCCGAGCCGATGGCCTTCGCGGCCCTACTGGCCCCGC
TGCGTCGCGCGGAATGGGTGGTCTACGCCAAGCCGCCGCTGGCCGGCCCCGAGGCGGTGCTCACCTACCTGTCGGGCTACACGCATCGCGTGGCGATCGC
CAACAGCCGGCTGCTGAGGCTGGACGAGCACGGGGTCACGGTCCGCTGGACGGATGACCGCGTCCGGGCTGGGGCGAGGGGGCGTGAGTGGATCAAGCGT
ATGACGCTGTCGGCCGACGAGTTCATCCGCCGCTTCCTGCTGCACGTGCTGCCCGATGGCTTCCACCGCATCCGCCACGACGGCCTGTTCGCCCGCGGGA
CGCGGGCTGGGGCCATCGCCACGATCCGGGCTCTGCTCGCTTCGGGGACGTCTCACGATCCGGTCAAGCCCGAACCTGCCCGCGCCCCGCCGGCTCCTCC
GGCTTGTCCCCGCTGCGGCGGCTGCCTGTCCATCGTCGAGCGCTTCCGCCGCGGCGAGACCCCGCGCTTGCCTCCGCTCCGCGCGGTCAGGATTGACACC
TCATGAGCCCCCATCCGGACGCGAGCCTGATGCTGGACCGCCCTGCGGTGCGCCTCCCCAGCTTGGACGCA
AGCCCCTCATGAGCAACCAAGCCGTCAGCCCGCTGCGCCGCCGCATGATCGAGGACATGACCATCCGGCACTTCAGTGCCAAGACGCAGGCCGACCCCAT
CCGCGCCGTGAGAACCTTCGCAGCCTTTCTCGGTCAGTCACCCGATCAGGCCGAGCCAGAAGACCTGCGTCGCTTCCAGCTGCACAGGGCCGCCCCGGGC
GCATCCCCGGCCACGATGAACGCCGCCGTCTCAGCCCTGCGCTTCTTCTTTGGCATCACCCTCGGCTGGGTCGGCTTCGGCGACCGGCTCGCCACCGTCC
GCTCGCCCGAGCGGCTGCCGGTGGTGCTCAGCCCCGAAGAGGTGGCGCTCCTGCTCGGCTGCGCACCGGGGCTGAAGGACCGCGCCGCCCTCAGTATCGC
CTATGGCTGCGGCCTACGCGTCTCGGAGATCGCCAACCTGAAGGTCACGGACATCGACAGCGCCCGCATGCTGATCCGGGTCGAGCAGGGTAAGGGCCGC
AAGGACCGCTTCGTCATGCTCGCCCCCGACCTGCTCGATCTGCTGCGGCACTGGTGGAAGCTGACGCGCTCCCGCGGCTGGCTGTTCCCGGGGCGGGAGC
CCGGTCAGCCGATCACGGTCCGCCAGCTCAACCGCGCCTGCCATGCGGCCGCCGCCAGCGCCCGGCTCGACAAGCGCGTCTCGATCGCGTCTCGATGCAC
ACCTTGCGGCACAGCTTCGCCACGCATCTCCTGGAGCAGAAGATCGACATCCGCCTCATCCAGGTGCTGCTCGGCCACAGGAAGCTCGACACCACGGCGC
TCTACACCCGCGTGGCACGAGGACGCTGCGCGCCGTCGCGAGCCCGCTGAGCCAGCTGCGGCCCAGGGACGTGCCGCCGGTCTGAGGAGGGGGGCGCGTG
CCGCGCCCCGCCCTGGAACTCGCCGACGTCCTCAACCAGCATGGCGAGGCCTCCCTGCGGGCCCATGCCGGACACCTCAGTCGCGGCCAGCGCAAGGTCA
TTGCGGCGATCCGGGCCTGCCGTACCGCGGCGCTCGGCGGCCATGTCGCACGCTGCCAGGCCTGCGATCAGATCCAGGTCTCCTCCAACTCCTGCCGCAA
CCGGCACTGCCCGAAGTGTCAGGGGCAGACCGCGCGGGCGTGGCTGGAACGGCAGCAGGCCGACCTGCTGCCGGTGGCCTACTTCCACGTCGTCTTCAGC
CGGCCGGCGCCGATCGCCAGGGTGGCGTTCCAGAACAAGGCCGTGGTCTACCACCTGCTGCTGCGGACCGCCGCCGACACGCTGCTCACCCTCGCGGCCG
ACCCCAAGCACCTCGGGGCGAGGATTGGGTTGACCGCCGTGCTCCACACCTGGGGCTCGGCGATGACCCATCATCCGCACGTGCCCGTCATCGTCCCCGG
CGGCGGGCTGTCCCCGGACGGGGCGCGCTGGATCGCCTGCCGGCCCGGCTTCTTCCTGCCCGTCCGTGTGCTCTCGCGCCTGTTCCGCGGGCTGTTCCTC
GACCGGCTGAGCCAAGCGCATCGCGCCGGCCGGCTCACCTTCGGGGCTGAGCTGTCTCGTCTGGCCGAGCCGATGGCCTTCGCGGCCCTACTGGCCCCGC
TGCGTCGCGCGGAATGGGTGGTCTACGCCAAGCCGCCGCTGGCCGGCCCCGAGGCGGTGCTCACCTACCTGTCGGGCTACACGCATCGCGTGGCGATCGC
CAACAGCCGGCTGCTGAGGCTGGACGAGCACGGGGTCACGGTCCGCTGGACGGATGACCGCGTCCGGGCTGGGGCGAGGGGGCGTGAGTGGATCAAGCGT
ATGACGCTGTCGGCCGACGAGTTCATCCGCCGCTTCCTGCTGCACGTGCTGCCCGATGGCTTCCACCGCATCCGCCACGACGGCCTGTTCGCCCGCGGGA
CGCGGGCTGGGGCCATCGCCACGATCCGGGCTCTGCTCGCTTCGGGGACGTCTCACGATCCGGTCAAGCCCGAACCTGCCCGCGCCCCGCCGGCTCCTCC
GGCTTGTCCCCGCTGCGGCGGCTGCCTGTCCATCGTCGAGCGCTTCCGCCGCGGCGAGACCCCGCGCTTGCCTCCGCTCCGCGCGGTCAGGATTGACACC
TCATGAGCCCCCATCCGGACGCGAGCCTGATGCTGGACCGCCCTGCGGTGCGCCTCCCCAGCTTGGACGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
842 bp | 280 aa | 109 | 950 | + | No |
AG : IS91 integrase/resolvase
ORF sequence :
MSNQAVSPLRRRMIEDMTIRHFSAKTQADPIRAVRTFAAFLGQSPDQAEPEDLRRFQLHRAAPGASPATMNAAVSALRFFFGITLGWVGFGDRLATVRSP
ERLPVVLSPEEVALLLGCAPGLKDRAALSIAYGCGLRVSEIANLKVTDIDSARMLIRVEQGKGRKDRFVMLAPDLLDLLRHWWKLTRSRGWLFPGREPGQ
PITVRQLNRACHAAAASARLDKRVSIAVSMHTLRHSFATHLLEQKIDIRLIQVLLGHRKLDTTALYTRVARGRCAPSRAR
ERLPVVLSPEEVALLLGCAPGLKDRAALSIAYGCGLRVSEIANLKVTDIDSARMLIRVEQGKGRKDRFVMLAPDLLDLLRHWWKLTRSRGWLFPGREPGQ
PITVRQLNRACHAAAASARLDKRVSIAVSMHTLRHSFATHLLEQKIDIRLIQVLLGHRKLDTTALYTRVARGRCAPSRAR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1209 bp | 402 aa | 998 | 2206 | + | No |
Chemistry : Y2
ORF sequence :
VPRPALELADVLNQHGEASLRAHAGHLSRGQRKVIAAIRACRTAALGGHVARCQACDQIQVSSNSCRNRHCPKCQGQTARAWLERQQADLLPVAYFHVVF
SRPAPIARVAFQNKAVVYHLLLRTAADTLLTLAADPKHLGARIGLTAVLHTWGSAMTHHPHVPVIVPGGGLSPDGARWIACRPGFFLPVRVLSRLFRGLF
LDRLSQAHRAGRLTFGAELSRLAEPMAFAALLAPLRRAEWVVYAKPPLAGPEAVLTYLSGYTHRVAIANSRLLRLDEHGVTVRWTDDRVRAGARGREWIK
RMTLSADEFIRRFLLHVLPDGFHRIRHDGLFARGTRAGAIATIRALLASGTSHDPVKPEPARAPPAPPACPRCGGCLSIVERFRRGETPRLPPLRAVRID
TS
SRPAPIARVAFQNKAVVYHLLLRTAADTLLTLAADPKHLGARIGLTAVLHTWGSAMTHHPHVPVIVPGGGLSPDGARWIACRPGFFLPVRVLSRLFRGLF
LDRLSQAHRAGRLTFGAELSRLAEPMAFAALLAPLRRAEWVVYAKPPLAGPEAVLTYLSGYTHRVAIANSRLLRLDEHGVTVRWTDDRVRAGARGREWIK
RMTLSADEFIRRFLLHVLPDGFHRIRHDGLFARGTRAGAIATIRALLASGTSHDPVKPEPARAPPAPPACPRCGGCLSIVERFRRGETPRLPPLRAVRID
TS
Blast result :
Comments
ISMno23 is 63% (orfA), and 64% (orfB, the Transposase) aa similar to ISAzo26.
There is a punctual mutation inducing a frameshift. We have reconstructed the ORFA in silico by joining (109-788;788-950)
There is a punctual mutation inducing a frameshift. We have reconstructed the ORFA in silico by joining (109-788;788-950)
References
1] Ming-Chun Lee and David Robinson (2009) Direct submission.
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Marx,C.J. and Richardson,P. (2009) Direct submission GenBank.
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Marx,C.J. and Richardson,P. (2009) Direct submission GenBank.