ISMno24
- Family IS91
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP001350 | ND | Methylobacterium nodulans | Methylobacterium nodulans ORS 2060 plasmid pMNOD01 |
DNA section
IS Length : 2157 bp
Ends
oriIS : GGCGGTCGTGATCAGGCTCGGGGCCGCGGTCATGACGTGTTGATCCGAACGGCGCTTGGGCGCAGCTTCGGGACCTCGCCCCGGCGGAAGCGCTCGATGA II struct. : No
terIS : GGTCATGTCCTCGATCATCCGCCGGCGCAGCGGAGAGACGGCGTCGTGGGTCATGGGAGGCTCCCATCTGGGGTGAGGTGAACCTCACGATCCTCAGACA II struct. : No
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
TCTGTCGACT | GGGCGACCGC |
DNA sequence
GGCGGTCGTGATCAGGCTCGGGGCCGCGGTCATGACGTGTTGATCCGAACGGCGCTTGGGCGCAGCTTCGGGACCTCGCCCCGGCGGAAGCGCTCGATGA
CGACGAGGCGTCCGCCGCAGCACGGGCAGGCCGGTCCCGCGGGTGGGTCGGTCATGGCGGCGGGTTCGGGCTTGGCCGCGCACGGAGCCTGCATCGGCGC
CAGCAGGCTCCTGATCCGGGTGATGTTGGCAGCCCGCGTGCCGCTGGCGAACAGGCCGTAGTGGCGGATGCGGTGGAAGCCGTCAGGCAGCACGTGGAGC
AGGAAGCGGCGGATGAACTCGTCCGGGGCCAGCGTCATGGTCTTGATCCACTCGCGACCCTTCGCGCCATCCCGAGCCCGGTAGTCCTTCCAGCGGAACG
TGACGCCGCGCTCGTCCATAGCGATGAGCCGGCTGTTGGCGATGGCGACGCGGTGGGTGTAGCGCGACAGGTAGGCGAGCACGGCTTCCGGCCCAGCGAA
AGGGCGCTTGGCGTAGACGATCCACTCGGCGCGCCGCAGCGGTGCGAGATGAGCTGCAAAGGCGCCCGCCTCGGCCAGCGGGGCCAACTCGCCGTGGAAG
CCGAGCCTGCCGACGCGATGAGCCTCGGCCAGTCGGGTGAGGAAGAGCCGGCGGAACAGGCGTGAGAGCACCCGCACGGGCAGGAAGAAGCCTGGCCGGC
AGGCGATCCAGCGCGATCCGTCAGACGAGAGGCCGCCGCCCGGCACGACGATATGCACGTGCGGATGGTGCGTCATGGCCGAGCCCCAGGTGTGGAGCAC
GGCGGTGAGCCCGATCTTTGCGCCCAGATGCTTCGGATCGGCGGCGATGGTGAGGAGCGTGTCGGCGGCGGTGCGGAACAGCAGGTCGTATACTGCGGCT
TTGTTCTGGAAGGCCACGGCCGCGACCGGCGCCGGCAGGGTGAAGACGACGTGGTAGTAGGGCACGGGCAGCAGGTCGGCCTGACGCCGCTCGAGCCACG
CCCGCGCCGTTTGGCCCTGGCACTTCGGGCAGTGCCGGTTGCGGCACGAGTTGGAGGCGATGGCGACGTGGTCGCAGCCTTCGCAGCGCACGACATGACC
GCCGAGCGCCGCGGTGCGGCAGGCTCGGATCGCGGCCATGACCTTCATCTGGCCGTGGCTCGGGCGGTGACGCGCGAGGAAGGCGTCGCCATGCCGGTTC
AGGATATCCGCGACCTCAAGAGAAGACCGCGCCACGACGGGCGCGGCAGCCTACGGCGGTGGTGGGGATGCGAGCAGCGCGAGCGGGCTCGTGACCTCGC
GGATCGCCTTGAGCGCGACCCGGGTATAGACGGCGGTGGTGTCGAGCTTGCGGTGCCCGAGCAGCACCTGAATGACGCGGATGTCGGTCTTGCGCTCCAG
GAGGTGGGTGGCGAAGCTGTGCCGCAGCGTGTGCATCGAGACGCGCTTGTCGATCCGGGCGGCCTCGGCGGCGGCATGGCAGGCGCGGTTGAGCTGACGC
GTGGTGATCGGCTGGGCGGGCTGCTGGCCGGGGAAGAGCCAGCCGCGCGGGCGCTTCACCCGCCACCACTGCCGCAGCAGGTCGAGGAGGTCGGGCGAGA
GCATGACGAAGCGGTCCTTGCGGCCCTTGCCCTGCTCGACCCGGATCAGCATGCGCGCGCTATCGATGTCTGTGACCTTCAGGTTGGCGATCTCGGAGAC
GCGCAGGCCGCAGCCGTAGGCGACGCTCAACGCCGCGCGGTACTTCAGGCTCGGCGCGTGCGCGAGCAGGAGTGCCACCTCCTCGGGGCTGAGCACGACG
GGCAGCCGCTCGGGCGTGGGGATGCGGGCCAAGCGGTCGGCAAAGCCCGATCGGCCGAGCGTGACGTGGAAGAAGAACCGCAGCGCTGTGATGGCCAAGT
TCATGCGGGCATAGCTCGCGCCGAGCGAGGCCATGTGGAGCTGGTAGCGCCGGAGGTCCTCCGGCTCGGCCAGGTCCGGCGCGCGGCCGAGGAAGATCGT
GAACTCGCGGATCTGTCGGATGTAGTCGCGCTTTGTGTGCTCGCCGAACCGGCGGATGGTCATGTCCTCGATCATCCGCCGGCGCAGCGGAGAGACGGCG
TCGTGGGTCATGGGAGGCTCCCATCTGGGGTGAGGTGAACCTCACGATCCTCAGACA
CGACGAGGCGTCCGCCGCAGCACGGGCAGGCCGGTCCCGCGGGTGGGTCGGTCATGGCGGCGGGTTCGGGCTTGGCCGCGCACGGAGCCTGCATCGGCGC
CAGCAGGCTCCTGATCCGGGTGATGTTGGCAGCCCGCGTGCCGCTGGCGAACAGGCCGTAGTGGCGGATGCGGTGGAAGCCGTCAGGCAGCACGTGGAGC
AGGAAGCGGCGGATGAACTCGTCCGGGGCCAGCGTCATGGTCTTGATCCACTCGCGACCCTTCGCGCCATCCCGAGCCCGGTAGTCCTTCCAGCGGAACG
TGACGCCGCGCTCGTCCATAGCGATGAGCCGGCTGTTGGCGATGGCGACGCGGTGGGTGTAGCGCGACAGGTAGGCGAGCACGGCTTCCGGCCCAGCGAA
AGGGCGCTTGGCGTAGACGATCCACTCGGCGCGCCGCAGCGGTGCGAGATGAGCTGCAAAGGCGCCCGCCTCGGCCAGCGGGGCCAACTCGCCGTGGAAG
CCGAGCCTGCCGACGCGATGAGCCTCGGCCAGTCGGGTGAGGAAGAGCCGGCGGAACAGGCGTGAGAGCACCCGCACGGGCAGGAAGAAGCCTGGCCGGC
AGGCGATCCAGCGCGATCCGTCAGACGAGAGGCCGCCGCCCGGCACGACGATATGCACGTGCGGATGGTGCGTCATGGCCGAGCCCCAGGTGTGGAGCAC
GGCGGTGAGCCCGATCTTTGCGCCCAGATGCTTCGGATCGGCGGCGATGGTGAGGAGCGTGTCGGCGGCGGTGCGGAACAGCAGGTCGTATACTGCGGCT
TTGTTCTGGAAGGCCACGGCCGCGACCGGCGCCGGCAGGGTGAAGACGACGTGGTAGTAGGGCACGGGCAGCAGGTCGGCCTGACGCCGCTCGAGCCACG
CCCGCGCCGTTTGGCCCTGGCACTTCGGGCAGTGCCGGTTGCGGCACGAGTTGGAGGCGATGGCGACGTGGTCGCAGCCTTCGCAGCGCACGACATGACC
GCCGAGCGCCGCGGTGCGGCAGGCTCGGATCGCGGCCATGACCTTCATCTGGCCGTGGCTCGGGCGGTGACGCGCGAGGAAGGCGTCGCCATGCCGGTTC
AGGATATCCGCGACCTCAAGAGAAGACCGCGCCACGACGGGCGCGGCAGCCTACGGCGGTGGTGGGGATGCGAGCAGCGCGAGCGGGCTCGTGACCTCGC
GGATCGCCTTGAGCGCGACCCGGGTATAGACGGCGGTGGTGTCGAGCTTGCGGTGCCCGAGCAGCACCTGAATGACGCGGATGTCGGTCTTGCGCTCCAG
GAGGTGGGTGGCGAAGCTGTGCCGCAGCGTGTGCATCGAGACGCGCTTGTCGATCCGGGCGGCCTCGGCGGCGGCATGGCAGGCGCGGTTGAGCTGACGC
GTGGTGATCGGCTGGGCGGGCTGCTGGCCGGGGAAGAGCCAGCCGCGCGGGCGCTTCACCCGCCACCACTGCCGCAGCAGGTCGAGGAGGTCGGGCGAGA
GCATGACGAAGCGGTCCTTGCGGCCCTTGCCCTGCTCGACCCGGATCAGCATGCGCGCGCTATCGATGTCTGTGACCTTCAGGTTGGCGATCTCGGAGAC
GCGCAGGCCGCAGCCGTAGGCGACGCTCAACGCCGCGCGGTACTTCAGGCTCGGCGCGTGCGCGAGCAGGAGTGCCACCTCCTCGGGGCTGAGCACGACG
GGCAGCCGCTCGGGCGTGGGGATGCGGGCCAAGCGGTCGGCAAAGCCCGATCGGCCGAGCGTGACGTGGAAGAAGAACCGCAGCGCTGTGATGGCCAAGT
TCATGCGGGCATAGCTCGCGCCGAGCGAGGCCATGTGGAGCTGGTAGCGCCGGAGGTCCTCCGGCTCGGCCAGGTCCGGCGCGCGGCCGAGGAAGATCGT
GAACTCGCGGATCTGTCGGATGTAGTCGCGCTTTGTGTGCTCGCCGAACCGGCGGATGGTCATGTCCTCGATCATCCGCCGGCGCAGCGGAGAGACGGCG
TCGTGGGTCATGGGAGGCTCCCATCTGGGGTGAGGTGAACCTCACGATCCTCAGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1206 bp | 401 aa | 1235 | 30 | - | No |
Chemistry : Y2
ORF sequence :
MARSSLEVADILNRHGDAFLARHRPSHGQMKVMAAIRACRTAALGGHVVRCEGCDHVAIASNSCRNRHCPKCQGQTARAWLERRQADLLPVPYYHVVFTL
PAPVAAVAFQNKAAVYDLLFRTAADTLLTIAADPKHLGAKIGLTAVLHTWGSAMTHHPHVHIVVPGGGLSSDGSRWIACRPGFFLPVRVLSRLFRRLFLT
RLAEAHRVGRLGFHGELAPLAEAGAFAAHLAPLRRAEWIVYAKRPFAGPEAVLAYLSRYTHRVAIANSRLIAMDERGVTFRWKDYRARDGAKGREWIKTM
TLAPDEFIRRFLLHVLPDGFHRIRHYGLFASGTRAANITRIRSLLAPMQAPCAAKPEPAAMTDPPAGPACPCCGGRLVVIERFRRGEVPKLRPSAVRINT
S
PAPVAAVAFQNKAAVYDLLFRTAADTLLTIAADPKHLGAKIGLTAVLHTWGSAMTHHPHVHIVVPGGGLSSDGSRWIACRPGFFLPVRVLSRLFRRLFLT
RLAEAHRVGRLGFHGELAPLAEAGAFAAHLAPLRRAEWIVYAKRPFAGPEAVLAYLSRYTHRVAIANSRLIAMDERGVTFRWKDYRARDGAKGREWIKTM
TLAPDEFIRRFLLHVLPDGFHRIRHYGLFASGTRAANITRIRSLLAPMQAPCAAKPEPAAMTDPPAGPACPCCGGRLVVIERFRRGEVPKLRPSAVRINT
S
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
861 bp | 286 aa | 2111 | 1251 | - | No |
AG : IS91 integrase/resolvase
ORF sequence :
MTHDAVSPLRRRMIEDMTIRRFGEHTKRDYIRQIREFTIFLGRAPDLAEPEDLRRYQLHMASLGASYARMNLAITALRFFFHVTLGRSGFADRLARIPTP
ERLPVVLSPEEVALLLAHAPSLKYRAALSVAYGCGLRVSEIANLKVTDIDSARMLIRVEQGKGRKDRFVMLSPDLLDLLRQWWRVKRPRGWLFPGQQPAQ
PITTRQLNRACHAAAEAARIDKRVSMHTLRHSFATHLLERKTDIRVIQVLLGHRKLDTTAVYTRVALKAIREVTSPLALLASPPPP
ERLPVVLSPEEVALLLAHAPSLKYRAALSVAYGCGLRVSEIANLKVTDIDSARMLIRVEQGKGRKDRFVMLSPDLLDLLRQWWRVKRPRGWLFPGQQPAQ
PITTRQLNRACHAAAEAARIDKRVSMHTLRHSFATHLLERKTDIRVIQVLLGHRKLDTTAVYTRVALKAIREVTSPLALLASPPPP
Blast result :
Comments
ISMno24 is 80% (ORFA : the transposase) and 85% (ORFB : integrase) aa similar to ISMno23.
References
1] Ming-Chun Lee and David Robinson, direct submission.
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Marx,C.J. and Richardson,P. (2009) Direct submission GenBank.
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Marx,C.J. and Richardson,P. (2009) Direct submission GenBank.