ISMno5
- Family ISL3
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_011894 | ND | Methylobacterium nodulans | Methylobacterium nodulans ORS 2060 Methylobacterium nodulans ORS 2060 plasmid pMNOD02 Methylobacterium nodulans ORS 2060 plasmid pMNOD01 |
DNA section
IS Length : 1728 bp
Ends
IR Length : 23
IRL : GGTTCTTCCGCACTTTGCGTGGAACGGTCGCGGTGACGCGGGAAGCGCAG
IRR : GGTTCTTCCGCACTTTGCGTGGATCAGGCAGCGATGAGGATGCGGCTACG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGATTGCCCC | GTTTTTGT | GGACGGTTAG | 8 |
CTCACGGTCC | AAGAAATG | GGGTCTGGTT | 8 |
AGAGCTCTCC | ACTTTTCC | GGGTTAAGTC | 8 |
AAAGGTGTCC | ACTTCTCC | TGAACAGACT | 8 |
CGGTTTTCGT | ACCAATCCGG | 0 | |
CCGGATTGGT | ACGAAAACCG | 0 |
DNA sequence
GGTTCTTCCGCACTTTGCGTGGAACGGTCGCGGTGACGCGGGAAGCGCAGGCTGATTCACGAGGCGGATGATCCTGTCCTCATCCCTGCCGGGCTGCACG
ATCGAGCGGGTCTTTCGCCACACCGATCACCTTGTCGTCATCGCCCATGGCCGCCGTTGTCATGGTCGATGTCCGACCTGTGGCACGCCCAGTTCCGCCG
TTCACAGCCGCTATGATCGCCGTCCGGCCGATCTGCCGAGCATGGGCCAGCCTGTGACGCTGCGCCTGCGTATCCGACGCTTCTACTGCCATCATCCCGC
TTGCCGCCGCCGCACGTTCGCCGAGCCTCTTCCGCGGCTGATACCGCCGCGAGCGCGCCGGACCCGCCGCCTCGCTCAGGCCCAGACCCGGATCGGGCTT
GCAGTCGGCGGCGAGGCCGGCGCGCGCCTGACCGGCCACCTGGGGATGCAAACCAGCCCCGACACGATCCTGCGCCTCGTGCACCGCCTCCCGTTGCCGA
GAGCGAACGCGCCGCGCGCCGTGGGCATCGACGACTGGGCCATCCGCAAAGGTCGGAGCTACGGCACGCTTCTCGTCGACCTCGAACGGCGATGCCCGAT
CGACCTGCTGCCCGACCGCTCAGGGGCGACCGTTGCCGCATGGCTGCGTCGCCATCCCAGCATCCAGATCGTCGCCCGCGACCGCTCGACCGAGTACGCC
CGAGCGGCCACCGCGGGCGCGCCGGCCGCCCTCCAGGTCGCCGACCGATGGCACCTGCTCCTCAACCTGCGCCAGGTTCTCGAACGCTGGCTTGGCCGCG
TCCATGGCCGGCTGCGACAGCTCCCTCCTCTTGCGAGTGGTGACGGACGACAGCCAGGGGAGCGCCCGCGCGCCTATCGCCGCAGCGCAGCCGAGATTGC
CGTCAGCCTCGACAGCCGCGCCCGCCGGCTGGCGGCCTATGAGGACGTGCGCCGACGCCATCTCGCTGGCGAGACCCTCCTGGCGATCGGCCGCGCCACG
GGTCTGGCGCGAGCGACCGTGCGCAAGTACGCCCAGGCCGAGAGCTTCCCGGAGCGCGCGATCCGCAGACCCAATCCCTCTCGCCTCGATCCCTACCTCG
CCCATCTGGAGCAGCGCATGGCCGAGGGCTGCGAGAACGCTATGGCGCTCTGGCGCGAGATCCGCCGCCAGGGCTTCGCGGGAACCCATCGGCAGGTGCA
CCGCTTCGTCGCCGAGCGGCGCACGGCTCGCAAATGGCTGTCGCAGCCCGCCTCGACGAGCACTGAGGCGATCAGACCCTCACCCATCGCCTCACCCAAG
CAACTGGCCTGGATCCTGGTGCAGCCCCTCGCGACACTGCAGCCCCGCGCCGCGGCTGACCTCGCCCGCATCCGGCAGGATCCTGAAGCCGCACGGATCG
CCGATCTGGCGCGGCGGTTCACGATGCTCGTGCGTGCCTGTGGCCTGGGCGGCGACCGGCCGGCGGACCCTGCCAGCGAGCTCGACAGATGGCTGCTCGA
AACCCGGAACTGCGGTGTCGCTGCGCTGGAGACCTTCGCGGCCGGTTTGGCGCAGGATGGGGCGGCCGTTCGGGCGGCGCTGACGACGTCCTGGAGCAAT
GCTCAGGCGGAAGGGCAGATCAGTCGGCTGAAGATGCTCAAGCGCACCATGTACGGTCGCGCCAGCTTCGCACTCCTCCGTAGCCGCATCCTCATCGCTG
CCTGATCCACGCAAAGTGCGGAAGAACC
ATCGAGCGGGTCTTTCGCCACACCGATCACCTTGTCGTCATCGCCCATGGCCGCCGTTGTCATGGTCGATGTCCGACCTGTGGCACGCCCAGTTCCGCCG
TTCACAGCCGCTATGATCGCCGTCCGGCCGATCTGCCGAGCATGGGCCAGCCTGTGACGCTGCGCCTGCGTATCCGACGCTTCTACTGCCATCATCCCGC
TTGCCGCCGCCGCACGTTCGCCGAGCCTCTTCCGCGGCTGATACCGCCGCGAGCGCGCCGGACCCGCCGCCTCGCTCAGGCCCAGACCCGGATCGGGCTT
GCAGTCGGCGGCGAGGCCGGCGCGCGCCTGACCGGCCACCTGGGGATGCAAACCAGCCCCGACACGATCCTGCGCCTCGTGCACCGCCTCCCGTTGCCGA
GAGCGAACGCGCCGCGCGCCGTGGGCATCGACGACTGGGCCATCCGCAAAGGTCGGAGCTACGGCACGCTTCTCGTCGACCTCGAACGGCGATGCCCGAT
CGACCTGCTGCCCGACCGCTCAGGGGCGACCGTTGCCGCATGGCTGCGTCGCCATCCCAGCATCCAGATCGTCGCCCGCGACCGCTCGACCGAGTACGCC
CGAGCGGCCACCGCGGGCGCGCCGGCCGCCCTCCAGGTCGCCGACCGATGGCACCTGCTCCTCAACCTGCGCCAGGTTCTCGAACGCTGGCTTGGCCGCG
TCCATGGCCGGCTGCGACAGCTCCCTCCTCTTGCGAGTGGTGACGGACGACAGCCAGGGGAGCGCCCGCGCGCCTATCGCCGCAGCGCAGCCGAGATTGC
CGTCAGCCTCGACAGCCGCGCCCGCCGGCTGGCGGCCTATGAGGACGTGCGCCGACGCCATCTCGCTGGCGAGACCCTCCTGGCGATCGGCCGCGCCACG
GGTCTGGCGCGAGCGACCGTGCGCAAGTACGCCCAGGCCGAGAGCTTCCCGGAGCGCGCGATCCGCAGACCCAATCCCTCTCGCCTCGATCCCTACCTCG
CCCATCTGGAGCAGCGCATGGCCGAGGGCTGCGAGAACGCTATGGCGCTCTGGCGCGAGATCCGCCGCCAGGGCTTCGCGGGAACCCATCGGCAGGTGCA
CCGCTTCGTCGCCGAGCGGCGCACGGCTCGCAAATGGCTGTCGCAGCCCGCCTCGACGAGCACTGAGGCGATCAGACCCTCACCCATCGCCTCACCCAAG
CAACTGGCCTGGATCCTGGTGCAGCCCCTCGCGACACTGCAGCCCCGCGCCGCGGCTGACCTCGCCCGCATCCGGCAGGATCCTGAAGCCGCACGGATCG
CCGATCTGGCGCGGCGGTTCACGATGCTCGTGCGTGCCTGTGGCCTGGGCGGCGACCGGCCGGCGGACCCTGCCAGCGAGCTCGACAGATGGCTGCTCGA
AACCCGGAACTGCGGTGTCGCTGCGCTGGAGACCTTCGCGGCCGGTTTGGCGCAGGATGGGGCGGCCGTTCGGGCGGCGCTGACGACGTCCTGGAGCAAT
GCTCAGGCGGAAGGGCAGATCAGTCGGCTGAAGATGCTCAAGCGCACCATGTACGGTCGCGCCAGCTTCGCACTCCTCCGTAGCCGCATCCTCATCGCTG
CCTGATCCACGCAAAGTGCGGAAGAACC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1638 bp | 545 aa | 68 | 1705 | + | No |
Chemistry : Unknow
ORF sequence :
MILSSSLPGCTIERVFRHTDHLVVIAHGRRCHGRCPTCGTPSSAVHSRYDRRPADLPSMGQPVTLRLRIRRFYCHHPACRRRTFAEPLPRLIPPRARRTR
RLAQAQTRIGLAVGGEAGARLTGHLGMQTSPDTILRLVHRLPLPRANAPRAVGIDDWAIRKGRSYGTLLVDLERRCPIDLLPDRSGATVAAWLRRHPSIQ
IVARDRSTEYARAATAGAPAALQVADRWHLLLNLRQVLERWLGRVHGRLRQLPPLASGDGRQPGERPRAYRRSAAEIAVSLDSRARRLAAYEDVRRRHLA
GETLLAIGRATGLARATVRKYAQAESFPERAIRRPNPSRLDPYLAHLEQRMAEGCENAMALWREIRRQGFAGTHRQVHRFVAERRTARKWLSQPASTSTE
AIRPSPIASPKQLAWILVQPLATLQPRAAADLARIRQDPEAARIADLARRFTMLVRACGLGGDRPADPASELDRWLLETRNCGVAALETFAAGLAQDGAA
VRAALTTSWSNAQAEGQISRLKMLKRTMYGRASFALLRSRILIAA
RLAQAQTRIGLAVGGEAGARLTGHLGMQTSPDTILRLVHRLPLPRANAPRAVGIDDWAIRKGRSYGTLLVDLERRCPIDLLPDRSGATVAAWLRRHPSIQ
IVARDRSTEYARAATAGAPAALQVADRWHLLLNLRQVLERWLGRVHGRLRQLPPLASGDGRQPGERPRAYRRSAAEIAVSLDSRARRLAAYEDVRRRHLA
GETLLAIGRATGLARATVRKYAQAESFPERAIRRPNPSRLDPYLAHLEQRMAEGCENAMALWREIRRQGFAGTHRQVHRFVAERRTARKWLSQPASTSTE
AIRPSPIASPKQLAWILVQPLATLQPRAAADLARIRQDPEAARIADLARRFTMLVRACGLGGDRPADPASELDRWLLETRNCGVAALETFAAGLAQDGAA
VRAALTTSWSNAQAEGQISRLKMLKRTMYGRASFALLRSRILIAA
Blast result :
Comments
ISMno5 is 67% aa similar to ISMex10.
References
1] Ming-Chun Lee and David Robinson, direct submission.
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Marx,C.J. and Richardson,P. (2009) Direct submission GenBank.
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Marx,C.J. and Richardson,P. (2009) Direct submission GenBank.