ISMac5
- Family IS1634
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_003552 | ND | Methanosarcina acetivorans | Methanosarcina acetivorans C2A |
DNA section
IS Length : 1719 bp
Ends
IR Length : 19/20
IRL : CCTAGATTCGGCAGGTAAACAAATATCTTAATATTATTACTTTTATATCT
IRR : CCTAGATTCGGCAGGTTAACGTAATTAGAGATAGATATTTTCATATTTCT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GTATTTCATT | ATAAAA | TATTATTTTA | 6 |
DNA sequence
CCTAGATTCGGCAGGTAAACAAATATCTTAATATTATTACTTTTATATCTCTTTCCTGGTCTTCCCATGGCAGAAAAAAACAGCAGTAGAAGAGTTGAAT
CCTCCCTAAAACGTACAAGGTTCTTAGGTCACCTTGGTCTTATGGCTGGAGTTTTCCGAGAACTTGAGGTTGACAAACTGATCGATGAAAAACTTCCCAA
AGAAAGGGATCATACTGTCCCTCACTCAGTCTGCATCCTTGCCATGTTGCTCAATGGTCTTGGTTTCGTAGGGCAACGTCTATACCTGTTTCCTGATTTT
TTTAAAAACATTTCTACGGAAAGGCTTTTCGGAGACGGTATAACAAGAGAGGATCTGAATCAATACGTTATCGGAGAGACTCTTGACAGAATCGTAAAAT
ATGGCCCTACAAAACTGTTTACGGAAATTGCTCTTCACATTATGACTCGTCTCCCTATTCCTGTTCATTGTTTACACGCTGACACTACAAGTGTCAGCGT
TTATGGGGATTATGATGACGAAGAAACTGAGTCTATTGACATTACTTTTGGAATTCCCAAAAACGGAAGATGGGACCTCAAACAATTTGTACTTAGTTTG
ATTGTTAATCAGCATGGGATACCTCTTTTCATGAACACACATTCAGGAAATTCTTCCGACAAAAACACAATTCTGGAAGCGATCCAGTCTCTCAAATCAG
TTTTAAGACCTGAAAGCGAAGTTTACTACGTCGCTGATAGTTCCTTTTACACAGACAATAATATCAAGAACATGGGAAAGTCATTCTGGATCAGTCGTGT
TCCTGCAACAATTACCGAGGCAAAGGAACTGCTAACTGCAAATCTGAACCTGAAAACGCTAAAAAGCGACGAAAGATACTCATTTTATCAAACCTTTGTG
GAATATGGTGGAATCAAACAAAAGTGGGTTTTGCTGCTTTCTCACAAGATGAAAGAGAAGAAAGAGCAAACTCTCAGGACGAAGCTTGAAAAAGAGGTTG
AAAAAGCAGAGAAGTCTTTTAAAGAACTGAAAGGAGAGGACTTTTTTTGCGAAGAGGATGCATTAAAAGCTGCAGAAAAATGGATTCAAGATTTCCCTTC
TGTCTCATTTGAAAAAGTAGATGTGAAATCCATTAAAAAACGTGAGTTGGGGAAAAGAGGCAGACCTTCAAAAGATGAGCAATTAAAGACTTATTACAGG
ATTAATGGAATCATAAAGGTTAATGATGCTTTTGTTTTAAATGAAATGGATAAAATGGGACTTTTTATTCTTGCAAGTAATGATATCAATCTTTCTCCTG
AGGATATGCTGAAGTATTACAAAGGTCAGGATAACGTGGAAAAAGGATTCAGATTCTTGAAAAGTAACACCTTTAGCATATCGAAGGTTTACCTCAAGAA
CAAAAAGAGAATTGAAGCGCTGACTATGATAATGGTTCTCTGCTTGATGATTTATTCAATTGCAGAATGGAAATTAAGGACAAAATTAGAAGAAGAAAAT
GAAACGATTCCAGATCAAAAAGGGAAACAAACAAAAAGACCTACAATGAGATGGATATTTTTCAATTTTCAGGGAATTACAGAACTTATTTATCAGAACG
AAGGACAAATGAAGTCAGAAATATTGAATATGGAGGAGATTCACTGGAAGATACTGGGTCTAATGGGAGAGAAATATGAAAATATCTATCTCTAATTACG
TTAACCTGCCGAATCTAGG
CCTCCCTAAAACGTACAAGGTTCTTAGGTCACCTTGGTCTTATGGCTGGAGTTTTCCGAGAACTTGAGGTTGACAAACTGATCGATGAAAAACTTCCCAA
AGAAAGGGATCATACTGTCCCTCACTCAGTCTGCATCCTTGCCATGTTGCTCAATGGTCTTGGTTTCGTAGGGCAACGTCTATACCTGTTTCCTGATTTT
TTTAAAAACATTTCTACGGAAAGGCTTTTCGGAGACGGTATAACAAGAGAGGATCTGAATCAATACGTTATCGGAGAGACTCTTGACAGAATCGTAAAAT
ATGGCCCTACAAAACTGTTTACGGAAATTGCTCTTCACATTATGACTCGTCTCCCTATTCCTGTTCATTGTTTACACGCTGACACTACAAGTGTCAGCGT
TTATGGGGATTATGATGACGAAGAAACTGAGTCTATTGACATTACTTTTGGAATTCCCAAAAACGGAAGATGGGACCTCAAACAATTTGTACTTAGTTTG
ATTGTTAATCAGCATGGGATACCTCTTTTCATGAACACACATTCAGGAAATTCTTCCGACAAAAACACAATTCTGGAAGCGATCCAGTCTCTCAAATCAG
TTTTAAGACCTGAAAGCGAAGTTTACTACGTCGCTGATAGTTCCTTTTACACAGACAATAATATCAAGAACATGGGAAAGTCATTCTGGATCAGTCGTGT
TCCTGCAACAATTACCGAGGCAAAGGAACTGCTAACTGCAAATCTGAACCTGAAAACGCTAAAAAGCGACGAAAGATACTCATTTTATCAAACCTTTGTG
GAATATGGTGGAATCAAACAAAAGTGGGTTTTGCTGCTTTCTCACAAGATGAAAGAGAAGAAAGAGCAAACTCTCAGGACGAAGCTTGAAAAAGAGGTTG
AAAAAGCAGAGAAGTCTTTTAAAGAACTGAAAGGAGAGGACTTTTTTTGCGAAGAGGATGCATTAAAAGCTGCAGAAAAATGGATTCAAGATTTCCCTTC
TGTCTCATTTGAAAAAGTAGATGTGAAATCCATTAAAAAACGTGAGTTGGGGAAAAGAGGCAGACCTTCAAAAGATGAGCAATTAAAGACTTATTACAGG
ATTAATGGAATCATAAAGGTTAATGATGCTTTTGTTTTAAATGAAATGGATAAAATGGGACTTTTTATTCTTGCAAGTAATGATATCAATCTTTCTCCTG
AGGATATGCTGAAGTATTACAAAGGTCAGGATAACGTGGAAAAAGGATTCAGATTCTTGAAAAGTAACACCTTTAGCATATCGAAGGTTTACCTCAAGAA
CAAAAAGAGAATTGAAGCGCTGACTATGATAATGGTTCTCTGCTTGATGATTTATTCAATTGCAGAATGGAAATTAAGGACAAAATTAGAAGAAGAAAAT
GAAACGATTCCAGATCAAAAAGGGAAACAAACAAAAAGACCTACAATGAGATGGATATTTTTCAATTTTCAGGGAATTACAGAACTTATTTATCAGAACG
AAGGACAAATGAAGTCAGAAATATTGAATATGGAGGAGATTCACTGGAAGATACTGGGTCTAATGGGAGAGAAATATGAAAATATCTATCTCTAATTACG
TTAACCTGCCGAATCTAGG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1629 bp | 542 aa | 67 | 1695 | + | No |
Chemistry : DDE
ORF sequence :
MAEKNSSRRVESSLKRTRFLGHLGLMAGVFRELEVDKLIDEKLPKERDHTVPHSVCILAMLLNGLGFVGQRLYLFPDFFKNISTERLFGDGITREDLNQY
VIGETLDRIVKYGPTKLFTEIALHIMTRLPIPVHCLHADTTSVSVYGDYDDEETESIDITFGIPKNGRWDLKQFVLSLIVNQHGIPLFMNTHSGNSSDKN
TILEAIQSLKSVLRPESEVYYVADSSFYTDNNIKNMGKSFWISRVPATITEAKELLTANLNLKTLKSDERYSFYQTFVEYGGIKQKWVLLLSHKMKEKKE
QTLRTKLEKEVEKAEKSFKELKGEDFFCEEDALKAAEKWIQDFPSVSFEKVDVKSIKKRELGKRGRPSKDEQLKTYYRINGIIKVNDAFVLNEMDKMGLF
ILASNDINLSPEDMLKYYKGQDNVEKGFRFLKSNTFSISKVYLKNKKRIEALTMIMVLCLMIYSIAEWKLRTKLEEENETIPDQKGKQTKRPTMRWIFFN
FQGITELIYQNEGQMKSEILNMEEIHWKILGLMGEKYENIYL
VIGETLDRIVKYGPTKLFTEIALHIMTRLPIPVHCLHADTTSVSVYGDYDDEETESIDITFGIPKNGRWDLKQFVLSLIVNQHGIPLFMNTHSGNSSDKN
TILEAIQSLKSVLRPESEVYYVADSSFYTDNNIKNMGKSFWISRVPATITEAKELLTANLNLKTLKSDERYSFYQTFVEYGGIKQKWVLLLSHKMKEKKE
QTLRTKLEKEVEKAEKSFKELKGEDFFCEEDALKAAEKWIQDFPSVSFEKVDVKSIKKRELGKRGRPSKDEQLKTYYRINGIIKVNDAFVLNEMDKMGLF
ILASNDINLSPEDMLKYYKGQDNVEKGFRFLKSNTFSISKVYLKNKKRIEALTMIMVLCLMIYSIAEWKLRTKLEEENETIPDQKGKQTKRPTMRWIFFN
FQGITELIYQNEGQMKSEILNMEEIHWKILGLMGEKYENIYL
Blast result :
Comments
ISMac5 is 54% aa similar to ISPlu4. There are 7 full copies and 2 partial copies in Methanosarcina acetivorans C2A genome.
References
1] Galagan,J.E., Nusbaum,C., Roy,A., Endrizzi,M.G., Macdonald,P., FitzHugh,W., Calvo,S., Engels,R., Smirnov,S., Atnoor,D., Brown,A., Allen,N., Naylor,J., Stange-Thomann,N., DeArellano,K., Johnson,R., Linton,L., McEwan,P., McKernan,K., Talamas,J., Tirrell,A., Ye,W., Zimmer,A., Barber,R.D., Cann,I., Graham,D.E., Grahame,D.A.,Guss,A., Hedderich,R., Ingram-Smith,C., Kuettner,C.H., Krzycki,J.A., Leigh,J.A., Li,W., Liu,J., Mukhopadhyay,B., Reeve,J.N., Smith,K., Springer,T.A., Umayam,L.A., White,O., White,R.H., de Macario,E.C., Ferry,J.G., Jarrell,K.F., Jing,H., Macario,A.J.L., Paulsen,I., Pritchett,M., Sowers,K.R., Swanson,R.V., Zinder,S.H., Lander,E., Metcalf,W.W. and Birren,B. (2002) Genome Res. 12 (4), 532-542