ISMac8
- Family IS66
- Group ISBst12
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_003552 | ND | Methanosarcina acetivorans | Methanosarcina acetivorans C2A |
DNA section
IS Length : 1603 bp
Ends
IR Length : 13/15
IRL : GTAACTATTCATCTATACCTTCGTAAAAGCAATTTTTCCTCGTTTCGAGG
IRR : GTAACTATTCAGCCACCGATAAAAAAGTATGTTTTTTGAGCAATGCCGTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCGTATCTGA | CATTTATT | TGTGTATCAT | 8 |
ACTAATTTAA | GGTTTTAT | AACAATAATG | 8 |
TAAGTTGAGA | AAATAATT | GGGAAAATAT | 8 |
DNA sequence
GTAACTATTCATCTATACCTTCGTAAAAGCAATTTTTCCTCGTTTCGAGGAAAAATTGGATATAAATTGAATACTATTTCATTTCGTTATCATGCTTACA
CGTGAAGAGATCCTTGAAATTTATGAAGCTGGTCCTGAAGCAGTTATTGCTGTAATCCAGAGACTTGAGTATATCATAGAGAAACAAGCTTCTCAGATTG
CTGAACTTGAAGAACGTGTAAGAATTTTAGAGGCTCGTTTGAATCAAAACAGCCAAAATAGCAGTAAACCTCCTTCTACTGATGTTTTTTGTAACGAAAA
ACCAAAACCTAAGAGTCTCCGTAAGAGTAGCGGCAAAAAAGCCGGTGGTCAAAAAGGTCATCCTGGAAAAACTCTTAAACTGGTTGAAAACCCTGATTAG
ATAAAATACCATTCGCCGGAATACTGCGATCATTGTGGTCATCATCTTGAAGATACTGAAGTTCAGGACTATGAACGTAGGCAAGAAGCTGAAATTCCTC
CTGCCCAAATCATATTTACTGAACATCGTTGTGAAATCAAGAAGTGTCCTCACTGCGGGAAAGTTAACAAAGGTTCTTTTCCAGAGTCTATAAAATTCCC
TATTCAATATGGTCCTCGTCTTTTAGCCTCAATTCTGTATTTGAGGAACTATCAATTTATTCCATATGAAAGGATTTGTGATTTGGTAGAGGACTTCTAC
GGCGTACGTATCAGCCCTGCTACCATAAAAAGGGCGGAAATAGAATGTTTTCAGAACCTGCAACCTTTTGAAGAAGCTGCTATGAAACATCTATTAGCGT
CTCATACTGCTCATTGTGATGAGACAGGGATGAGAGTTTTAGGGACGAAATGGTGGCTTCATGTTGTCTCAAACAATTTATGGACCTACTATTTCCCACA
TCCAAAAAGAGGGACAGAAGCAATGGATGCTCTGGGATTTCTTCTGCAATACAATGGGGTAGCAGTTCATGATGGATTTGCTTCATATAACAAGTATGAA
TGTGAACATGCTCTGTGCAACGCTCATCTTAAACGGGAACTTACCGGGATTGAAGAGAATTTTGAGCAGCAATGGGCTAAAGAGATAAATGAACTACTCA
GTGAGATGAAAAAGTATACTGATGAATGTAGGGAGATGGAAATTCCAATAGATCCAGAAAAAGTAAGGGAACTCGAGGGAATATACGATGCAATAATGCA
GGGAGGAATTGAAGAAAATCCACCGCCTGATCCCTTAAAAGAACAGGTGAAAAAGAGAGGAAGAAAAGCACAAACAAAAGCAAAGAATCTCCTTGATAGG
TTCATACTACACAAGGAACAGATTCTGCGATTCCTCAATAACCTGAGAGTTTCGTTTGACAACAATCAAGCGGAAAGAGATATCAGGATGATGAAGCTAC
AACAGAAAATATCAGGAACTTTCAGAAGTATAGAAGGAGCGGTAGCTTTCTGCAGAATTAGGGCTTACATATCATCAATTAAAAAGAATGAACTCAATGT
CATGGATGCTATTCTAGCGGCGCTCAATGGAGCGCCGCTATTAGCCTGAGAATTACGGCATTGCTCAAAAAACATACTTTTTTATCGGTGGCTGAATAGT
TAC
CGTGAAGAGATCCTTGAAATTTATGAAGCTGGTCCTGAAGCAGTTATTGCTGTAATCCAGAGACTTGAGTATATCATAGAGAAACAAGCTTCTCAGATTG
CTGAACTTGAAGAACGTGTAAGAATTTTAGAGGCTCGTTTGAATCAAAACAGCCAAAATAGCAGTAAACCTCCTTCTACTGATGTTTTTTGTAACGAAAA
ACCAAAACCTAAGAGTCTCCGTAAGAGTAGCGGCAAAAAAGCCGGTGGTCAAAAAGGTCATCCTGGAAAAACTCTTAAACTGGTTGAAAACCCTGATTAG
ATAAAATACCATTCGCCGGAATACTGCGATCATTGTGGTCATCATCTTGAAGATACTGAAGTTCAGGACTATGAACGTAGGCAAGAAGCTGAAATTCCTC
CTGCCCAAATCATATTTACTGAACATCGTTGTGAAATCAAGAAGTGTCCTCACTGCGGGAAAGTTAACAAAGGTTCTTTTCCAGAGTCTATAAAATTCCC
TATTCAATATGGTCCTCGTCTTTTAGCCTCAATTCTGTATTTGAGGAACTATCAATTTATTCCATATGAAAGGATTTGTGATTTGGTAGAGGACTTCTAC
GGCGTACGTATCAGCCCTGCTACCATAAAAAGGGCGGAAATAGAATGTTTTCAGAACCTGCAACCTTTTGAAGAAGCTGCTATGAAACATCTATTAGCGT
CTCATACTGCTCATTGTGATGAGACAGGGATGAGAGTTTTAGGGACGAAATGGTGGCTTCATGTTGTCTCAAACAATTTATGGACCTACTATTTCCCACA
TCCAAAAAGAGGGACAGAAGCAATGGATGCTCTGGGATTTCTTCTGCAATACAATGGGGTAGCAGTTCATGATGGATTTGCTTCATATAACAAGTATGAA
TGTGAACATGCTCTGTGCAACGCTCATCTTAAACGGGAACTTACCGGGATTGAAGAGAATTTTGAGCAGCAATGGGCTAAAGAGATAAATGAACTACTCA
GTGAGATGAAAAAGTATACTGATGAATGTAGGGAGATGGAAATTCCAATAGATCCAGAAAAAGTAAGGGAACTCGAGGGAATATACGATGCAATAATGCA
GGGAGGAATTGAAGAAAATCCACCGCCTGATCCCTTAAAAGAACAGGTGAAAAAGAGAGGAAGAAAAGCACAAACAAAAGCAAAGAATCTCCTTGATAGG
TTCATACTACACAAGGAACAGATTCTGCGATTCCTCAATAACCTGAGAGTTTCGTTTGACAACAATCAAGCGGAAAGAGATATCAGGATGATGAAGCTAC
AACAGAAAATATCAGGAACTTTCAGAAGTATAGAAGGAGCGGTAGCTTTCTGCAGAATTAGGGCTTACATATCATCAATTAAAAAGAATGAACTCAATGT
CATGGATGCTATTCTAGCGGCGCTCAATGGAGCGCCGCTATTAGCCTGAGAATTACGGCATTGCTCAAAAAACATACTTTTTTATCGGTGGCTGAATAGT
TAC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1458 bp | 485 aa | 92 | 1549 | + | No |
Chemistry : DDE
ORF sequence :
MLTREEILEIYEAGPEAVIAVIQRLEYIIEKQASQIAELEERVRILEARLNQNSQNSSKPPSTDVFCNEKPKPKSLRKSSGKKAGGQKGHPGKTLKLVEN
PDXIKYHSPEYCDHCGHHLEDTEVQDYERRQEAEIPPAQIIFTEHRCEIKKCPHCGKVNKGSFPESIKFPIQYGPRLLASILYLRNYQFIPYERICDLVE
DFYGVRISPATIKRAEIECFQNLQPFEEAAMKHLLASHTAHCDETGMRVLGTKWWLHVVSNNLWTYYFPHPKRGTEAMDALGFLLQYNGVAVHDGFASYN
KYECEHALCNAHLKRELTGIEENFEQQWAKEINELLSEMKKYTDECREMEIPIDPEKVRELEGIYDAIMQGGIEENPPPDPLKEQVKKRGRKAQTKAKNL
LDRFILHKEQILRFLNNLRVSFDNNQAERDIRMMKLQQKISGTFRSIEGAVAFCRIRAYISSIKKNELNVMDAILAALNGAPLLA
PDXIKYHSPEYCDHCGHHLEDTEVQDYERRQEAEIPPAQIIFTEHRCEIKKCPHCGKVNKGSFPESIKFPIQYGPRLLASILYLRNYQFIPYERICDLVE
DFYGVRISPATIKRAEIECFQNLQPFEEAAMKHLLASHTAHCDETGMRVLGTKWWLHVVSNNLWTYYFPHPKRGTEAMDALGFLLQYNGVAVHDGFASYN
KYECEHALCNAHLKRELTGIEENFEQQWAKEINELLSEMKKYTDECREMEIPIDPEKVRELEGIYDAIMQGGIEENPPPDPLKEQVKKRGRKAQTKAKNL
LDRFILHKEQILRFLNNLRVSFDNNQAERDIRMMKLQQKISGTFRSIEGAVAFCRIRAYISSIKKNELNVMDAILAALNGAPLLA
Blast result :
Comments
ISMac8 is 56% aa similar to ISBst12. There are 3 full copies and two partial copies in Methanosarcina acetivorans C2A genome.
References
1] Galagan,J.E., Nusbaum,C., Roy,A., Endrizzi,M.G., Macdonald,P., FitzHugh,W., Calvo,S., Engels,R., Smirnov,S., Atnoor,D., Brown,A., Allen,N., Naylor,J., Stange-Thomann,N., DeArellano,K., Johnson,R., Linton,L., McEwan,P., McKernan,K., Talamas,J., Tirrell,A., Ye,W., Zimmer,A., Barber,R.D., Cann,I., Graham,D.E., Grahame,D.A., Guss,A.M., Hedderich,R., Ingram-Smith,C., Kuettner,H.C., Krzycki,J.A., Leigh,J.A., Li,W., Liu,J., Mukhopadhyay,B., Reeve,J.N., Smith,K., Springer,T.A., Umayam,L.A., White,O., White,R.H., Conway de Macario,E., Ferry,J.G., Jarrell,K.F., Jing,H., Macario,A.J., Paulsen,I., Pritchett,M., Sowers,K.R., Swanson,R.V., Zinder,S.H., Lander,E., Metcalf,W.W. and Birren,B.(2002) Genome Res. 12 (4), 532-542