ISMac3
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_003552 | ND | Methanosarcina acetivorans | Methanosarcina acetivorans C2A |
DNA section
IS Length : 2199 bp
Ends
IR Length : 16
IRL : TGTCAAGGGCGGTTAAGAATCTTCCAATTACGGCGGTTTAAAAACTTCCA
IRR : TGTCAAGGGCGGTTAACTTTTTTCCATTTTGGGCGGTTTAGAATTTGCCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCATATTCAA | ACTT | ACTAATATCG | 4 |
DNA sequence
TGTCAAGGGCGGTTAAGAATCTTCCAATTACGGCGGTTTAAAAACTTCCAATTCTCAATCCTTGTGAAAAGTTCGAGGATTGTAGATAATGCTGAAAACG
GAGGAATGGCTATTGATACGAGATTTGTATTCACAAGGCTTCAGCATCAGTGAGATCTCTAGAAGAACAGGTTATGCTAGGGAAACTGTGAGGAAATATC
TTAAAAAGAAAACTGCCCCAGAACCTCAGAAACGTCCGCCAAAACCGAGTAAACTTGATCCTTTCAAACCTTACATACAAGAAAAACTCAAAGAAGGTCC
TTATACTGCTGTTCGCCTTTATAGGGAAATCAAAGAAATGGGTTTTGATGGAGGAAAAACCATAGTCAAGGACTTCGTAAGAGAAGTCCGACCTAAACAG
GGAGTCCCTGCTGTACTCCGCTATGAAACAAAACCAGGTGTACAGGCTCAGGTTGACTGGGCAGAGATGGGAACAGTTGAGGTTGATGGAAAGGTAAAGA
AACTCTTTTGCTTCAACATGATTCTTGGATATTCCAGAATGAGATATGTTGAATTTACACTGAGCATAGACACTCCCACTCTTATTCAGTGTCATCTGAA
CGCTTTTGAGTACTTTGGAGGATTTACACAGGAGATCCTCTATGATAACATGAAACAGGTTGTTATCAAAAGAGCCTTAAAATCATCAGATTCCGAATGG
AACCCACAGTTTGAGGAGTTCTTCAAATGCTTTGGTTTTATTCCACGGTTATGCAGGCCTTACAGGCCTCAGACAAAAGGGAAAATTGAAAATACAGTCG
GTTTCGTCAAGAGGGATTTCTTCCTTGGAAGAAGGTTTACCTCTCTCGAAGACCTGAACGCCCAAGTTCACAGGTGGTTGGAAAGGGTAAATTCAACTGT
CCACGGAACAACCTATCAAATCCCTCTTGAACGCTTTAAGGAGGAGAAACTGAGCCCTCTGGATCAGGTTCCTCCTTACAAAGTTGTCCATAAGGAGACC
AGAAAGGTCTCCAGAGACTGTTATATTTCGTTCCTTGGAAATAAGTATTCTGTTCCTTACAGGTTTGCAGGGAGAACTGCAGAGCTTCAGATCCTTGAAG
GAATATTCGAGGTCTATGTTGATTATGAGAAAGTCTGTGAACATGAAATCCTTCCTGGAAACTGCAGAGTTTCAAGGAAAAAGGAACATTTCCAGGGTCT
CCTGAGTGAGATTCTTAAAGAGAACTCAAAATGCAAAAAAGATTCACAGATCCCGTTGAAGTTCTCAGATCCCGAAGTTGAAAAAAGGTCTCTTGATGTC
TATGAAATATTTAGTGAAGGTGGTTTTGAATGAACAACTTCACCTATGAGAGACTTCACAATAACCTGCAATACCTGAAACTTAATTCTATCGAAGAGCT
TCTGGATAACTACCTTGAAATTGCTGCAAGGGACAACAAGACAACAATGGAAGTCCTTGATTACTTGTTTGAACAGGAAAAGAAACACAGAGAAGCTGTT
GCAATTGAGAGAAGGATGAAAAGTGCAGTTTTTCCCGTTAAAAAGACTCTTGAGGAATTCGATTTTGAATTTCAGAAATCCATTGATAAAAAAGCAATCG
AAGACCTTGCAACCTTGAGATTTGTTCATAATTCAGAGAATGTCGTTTTCCTTGGTCCTCCCGGAGTTGGAAAGTCTCATCTTGCAATCGCTCTTGGGAT
TGAAGTAGCAAAAGCAGGGATTTCGGTTTACTTTACCAATACAGGAAACCTTATCGAGAAGTTGAAAATAGCAAATCGAGAAGGAATGCTTGAAAAGAAA
CTAAGGGACTTGATGAAATATAAAGTGCTGATAATTGACGAAATAGGGTATCTCCCATTTGACGAAGAAGGAGCTCACTGCCTATTTCAGCTGATCTCAA
GACGGTATGAAAAGAGTTCAACGATCTTGACATCAAATAAATCATATGGAGAATGGGGAGAGATATTCAAGGACCATGTAATAGCGGCTGCTGTACTTGA
TAGGATTCTCCACCATTCAACTACGATTAACATCAAAGGGGAAAGTTACAGGCTGAAAGAAAGGAAGAAACAGGGAATAAAAACAGGAAATATATGCCAG
TAATTTCTAAAAGTTTATGAAAAATTGAGTAAAATTTATATTAAAAGGTTGGCAAATTCTAAACCGCCCAAAATGGAAAAAAGTTAACCGCCCTTGACA
GAGGAATGGCTATTGATACGAGATTTGTATTCACAAGGCTTCAGCATCAGTGAGATCTCTAGAAGAACAGGTTATGCTAGGGAAACTGTGAGGAAATATC
TTAAAAAGAAAACTGCCCCAGAACCTCAGAAACGTCCGCCAAAACCGAGTAAACTTGATCCTTTCAAACCTTACATACAAGAAAAACTCAAAGAAGGTCC
TTATACTGCTGTTCGCCTTTATAGGGAAATCAAAGAAATGGGTTTTGATGGAGGAAAAACCATAGTCAAGGACTTCGTAAGAGAAGTCCGACCTAAACAG
GGAGTCCCTGCTGTACTCCGCTATGAAACAAAACCAGGTGTACAGGCTCAGGTTGACTGGGCAGAGATGGGAACAGTTGAGGTTGATGGAAAGGTAAAGA
AACTCTTTTGCTTCAACATGATTCTTGGATATTCCAGAATGAGATATGTTGAATTTACACTGAGCATAGACACTCCCACTCTTATTCAGTGTCATCTGAA
CGCTTTTGAGTACTTTGGAGGATTTACACAGGAGATCCTCTATGATAACATGAAACAGGTTGTTATCAAAAGAGCCTTAAAATCATCAGATTCCGAATGG
AACCCACAGTTTGAGGAGTTCTTCAAATGCTTTGGTTTTATTCCACGGTTATGCAGGCCTTACAGGCCTCAGACAAAAGGGAAAATTGAAAATACAGTCG
GTTTCGTCAAGAGGGATTTCTTCCTTGGAAGAAGGTTTACCTCTCTCGAAGACCTGAACGCCCAAGTTCACAGGTGGTTGGAAAGGGTAAATTCAACTGT
CCACGGAACAACCTATCAAATCCCTCTTGAACGCTTTAAGGAGGAGAAACTGAGCCCTCTGGATCAGGTTCCTCCTTACAAAGTTGTCCATAAGGAGACC
AGAAAGGTCTCCAGAGACTGTTATATTTCGTTCCTTGGAAATAAGTATTCTGTTCCTTACAGGTTTGCAGGGAGAACTGCAGAGCTTCAGATCCTTGAAG
GAATATTCGAGGTCTATGTTGATTATGAGAAAGTCTGTGAACATGAAATCCTTCCTGGAAACTGCAGAGTTTCAAGGAAAAAGGAACATTTCCAGGGTCT
CCTGAGTGAGATTCTTAAAGAGAACTCAAAATGCAAAAAAGATTCACAGATCCCGTTGAAGTTCTCAGATCCCGAAGTTGAAAAAAGGTCTCTTGATGTC
TATGAAATATTTAGTGAAGGTGGTTTTGAATGAACAACTTCACCTATGAGAGACTTCACAATAACCTGCAATACCTGAAACTTAATTCTATCGAAGAGCT
TCTGGATAACTACCTTGAAATTGCTGCAAGGGACAACAAGACAACAATGGAAGTCCTTGATTACTTGTTTGAACAGGAAAAGAAACACAGAGAAGCTGTT
GCAATTGAGAGAAGGATGAAAAGTGCAGTTTTTCCCGTTAAAAAGACTCTTGAGGAATTCGATTTTGAATTTCAGAAATCCATTGATAAAAAAGCAATCG
AAGACCTTGCAACCTTGAGATTTGTTCATAATTCAGAGAATGTCGTTTTCCTTGGTCCTCCCGGAGTTGGAAAGTCTCATCTTGCAATCGCTCTTGGGAT
TGAAGTAGCAAAAGCAGGGATTTCGGTTTACTTTACCAATACAGGAAACCTTATCGAGAAGTTGAAAATAGCAAATCGAGAAGGAATGCTTGAAAAGAAA
CTAAGGGACTTGATGAAATATAAAGTGCTGATAATTGACGAAATAGGGTATCTCCCATTTGACGAAGAAGGAGCTCACTGCCTATTTCAGCTGATCTCAA
GACGGTATGAAAAGAGTTCAACGATCTTGACATCAAATAAATCATATGGAGAATGGGGAGAGATATTCAAGGACCATGTAATAGCGGCTGCTGTACTTGA
TAGGATTCTCCACCATTCAACTACGATTAACATCAAAGGGGAAAGTTACAGGCTGAAAGAAAGGAAGAAACAGGGAATAAAAACAGGAAATATATGCCAG
TAATTTCTAAAAGTTTATGAAAAATTGAGTAAAATTTATATTAAAAGGTTGGCAAATTCTAAACCGCCCAAAATGGAAAAAAGTTAACCGCCCTTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1245 bp | 414 aa | 89 | 1333 | + | No |
Chemistry : DDE
ORF sequence :
MLKTEEWLLIRDLYSQGFSISEISRRTGYARETVRKYLKKKTAPEPQKRPPKPSKLDPFKPYIQEKLKEGPYTAVRLYREIKEMGFDGGKTIVKDFVREV
RPKQGVPAVLRYETKPGVQAQVDWAEMGTVEVDGKVKKLFCFNMILGYSRMRYVEFTLSIDTPTLIQCHLNAFEYFGGFTQEILYDNMKQVVIKRALKSS
DSEWNPQFEEFFKCFGFIPRLCRPYRPQTKGKIENTVGFVKRDFFLGRRFTSLEDLNAQVHRWLERVNSTVHGTTYQIPLERFKEEKLSPLDQVPPYKVV
HKETRKVSRDCYISFLGNKYSVPYRFAGRTAELQILEGIFEVYVDYEKVCEHEILPGNCRVSRKKEHFQGLLSEILKENSKCKKDSQIPLKFSDPEVEKR
SLDVYEIFSEGGFE
RPKQGVPAVLRYETKPGVQAQVDWAEMGTVEVDGKVKKLFCFNMILGYSRMRYVEFTLSIDTPTLIQCHLNAFEYFGGFTQEILYDNMKQVVIKRALKSS
DSEWNPQFEEFFKCFGFIPRLCRPYRPQTKGKIENTVGFVKRDFFLGRRFTSLEDLNAQVHRWLERVNSTVHGTTYQIPLERFKEEKLSPLDQVPPYKVV
HKETRKVSRDCYISFLGNKYSVPYRFAGRTAELQILEGIFEVYVDYEKVCEHEILPGNCRVSRKKEHFQGLLSEILKENSKCKKDSQIPLKFSDPEVEKR
SLDVYEIFSEGGFE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
774 bp | 257 aa | 1330 | 2103 | + | No |
AG : IS21 helper
ORF sequence :
MNNFTYERLHNNLQYLKLNSIEELLDNYLEIAARDNKTTMEVLDYLFEQEKKHREAVAIERRMKSAVFPVKKTLEEFDFEFQKSIDKKAIEDLATLRFVH
NSENVVFLGPPGVGKSHLAIALGIEVAKAGISVYFTNTGNLIEKLKIANREGMLEKKLRDLMKYKVLIIDEIGYLPFDEEGAHCLFQLISRRYEKSSTIL
TSNKSYGEWGEIFKDHVIAAAVLDRILHHSTTINIKGESYRLKERKKQGIKTGNICQ
NSENVVFLGPPGVGKSHLAIALGIEVAKAGISVYFTNTGNLIEKLKIANREGMLEKKLRDLMKYKVLIIDEIGYLPFDEEGAHCLFQLISRRYEKSSTIL
TSNKSYGEWGEIFKDHVIAAAVLDRILHHSTTINIKGESYRLKERKKQGIKTGNICQ
Blast result :
Comments
ISMac3 is 58%(ORF1) aa similar to IS5376 and 66% (ORF2) to IS21.
There are 11 full copies and one partial copy in Methanosarcina acetivorans C2A genome.
There are 11 full copies and one partial copy in Methanosarcina acetivorans C2A genome.
References
1] Galagan,J.E., Nusbaum,C., Roy,A., Endrizzi,M.G., Macdonald,P., FitzHugh,W., Calvo,S., Engels,R., Smirnov,S., Atnoor,D., Brown,A., Allen,N., Naylor,J., Stange-Thomann,N., DeArellano,K., Johnson,R., Linton,L., McEwan,P., McKernan,K., Talamas,J., Tirrell,A., Ye,W., Zimmer,A., Barber,R.D., Cann,I., Graham,D.E., Grahame,D.A., Guss,A., Hedderich,R., Ingram-Smith,C., Kuettner,C.H., Krzycki,J.A., Leigh,J.A., Li,W., Liu,J., Mukhopadhyay,B., Reeve,J.N., Smith,K., Springer,T.A., Umayam,L.A., White,O., White,R.H., de Macario,E.C., Ferry,J.G., Jarrell,K.F., Jing,H., Macario,A.J.L., Paulsen,I., Pritchett,M., Sowers,K.R., Swanson,R.V., Zinder,S.H., Lander,E., Metcalf,W.W. and Birren,B.(2002) Genome Res. 12 (4), 532-542