ISMac9
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_003552 | ND | Methanosarcina acetivorans | Methanosarcina acetivorans C2A |
DNA section
IS Length : 2199 bp
Ends
IR Length : 21/26
IRL : TGTCAACGGCGGTTAAGAATTCCCCATTTTCGGCGGATTAAAATTCCCCA
IRR : TGTCAACAGCGGTTTAATTTTCCCCACATTGGTCGGTTTAAATTTCCCCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATCTTTTAACTT | ATTG | AAACTACATCG | 4 |
DNA sequence
TGTCAACGGCGGTTAAGAATTCCCCATTTTCGGCGGATTAAAATTCCCCAATTCTCAATCCTTGAGAAAAGTTCGAGGATTGTGAATCATGCTGAAAAAG
GAGGATTTATTCTTGATTCGAGATTTAAGTTCACAAAACTTGAGCATTAGTGAAATCGCCAGACAAACCGGTTTTGACAGGAAAACTGTGAGGAAATATC
TCCAGCTGAAAACCTTACCTGAACCCCAGAAACGTCCCGGAAGAAAGAGCAAGCTTGATCCATATAAACCTTATATACTCAAAAAGCTTGAAGAAGGCTC
CTACACTACTGCTCGGCTCTATCGGGAAATCAAAGAAATGGGTTTTGATGGAGGAATGACCATCGTCAAGGACTTTGTAAGAGAAGTCCGACCTCAGCAG
GGAGTCCCTGCTGTATTCCGCTATGAAACAAAACCAGGTGTACAGGCTCAGGTTGACTGGGCAGAGATGGGAACAGTTGAGGTTGATGGAAAGATAAAGA
AACTCTTTTGCTTCAACATGATTCTTGGATATTCCAGGATGAAATATGTTGAATTTACACTGGGCATAGACACTTCCACTCTTATCCAGTGTCATCTGAA
CGCCTTTGAGTACTTTGGAGGATTTACACAGGAGATTCTCTATGATAACATGAAACAGGTTGTTATCAAAAGAGCCTTAAAATCATCAGATTCTGAATGG
AACTCACAGTTTGAGGATTTCTTCAAATGCTTTGGTTTTATTCCACGGTTATGCAGGCCTTACAGGCCTCAGACAAAAGGTAAAATTGAAAATACGGTAG
GCTATGTCAAGAGGGATTTCTTCCTTGGAAGACGATTTACCTCTCTCGAAGACCTGAACGCCCAAGTTCACAGTTGGTTGGAAAGGGTAAATTCAACTGT
CCACGGAACAACCTATCAAATCCCCCTTGAACGTTTTAAGGAGGAGAAACTGATCCCTCTGGATCAGGTTCCTCCTTACAAAGTTGTCCATAAGGAGACC
AGAAAGGTCTCCAGAGACTGTTATATTTCGTTCCTTGGAAATAAGTATTCTGTTCCTTACAGGTTTGCAGGAAGAACTGCAGAGCTTCAGATTTTTGAAG
GAATATTCGAGGTCTATGTTGATTATGAGAAGGTTTGTGAACATGAAATTCTTTCAGGTAATTGTAGGGTTTCCAGAAAAAAGGAACATTTTCAGGGCCT
CCTGAGTGAGATTCTTAAAGAGAATTCAAAATGCAAGAAAGAATTACAGATTCCGTTGAAGTTCTCAGGTCCTGAAGTTGAAAAGAGGTCTCTTGACATC
TATGAAACATTCAGTGACGGTGATTTTGAATGAACAATTTCAGCTATGAGAGACTTCACAGTAACCTGCAATACCTCAAACTGAATACTATTGAAGAGGT
TCTGGACAACTATCTTGAAATTGCTGCAAGAGATAGCAAGACAACAATGGAAGTACTTGATTATCTGTTTGAACAGGAAAAGAAGCACAGAGAAGCTGCT
GCAATTGAGAGAAGGATGAAAAGTGCAGCATTTCCTGTGAAAAAGACGCTTGATGAATTCGATTTTGAGTTTCAGTCATCTATTGATAAAAAAGTCATAG
AAGACCTTGCAACGTTGAGATTTGTTCATAACGTAGAAAACGTTGTTTTCCTTGGTCCTCCCGGAGTTGGAAAGTCTCATCTTGCAATCGCTCTTGGGAT
TGAAGTAGCAAAAGCAGGGATTTCGGTTTACTTTACCAATACAGGAAACCTTATCGAGAAGTTGAAAATAGCAAATCGAGAAGGAATGCTTGAAAAGAAA
CTCAAAGGCTTTATGAAATTTAAAGTTCTGATCATTGATGAAATGGGTTATCTCCCATTTGATGAGGAAGGAGCTCACTGTTTATTTCAGTTGATTTCCA
GACGTTATGAAAAGAGTTCGACCATCTTTACGTCAAATAAATCATATGGAGAATGGGGAGAGATATTCAAAGACCAGGTAATAGCGGCTGCTGTACTTGA
TAGAATTCTCCATCACTGTACTACAATTAACATCAGAGGAGAAAGTTACAGGCTGAAAGAAAGGAAGAAACATGGTATAAAATCAGGAAATATCTACCAG
TAATTTCTAACAAGTTTATGAAAAATTGAGTAAAATTTATATCAAAAGATGGGGAAATTTAAACCGACCAATGTGGGGAAAATTAAACCGCTGTTGACA
GAGGATTTATTCTTGATTCGAGATTTAAGTTCACAAAACTTGAGCATTAGTGAAATCGCCAGACAAACCGGTTTTGACAGGAAAACTGTGAGGAAATATC
TCCAGCTGAAAACCTTACCTGAACCCCAGAAACGTCCCGGAAGAAAGAGCAAGCTTGATCCATATAAACCTTATATACTCAAAAAGCTTGAAGAAGGCTC
CTACACTACTGCTCGGCTCTATCGGGAAATCAAAGAAATGGGTTTTGATGGAGGAATGACCATCGTCAAGGACTTTGTAAGAGAAGTCCGACCTCAGCAG
GGAGTCCCTGCTGTATTCCGCTATGAAACAAAACCAGGTGTACAGGCTCAGGTTGACTGGGCAGAGATGGGAACAGTTGAGGTTGATGGAAAGATAAAGA
AACTCTTTTGCTTCAACATGATTCTTGGATATTCCAGGATGAAATATGTTGAATTTACACTGGGCATAGACACTTCCACTCTTATCCAGTGTCATCTGAA
CGCCTTTGAGTACTTTGGAGGATTTACACAGGAGATTCTCTATGATAACATGAAACAGGTTGTTATCAAAAGAGCCTTAAAATCATCAGATTCTGAATGG
AACTCACAGTTTGAGGATTTCTTCAAATGCTTTGGTTTTATTCCACGGTTATGCAGGCCTTACAGGCCTCAGACAAAAGGTAAAATTGAAAATACGGTAG
GCTATGTCAAGAGGGATTTCTTCCTTGGAAGACGATTTACCTCTCTCGAAGACCTGAACGCCCAAGTTCACAGTTGGTTGGAAAGGGTAAATTCAACTGT
CCACGGAACAACCTATCAAATCCCCCTTGAACGTTTTAAGGAGGAGAAACTGATCCCTCTGGATCAGGTTCCTCCTTACAAAGTTGTCCATAAGGAGACC
AGAAAGGTCTCCAGAGACTGTTATATTTCGTTCCTTGGAAATAAGTATTCTGTTCCTTACAGGTTTGCAGGAAGAACTGCAGAGCTTCAGATTTTTGAAG
GAATATTCGAGGTCTATGTTGATTATGAGAAGGTTTGTGAACATGAAATTCTTTCAGGTAATTGTAGGGTTTCCAGAAAAAAGGAACATTTTCAGGGCCT
CCTGAGTGAGATTCTTAAAGAGAATTCAAAATGCAAGAAAGAATTACAGATTCCGTTGAAGTTCTCAGGTCCTGAAGTTGAAAAGAGGTCTCTTGACATC
TATGAAACATTCAGTGACGGTGATTTTGAATGAACAATTTCAGCTATGAGAGACTTCACAGTAACCTGCAATACCTCAAACTGAATACTATTGAAGAGGT
TCTGGACAACTATCTTGAAATTGCTGCAAGAGATAGCAAGACAACAATGGAAGTACTTGATTATCTGTTTGAACAGGAAAAGAAGCACAGAGAAGCTGCT
GCAATTGAGAGAAGGATGAAAAGTGCAGCATTTCCTGTGAAAAAGACGCTTGATGAATTCGATTTTGAGTTTCAGTCATCTATTGATAAAAAAGTCATAG
AAGACCTTGCAACGTTGAGATTTGTTCATAACGTAGAAAACGTTGTTTTCCTTGGTCCTCCCGGAGTTGGAAAGTCTCATCTTGCAATCGCTCTTGGGAT
TGAAGTAGCAAAAGCAGGGATTTCGGTTTACTTTACCAATACAGGAAACCTTATCGAGAAGTTGAAAATAGCAAATCGAGAAGGAATGCTTGAAAAGAAA
CTCAAAGGCTTTATGAAATTTAAAGTTCTGATCATTGATGAAATGGGTTATCTCCCATTTGATGAGGAAGGAGCTCACTGTTTATTTCAGTTGATTTCCA
GACGTTATGAAAAGAGTTCGACCATCTTTACGTCAAATAAATCATATGGAGAATGGGGAGAGATATTCAAAGACCAGGTAATAGCGGCTGCTGTACTTGA
TAGAATTCTCCATCACTGTACTACAATTAACATCAGAGGAGAAAGTTACAGGCTGAAAGAAAGGAAGAAACATGGTATAAAATCAGGAAATATCTACCAG
TAATTTCTAACAAGTTTATGAAAAATTGAGTAAAATTTATATCAAAAGATGGGGAAATTTAAACCGACCAATGTGGGGAAAATTAAACCGCTGTTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1245 bp | 414 aa | 89 | 1333 | + | No |
Chemistry : DDE
ORF sequence :
MLKKEDLFLIRDLSSQNLSISEIARQTGFDRKTVRKYLQLKTLPEPQKRPGRKSKLDPYKPYILKKLEEGSYTTARLYREIKEMGFDGGMTIVKDFVREV
RPQQGVPAVFRYETKPGVQAQVDWAEMGTVEVDGKIKKLFCFNMILGYSRMKYVEFTLGIDTSTLIQCHLNAFEYFGGFTQEILYDNMKQVVIKRALKSS
DSEWNSQFEDFFKCFGFIPRLCRPYRPQTKGKIENTVGYVKRDFFLGRRFTSLEDLNAQVHSWLERVNSTVHGTTYQIPLERFKEEKLIPLDQVPPYKVV
HKETRKVSRDCYISFLGNKYSVPYRFAGRTAELQIFEGIFEVYVDYEKVCEHEILSGNCRVSRKKEHFQGLLSEILKENSKCKKELQIPLKFSGPEVEKR
SLDIYETFSDGDFE
RPQQGVPAVFRYETKPGVQAQVDWAEMGTVEVDGKIKKLFCFNMILGYSRMKYVEFTLGIDTSTLIQCHLNAFEYFGGFTQEILYDNMKQVVIKRALKSS
DSEWNSQFEDFFKCFGFIPRLCRPYRPQTKGKIENTVGYVKRDFFLGRRFTSLEDLNAQVHSWLERVNSTVHGTTYQIPLERFKEEKLIPLDQVPPYKVV
HKETRKVSRDCYISFLGNKYSVPYRFAGRTAELQIFEGIFEVYVDYEKVCEHEILSGNCRVSRKKEHFQGLLSEILKENSKCKKELQIPLKFSGPEVEKR
SLDIYETFSDGDFE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
774 bp | 257 aa | 1330 | 2103 | + | No |
AG : IS21 helper
ORF sequence :
MNNFSYERLHSNLQYLKLNTIEEVLDNYLEIAARDSKTTMEVLDYLFEQEKKHREAAAIERRMKSAAFPVKKTLDEFDFEFQSSIDKKVIEDLATLRFVH
NVENVVFLGPPGVGKSHLAIALGIEVAKAGISVYFTNTGNLIEKLKIANREGMLEKKLKGFMKFKVLIIDEMGYLPFDEEGAHCLFQLISRRYEKSSTIF
TSNKSYGEWGEIFKDQVIAAAVLDRILHHCTTINIRGESYRLKERKKHGIKSGNIYQ
NVENVVFLGPPGVGKSHLAIALGIEVAKAGISVYFTNTGNLIEKLKIANREGMLEKKLKGFMKFKVLIIDEMGYLPFDEEGAHCLFQLISRRYEKSSTIF
TSNKSYGEWGEIFKDQVIAAAVLDRILHHCTTINIRGESYRLKERKKHGIKSGNIYQ
Blast result :
Comments
ISMac9 is 93%(ORF1) and 95% (ORF2) aa similar to ISMac3. There are 7 full copies and one partial copy in Mathanosarcina acetivorans C2A genome.
References
1] Galagan,J.E., Nusbaum,C., Roy,A., Endrizzi,M.G., Macdonald,P., FitzHugh,W., Calvo,S., Engels,R., Smirnov,S., Atnoor,D., Brown,A., Allen,N., Naylor,J., Stange-Thomann,N., DeArellano,K., Johnson,R., Linton,L., McEwan,P., McKernan,K., Talamas,J., Tirrell,A., Ye,W., Zimmer,A., Barber,R.D., Cann,I., Graham,D.E., Grahame,D.A., Guss,A.M., Hedderich,R., Ingram-Smith,C., Kuettner,H.C., Krzycki,J.A., Leigh,J.A., Li,W., Liu,J., Mukhopadhyay,B., Reeve,J.N., Smith,K., Springer,T.A., Umayam,L.A., White,O., White,R.H., Conway de Macario,E., Ferry,J.G., Jarrell,K.F., Jing,H., Macario,A.J., Paulsen,I., Pritchett,M., Sowers,K.R., Swanson,R.V., Zinder,S.H., Lander,E., Metcalf,W.W. and Birren,B.(2002) Genome Res. 12 (4), 532-542