ISMca1
- Family IS5
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_002977 | ND | Methylococcus capsulatus | Methylococcus capsulatus Bath |
DNA section
IS Length : 1336 bp
Ends
IR Length : 15
IRL : CAGGCCGTTGAAAAACTCCCCCGCAGCCGCCCGTGCTGCCGGTACCATGT
IRR : CAGGCCGTTGAAAAAGTGTCCATCGAAGCGGCCATGTACCCGTTTGCGTG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCCGTGGAGT | CTAA | GGGGAGAGGA | 4 |
TGTTTCATCC | TTAA | GCAGCAGGGA | 4 |
ATGACGCTCC | TTAA | AAGTTGGTTT | 4 |
TTTGATAAAA | TTAA | AGAATTCTCT | 4 |
CGGCGATCGG | CTAA | GGATCGTTCG | 4 |
CACCGCCGAT | CTAA | AGTTGGCGGA | 4 |
GCACGAATTT | ATAA | ATGTAACTGT | 4 |
TGCGAGCCAT | CTAG | AGGCCAACTG | 4 |
ACTATCACGC | TTAA | GGAGAATGAC | 4 |
GCGTTCGGAT | ATAG | GTGCCCTTGG | 4 |
TTCTGTATGC | TTAG | GTCACTATGC | 4 |
GAAGCGGTGC | TTAA | GGTACATGAA | 4 |
ATTCTTTTCC | CTAA | ATCGCGGTAT | 4 |
ATTGATTTTC | TTAG | AGATTTCGAC | 4 |
AAGGCAAGCT | TTAA | AGTCCCGACC | 4 |
ATGGAACGGC | CTAA | GGACTTTACT | 4 |
GGGAGATCTT | CTAA | TGAGTATGAA | 4 |
TCCGTGACCT | TTAA | GTACTCGGCG | 4 |
ACTGCGGCAG | ATAA | ACCTCCCGCC | 4 |
DNA sequence
CAGGCCGTTGAAAAACTCCCCCGCAGCCGCCCGTGCTGCCGGTACCATGTGAGCGGCGGCGACGACGAGACAGCAGACACCGAAGATGAGAGGCACACAG
AACTTCCAAGGGGCGATGTTCAGCTACATCAGCCTTGAAGAGCGGGTACCGGCCAGACACCCGCTGCGCAAGCTGCGCGCGCTGGTCGATGCCTTGCTGG
CCAGCATGAGCGCGGAATTCGAGGCGGTCTATGCCCGCCGTGGCCGCCCTTCGGTGCCGCCCGAAATGCTGCTCAAGGCGTTGCTGCTGCAAATCCTGTT
TTCCATCCGCAGCGAGCGGCTGCTGGTGGAGGCCATCGACTACAACCTGCTGTACCGCTGGTTCGTGGGCCTGAACCTGGAAGACAAGGTGTGGGACCAC
TCCACCTTCAGCGCCAACCGCCAGCGGCTGTTCAACGAAGACCTCGCCCGCGTGTTCTTTGAGCGGGTCAAATACACCGCGGACTGGGCGAAGTTGATCG
GTGACGAGCACTTCAGCGTCGACGGCACACTCATCGAGGCCTGGGCCTCGCAAAAGAGCTTCAAGCGCAAGGACGCAAGCGGCAGTGACGACGGCGCACC
GCCCCAGGGTCGCAACCCCGAGGTGGATTTCAAGGGCGAGACCCGTCGCAACGACACCCACGCCAGCACGACAGATGCCGATGCGCGGCTGTTCAAGAAA
GCTGCAGGCGACAAGTCCCGCCTGTGCCACATGGGTCACATCCTCATGGACAACCGACACGGGCTGGTGGTGGACGTCGAAATCACCCATGCCAGCGGCA
CGGCCGAGCGGCAGGCCGCACTCAAGATGCTCCAGCGCCAAAAGCGCAAAGCCGGCCGACTCACCGTGGGGGCGGACAAGGGCTATGACTGCCGTGCCTT
CGTGCAGGGCTGCCGCAAGCTGGGGATCACCCCGCACGTGGCGGCCAAAGCCAAGCACTCGGCCATTGACGGACGCACCCAGCGGCACGAAGGCTACAAG
GTGAGCCTGAGGGTGCGCAAACGCATCGAGGAGGCCTTCGGCTGGATCAAGACCGTGGGCGGTCTGGCCAAGACCAAGCTCATCGGGCATGCCAAGCTGG
CGGGGCAGGCGCTGATGTGCTTTGCCGCGTACAACCTCGTGCGCATGGGCTCCCTCGGTGGCTGGTGGGATGCGCATCATGCGTGATTGCGGGGGTCAGT
GCGCCCAAAATGGGCGAGCAGCCCCCAAAGGGGGAGCCCAAGCGGCTGTCGGAGCCGAGAAAAACGGCTTGCGCCGGCCTCGGACCCACGCAAACGGGTA
CATGGCCGCTTCGATGGACACTTTTTCAACGGCCTG
AACTTCCAAGGGGCGATGTTCAGCTACATCAGCCTTGAAGAGCGGGTACCGGCCAGACACCCGCTGCGCAAGCTGCGCGCGCTGGTCGATGCCTTGCTGG
CCAGCATGAGCGCGGAATTCGAGGCGGTCTATGCCCGCCGTGGCCGCCCTTCGGTGCCGCCCGAAATGCTGCTCAAGGCGTTGCTGCTGCAAATCCTGTT
TTCCATCCGCAGCGAGCGGCTGCTGGTGGAGGCCATCGACTACAACCTGCTGTACCGCTGGTTCGTGGGCCTGAACCTGGAAGACAAGGTGTGGGACCAC
TCCACCTTCAGCGCCAACCGCCAGCGGCTGTTCAACGAAGACCTCGCCCGCGTGTTCTTTGAGCGGGTCAAATACACCGCGGACTGGGCGAAGTTGATCG
GTGACGAGCACTTCAGCGTCGACGGCACACTCATCGAGGCCTGGGCCTCGCAAAAGAGCTTCAAGCGCAAGGACGCAAGCGGCAGTGACGACGGCGCACC
GCCCCAGGGTCGCAACCCCGAGGTGGATTTCAAGGGCGAGACCCGTCGCAACGACACCCACGCCAGCACGACAGATGCCGATGCGCGGCTGTTCAAGAAA
GCTGCAGGCGACAAGTCCCGCCTGTGCCACATGGGTCACATCCTCATGGACAACCGACACGGGCTGGTGGTGGACGTCGAAATCACCCATGCCAGCGGCA
CGGCCGAGCGGCAGGCCGCACTCAAGATGCTCCAGCGCCAAAAGCGCAAAGCCGGCCGACTCACCGTGGGGGCGGACAAGGGCTATGACTGCCGTGCCTT
CGTGCAGGGCTGCCGCAAGCTGGGGATCACCCCGCACGTGGCGGCCAAAGCCAAGCACTCGGCCATTGACGGACGCACCCAGCGGCACGAAGGCTACAAG
GTGAGCCTGAGGGTGCGCAAACGCATCGAGGAGGCCTTCGGCTGGATCAAGACCGTGGGCGGTCTGGCCAAGACCAAGCTCATCGGGCATGCCAAGCTGG
CGGGGCAGGCGCTGATGTGCTTTGCCGCGTACAACCTCGTGCGCATGGGCTCCCTCGGTGGCTGGTGGGATGCGCATCATGCGTGATTGCGGGGGTCAGT
GCGCCCAAAATGGGCGAGCAGCCCCCAAAGGGGGAGCCCAAGCGGCTGTCGGAGCCGAGAAAAACGGCTTGCGCCGGCCTCGGACCCACGCAAACGGGTA
CATGGCCGCTTCGATGGACACTTTTTCAACGGCCTG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1101 bp | 366 aa | 86 | 1186 | + | No |
Chemistry : DDE
ORF sequence :
MRGTQNFQGAMFSYISLEERVPARHPLRKLRALVDALLASMSAEFEAVYARRGRPSVPPEMLLKALLLQILFSIRSERLLVEAIDYNLLYRWFVGLNLED
KVWDHSTFSANRQRLFNEDLARVFFERVKYTADWAKLIGDEHFSVDGTLIEAWASQKSFKRKDASGSDDGAPPQGRNPEVDFKGETRRNDTHASTTDADA
RLFKKAAGDKSRLCHMGHILMDNRHGLVVDVEITHASGTAERQAALKMLQRQKRKAGRLTVGADKGYDCRAFVQGCRKLGITPHVAAKAKHSAIDGRTQR
HEGYKVSLRVRKRIEEAFGWIKTVGGLAKTKLIGHAKLAGQALMCFAAYNLVRMGSLGGWWDAHHA
KVWDHSTFSANRQRLFNEDLARVFFERVKYTADWAKLIGDEHFSVDGTLIEAWASQKSFKRKDASGSDDGAPPQGRNPEVDFKGETRRNDTHASTTDADA
RLFKKAAGDKSRLCHMGHILMDNRHGLVVDVEITHASGTAERQAALKMLQRQKRKAGRLTVGADKGYDCRAFVQGCRKLGITPHVAAKAKHSAIDGRTQR
HEGYKVSLRVRKRIEEAFGWIKTVGGLAKTKLIGHAKLAGQALMCFAAYNLVRMGSLGGWWDAHHA
Blast result :
Comments
There are 20 copies of ISMca1 in this strain. One copy of ISMca1 is missing the terminus of the right end-sequence, which is relatively close to a phage region.
ISMca1 is 89% aa similar to ISAav3.
ISMca1 is 89% aa similar to ISAav3.
References
1] Ward,N., Larsen,O., Sakwa,J., Bruseth,L., Khouri,H., Durkin,A.S., Dimitrov,G., Jiang,L., Scanlan,D., Kang,K.H., Lewis,M., Nelson,K.E., Methe,B., Wu,M., Heidelberg,J.F., Paulsen,I.T., Fouts,D., Ravel,J., Tettelin,H., Ren,Q., Read,T., DeBoy,R.T., Seshadri,R., Salzberg,S.L., Jensen,H.B., Birkeland,N.K., Nelson,W.C., Dodson,R.J., Grindhaug,S.H., Holt,I., Eidhammer,I., Jonasen,I., Vanaken,S., Utterback,T., Feldblyum,T.V., Fraser,C.M., Lillehaug,J.R. and Eisen,J.A. (2004) PLoS Biol. 2(10):e303.