ISOcsp1
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
HQ144191 | ND | Ochrobactrum sp. | Lysobacter ruishenii CTN-1T Ochrobactrum sp. CTN-11 Pseudomonas sp. CTN-3 Pseudoxanthomonas sp. CTN-8 Bordetella sp. CTN-10 Shinella sp. CTN-13 Caulobacter sp. CTN-14 Rhizobium sp. CTN-15 |
DNA section
IS Length : 1955 bp
Ends
IR Length : 27/35
IRL : TGTTGCGCGCAGAGTAAAACTGGGCCACTTCGCGCAAGGTAAATCTGAAC
IRR : TGCCAAGCGCAGCCCAAATCTGAGCCACTTCGCGCGCCCGCGCGCAAAGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGCGTGCGCC | TGGG | CTCACGGATT | 4 |
DNA sequence
TGTTGCGCGCAGAGTAAAACTGGGCCACTTCGCGCAAGGTAAATCTGAACCACTTTTCCGTACAGTCTGCCTTTTCCGCGCGAAGGCCGACGATGCTTTT
GAAGGAACAGTGGATGCAGATCCATGTGCTCAAGGCCCAAGGGCTGTCGCTGCGCGAGATCGCGCGACGGCTGGGCGTCTCTCGTAATACCGTAACTCGC
TACTTGGCTGCGGACGATGTCCCCCGTTACAAGCAGCGTGAGGCTCGCCCGACCAAGCTCGACCCGTTTCACGATTACATCCGCGAGCGCATGCAAGCTG
CCTTTCCGGATTGGATCGCGGCGCCCGCGCTATTGCGCGAGCTAAGAGCGCGAGGCTATCAGGGACAGCTGCGCAGTCTCCAAGCCTTCATGCACGCGCA
TAAGCCTTTGCAAGCGGCTGATCCGATCGTGCGGTTCGAGACCGCGCCGGGTCATCAGATGCAATGCGACTTCGTCGTCTTTCGCCGTGGCGCTGACCCG
CTTTACGCTTTTACCGCGATTCTGGGGTTCAGTCGCTGGCGCTGGGCCAGATTCACCACGAACGAGCGGGCGGAAACGCTGATGACCTGCCATCACGCAC
TGTTCGAGACGTTGGGCGGGGTTCCGCGGGAGATCCTGTACGACAACGCCAAGACTATCGTCGCGTCGCGCGATGCTTACGCAGAGGGCCATCACCGCTG
GCATCCCGGCCTCCTGGACCTTGCCAAGCGTTATGGATTCATTCCCCGCCTCTGTCGTCCGTATCGTGCCCAAACCAAAGGCAAAGTCGAACGCTTCCAT
CGGTACCTGCGTGGCAACTTCTACGTGCCGTTGACCAGTTGGTTGAAGCAGTCGGGCCTGGTGCTGGACGTCGATACAGCTAACGCCGAAGTCGGCAAGT
GGCTGCGCGACGTGGCGAACCAGCGTGTGCATCCGGTAACCGGATACGCGCCAGCGGAGTTGTTCACTCAGCGAGAGCGCGCGTTCTTGCGTGAGCTGCC
AGCATTCTCCCAGCCCTCCCAGCTCACGAAGGTCACACAAGCTGCTCCGTCGCTGGACGCCACGCTGCAACACCCCCTGTCGGTCTATCAGCAACTTTTG
ACTGAGGTACGAGCGTGAACCTGCAACAGGAGCGAATTGATGCCCATTGCCAGACGCTCAAGCTCGAGGGCTTGATGCAGGCATACCGCGCAATGGCCAG
TGACGCGGTGAACAAGGACTGGAGCTTCATGGACTATCTGGAGCACGTGCTGGAACATGAGCGTGATACCCGCCAAGTCCGCTCACGCCAGACGCTGGTT
CGCATGGCCGGGTTCCCAGCGGTCAAGACGCTGGACGAGTACGACTACGGCTTTGCTGTGGGAGCGCCTCGGAAGTTGATCGACGAACTGGCGACGCTGC
GGTTCATCGAACGCAGTCAGAACGCCGTGTTGCTTGGTCCCTCCGGCGTCGGCAAGACTCATCTGGCGATTGCCATCGGATACGCCGCCACACAAGCCGG
GATCAAGACCAAGTTCATCACCGCCGCCGACCTCATGCTGCAACTCGAGGCCGCCAGGCGTCAGGAGCGATACGACGCCGTCCTGCGACACAACATCCTC
GGCCCACGCCTGCTAATCGTTGACGAGATCGGCTACTTACCCCTGTCCGGCGACCAAGCCAGCCACTTCTTCCAGATCGTCGCCAAGCGATATGAACGCG
GCTCCATGATCCTCACCAGCAATCTTCCCTTCACCCAATGGGATCAGACCTTCGGCGGCAATACCACGCTGACTGCGGCAATGTTGGACCGGATACTGCA
TCACGCGCACATCGTCCAAATCAAAGGAGATAGTTATCGGCTGAAACAGCAACGCAAGGCCGGCCACGTGCCCGCCACGAAGAACTAATCAACTGGTTCA
CATTTACTTTGCGCGCGGGCGCGCGAAGTGGCTCAGATTTGGGCTGCGCTTGGCA
GAAGGAACAGTGGATGCAGATCCATGTGCTCAAGGCCCAAGGGCTGTCGCTGCGCGAGATCGCGCGACGGCTGGGCGTCTCTCGTAATACCGTAACTCGC
TACTTGGCTGCGGACGATGTCCCCCGTTACAAGCAGCGTGAGGCTCGCCCGACCAAGCTCGACCCGTTTCACGATTACATCCGCGAGCGCATGCAAGCTG
CCTTTCCGGATTGGATCGCGGCGCCCGCGCTATTGCGCGAGCTAAGAGCGCGAGGCTATCAGGGACAGCTGCGCAGTCTCCAAGCCTTCATGCACGCGCA
TAAGCCTTTGCAAGCGGCTGATCCGATCGTGCGGTTCGAGACCGCGCCGGGTCATCAGATGCAATGCGACTTCGTCGTCTTTCGCCGTGGCGCTGACCCG
CTTTACGCTTTTACCGCGATTCTGGGGTTCAGTCGCTGGCGCTGGGCCAGATTCACCACGAACGAGCGGGCGGAAACGCTGATGACCTGCCATCACGCAC
TGTTCGAGACGTTGGGCGGGGTTCCGCGGGAGATCCTGTACGACAACGCCAAGACTATCGTCGCGTCGCGCGATGCTTACGCAGAGGGCCATCACCGCTG
GCATCCCGGCCTCCTGGACCTTGCCAAGCGTTATGGATTCATTCCCCGCCTCTGTCGTCCGTATCGTGCCCAAACCAAAGGCAAAGTCGAACGCTTCCAT
CGGTACCTGCGTGGCAACTTCTACGTGCCGTTGACCAGTTGGTTGAAGCAGTCGGGCCTGGTGCTGGACGTCGATACAGCTAACGCCGAAGTCGGCAAGT
GGCTGCGCGACGTGGCGAACCAGCGTGTGCATCCGGTAACCGGATACGCGCCAGCGGAGTTGTTCACTCAGCGAGAGCGCGCGTTCTTGCGTGAGCTGCC
AGCATTCTCCCAGCCCTCCCAGCTCACGAAGGTCACACAAGCTGCTCCGTCGCTGGACGCCACGCTGCAACACCCCCTGTCGGTCTATCAGCAACTTTTG
ACTGAGGTACGAGCGTGAACCTGCAACAGGAGCGAATTGATGCCCATTGCCAGACGCTCAAGCTCGAGGGCTTGATGCAGGCATACCGCGCAATGGCCAG
TGACGCGGTGAACAAGGACTGGAGCTTCATGGACTATCTGGAGCACGTGCTGGAACATGAGCGTGATACCCGCCAAGTCCGCTCACGCCAGACGCTGGTT
CGCATGGCCGGGTTCCCAGCGGTCAAGACGCTGGACGAGTACGACTACGGCTTTGCTGTGGGAGCGCCTCGGAAGTTGATCGACGAACTGGCGACGCTGC
GGTTCATCGAACGCAGTCAGAACGCCGTGTTGCTTGGTCCCTCCGGCGTCGGCAAGACTCATCTGGCGATTGCCATCGGATACGCCGCCACACAAGCCGG
GATCAAGACCAAGTTCATCACCGCCGCCGACCTCATGCTGCAACTCGAGGCCGCCAGGCGTCAGGAGCGATACGACGCCGTCCTGCGACACAACATCCTC
GGCCCACGCCTGCTAATCGTTGACGAGATCGGCTACTTACCCCTGTCCGGCGACCAAGCCAGCCACTTCTTCCAGATCGTCGCCAAGCGATATGAACGCG
GCTCCATGATCCTCACCAGCAATCTTCCCTTCACCCAATGGGATCAGACCTTCGGCGGCAATACCACGCTGACTGCGGCAATGTTGGACCGGATACTGCA
TCACGCGCACATCGTCCAAATCAAAGGAGATAGTTATCGGCTGAAACAGCAACGCAAGGCCGGCCACGTGCCCGCCACGAAGAACTAATCAACTGGTTCA
CATTTACTTTGCGCGCGGGCGCGCGAAGTGGCTCAGATTTGGGCTGCGCTTGGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1026 bp | 341 aa | 93 | 1118 | + | No |
Chemistry : DDE
ORF sequence :
MLLKEQWMQIHVLKAQGLSLREIARRLGVSRNTVTRYLAADDVPRYKQREARPTKLDPFHDYIRERMQAAFPDWIAAPALLRELRARGYQGQLRSLQAFM
HAHKPLQAADPIVRFETAPGHQMQCDFVVFRRGADPLYAFTAILGFSRWRWARFTTNERAETLMTCHHALFETLGGVPREILYDNAKTIVASRDAYAEGH
HRWHPGLLDLAKRYGFIPRLCRPYRAQTKGKVERFHRYLRGNFYVPLTSWLKQSGLVLDVDTANAEVGKWLRDVANQRVHPVTGYAPAELFTQRERAFLR
ELPAFSQPSQLTKVTQAAPSLDATLQHPLSVYQQLLTEVRA
HAHKPLQAADPIVRFETAPGHQMQCDFVVFRRGADPLYAFTAILGFSRWRWARFTTNERAETLMTCHHALFETLGGVPREILYDNAKTIVASRDAYAEGH
HRWHPGLLDLAKRYGFIPRLCRPYRAQTKGKVERFHRYLRGNFYVPLTSWLKQSGLVLDVDTANAEVGKWLRDVANQRVHPVTGYAPAELFTQRERAFLR
ELPAFSQPSQLTKVTQAAPSLDATLQHPLSVYQQLLTEVRA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
774 bp | 257 aa | 1115 | 1888 | + | No |
AG : IS21 helper
ORF sequence :
MNLQQERIDAHCQTLKLEGLMQAYRAMASDAVNKDWSFMDYLEHVLEHERDTRQVRSRQTLVRMAGFPAVKTLDEYDYGFAVGAPRKLIDELATLRFIER
SQNAVLLGPSGVGKTHLAIAIGYAATQAGIKTKFITAADLMLQLEAARRQERYDAVLRHNILGPRLLIVDEIGYLPLSGDQASHFFQIVAKRYERGSMIL
TSNLPFTQWDQTFGGNTTLTAAMLDRILHHAHIVQIKGDSYRLKQQRKAGHVPATKN
SQNAVLLGPSGVGKTHLAIAIGYAATQAGIKTKFITAADLMLQLEAARRQERYDAVLRHNILGPRLLIVDEIGYLPLSGDQASHFFQIVAKRYERGSMIL
TSNLPFTQWDQTFGGNTTLTAAMLDRILHHAHIVQIKGDSYRLKQQRKAGHVPATKN
Blast result :
Comments
ISOcsp1 is 84%(orfA, the transposase) and 94%(orfB, helper of transposition) aa similar to ISBcen13.
References
1] Liang,B. and Jiang,J. (2010) Direct submission.