BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SMu1602
(291 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|24380128|ref|NP_722083.1| hypothetical protein SMU.1760c... 541 e-152
gi|15675456|ref|NP_269630.1| hypothetical protein SPy_1564 ... 349 1e-94
gi|71903954|ref|YP_280757.1| hypothetical cytosolic protein... 349 1e-94
gi|56808099|ref|ZP_00365890.1| COG3649: Uncharacterized pro... 346 8e-94
gi|94994791|ref|YP_602889.1| hypothetical cytosolic protein... 345 1e-93
gi|94988912|ref|YP_597013.1| hypothetical cytosolic protein... 344 3e-93
gi|15612902|ref|NP_241205.1| hypothetical protein BH0339 [B... 254 4e-66
gi|68055788|ref|ZP_00539929.1| CRISPR-associated protein TM... 246 1e-63
gi|124485669|ref|YP_001030285.1| hypothetical protein Mlab_... 244 3e-63
gi|56965356|ref|YP_177088.1| hypothetical protein ABC3594 [... 243 1e-62
gi|150389452|ref|YP_001319501.1| CRISPR-associated protein,... 224 6e-57
gi|56477287|ref|YP_158876.1| hypothetical protein ebA3286 [... 219 2e-55
gi|119026200|ref|YP_910045.1| CRISPR-associated protein TM1... 212 2e-53
gi|78188974|ref|YP_379312.1| CRISPR-associated TM1801 famil... 211 3e-53
gi|152978953|ref|YP_001344582.1| CRISPR-associated protein,... 210 8e-53
gi|117926796|ref|YP_867413.1| CRISPR-associated protein, Cs... 207 8e-52
gi|94269288|ref|ZP_01291422.1| CRISPR-associated protein TM... 206 1e-51
gi|85860586|ref|YP_462788.1| hypothetical cytosolic protein... 200 1e-49
gi|148658415|ref|YP_001278620.1| CRISPR-associated protein,... 196 1e-48
gi|110601407|ref|ZP_01389595.1| CRISPR-associated protein T... 196 1e-48
gi|89902663|ref|YP_525134.1| CRISPR-associated protein TM18... 193 9e-48
gi|21673958|ref|NP_662023.1| hypothetical protein CT1132 [C... 156 2e-36
gi|145220108|ref|YP_001130817.1| CRISPR-associated protein,... 152 3e-35
gi|78187158|ref|YP_375201.1| CRISPR-associated TM1801 famil... 150 8e-35
gi|85714553|ref|ZP_01045540.1| CRISPR-associated protein [N... 147 9e-34
gi|120586829|ref|YP_961174.1| CRISPR-associated protein, Cs... 145 2e-33
gi|46562130|ref|YP_009172.1| CRISPR-associated TM1801 famil... 144 9e-33
gi|83592167|ref|YP_425919.1| CRISPR-associated TM1801 famil... 142 2e-32
gi|78222285|ref|YP_384032.1| CRISPR-associated TM1801 famil... 142 3e-32
gi|108758563|ref|YP_635130.1| CRISPR-associated protein, Cs... 141 5e-32
gi|154506184|ref|ZP_02042922.1| hypothetical protein RUMGNA... 140 7e-32
gi|114566030|ref|YP_753184.1| hypothetical protein Swol_047... 139 3e-31
gi|75676129|ref|YP_318550.1| CRISPR-associated TM1801 famil... 137 9e-31
gi|83589359|ref|YP_429368.1| CRISPR-associated TM1801 famil... 136 1e-30
gi|150007823|ref|YP_001302566.1| uncharacterized protein pr... 135 2e-30
gi|146284060|ref|YP_001174213.1| CRISPR-associated protein,... 135 2e-30
gi|134298869|ref|YP_001112365.1| CRISPR-associated protein,... 135 3e-30
gi|153091351|gb|EDN73325.1| hypothetical protein MHA_0345 [... 135 3e-30
gi|83645584|ref|YP_434019.1| uncharacterized protein predic... 135 4e-30
gi|91774959|ref|YP_544715.1| CRISPR-associated protein TM18... 134 9e-30
gi|154495122|ref|ZP_02034127.1| hypothetical protein PARMER... 134 1e-29
gi|154495727|ref|ZP_02034423.1| hypothetical protein BACCAP... 132 2e-29
gi|67158962|ref|ZP_00419749.1| CRISPR-associated protein TM... 132 2e-29
gi|34496682|ref|NP_900897.1| hypothetical protein CV_1227 [... 131 4e-29
gi|118726119|ref|ZP_01574750.1| CRISPR-associated protein, ... 131 6e-29
gi|114331017|ref|YP_747239.1| CRISPR-associated protein, Cs... 130 1e-28
gi|94984344|ref|YP_603708.1| CRISPR-associated protein Csd2... 130 1e-28
gi|53804985|ref|YP_113167.1| CRISPR-associated TM1801 famil... 128 4e-28
gi|21244564|ref|NP_644146.1| hypothetical protein XAC3840 [... 125 2e-27
gi|58580492|ref|YP_199508.1| hypothetical protein XOO0869 [... 125 3e-27
gi|126664601|ref|ZP_01735585.1| CRISPR-associated protein, ... 124 7e-27
gi|52425041|ref|YP_088178.1| hypothetical protein MS0986 [M... 124 7e-27
gi|84703568|ref|ZP_01017396.1| CRISPR-associated protein [P... 120 1e-25
gi|68549523|ref|ZP_00588986.1| CRISPR-associated protein TM... 118 4e-25
gi|149126262|ref|ZP_01851159.1| CRISPR-associated protein, ... 114 7e-24
gi|45658744|ref|YP_002830.1| hypothetical protein LIC12914 ... 113 1e-23
gi|24213386|ref|NP_710867.1| hypothetical protein LA0686 [L... 112 2e-23
gi|86742030|ref|YP_482430.1| CRISPR-associated protein TM18... 105 3e-21
gi|119357241|ref|YP_911885.1| CRISPR-associated protein, Cs... 96 2e-18
gi|46255206|ref|YP_006118.1| hypothetical protein TT_P0135 ... 96 2e-18
gi|51891450|ref|YP_074141.1| hypothetical protein STH312 [S... 92 5e-17
gi|59801386|ref|YP_208098.1| hypothetical protein, putative... 91 7e-17
gi|147678326|ref|YP_001212541.1| hypothetical protein PTH_1... 83 2e-14
gi|109646593|ref|ZP_01370497.1| Uncharacterized protein pre... 80 1e-13
gi|149125882|ref|ZP_01850852.1| CRISPR-associated protein, ... 80 2e-13
gi|125975680|ref|YP_001039590.1| CRISPR-associated protein,... 77 2e-12
gi|154249620|ref|YP_001410445.1| CRISPR-associated protein,... 71 1e-10
gi|145622619|ref|ZP_01778576.1| CRISPR-associated protein, ... 69 4e-10
gi|78043120|ref|YP_360970.1| CRISPR-associated protein, Csh... 66 2e-09
gi|114844588|ref|ZP_01455032.1| CRISPR-associated protein T... 66 3e-09
gi|89895517|ref|YP_519004.1| hypothetical protein DSY2771 [... 66 3e-09
gi|150398985|ref|YP_001322752.1| CRISPR-associated protein,... 66 3e-09
gi|109645854|ref|ZP_01369774.1| CRISPR-associated protein, ... 65 6e-09
gi|114568025|ref|YP_755179.1| hypothetical protein Swol_252... 64 7e-09
gi|89895552|ref|YP_519039.1| hypothetical protein DSY2806 [... 64 8e-09
gi|153869181|ref|ZP_01998851.1| CRISPR-associated protein T... 64 2e-08
gi|109672064|ref|ZP_01374306.1| crispr-associated protein, ... 63 2e-08
gi|21226665|ref|NP_632587.1| hypothetical protein MM_0563 [... 62 5e-08
gi|146295132|ref|YP_001178903.1| CRISPR-associated protein,... 61 8e-08
gi|134045809|ref|YP_001097295.1| CRISPR-associated protein,... 60 1e-07
gi|116753951|ref|YP_843069.1| CRISPR-associated protein, Cs... 60 2e-07
gi|124004088|ref|ZP_01688935.1| crispr-associated protein, ... 60 2e-07
gi|108803121|ref|YP_643058.1| CRISPR-associated protein Csh... 59 4e-07
gi|89211072|ref|ZP_01189450.1| CRISPR-associated protein TM... 58 7e-07
gi|154175378|ref|YP_001408164.1| crispr-associated protein,... 57 1e-06
gi|84489232|ref|YP_447464.1| hypothetical protein Msp_0420 ... 57 2e-06
gi|124521530|ref|ZP_01696443.1| CRISPR-associated protein, ... 55 4e-06
>gi|24380128|ref|NP_722083.1| hypothetical protein SMU.1760c [Streptococcus mutans UA159]
gi|24378127|gb|AAN59389.1|AE015004_8 conserved hypothetical protein [Streptococcus mutans UA159]
Length = 291
Score = 541 bits (1393), Expect = e-152, Method: Composition-based stats.
Identities = 291/291 (100%), Positives = 291/291 (100%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF
Sbjct: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF
Sbjct: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVY 180
DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVY
Sbjct: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVY 180
Query: 181 VIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLG 240
VIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLG
Sbjct: 181 VIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLG 240
Query: 241 NVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
NVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL
Sbjct: 241 NVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
>gi|15675456|ref|NP_269630.1| hypothetical protein SPy_1564 [Streptococcus pyogenes M1 GAS]
gi|71911101|ref|YP_282651.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS5005]
gi|13622647|gb|AAK34351.1| conserved hypothetical protein [Streptococcus pyogenes M1 GAS]
gi|71853883|gb|AAZ51906.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS5005]
Length = 282
Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats.
Identities = 195/292 (66%), Positives = 228/292 (78%), Gaps = 11/292 (3%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 1 MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 60
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
VQA +R++D SL++R F K +E ++ NA W DVR+FGQVF +
Sbjct: 61 VQANERIEDDFRSLEKR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 111
Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
S VRGPVSIS AKSLE +V S+QIT+STN E + + S TMGTKHFVDYGV
Sbjct: 112 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 170
Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
YV+KGSIN FAEKTGFS DAE IKEVLVSLFENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 171 YVLKGSINAYFAEKTGFSQEDAEAIKEVLVSLFENDASSARPEGSMRVCEVFWFTHSSKL 230
Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
GNVSSARVFDLLE+ + ++K +Y+ Y IHLNQE+LA+YEAKGL +EI+EGL
Sbjct: 231 GNVSSARVFDLLEYHQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTLEILEGL 282
>gi|71903954|ref|YP_280757.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS6180]
gi|94990878|ref|YP_598978.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10270]
gi|71803049|gb|AAX72402.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS6180]
gi|94544386|gb|ABF34434.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10270]
Length = 287
Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats.
Identities = 196/292 (67%), Positives = 228/292 (78%), Gaps = 11/292 (3%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 6 MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 65
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
VQA +R++D SL++R F K +E ++ NA W DVR+FGQVF +
Sbjct: 66 VQANERIEDDFRSLERR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 116
Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
S VRGPVSIS AKSLE +V S+QIT+STN E + + S TMGTKHFVDYGV
Sbjct: 117 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 175
Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
YV+KGSIN FAEKTGFS DAE IKEVLVSLFENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 176 YVLKGSINAYFAEKTGFSQEDAEAIKEVLVSLFENDASSARPEGSMRVCEVFWFTHSSKL 235
Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
GNVSSARVFDLLE+ + ++K +Y+ Y IHLNQE+LA+YEAKGL VEI+EGL
Sbjct: 236 GNVSSARVFDLLEYHQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTVEILEGL 287
>gi|56808099|ref|ZP_00365890.1| COG3649: Uncharacterized protein predicted to be involved in DNA
repair [Streptococcus pyogenes M49 591]
Length = 282
Score = 346 bits (888), Expect = 8e-94, Method: Composition-based stats.
Identities = 195/292 (66%), Positives = 227/292 (77%), Gaps = 11/292 (3%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 1 MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 60
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
VQA +R++D SL++R F K +E ++ NA W DVR+FGQVF +
Sbjct: 61 VQANERIEDDFRSLEKR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 111
Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
S VRGPVSIS AKSLE +V S+QIT+STN E + + S TMGTKHFVDYGV
Sbjct: 112 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 170
Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
YV+KGSIN FAEKTGFS DAE IKEVLVSLFENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 171 YVLKGSINAYFAEKTGFSQEDAEAIKEVLVSLFENDASSARPEGSMRVCEVFWFTHSSKL 230
Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
GNVSSARVFDLLE+ + ++K +Y+ Y IHLNQE+LA+YEAKGL VEI+E L
Sbjct: 231 GNVSSARVFDLLEYHQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTVEILERL 282
>gi|94994791|ref|YP_602889.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10750]
gi|94548299|gb|ABF38345.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10750]
Length = 287
Score = 345 bits (886), Expect = 1e-93, Method: Composition-based stats.
Identities = 194/292 (66%), Positives = 227/292 (77%), Gaps = 11/292 (3%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 6 MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 65
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
VQA +R++D SL++R F K +E ++ NA W DVR+FGQVF +
Sbjct: 66 VQANERIEDDFRSLERR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 116
Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
S VRGPVSIS AKSLE +V S+QIT+STN E + + S TMGTKHFVDYGV
Sbjct: 117 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 175
Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
YV++GSIN FAEKTGFS DAE IKEVLVSL ENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 176 YVLEGSINAYFAEKTGFSQEDAEAIKEVLVSLCENDASSARPEGSMRVCEVFWFTHSSKL 235
Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
GNVSSARVFDLLE+ + ++K +Y+ Y IHLNQE+LA+YEAKGL VEI+EGL
Sbjct: 236 GNVSSARVFDLLEYHQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTVEILEGL 287
>gi|94988912|ref|YP_597013.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS9429]
gi|94992804|ref|YP_600903.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS2096]
gi|94542420|gb|ABF32469.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS9429]
gi|94546312|gb|ABF36359.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS2096]
Length = 287
Score = 344 bits (883), Expect = 3e-93, Method: Composition-based stats.
Identities = 194/292 (66%), Positives = 228/292 (78%), Gaps = 11/292 (3%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 6 MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 65
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
VQA +R++D SL++R F K +E ++ NA W DVR+FGQVF +
Sbjct: 66 VQANERIEDDFRSLEKR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 116
Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
S VRGPVSIS AKSLE +V S+QIT+STN E + + S TMGTKHFVDYGV
Sbjct: 117 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 175
Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
YV+KGSIN FAEKT FS DAE IKEVLVSLFENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 176 YVLKGSINAYFAEKTVFSQEDAEAIKEVLVSLFENDASSARPEGSMRVCEVFWFTHSSKL 235
Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
GNVSSARVFDLLE+++ ++K +Y+ Y IHLNQE+LA+YEAKGL +EI+EGL
Sbjct: 236 GNVSSARVFDLLEYNQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTLEILEGL 287
>gi|15612902|ref|NP_241205.1| hypothetical protein BH0339 [Bacillus halodurans C-125]
gi|10172952|dbj|BAB04058.1| BH0339 [Bacillus halodurans C-125]
Length = 283
Score = 254 bits (649), Expect = 4e-66, Method: Composition-based stats.
Identities = 150/292 (51%), Positives = 192/292 (65%), Gaps = 14/292 (4%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
+L+ KIDF V + V +AN NGDPL+ N PR + +G +SDV+IKRKIRNR+ DM +PIF
Sbjct: 3 ILDHKIDFAVILSVTKANPNGDPLNGNRPRQNYDGHGEISDVAIKRKIRNRLLDMEEPIF 62
Query: 61 VQARDRVDDCIYSLKQRLE-NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
VQ+ DR D SL+ R + N E +K K SV+ F K EW+DVRSFGQVFA
Sbjct: 63 VQSDDRKADSFKSLRDRADSNPELAKMLK---AKNASVDEFAKIACQEWMDVRSFGQVFA 119
Query: 120 FDGYS-AANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYG 178
F G + + VRGPVSI A S++ + S QITKS NS K S TMG KH VD+G
Sbjct: 120 FKGSNLSVGVRGPVSIHTATSIDPIDIVSTQITKSVNSVTGDKRS--SDTMGMKHRVDFG 177
Query: 179 VYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNK 238
VYV KGSIN AEKTGF++ DAE IK L++LFEND+SSARP+GSM V +V+W+ HS+K
Sbjct: 178 VYVFKGSINTQLAEKTGFTNEDAEKIKRALITLFENDSSSARPDGSMEVHKVYWWEHSSK 237
Query: 239 LGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEG 290
LG SSA+V L+ + + S++DYA+ L YE GL VE+I+G
Sbjct: 238 LGQYSSAKVHRSLKIESKTDTPKSFDDYAVEL-------YELDGLGVEVIDG 282
>gi|68055788|ref|ZP_00539929.1| CRISPR-associated protein TM1801 [Exiguobacterium sibiricum 255-15]
gi|68007634|gb|EAM86884.1| CRISPR-associated protein TM1801 [Exiguobacterium sibiricum 255-15]
Length = 285
Score = 246 bits (629), Expect = 1e-63, Method: Composition-based stats.
Identities = 149/294 (50%), Positives = 184/294 (62%), Gaps = 16/294 (5%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
L++KIDF V + V +AN NGDPL+ N PR + YG +SDV+IKRKIRNR+QDMG+ IFV
Sbjct: 4 LDRKIDFTVILSVTKANPNGDPLNGNRPRQNYDGYGEISDVAIKRKIRNRLQDMGESIFV 63
Query: 62 QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFD 121
Q+ DR D SL+ R E DT+K+ K K + + + A W+DVR+FGQVFAF
Sbjct: 64 QSNDRNLDGYASLRDRAEAN---DTIKKLMKTKNASDQVAAEACATWMDVRAFGQVFAFK 120
Query: 122 GYSAANV----RGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDY 177
G A V RGPVSI A S+ + SMQITKS NSE S E S TMG KH VD+
Sbjct: 121 GDKAGGVSVGVRGPVSIHTATSVAPIDVTSMQITKSVNSE--SGKERGSDTMGMKHRVDH 178
Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSN 237
GVYV GSIN AEKT F+ DAE K LVSLFENDASSARPEGSM V +V+W+ H +
Sbjct: 179 GVYVFNGSINTQLAEKTNFTQEDAEKFKLALVSLFENDASSARPEGSMEVHKVYWWEHDS 238
Query: 238 KLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
KLG SSA+V + + S+ DY I ++ E G EI++GL
Sbjct: 239 KLGRFSSAKVHRAVSVQALTDEPKSFLDYEIKVDALE-------GFTPEILDGL 285
>gi|124485669|ref|YP_001030285.1| hypothetical protein Mlab_0847 [Methanocorpusculum labreanum Z]
gi|124363210|gb|ABN07018.1| CRISPR-associated protein, Csd2 family [Methanocorpusculum
labreanum Z]
Length = 307
Score = 244 bits (624), Expect = 3e-63, Method: Composition-based stats.
Identities = 150/314 (47%), Positives = 187/314 (59%), Gaps = 36/314 (11%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
L KIDF V + + AN NGDPL+ N PRT+ G MSDV IKRK+RNR+QDMGQP+FV
Sbjct: 4 LNNKIDFAVVISAKNANPNGDPLNGNCPRTNFAGIGEMSDVCIKRKLRNRLQDMGQPVFV 63
Query: 62 QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
Q+ DR D SLK R + E F T +KK + + K EW+DVRSFGQVFA+
Sbjct: 64 QSLDRRTDSFLSLKDRADGCEAFKTALADNKKDDAEKIACK----EWIDVRSFGQVFAYK 119
Query: 121 ----------------------DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSEL 158
+G + +RGPVS+ A S++ V S QI KS N E
Sbjct: 120 GKKQNKKNKNEQKNDSESEDGEEGCVSIGIRGPVSVHPAFSVDAVEITSQQIVKSVNGET 179
Query: 159 NSKTELE--SSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDA 216
+ K + S TMG KH VD+G+YV GSIN AEKTGFSD DAE IKE L++LFENDA
Sbjct: 180 DKKNPYKRGSDTMGLKHRVDFGLYVFYGSINCQLAEKTGFSDEDAESIKEALMTLFENDA 239
Query: 217 SSARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELA 276
SSARPEGSM V +V W+ H++K G SSA+V L +K K+ S +DYAI ++ E
Sbjct: 240 SSARPEGSMEVVQVAWWKHNSKSGQYSSAKVHRTLHVEKRKETPMSVDDYAIKIDDLE-- 297
Query: 277 EYEAKGLQVEIIEG 290
GL+ EI EG
Sbjct: 298 -----GLKPEIFEG 306
>gi|56965356|ref|YP_177088.1| hypothetical protein ABC3594 [Bacillus clausii KSM-K16]
gi|56911600|dbj|BAD66127.1| conserved hypothetical protein [Bacillus clausii KSM-K16]
Length = 285
Score = 243 bits (620), Expect = 1e-62, Method: Composition-based stats.
Identities = 146/293 (49%), Positives = 195/293 (66%), Gaps = 16/293 (5%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
L+ KIDF V + V +AN NGDPL+ N PR + +G +SDV+IKRKIRNR+QDMG+P+FV
Sbjct: 4 LDHKIDFAVILSVSKANPNGDPLNGNRPRQNYDGHGEISDVAIKRKIRNRLQDMGEPVFV 63
Query: 62 QARDRVDDCIYSLKQRLE-NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
Q+ DR D SL++R + N E +K K S ++F + EW+DVRSFGQVFAF
Sbjct: 64 QSDDRKVDSHKSLRERADSNPELAKMLK---AKNASSDDFAQIACEEWIDVRSFGQVFAF 120
Query: 121 DGYS-AANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGV 179
G + + VRGPVSI A S++ + S QITKS NS K S TMG KH VD+GV
Sbjct: 121 KGSNLSVGVRGPVSIHTATSIDPIDIVSTQITKSVNSVTGDKRS--SDTMGMKHRVDFGV 178
Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
YV KGSIN AEKTGF++ DAE IK+ LV+LFEND+SSARP+GSM V +V+W+ HS+KL
Sbjct: 179 YVFKGSINTQLAEKTGFTNEDAEKIKQALVTLFENDSSSARPDGSMEVHKVYWWEHSSKL 238
Query: 240 GNVSSARVFDLLEFDKEKQDKDSY--EDYAIHLNQEELAEYEAKGLQVEIIEG 290
G SSA+V L+ + + ++ E+Y++ L+ E GL VE+++G
Sbjct: 239 GQYSSAKVHRSLKVEAKTDSPKAFDEENYSVELSDLE-------GLSVEVLDG 284
>gi|150389452|ref|YP_001319501.1| CRISPR-associated protein, Csd2 family [Alkaliphilus
metalliredigens QYMF]
gi|149949314|gb|ABR47842.1| CRISPR-associated protein, Csd2 family [Alkaliphilus
metalliredigens QYMF]
Length = 286
Score = 224 bits (570), Expect = 6e-57, Method: Composition-based stats.
Identities = 136/296 (45%), Positives = 175/296 (59%), Gaps = 19/296 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
E KIDF V + V+ AN NGDPL+ N PR + +G +SDV IKRKIRNR QDM Q IFV
Sbjct: 4 FENKIDFAVVISVKNANPNGDPLNGNRPRENYDGFGEISDVCIKRKIRNRFQDMDQAIFV 63
Query: 62 QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
Q+ +R D SLK R D +E K K E + K +W+DVRSFGQVFAF
Sbjct: 64 QSDERRTDGFRSLKDRA------DGCEELKKSSKDKEQYAKIACEKWIDVRSFGQVFAFK 117
Query: 121 ----DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELES-STMGTKHFV 175
D + +RGPVSI A S+ + SMQITKS N E + +S TMG KH V
Sbjct: 118 KGGDDNSVSIGIRGPVSIHSAVSISPIEISSMQITKSVNGETGKDPDKKSPDTMGMKHRV 177
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
++G YVI GSIN A KTGF+ D+E++K+ L++LFEND SSARP+GSM V +++W+ H
Sbjct: 178 EFGAYVIYGSINTQLATKTGFNQEDSELVKKALITLFENDCSSARPDGSMEVCKLYWWKH 237
Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
++K+G SSA+V L + DY I E L GL EI +G+
Sbjct: 238 NSKIGQYSSAKVHRTLRIAPTIEMPKDINDYNI--THETL-----DGLAPEIYDGI 286
>gi|56477287|ref|YP_158876.1| hypothetical protein ebA3286 [Azoarcus sp. EbN1]
gi|56313330|emb|CAI07975.1| conserved hypothetical protein [Azoarcus sp. EbN1]
Length = 280
Score = 219 bits (558), Expect = 2e-55, Method: Composition-based stats.
Identities = 130/256 (50%), Positives = 162/256 (63%), Gaps = 7/256 (2%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
L+ KIDF V + V+ AN NGDPL+ N PRTD G ++DV +KRK+R+R+Q+ G IFV
Sbjct: 4 LQNKIDFAVVIRVKHANPNGDPLNGNRPRTDYGGLGEITDVCLKRKLRDRLQETGHAIFV 63
Query: 62 QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFD 121
Q+ DR D SL+ R E+ E KE KK E K+ +W DVR+FGQVFAF
Sbjct: 64 QSDDRKIDGEPSLRTRAES-EKNGLGKEAFKKGAKREETAKKACEKWFDVRAFGQVFAFG 122
Query: 122 GYSAAN-----VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVD 176
AN +RGP++I A S E V S QITKS + E ++ S TMG KH VD
Sbjct: 123 KGDDANGVSIPIRGPLTIQSAFSKEPVSITSTQITKSVSGE-GDGSKRGSDTMGMKHRVD 181
Query: 177 YGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHS 236
G+Y GSINP AE+TGFSDADAE IK +L LFENDASSARP+GSM V +V W+ H+
Sbjct: 182 SGIYECFGSINPQLAERTGFSDADAETIKTILPKLFENDASSARPDGSMEVLKVVWWKHN 241
Query: 237 NKLGNVSSARVFDLLE 252
K G SSA+V LL+
Sbjct: 242 CKAGQYSSAKVHRLLK 257
>gi|119026200|ref|YP_910045.1| CRISPR-associated protein TM1801 [Bifidobacterium adolescentis ATCC
15703]
gi|154489041|ref|ZP_02029890.1| hypothetical protein BIFADO_02351 [Bifidobacterium adolescentis
L2-32]
gi|118765784|dbj|BAF39963.1| CRISPR-associated protein TM1801 [Bifidobacterium adolescentis ATCC
15703]
gi|154083178|gb|EDN82223.1| hypothetical protein BIFADO_02351 [Bifidobacterium adolescentis
L2-32]
Length = 281
Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats.
Identities = 135/295 (45%), Positives = 174/295 (58%), Gaps = 24/295 (8%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
LE KIDF + V AN NGDPL+ N PRT ++ G +SDV++KRKIRNR+QD G+ +FV
Sbjct: 4 LENKIDFAIAFAVNNANPNGDPLNGNRPRTTSEGLGEVSDVALKRKIRNRLQDAGESVFV 63
Query: 62 QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFD 121
Q+ DR DD SL R + T+ + +K+K+V ++ WLDVRSFGQVFAF
Sbjct: 64 QSDDRSDDGAKSLSDRFNT--YLKTLPKDEQKQKNV--VFGKVCERWLDVRSFGQVFAFK 119
Query: 122 GYSAAN-----VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVD 176
+ VRGPVSI A S+ + +QITKS NSE + S TMG K+ V
Sbjct: 120 KAKDTDEVSIGVRGPVSIQPAFSINPIAIDDVQITKSVNSETTDSGKKSSDTMGMKYRVS 179
Query: 177 -YGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
VYV GSI+P AE+TGFS DAE IKE LV+LFEND SSARP GSM V +V WF H
Sbjct: 180 GRAVYVTYGSISPQLAERTGFSAEDAEKIKEALVTLFENDESSARPSGSMEVLDVVWFAH 239
Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEG 290
+ K G SSA+V + D D ++++ + GL+ E+IEG
Sbjct: 240 NCKGGQYSSAKVHRSVSVDA---------DGTVNVDDASIP-----GLRYEVIEG 280
>gi|78188974|ref|YP_379312.1| CRISPR-associated TM1801 family protein [Chlorobium chlorochromatii
CaD3]
gi|78171173|gb|ABB28269.1| CRISPR-associated protein TM1801 [Chlorobium chlorochromatii CaD3]
Length = 280
Score = 211 bits (538), Expect = 3e-53, Method: Composition-based stats.
Identities = 127/266 (47%), Positives = 162/266 (60%), Gaps = 19/266 (7%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQP--- 58
L QKIDF + + V AN NGDPL+ N PRTD +G M+DV +KRKIRNR+ ++
Sbjct: 4 LNQKIDFAIIMRVTNANPNGDPLNGNRPRTDLDGHGEMTDVCLKRKIRNRIMELKDKEQK 63
Query: 59 ----IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSF 114
IFVQ D D SLK R E+ E K K ++ K+ +W DVR+F
Sbjct: 64 YQFDIFVQPDDSKRDSHTSLKARFES--------EIGKNVKDKDDAAKKACKKWFDVRAF 115
Query: 115 GQVFAFDGYSAAN----VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMG 170
GQ+FAFDG ++ VRGPVSI A S+E V S+QITKS + + S TMG
Sbjct: 116 GQLFAFDGEESSGLSIPVRGPVSIHSAFSVEPVNVSSIQITKSVSGNEGKNGKRSSDTMG 175
Query: 171 TKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREV 230
KH VDYG+YV GS+NP AE+TGFSD DA++I E+L LFENDASSARP+GSM V V
Sbjct: 176 MKHRVDYGIYVTYGSMNPQLAERTGFSDEDAKVIMEILPKLFENDASSARPDGSMEVVSV 235
Query: 231 FWFTHSNKLGNVSSARVFDLLEFDKE 256
W+ H +K G SSA+V L +++
Sbjct: 236 IWWKHGSKAGKHSSAKVHKSLHVNED 261
>gi|152978953|ref|YP_001344582.1| CRISPR-associated protein, Csd2 family [Actinobacillus succinogenes
130Z]
gi|150840676|gb|ABR74647.1| CRISPR-associated protein, Csd2 family [Actinobacillus succinogenes
130Z]
Length = 279
Score = 210 bits (535), Expect = 8e-53, Method: Composition-based stats.
Identities = 127/260 (48%), Positives = 167/260 (64%), Gaps = 7/260 (2%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
L +KIDF + ++V AN NGDPL+ N PRTD G M+DV +KRKIR+R+Q G+ IFV
Sbjct: 3 LTKKIDFALILKVTNANPNGDPLNGNRPRTDFAGIGEMTDVCLKRKIRDRLQSNGESIFV 62
Query: 62 QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
Q+ ++ D + SL R ++K+ E KK + + K A+WLDVRSFGQVFAF
Sbjct: 63 QSDEKKTDGMTSLANRAKDKDV-GLGAEAFGKKANKDETAKAACAKWLDVRSFGQVFAFG 121
Query: 121 ---DGYSAA-NVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVD 176
DG + VRGPV+I A S+E V S QITKS + E T+ S TMG KH VD
Sbjct: 122 KSDDGAGVSIAVRGPVTIQSAFSVEPVNITSTQITKSVSGE-GDGTKRGSDTMGMKHRVD 180
Query: 177 YGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHS 236
G+YV G+++P AE+TGFSD DA+ IK VL LFE DASSARPEGSM+V ++ W+ H+
Sbjct: 181 SGIYVAFGAMSPQLAERTGFSDEDADKIKAVLTKLFEGDASSARPEGSMQVLKLIWWEHN 240
Query: 237 NKLGNVSSARVFDLLEFDKE 256
K G SSA+V L+ + +
Sbjct: 241 CKSGQYSSAKVHGSLKVNTD 260
>gi|117926796|ref|YP_867413.1| CRISPR-associated protein, Csd2 family [Magnetococcus sp. MC-1]
gi|117610552|gb|ABK46007.1| CRISPR-associated protein, Csd2 family [Magnetococcus sp. MC-1]
Length = 272
Score = 207 bits (526), Expect = 8e-52, Method: Composition-based stats.
Identities = 117/252 (46%), Positives = 159/252 (63%), Gaps = 16/252 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
L KIDF V V+ AN NGDPL+ N PR + G +SDV++KRK+R+R+ + G IFV
Sbjct: 3 LSHKIDFAVIFAVKNANPNGDPLNGNRPRLTFDNLGEVSDVALKRKLRDRLLEGGHAIFV 62
Query: 62 QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFD 121
Q+ DR +D SLK R + GSK + E ++ A+WLDVR+FGQ+FA+
Sbjct: 63 QSNDRNNDGATSLKDRSDKTL-------GSKL--TSEELAQKACAQWLDVRAFGQLFAWK 113
Query: 122 GYSAAN------VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
G A +RGPV+ A S+ V S+QITKS N+E +S+ + S TMG KH V
Sbjct: 114 GTKGAGDGVSVAIRGPVTFQSAFSIAPVDISSIQITKSVNTEGDSEKK-GSDTMGMKHRV 172
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
D+G+Y+ GS+NP A++T FSD DA++IK+ L LFEND S+ARP GSM VR+V W+ H
Sbjct: 173 DHGIYLFYGSMNPQLAQRTHFSDEDAQLIKQTLPRLFENDESTARPAGSMEVRKVLWWQH 232
Query: 236 SNKLGNVSSARV 247
+ G SSA+V
Sbjct: 233 NCAAGQYSSAKV 244
>gi|94269288|ref|ZP_01291422.1| CRISPR-associated protein TM1801 [delta proteobacterium MLMS-1]
gi|93451274|gb|EAT02162.1| CRISPR-associated protein TM1801 [delta proteobacterium MLMS-1]
Length = 287
Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats.
Identities = 131/276 (47%), Positives = 160/276 (57%), Gaps = 31/276 (11%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
L KIDF V V AN NGDPL+ N PRT + G +SDV IKRKIRNR+ + G+ IFV
Sbjct: 3 LSNKIDFAVIFRVVNANPNGDPLNGNRPRTIYEGNGEVSDVCIKRKIRNRLMEAGKAIFV 62
Query: 62 QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
Q+ D D SL+ R D+V G K + K+ A WLDVR+FGQ+FAF
Sbjct: 63 QSDDNKIDEHSSLRSRA------DSVLSGIKGSDEI---AKKACATWLDVRAFGQLFAFK 113
Query: 121 --------------------DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNS 160
D + +RGPVS+ A S+ V S QITKS + E
Sbjct: 114 AAGGQKTKAKEGEAAAGAGDDKGVSIGIRGPVSVQSAFSITPVSVTSTQITKSVSGE-GD 172
Query: 161 KTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSAR 220
++ S TMG KH VD GVYV GS+NP AEKTGFSDADAE IK +L LFENDASSAR
Sbjct: 173 GSKRGSDTMGMKHRVDRGVYVFYGSMNPQLAEKTGFSDADAETIKAILPKLFENDASSAR 232
Query: 221 PEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKE 256
PEGSM V +VFW+ H +K G SSA+V L +++
Sbjct: 233 PEGSMAVEKVFWWRHDSKAGQYSSAKVHRSLSVNED 268
>gi|85860586|ref|YP_462788.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
gi|85723677|gb|ABC78620.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
Length = 291
Score = 200 bits (508), Expect = 1e-49, Method: Composition-based stats.
Identities = 129/277 (46%), Positives = 167/277 (60%), Gaps = 30/277 (10%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRM--------- 52
L +KIDF + + V+ AN NGDPL+ N PRTD + G ++DV +KRKIR+R+
Sbjct: 4 LSKKIDFAIIMSVKNANPNGDPLNGNRPRTDYEGLGEITDVCLKRKIRDRLVEQYVSLKN 63
Query: 53 --QDMGQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAE--- 107
+ GQ IFVQ+ DR D SL+ R E++ K G KK N K A+
Sbjct: 64 EEEKKGQAIFVQSDDRKIDGETSLRNRAESE------KNGLGKKAFGANAKKDETAKSAC 117
Query: 108 --WLDVRSFGQVFAF------DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELN 159
W DVR+FGQVFAF DG S VRGPV+I A S + V S QITKS + E +
Sbjct: 118 EKWFDVRAFGQVFAFGKGNDADGVSIP-VRGPVTIQSAFSRDLVSISSTQITKSVSGEGD 176
Query: 160 SKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSA 219
K S TMG KH VD G+YV G++NP AE+TGFSD DA +IK +L +FENDASSA
Sbjct: 177 GKKR-SSDTMGMKHRVDRGIYVTYGTMNPQLAERTGFSDKDAAVIKAILPKIFENDASSA 235
Query: 220 RPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKE 256
RPEGSM V +V W+ H++K G+ SSA+V L+ + +
Sbjct: 236 RPEGSMEVVKVLWWQHNSKSGDYSSAKVHRSLKVNPD 272
>gi|148658415|ref|YP_001278620.1| CRISPR-associated protein, Csd2 family [Roseiflexus sp. RS-1]
gi|148570525|gb|ABQ92670.1| CRISPR-associated protein, Csd2 family [Roseiflexus sp. RS-1]
Length = 324
Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats.
Identities = 142/339 (41%), Positives = 182/339 (53%), Gaps = 70/339 (20%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMG----- 56
L +KIDF V + VR AN NGDPL+ N PRT + G +SDV+IKRKIRNR+ ++
Sbjct: 5 LSKKIDFAVILCVRNANPNGDPLNGNRPRTTYEGLGEISDVAIKRKIRNRVMELAKEIER 64
Query: 57 ---------------------QPIFVQARDRVDDCIYSLKQRLEN--KEFFDTVKEGSKK 93
PIFVQ+ D +D SL++R E K+F + + +
Sbjct: 65 IKEEERTEAQKQAYERLKGVEHPIFVQSDDNRNDEYTSLRERAEAYLKDFMN--DRSAYQ 122
Query: 94 KKSVENFVKQINAEWLDVRSFGQVFAF------------------DGYSAANVRGPVSIS 135
KK+ E W DVR+FGQVF F DG S +RGPVSI
Sbjct: 123 KKACET--------WFDVRAFGQVFPFKGKGKGKKGDKSNEGEESDGVSIG-IRGPVSIH 173
Query: 136 WAKSLEKVVTQ--SMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEK 193
A S+ V+ + S+QITKS ++E K S TMG KH VD+GVYV GSINP A K
Sbjct: 174 PAFSVVPVLDRVSSIQITKSVSNEPGEKRG--SDTMGMKHRVDHGVYVFYGSINPQLASK 231
Query: 194 TGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEF 253
TGFSD DA +IKE L +LF NDASSARPEGS+ V V+W+ H++ G SSA+V L
Sbjct: 232 TGFSDDDAAVIKEALRTLFRNDASSARPEGSIEVYRVYWWKHNSPNGQYSSAKVHRSLRV 291
Query: 254 DKEK--QDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEG 290
+D S +DY I L + GL+ E+IEG
Sbjct: 292 RVRDGIEDPKSIDDYVIELQ-------DLDGLKPEVIEG 323
>gi|110601407|ref|ZP_01389595.1| CRISPR-associated protein TM1801 [Geobacter sp. FRC-32]
gi|110547870|gb|EAT61108.1| CRISPR-associated protein TM1801 [Geobacter sp. FRC-32]
Length = 288
Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats.
Identities = 126/264 (47%), Positives = 151/264 (57%), Gaps = 28/264 (10%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
L KIDF V +V AN NGDPL+ N PRT + G +SDV IKRKIRNR+ + GQ IFV
Sbjct: 3 LSNKIDFAVVFKVTNANPNGDPLNGNRPRTIYEGNGEVSDVCIKRKIRNRLMEAGQRIFV 62
Query: 62 QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
Q+ D +D SLK R + E D +K +K K W DVR+FGQ+FAF
Sbjct: 63 QSDDSKNDKHPSLKSRAD--EVLDGIKAADEKAKKA-------CETWFDVRAFGQLFAFK 113
Query: 121 -----------------DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTE 163
D + +RGPVS+ A S+ V S QITKS + E T+
Sbjct: 114 AAGGKKGKAKEGEEPGDDKGVSIGIRGPVSVQSAFSISPVSLTSTQITKSVSGE-GDGTK 172
Query: 164 LESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEG 223
S TMG KH VD G+Y GS+NP A KTGFSDADA IK VL LFENDASSARPEG
Sbjct: 173 RGSDTMGMKHRVDRGIYTFYGSMNPQLAVKTGFSDADAAAIKAVLPRLFENDASSARPEG 232
Query: 224 SMRVREVFWFTHSNKLGNVSSARV 247
SM V +V W+ H+ G SSA+V
Sbjct: 233 SMEVLKVVWWQHNCASGQCSSAKV 256
>gi|89902663|ref|YP_525134.1| CRISPR-associated protein TM1801 [Rhodoferax ferrireducens T118]
gi|89347400|gb|ABD71603.1| CRISPR-associated protein TM1801 [Rhodoferax ferrireducens DSM
15236]
Length = 294
Score = 193 bits (491), Expect = 9e-48, Method: Composition-based stats.
Identities = 123/273 (45%), Positives = 154/273 (56%), Gaps = 37/273 (13%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRM--------- 52
L+ KIDF V + V+ AN NGDPL+ N PRTD ++G M+DVSIKRKIR+R+
Sbjct: 4 LQNKIDFAVILRVKNANPNGDPLNGNRPRTDYSNFGEMTDVSIKRKIRDRLLERWVAAGK 63
Query: 53 QDMGQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVR 112
D G IFVQ+ DR D SL+ R E K + ++WLDVR
Sbjct: 64 ADDGNMIFVQSDDRKADEYKSLRARAEAV---------LGKALGSDQTALLACSKWLDVR 114
Query: 113 SFGQVFAFDGYSAAN------------------VRGPVSISWAKSLEKVVTQSMQITKST 154
+FGQ+FA A +RGPV++ A S+E + S QITKS
Sbjct: 115 AFGQLFALKSNKKAGKKNDDGSDDEGDTGVSIGIRGPVTVQSAFSVEPIDITSTQITKSV 174
Query: 155 NSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFEN 214
+ E T+ S TMGTKH VD G+Y GS+NP AEKTGFSDADA+ +K VL LFEN
Sbjct: 175 SGE-GDGTKRSSDTMGTKHRVDQGIYRFFGSMNPQLAEKTGFSDADAQALKAVLPKLFEN 233
Query: 215 DASSARPEGSMRVREVFWFTHSNKLGNVSSARV 247
D SSARP GSM V +V W+ H+ K G SSA+V
Sbjct: 234 DESSARPAGSMEVVKVIWWQHNCKSGQYSSAKV 266
>gi|21673958|ref|NP_662023.1| hypothetical protein CT1132 [Chlorobium tepidum TLS]
gi|21647100|gb|AAM72365.1| CRISPR-associated protein, TM1801 family [Chlorobium tepidum TLS]
Length = 299
Score = 156 bits (394), Expect = 2e-36, Method: Composition-based stats.
Identities = 101/295 (34%), Positives = 180/295 (61%), Gaps = 26/295 (8%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
++++ DF+V +V++ N NGDP + N+PR DA+ GL++DV +KRK+RN +Q +GQ IF
Sbjct: 4 VDKRYDFVVLFDVQDGNPNGDPDAGNLPRIDAETGMGLVTDVCLKRKVRNYVQLLGQDIF 63
Query: 61 VQAR----DRVDDCIYSLKQRLENKEFFDTVKEGSKKKK-------SVENFVKQINAEWL 109
++ + +++D+ +L L N D+ K+GSK+ K V+ Q+ ++
Sbjct: 64 IKEKAILNNKIDEAYKALNIDL-NAAPADS-KDGSKRNKPGVAQGGEVDKGRVQMCTKYY 121
Query: 110 DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELES 166
D+R+FG V + G +A VRGP+ +++A+S+E VV IT+ +T +E ++ ++
Sbjct: 122 DIRAFGAVMS-TGANAGQVRGPIQMTFARSVEPVVALEHSITRMAVATEAEAEKQSG-DN 179
Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
TMG K+ V YG+Y G ++ N A +TGFS D ++ + L+++FE+D S+AR G M
Sbjct: 180 RTMGRKYTVPYGLYRAHGFVSANLASQTGFSAEDLDLFWDALLNMFEHDRSAAR--GLMS 237
Query: 227 VREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEELAE 277
R ++ F HS+ LGN ++++F+ + K K+D + S++DY + +++ L E
Sbjct: 238 TRGLYVFEHSSALGNAPASQLFERITV-KRKEDSEGPARSFKDYEVLVDESNLGE 291
>gi|145220108|ref|YP_001130817.1| CRISPR-associated protein, Csd2 family [Prosthecochloris
vibrioformis DSM 265]
gi|145206272|gb|ABP37315.1| CRISPR-associated protein, Csd2 family [Prosthecochloris
vibrioformis DSM 265]
Length = 282
Score = 152 bits (384), Expect = 3e-35, Method: Composition-based stats.
Identities = 95/281 (33%), Positives = 167/281 (59%), Gaps = 19/281 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
++++ DF++ +V++ N NGDP + N+PR DA+ GL+SDV +KRK+RN +Q GQ IF
Sbjct: 4 IKKRYDFVLLFDVQDGNPNGDPDAGNLPRIDAETGMGLVSDVCLKRKVRNYVQLAGQEIF 63
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
++ + ++ I ++ E VK +K K+ E + +++ D+R+FG V +
Sbjct: 64 IKEKGVLNTLIAESHEQPE-------VKSKTKGDKT-EAARSWMCSKYYDIRTFGAVMS- 114
Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTKHFVDY 177
G +A VRGPV I++A+S+E VV IT+ +T +E K + + TMG K+ + Y
Sbjct: 115 TGENAGQVRGPVQITFARSVEPVVALEHSITRMAVATEAEA-EKQDGGNRTMGRKYTIPY 173
Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSN 237
G+Y+ G ++ N A +TGFS+ D ++ + L+++FE+D S+AR G M R ++ F H++
Sbjct: 174 GLYLAHGFVSANLANQTGFSEKDLQLFWDALLNMFEHDRSAAR--GMMSTRGLYIFEHNS 231
Query: 238 KLGNVSSARVFDLLEFDKEKQDKD---SYEDYAIHLNQEEL 275
LGN + +F+ + ++ S+ +Y+I +NQ L
Sbjct: 232 ALGNAPAHSLFERISVSRKNPASGPARSFAEYSIDINQANL 272
>gi|78187158|ref|YP_375201.1| CRISPR-associated TM1801 family protein [Pelodictyon luteolum DSM
273]
gi|78167060|gb|ABB24158.1| CRISPR-associated protein TM1801 [Pelodictyon luteolum DSM 273]
Length = 299
Score = 150 bits (379), Expect = 8e-35, Method: Composition-based stats.
Identities = 94/289 (32%), Positives = 172/289 (59%), Gaps = 18/289 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
L ++ DF + +V++ N NGDP + N+PR DA+ GL++DV +KRK+RN +Q G+ IF
Sbjct: 4 LTKRYDFALLFDVQDGNPNGDPDAGNLPRIDAETGMGLVTDVCLKRKVRNYVQLSGKDIF 63
Query: 61 VQARDRVDDCIYSL--KQRLENKEFFDTVKEGSKKKK-------SVENFVKQINAEWLDV 111
++ + ++ I + +Q+++ + +K+G K+ K VE + + + D+
Sbjct: 64 IKEKAVLNTLISNAYEEQKIDLTKDPVDLKDGKKRNKDGTAQGGEVEKGRSYMCSRYYDI 123
Query: 112 RSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK-STNSELNS-KTELESSTM 169
R+FG V + G +A VRGP+ I++A+S+E VV IT+ + +E ++ K ++ TM
Sbjct: 124 RTFGAVMS-TGANAGQVRGPIQITFARSVEPVVALEHSITRMAVTTEADAEKQSGDNRTM 182
Query: 170 GTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVRE 229
G K+ V YG+Y G ++ + A +TGFS D ++ E L ++FE+D S+AR G M R
Sbjct: 183 GRKYTVPYGLYCSHGFVSAHLANQTGFSAEDLKLFWEALQNMFEHDRSAAR--GMMSTRG 240
Query: 230 VFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD---SYEDYAIHLNQEEL 275
++ F HS LGN + ++F+ ++ +++ + + S+EDY + +++ L
Sbjct: 241 LYVFEHSTALGNAPAHKLFERIKVERKPESEGPARSFEDYTVTIDESGL 289
>gi|85714553|ref|ZP_01045540.1| CRISPR-associated protein [Nitrobacter sp. Nb-311A]
gi|85698438|gb|EAQ36308.1| CRISPR-associated protein [Nitrobacter sp. Nb-311A]
Length = 317
Score = 147 bits (370), Expect = 9e-34, Method: Composition-based stats.
Identities = 108/320 (33%), Positives = 171/320 (53%), Gaps = 44/320 (13%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
L + DF++ +V N NGDP + N+PR D + ++GL+SDVS+KRK+RN R
Sbjct: 4 LANRYDFVLLFDVMRGNPNGDPDAGNLPRLDPETNHGLVSDVSLKRKVRNYIEFARNGVA 63
Query: 56 GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQIN----AEWLDV 111
G I+VQ +++ K L + D V + +K E+ ++ + DV
Sbjct: 64 GFNIYVQEGAILNE--QHRKAYLAVRPGDDKVAKDTKLNPKSEDEAARLRDFMCRNFFDV 121
Query: 112 RSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKT--------- 162
R+FG V + G +A VRGPV +++A+S+E +V Q + IT+ + KT
Sbjct: 122 RAFGAVMS-TGINAGQVRGPVQMTFAQSVEPIVPQEITITRMAATTPAEKTLRAEGQEEG 180
Query: 163 --ELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSAR 220
++ TMG K+ V YG+Y G ++ AE+TGFSDAD + ++E L S+FE+D S+AR
Sbjct: 181 NDRTDNRTMGRKYIVPYGLYRSHGFVSAKLAERTGFSDADLDALREALTSMFEHDRSAAR 240
Query: 221 PEGSMRVREVFWFTHSNKLGNVSSARVFDLL--------EFDKEKQDKDS------YEDY 266
G M +R+ F H+N LGN + +FD + EF K + D+ + DY
Sbjct: 241 --GEMAMRKAIAFKHANPLGNAPAHELFDRVKVGRGVDDEFRKIDRRLDNLPPAREFADY 298
Query: 267 AIHLNQEELAEYEAKGLQVE 286
AI +++ EL E G+++E
Sbjct: 299 AIEIDRNELPE----GVEIE 314
>gi|120586829|ref|YP_961174.1| CRISPR-associated protein, Csd2 family [Desulfovibrio vulgaris
subsp. vulgaris DP4]
gi|120564243|gb|ABM29986.1| CRISPR-associated protein, Csd2 family [Desulfovibrio vulgaris
subsp. vulgaris DP4]
Length = 290
Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats.
Identities = 96/283 (33%), Positives = 160/283 (56%), Gaps = 19/283 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQ--- 57
+ + +F++ +V N NGDP + NMPR D + +GL++DV +KRKIRN + +
Sbjct: 4 IANRYEFVLLFDVENGNPNGDPDAGNMPRIDPETGHGLVTDVCLKRKIRNHVALTKEGAE 63
Query: 58 --PIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
I++Q + +++ + K + + + KSV ++ + + D+R+FG
Sbjct: 64 RFNIYIQEKAILNETHERAYTACKLKPEPKKLPKKVEDAKSVTDW---MCTNFYDIRTFG 120
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTK 172
V + + VRGPV +++A+S+E VV Q + IT+ +T +E K + ++ TMG K
Sbjct: 121 AVMTTE-VNCGQVRGPVQMAFARSVEPVVPQEVSITRMAVTTKAEA-EKQQGDNRTMGRK 178
Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
H V YG+YV G I+ AEKTGFSD D + + LV++FE+D S+AR G M R++
Sbjct: 179 HIVPYGLYVAHGFISAPLAEKTGFSDEDLTLFWDALVNMFEHDRSAAR--GLMSSRKLIV 236
Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKD---SYEDYAIHLNQ 272
F H NKLGN + ++FDL++ + + S+ DYA+ + Q
Sbjct: 237 FKHQNKLGNAPAHKLFDLVKVSRAEGSSGPARSFADYAVTVGQ 279
>gi|46562130|ref|YP_009172.1| CRISPR-associated TM1801 family protein [Desulfovibrio vulgaris
subsp. vulgaris str. Hildenborough]
gi|46447667|gb|AAS94333.1| CRISPR-associated protein, TM1801 family [Desulfovibrio vulgaris
subsp. vulgaris str. Hildenborough]
Length = 290
Score = 144 bits (362), Expect = 9e-33, Method: Composition-based stats.
Identities = 94/283 (33%), Positives = 159/283 (56%), Gaps = 19/283 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQ--- 57
+ + +F++ +V N NGDP + NMPR D + +GL++DV +KRKIRN + +
Sbjct: 4 IANRYEFVLLFDVENGNPNGDPDAGNMPRIDPETGHGLVTDVCLKRKIRNHVALTKEGAE 63
Query: 58 --PIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
I++Q + +++ + K + + + K V ++ + + D+R+FG
Sbjct: 64 RFNIYIQEKAILNETHERAYTACDLKPEPKKLPKKVEDAKRVTDW---MCTNFYDIRTFG 120
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTK 172
V + + VRGPV +++A+S+E VV Q + IT+ +T +E K + ++ TMG K
Sbjct: 121 AVMTTE-VNCGQVRGPVQMAFARSVEPVVPQEVSITRMAVTTKAEA-EKQQGDNRTMGRK 178
Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
H V YG+YV G I+ AEKTGFSD D + + LV++FE+D S+AR G M R++
Sbjct: 179 HIVPYGLYVAHGFISAPLAEKTGFSDEDLTLFWDALVNMFEHDRSAAR--GLMSSRKLIV 236
Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKD---SYEDYAIHLNQ 272
F H N+LGN + ++FDL++ + + S+ DYA+ + Q
Sbjct: 237 FKHQNRLGNAPAHKLFDLVKVSRAEGSSGPARSFADYAVTVGQ 279
>gi|83592167|ref|YP_425919.1| CRISPR-associated TM1801 family protein [Rhodospirillum rubrum ATCC
11170]
gi|83575081|gb|ABC21632.1| CRISPR-associated protein TM1801 [Rhodospirillum rubrum ATCC 11170]
Length = 317
Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats.
Identities = 107/323 (33%), Positives = 163/323 (50%), Gaps = 42/323 (13%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
L Q+ DF+V +V N NGDP + N PR D + ++GL+SDV +KRKIRN + +D
Sbjct: 4 LTQRHDFVVLFDVTNGNPNGDPDAGNTPRLDPETNHGLVSDVCLKRKIRNYVELAKGEDS 63
Query: 56 GQPIFVQARDRVDDCIYS--LKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRS 113
G I+VQ ++D + R + ++ K + + + A + DVR+
Sbjct: 64 GFHIYVQEGAILNDQHRKAYVALRPDKEKAAKEAKLNPQNDDEAKALRAFMCANFFDVRT 123
Query: 114 FGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKT----------E 163
FG V + G + VRGPV S+A+S+E +V + IT+ + KT
Sbjct: 124 FGAVMS-TGINCGQVRGPVQFSFARSIEPIVPLEISITRMAATNEKEKTAQREGQEGDER 182
Query: 164 LESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEG 223
E+ TMG KH + YG+Y G I+ AE+TGF + D +++ E + +FE+D S+AR G
Sbjct: 183 TENRTMGRKHIIPYGLYRAHGFISAKLAERTGFDEGDLDLLLEAVEQMFEHDRSAAR--G 240
Query: 224 SMRVREVFWFTHSNKLGNVSSARVFDLLE----FDKEKQDKD-----------SYEDYAI 268
M VR++ F H+N LGN + +FD + D E + D S+ DY +
Sbjct: 241 EMAVRKLIVFRHANALGNAPAHSLFDRVTVGRVIDGEVRAVDDPAIDNRPPARSFGDYRV 300
Query: 269 HLNQEELAEYEAKGLQVEIIEGL 291
+ +E L E VEIIE L
Sbjct: 301 TIGREGLPE------GVEIIERL 317
>gi|78222285|ref|YP_384032.1| CRISPR-associated TM1801 family protein [Geobacter metallireducens
GS-15]
gi|78193540|gb|ABB31307.1| CRISPR-associated protein TM1801 [Geobacter metallireducens GS-15]
Length = 284
Score = 142 bits (357), Expect = 3e-32, Method: Composition-based stats.
Identities = 92/282 (32%), Positives = 162/282 (57%), Gaps = 21/282 (7%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQ-----DM 55
++ + DF++ +V++ N NGDP + N+PR D + +GL++DV +KRK+RN +Q
Sbjct: 3 IQNRYDFVLFFDVKDGNPNGDPDAGNLPRIDPETGHGLVTDVCLKRKVRNYVQLDKELSQ 62
Query: 56 GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
G IFV+ + +++ I ++ K KE +K ++ F + ++ DVR+FG
Sbjct: 63 GYDIFVKEKAILNNLIDEAHEQENVK-----AKEKGEKTEAARQF---MCGKYFDVRTFG 114
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTK 172
V + G +A VRGPV +++A+S++++V IT+ +T +E K + ++ TMG K
Sbjct: 115 AVMS-TGKNAGQVRGPVQLTFARSVDQIVPLEHSITRMAVATPAEA-EKQDGDNRTMGRK 172
Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
V Y +Y G I+ A +TGFS+ D E+ + LV++FE+D S+AR G M R++
Sbjct: 173 FTVPYALYRCHGFISAPLAAQTGFSEEDLELFWQSLVNMFEHDRSAAR--GQMSARKLIV 230
Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEE 274
F H +K+GN + +FDL+ + + S++ Y I + +E
Sbjct: 231 FKHDSKMGNAPAHHLFDLVSAEASETPVRSFDQYDICVPTQE 272
>gi|108758563|ref|YP_635130.1| CRISPR-associated protein, Csd3 family [Myxococcus xanthus DK 1622]
gi|108462443|gb|ABF87628.1| CRISPR-associated protein, Csd3 family [Myxococcus xanthus DK 1622]
Length = 304
Score = 141 bits (355), Expect = 5e-32, Method: Composition-based stats.
Identities = 101/298 (33%), Positives = 169/298 (56%), Gaps = 28/298 (9%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRN--RMQDMGQP 58
L+Q+ DF++ +V + N NGDP + N+PR DA+ +GL++DVS+KRK+RN + G+P
Sbjct: 4 LKQRHDFVLFFDVLDGNPNGDPDAGNLPRIDAETGHGLVTDVSLKRKVRNFVLLTQEGKP 63
Query: 59 ---IFVQAR----DRVDDCIYSLKQRLENKEFFDTVKEGSKKK-------KSVENFVKQI 104
IFV+ + +R+ D SL L K ++G K+ VE +
Sbjct: 64 GLDIFVKEKAILNNRIADGYKSLGIDLNEKP--ARAEDGKKRNDKGRAQGSEVEKGRAWM 121
Query: 105 NAEWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSK 161
+ DVR+FG V + G +A VRGPV +++A+S++ +V+Q IT+ +T E K
Sbjct: 122 CKTFFDVRTFGAVMS-TGPNAGQVRGPVQLTFARSVDPIVSQEHSITRMAVATEDEA-EK 179
Query: 162 TELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARP 221
++ TMG K+ V YG+Y G ++P+ A++TGF AD E++ + +FE D S+AR
Sbjct: 180 QGGDNRTMGRKNTVPYGLYRAHGFVSPHLAKQTGFGTADLELLFQSFTHMFELDRSAAR- 238
Query: 222 EGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD--SYEDYAIHLNQEELAE 277
G M +R+V F H ++LGN + +FD + + + +K S+ DY + +++ L +
Sbjct: 239 -GLMSMRKVVVFKHGSELGNAPAHALFDRVLAVRAQPEKPARSFSDYEVRVDKAGLPQ 295
>gi|154506184|ref|ZP_02042922.1| hypothetical protein RUMGNA_03726 [Ruminococcus gnavus ATCC 29149]
gi|153793683|gb|EDN76103.1| hypothetical protein RUMGNA_03726 [Ruminococcus gnavus ATCC 29149]
Length = 303
Score = 140 bits (354), Expect = 7e-32, Method: Composition-based stats.
Identities = 93/296 (31%), Positives = 176/296 (59%), Gaps = 23/296 (7%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM---- 55
+++ + +F+V +V N NGDP + NMPR D + GL++DV +KRKIRN ++ +
Sbjct: 4 VIKNRYEFVVLFDVENGNPNGDPDAGNMPRIDPESGLGLVTDVCLKRKIRNYIETVKEDA 63
Query: 56 -GQPIFVQ-----ARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAE-W 108
G I+++ R + C + + K+ +++K+ K ++V+ ++ + +
Sbjct: 64 EGYKIYIKDDVPLNRSDREACASVGVEETDEKKVTESLKKLKKNDENVDVKIRDYMCQNF 123
Query: 109 LDVRSFGQV---FAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK-STNSELNSKTEL 164
D+R+FG V F + VRGPV + +A+S++ +V+Q + IT+ + +E ++ +
Sbjct: 124 FDIRTFGAVMTTFVKAALNCGQVRGPVQLGFARSIDPIVSQEVTITRVAITTEKDAANK- 182
Query: 165 ESSTMGTKHFVDYGVYVIKGSINPNFAEK-TGFSDADAEIIKEVLVSLFENDASSARPEG 223
S+ MG K V YG+Y ++G ++ N A K TGFS+ D E++ E ++++FE+D S+AR G
Sbjct: 183 -STEMGRKSVVPYGLYRVEGYVSANLARKVTGFSEEDLELLWEAIINMFEHDHSAAR--G 239
Query: 224 SMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDK--DSYEDYAIHLNQEELAE 277
+M VRE+ F HS +LG+ + ++FD +E K+++ + SY DY + +++E + +
Sbjct: 240 NMAVRELIVFKHSKELGDCPAYKLFDSVEVRKKEEVEYPRSYRDYIVEVHEENIPD 295
>gi|114566030|ref|YP_753184.1| hypothetical protein Swol_0478 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114336965|gb|ABI67813.1| conserved hypothetical protein [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 288
Score = 139 bits (349), Expect = 3e-31, Method: Composition-based stats.
Identities = 90/284 (31%), Positives = 160/284 (56%), Gaps = 16/284 (5%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMGQ--- 57
++ + +F++ +V N NGDP + NMPR D + +GL+SDV IKRKIRN + + +
Sbjct: 5 IKNRYEFVLFFDVENGNPNGDPDADNMPRIDPETSFGLVSDVCIKRKIRNYVALLKENED 64
Query: 58 --PIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
I+VQ + +++ K+ E+ + K+ K E + + + D+R+FG
Sbjct: 65 DYQIYVQEKAVLNN---QHKKAYEHFKIKPESKKLPKDTAQAEAITQFMCKNFYDIRTFG 121
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
V + + VRGPV + +++SL+ +V Q + IT+ + N + + TMG KH V
Sbjct: 122 AVMTTE-VNCGQVRGPVQLGFSRSLDPIVPQEITITRMAVT--NERDLEKERTMGRKHIV 178
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
+Y +Y +G I+ A+KTGFS+ D E++ + L+++F++D S+AR G M R+++ F H
Sbjct: 179 NYALYRAEGFISAPLADKTGFSEEDLELLWDALINMFDHDRSAAR--GKMSSRKLYVFKH 236
Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKD--SYEDYAIHLNQEELAE 277
+KLGN ++ +FD + K +K ++ DY I + E+ +
Sbjct: 237 DSKLGNAPASILFDAITVKKINGNKPVRNFSDYEISVTGNEIPQ 280
>gi|75676129|ref|YP_318550.1| CRISPR-associated TM1801 family protein [Nitrobacter winogradskyi
Nb-255]
gi|74420999|gb|ABA05198.1| CRISPR-associated protein TM1801 [Nitrobacter winogradskyi Nb-255]
Length = 316
Score = 137 bits (344), Expect = 9e-31, Method: Composition-based stats.
Identities = 104/325 (32%), Positives = 171/325 (52%), Gaps = 49/325 (15%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQD 54
ML + DF++ +V + N NGDP + N+PR D + ++GL+SDVS+KRK+RN R
Sbjct: 3 MLTNRYDFVLLFDVTKGNPNGDPDAGNLPRLDPETNHGLVSDVSLKRKVRNYVDLVRSGT 62
Query: 55 MGQPIFVQARDRVDDCIYSLKQRLE------NKEFFDTVKEGSKKKKSVENFVKQINAEW 108
G I+V+ ++D + L +KE ++ + KK E K +
Sbjct: 63 DGHHIYVEEAAILNDKHRQAYKALRPDDPKVDKEAKLNPRDDVEAKKLREFMCKN----F 118
Query: 109 LDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTE-- 163
DVR+FG V + G +A VRGPV +++A S+E +V Q + IT+ + +E + E
Sbjct: 119 FDVRTFGAVMS-TGINAGQVRGPVQMTFANSVEPIVPQEISITRMAATNEAEKKQRAEGG 177
Query: 164 ------LESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDAS 217
+++ TMG K+ V YG+Y G ++ AE+TGFS+AD E+ E L S+FE+D S
Sbjct: 178 EEGNDRVDNRTMGRKYIVPYGLYRAHGFVSAKLAERTGFSEADLELTFEALTSMFEHDRS 237
Query: 218 SARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDK---------EKQDK----DSYE 264
+AR G M R++ F H N LG+ + +F+ + + ++ D ++
Sbjct: 238 AAR--GEMTTRKLVVFKHGNALGSAPAHALFERVRIGRNIDGQFRRIDRSDNYPPARAFS 295
Query: 265 DYAIHLNQEELAEYEAKGLQVEIIE 289
DYA+ ++++ + VEIIE
Sbjct: 296 DYAVEIDRDNPPD------GVEIIE 314
>gi|83589359|ref|YP_429368.1| CRISPR-associated TM1801 family protein [Moorella thermoacetica
ATCC 39073]
gi|83572273|gb|ABC18825.1| CRISPR-associated protein TM1801 [Moorella thermoacetica ATCC
39073]
Length = 297
Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats.
Identities = 90/291 (30%), Positives = 155/291 (53%), Gaps = 30/291 (10%)
Query: 3 EQKIDFMVTVEVREANANGDPLSVNMPRTDAKDY-GLMSDVSIKRKIRN-----RMQDMG 56
E + DF++ +VR+ N NGDP + N+PR D + GL++DV +KRKIR+ R +
Sbjct: 8 EVRHDFVLLFDVRDGNPNGDPDAGNLPRLDPETMQGLVTDVCLKRKIRDWVDMTRGSEAN 67
Query: 57 QPIFVQARDRVDDCIYSLKQRLENKEFFDTVKE---GSKKKKSVENFVKQ-INAEWLDVR 112
I+VQ ++ +++ +D + E GSK+ + + + +Q + + D+R
Sbjct: 68 MKIYVQHHGILN---------AQHQRAYDAIGEKSTGSKQNREIVDKARQWMCQNFYDIR 118
Query: 113 SFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELES------ 166
FG V G + VRGP+ +++A+S++ +V + IT+ + + E
Sbjct: 119 MFGAVMT-TGVNCGQVRGPMQLTFARSIDPIVPLDISITRVAITRVEDAATSEQGEGGKV 177
Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
+ MG K V YG+Y+ G NP+FA TG S AD EI E L +++ D S++R G M
Sbjct: 178 TEMGRKTLVPYGLYLGYGFFNPHFAADTGVSAADLEIFWEALQRMWDVDRSASR--GMMA 235
Query: 227 VREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDK--DSYEDYAIHLNQEEL 275
R ++ F+H++ LGN + +F L+ + K S+ DY + +N+E+L
Sbjct: 236 CRGLYIFSHASALGNAPADNLFKLITVKRRDGVKAARSFADYQVTINEEDL 286
>gi|150007823|ref|YP_001302566.1| uncharacterized protein predicted to be involved in DNA repair
[Parabacteroides distasonis ATCC 8503]
gi|149936247|gb|ABR42944.1| uncharacterized protein predicted to be involved in DNA repair
[Parabacteroides distasonis ATCC 8503]
Length = 283
Score = 135 bits (341), Expect = 2e-30, Method: Composition-based stats.
Identities = 90/284 (31%), Positives = 160/284 (56%), Gaps = 24/284 (8%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQ-----DM 55
++ +IDF+ +V++ N NGDP + N+PR DA+ GL++DV +KRK+RN +Q +
Sbjct: 4 IKNRIDFVYIFDVQDGNPNGDPDAGNLPRVDAETGMGLVTDVCLKRKVRNYVQVAKGLED 63
Query: 56 GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
G IF++ + ++ I E VK K + F+ + + DVR+FG
Sbjct: 64 GYDIFIKEKAVLNTLIDKAHDDSE-------VKNAKDKTDAARRFMCK---NYFDVRTFG 113
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTK 172
V + G +A VRGP+ ++A+S++ + IT+ +T++E ++ ++ TMG K
Sbjct: 114 AVMS-TGKNAGQVRGPIQFTFARSVDPIAAAEHSITRMAVATDAEAKKQSG-DNRTMGRK 171
Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
V YG+Y+ G I+ N A++TGFS+ D + E L ++F+ D S+AR G M +++
Sbjct: 172 ATVPYGLYICHGFISANLAQQTGFSEEDLALFWEALKNMFDMDRSAAR--GLMSAQKLIV 229
Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKD-SYEDYAIHLNQEEL 275
F H + LGN + ++FDL++ +K S+ DY + +++E +
Sbjct: 230 FKHDSVLGNAPANKLFDLVKVEKVCDGAPRSFGDYHVTIDKEHV 273
>gi|146284060|ref|YP_001174213.1| CRISPR-associated protein, TM1801 family [Pseudomonas stutzeri
A1501]
gi|145572265|gb|ABP81371.1| CRISPR-associated protein, TM1801 family [Pseudomonas stutzeri
A1501]
Length = 289
Score = 135 bits (341), Expect = 2e-30, Method: Composition-based stats.
Identities = 87/287 (30%), Positives = 155/287 (54%), Gaps = 18/287 (6%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN--RMQDMGQ 57
++ + +F+ +V N NGDP + N+PR D + + GL++DV +KRK+RN ++ G
Sbjct: 3 VIANRYEFVYLFDVTNGNPNGDPDAGNLPRLDPETNQGLVTDVCLKRKLRNYVALEQEGA 62
Query: 58 P---IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSF 114
P I++Q + +++ KQ E K+ K + + + DVR+F
Sbjct: 63 PGYAIYMQEKSVLNN---QHKQAYEALGIESEAKKLPKDEAKARELTGWMCKNFFDVRAF 119
Query: 115 GQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHF 174
G V + +A VRGP+ +++A S++ V+ + IT+ + N K + TMG KH
Sbjct: 120 GAVMTTE-VNAGQVRGPIQLAFATSIDPVLPMEISITRMAVT--NEKDLEKERTMGRKHI 176
Query: 175 VDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFT 234
V YG+Y G ++ AE+TGFS+ D ++ L+++FE+D S+AR G M R++ F
Sbjct: 177 VPYGLYRAHGFVSAKLAERTGFSEEDLGLLWRALINMFEHDRSAAR--GEMAARKLIVFK 234
Query: 235 HSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEELAE 277
H + +GN + +FD ++ ++ + + S++DY I +N E L +
Sbjct: 235 HEHPMGNAPAHVLFDSVQIERVEGEAHTPARSFKDYQISVNAEALPQ 281
>gi|134298869|ref|YP_001112365.1| CRISPR-associated protein, Csd2 family [Desulfotomaculum reducens
MI-1]
gi|134051569|gb|ABO49540.1| CRISPR-associated protein, Csd2 family [Desulfotomaculum reducens
MI-1]
Length = 288
Score = 135 bits (340), Expect = 3e-30, Method: Composition-based stats.
Identities = 90/286 (31%), Positives = 158/286 (55%), Gaps = 23/286 (8%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM----- 55
++ + +F++ +V N NGDP + NMPR DA+ GL++DV +KRKIRN + +
Sbjct: 5 IKNRYEFVLFFDVENGNPNGDPDAGNMPRIDAETGLGLVTDVCLKRKIRNYVDIVKNGIE 64
Query: 56 GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINA----EWLDV 111
G I+V+ +++ Q ++ D +K SKK E +++ A + D+
Sbjct: 65 GFDIYVREGSILNN------QHMKAYSALD-IKPESKKLPKNEEDARRVKAFMCKHFYDI 117
Query: 112 RSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGT 171
R+FG V + + VRGPV I++A+S++ +V Q + IT+ + + E + MG
Sbjct: 118 RTFGAVMTTE-VNCGQVRGPVQINFARSIDPIVQQEVTITRMAVTSVKD-AEKKDREMGR 175
Query: 172 KHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVF 231
KH V Y +Y +G ++ + AEK+GF+ D E++ + L ++F++D S+AR G M R++
Sbjct: 176 KHIVPYALYRAEGYVSAHLAEKSGFTKEDLELLWQSLTNMFDHDRSAAR--GKMATRKLI 233
Query: 232 WFTHSNKLGNVSSARVFDLLEFDKEK--QDKDSYEDYAIHLNQEEL 275
F H LGN S+ +FD++E ++ + +Y DY + +N L
Sbjct: 234 IFEHETALGNASAHSLFDMVEVSRKDLIRPPRAYSDYKVTVNMAAL 279
>gi|153091351|gb|EDN73325.1| hypothetical protein MHA_0345 [Mannheimia haemolytica PHL213]
Length = 287
Score = 135 bits (340), Expect = 3e-30, Method: Composition-based stats.
Identities = 92/290 (31%), Positives = 162/290 (55%), Gaps = 30/290 (10%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQ-----DM 55
++ + +F+ +V N NGDP + NMPR D + GL++DV +KRKIRN ++ +
Sbjct: 3 IQNRYEFVFFFDVTNGNPNGDPDAGNMPRLDPETSKGLVTDVCLKRKIRNFIEMSYENEA 62
Query: 56 GQPIFVQARDRVDDCIYSLKQRLENKEFFDT--VKEGSKKKKSVENFVKQINA----EWL 109
G I+V+ + ++ L+NK ++ ++ +KK E ++I A +
Sbjct: 63 GYEIYVKEKSVLN---------LQNKRAYEALGIESEAKKLPKEEAKAREITAWMCKNFF 113
Query: 110 DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTM 169
D+R+FG V + ++ VRGPV +++A+S++ ++ + IT+ + N K + TM
Sbjct: 114 DIRTFGAVMTTE-VNSGQVRGPVQLAFAQSIDPIIPLEVSITRMAVT--NEKDLEKERTM 170
Query: 170 GTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVRE 229
G K+ V Y +Y + G I+ AEKTGFSD D + + + L +FE+D S+AR G M R+
Sbjct: 171 GRKYIVPYALYRVHGFISAKLAEKTGFSDEDVQKLWQALQLMFEHDRSAAR--GEMAARK 228
Query: 230 VFWFTHSNKLGNVSSARVFDLLEFDKEKQDKDS----YEDYAIHLNQEEL 275
+ F H ++LGN + ++FD ++ ++ +KD+ Y+DY I + E L
Sbjct: 229 LIVFKHDSELGNQPAHKLFDSVKVERINGEKDTPAKDYDDYCISVQTEGL 278
>gi|83645584|ref|YP_434019.1| uncharacterized protein predicted to be involved in DNA repair
[Hahella chejuensis KCTC 2396]
gi|83633627|gb|ABC29594.1| uncharacterized protein predicted to be involved in DNA repair
[Hahella chejuensis KCTC 2396]
Length = 297
Score = 135 bits (339), Expect = 4e-30, Method: Composition-based stats.
Identities = 92/293 (31%), Positives = 158/293 (53%), Gaps = 28/293 (9%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN----RMQDM- 55
+ + +F+ +V N NGDP + N+PR D + ++GL++DV +KRK+RN DM
Sbjct: 4 IANRYEFVFLFDVTNGNPNGDPDAGNLPRLDPETNHGLVTDVCLKRKVRNFVALEKSDMN 63
Query: 56 ----GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINA----E 107
G I+VQ + +++ KQ E K+ KK E +++ A
Sbjct: 64 GSAPGFNIYVQEKSVLNN---QHKQAWEALGIPPDAKDKYKKLPKDEAKARELTAWMCNN 120
Query: 108 WLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESS 167
+ D+R+FG V + + VRGPV ++A S++ V + IT+ + ++ +LES
Sbjct: 121 FFDIRAFGAVMTME-VNCGQVRGPVQFAFATSVDPVTPLEISITRMA---VTNERDLESE 176
Query: 168 -TMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
TMG KH V YG+Y G I+ AE+TGFS+ D +++ L+++FE+D S+AR G M
Sbjct: 177 RTMGRKHIVPYGLYRAHGFISAKLAERTGFSEEDLQLLWRALINMFEHDRSAAR--GEMA 234
Query: 227 VREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEEL 275
R++ F H + +G+ S +FD ++ ++ + + D SY DY + ++ E +
Sbjct: 235 ARKLIVFKHEHPMGDAPSHVLFDKVKVERSEGEADTPARSYNDYRVTVDSESI 287
>gi|91774959|ref|YP_544715.1| CRISPR-associated protein TM1801 [Methylobacillus flagellatus KT]
gi|91708946|gb|ABE48874.1| CRISPR-associated protein TM1801 [Methylobacillus flagellatus KT]
Length = 293
Score = 134 bits (336), Expect = 9e-30, Method: Composition-based stats.
Identities = 95/304 (31%), Positives = 167/304 (54%), Gaps = 35/304 (11%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDA-KDYGLMSDVSIKRKIRN--RMQDMGQP 58
+ + +F+ +V N NGDP + NMPR D GL++DV +KRKIRN + + G P
Sbjct: 3 ITNRYEFVYFFDVTNGNPNGDPDAGNMPRLDPDSSKGLVTDVCLKRKIRNFVELTEEGHP 62
Query: 59 ---IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQ------INAEWL 109
I+V+ + ++ L+NK+ ++ + + KK ++ K + A +
Sbjct: 63 GFEIYVKEKGILN---------LQNKKAYEALSITPEPKKLPKDEAKAREVTAWMCANFF 113
Query: 110 DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELES 166
DVR+FG V + ++ VRGPV +++AKS++ ++ + IT+ +T E +++ +
Sbjct: 114 DVRTFGAVMTTE-VNSGQVRGPVQLAFAKSIDPIIPLELSITRMAVTTEKEAEAQSG-GN 171
Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
TMG KH V YG+Y + G ++ +EKTGFSD D + + L +FE+D S+AR G M
Sbjct: 172 RTMGRKHIVPYGLYRVHGFVSAKLSEKTGFSDDDLAKLWQALTLMFEHDRSAAR--GEMA 229
Query: 227 VREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEELAEYEAKG 282
R++ F H++ LG+ + +FD ++ ++ ++D ++ DY I L+ L +A G
Sbjct: 230 ARKLVVFKHADALGSAPAHLLFDRVKVERVSGERDTPATTFSDYRIVLDTNGL---DALG 286
Query: 283 LQVE 286
+ VE
Sbjct: 287 VTVE 290
>gi|154495122|ref|ZP_02034127.1| hypothetical protein PARMER_04169 [Parabacteroides merdae ATCC
43184]
gi|154085672|gb|EDN84717.1| hypothetical protein PARMER_04169 [Parabacteroides merdae ATCC
43184]
Length = 288
Score = 134 bits (336), Expect = 1e-29, Method: Composition-based stats.
Identities = 90/289 (31%), Positives = 160/289 (55%), Gaps = 26/289 (8%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
+ + DF+ +V++ N NGDP + N+PR D + GL+SDV +KRK+RN +Q
Sbjct: 5 INNRYDFIYLFDVQDGNPNGDPDAGNLPRVDPETGEGLVSDVCLKRKVRNFVQ------I 58
Query: 61 VQARDRVDDCIYSLKQRLEN-------KEFFDTVKEGSKKKKSVENFVKQINAEWLDVRS 113
V+ +R+ D K L N +E +KE K ++ ++ + + D+R+
Sbjct: 59 VKGGERLYDIFIKEKAVLNNLIADAHKQEGVKDIKEKGDKTEAARQWMCR---NFYDIRT 115
Query: 114 FGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMG 170
FG V + G +A VRGP+ ++A+S+ +VT IT+ +T E ++ ++ TMG
Sbjct: 116 FGAVLS-TGENAGQVRGPIQFTFARSISPIVTAEHSITRMAVATEDEAKKQSG-DNRTMG 173
Query: 171 TKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREV 230
K V YG+Y G I+ + A +TGF++ D + + L ++F++D S+AR G M R++
Sbjct: 174 RKFTVPYGLYKANGFISAHLAAQTGFNEDDLNLFWDSLKNMFDHDHSAAR--GMMNARKL 231
Query: 231 FWFTHSNKLGNVSSARVFDL--LEFDKEKQDKDSYEDYAIHLNQEELAE 277
F HS LGN S+ +F L ++ +++ S++DY + +++++L E
Sbjct: 232 IVFKHSTALGNASAHSLFGLVKVQLKDDQRPPRSFDDYIVTIDKDKLPE 280
>gi|154495727|ref|ZP_02034423.1| hypothetical protein BACCAP_00006 [Bacteroides capillosus ATCC
29799]
gi|150274925|gb|EDN01973.1| hypothetical protein BACCAP_00006 [Bacteroides capillosus ATCC
29799]
Length = 298
Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats.
Identities = 91/288 (31%), Positives = 157/288 (54%), Gaps = 18/288 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
++ + +F++ +V N NGDP + NMPR D + G+++DV +KRKIRN ++ + +
Sbjct: 5 IKNRYEFVILFDVENGNPNGDPDAGNMPRVDPETGLGIVTDVCLKRKIRNYVETVKEDA- 63
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWL-----DVRSFG 115
R V D + + E D ++ K+KK + + +W+ D+R+FG
Sbjct: 64 TGYRIYVKDGVPLNRSDAEAYAELDVTEKTVKEKKKANPDLDRKIRDWMCANFYDIRTFG 123
Query: 116 QV---FAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTK 172
V F + VRGPV + +A+S+E VV Q + IT+ + + E + + MG K
Sbjct: 124 AVMTTFVKAALNCGQVRGPVQLGFARSVEPVVPQEVTITRVAITT-EADAEKKGTEMGRK 182
Query: 173 HFVDYGVYVIKGSINPNFAEKT-GFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVF 231
+ V YG+Y +G I+ N A KT GFS+ D ++ E ++++FE+D S+AR G M VRE+
Sbjct: 183 YIVPYGLYRCEGYISANLARKTTGFSEEDLSLLWEAILNMFEHDHSAAR--GKMAVRELI 240
Query: 232 WFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEEL 275
F H ++LG + ++FD + ++ D SY+DY + +++ L
Sbjct: 241 VFKHDSELGCAPAWKLFDAVSVTRKNPDDPAPARSYQDYTVAVDEAAL 288
>gi|67158962|ref|ZP_00419749.1| CRISPR-associated protein TM1801 [Azotobacter vinelandii AvOP]
gi|67084459|gb|EAM03943.1| CRISPR-associated protein TM1801 [Azotobacter vinelandii AvOP]
Length = 302
Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats.
Identities = 89/296 (30%), Positives = 159/296 (53%), Gaps = 29/296 (9%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
+ + +F+ EV N NGDP + N+PR D + + GL++DV +KRKIRN +
Sbjct: 4 IANRYEFVYLFEVTNGNPNGDPDAGNLPRLDPETNQGLVTDVCLKRKIRNYVALEKSDAE 63
Query: 56 GQP-----IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKK----KKSVENFVKQINA 106
G+P I++Q + ++ +Q + E K G KK ++ + +
Sbjct: 64 GKPEQSYVIYMQEKAVLNQ---QHEQAWKACEIPPDAKNGYKKLPADTAKAKSLTDWMCS 120
Query: 107 EWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTE 163
+ DVR+FG V G + VRGP+ +++A S++ ++ + IT+ +T E ++
Sbjct: 121 NFFDVRAFGAVMT-TGVNCGQVRGPIQLAFATSIDPIIPLEVSITRMAVTTEKEAEEQSG 179
Query: 164 LESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEG 223
++ TMG KH + YG+Y G I+ AE+TGFS+ D ++ L ++FE+D S+AR G
Sbjct: 180 -DNRTMGRKHIIPYGLYRAHGFISAKLAERTGFSEDDLALLWRALENMFEHDRSAAR--G 236
Query: 224 SMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEEL 275
M R++ F H + +GN + ++FDL++ + + + D S+ DY I ++++ L
Sbjct: 237 EMAARKLIVFKHEHPMGNAPAHKLFDLVKVARTEGEADTPARSFADYQISIDRDGL 292
>gi|34496682|ref|NP_900897.1| hypothetical protein CV_1227 [Chromobacterium violaceum ATCC 12472]
gi|34102537|gb|AAQ58902.1| conserved hypothetical protein [Chromobacterium violaceum ATCC
12472]
Length = 288
Score = 131 bits (330), Expect = 4e-29, Method: Composition-based stats.
Identities = 86/284 (30%), Positives = 147/284 (51%), Gaps = 18/284 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
L + +F+ +V N NGDP + N+PR D + + GL++DV +KRK+RN + +
Sbjct: 3 LANRYEFVYLFDVSNGNPNGDPDAGNLPRLDPETNQGLVTDVCLKRKLRNYVALEKENEP 62
Query: 56 GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
G I++Q + ++ + K+ E K+ K + + + DVR+FG
Sbjct: 63 GFAIYMQEKSVLN---HQHKRAYEALSLEPEPKKLPKDQAKARELTAWMCKNFFDVRAFG 119
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
V + +A VRGP+ +++A S++ VV + IT+ + N K + TMG KH V
Sbjct: 120 AVMTTE-VNAGQVRGPIQLTFATSIDPVVPLEVSITRMAVT--NDKDLEKERTMGRKHIV 176
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
YG+Y G ++ AE+ GF + D +++ L+++FE+D S+AR G M R++ F H
Sbjct: 177 PYGLYRAHGFVSAKLAERAGFGEEDLQLLWRGLINMFEHDRSAAR--GEMTARKLIAFKH 234
Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKDS----YEDYAIHLNQEEL 275
LGN + R+FD + + DS + DY + L++ L
Sbjct: 235 ECALGNAPAHRLFDSVRVSRADGAGDSPARGFTDYRVELDRAVL 278
>gi|118726119|ref|ZP_01574750.1| CRISPR-associated protein, Csd2 family [Clostridium cellulolyticum
H10]
gi|118664499|gb|EAV71130.1| CRISPR-associated protein, Csd2 family [Clostridium cellulolyticum
H10]
Length = 294
Score = 131 bits (329), Expect = 6e-29, Method: Composition-based stats.
Identities = 88/292 (30%), Positives = 167/292 (57%), Gaps = 28/292 (9%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM---- 55
+++ + +F V EV+ N NGDP + NMPR D + YG+++DV +KRKIRN ++ +
Sbjct: 4 IIKNRYEFTVLFEVKNGNPNGDPDAGNMPRIDPETGYGIVTDVCLKRKIRNYIETVKADS 63
Query: 56 -GQPIFVQARDRVDDCIYSLKQRLENKEFFDTV--KEGSKKKKSVENFVKQINAE-WLDV 111
G I+++ D + + E +F KE KK+ V+ +K + + D+
Sbjct: 64 TGYKIYIK------DGVPLERSDREAFTYFGISDEKEAQSKKEEVDIKIKDFMCKNFFDI 117
Query: 112 RSFGQV---FAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK--STNSELNSKTELES 166
R+FG V F + VRGPV + +A+S++++V Q + IT+ +T + +K E E
Sbjct: 118 RTFGAVMTTFVKAKLNCGQVRGPVQLGFARSIDQIVQQEISITRVVATTEKDAAKKETE- 176
Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEK-TGFSDADAEIIKEVLVSLFENDASSARPEGSM 225
MG K+ + Y +Y + G I+ N A+K TGF++ D ++ + ++++FE++ S+AR G+M
Sbjct: 177 --MGRKYIIPYALYRVDGYISANLAQKTTGFNEDDLSMLWDAVINMFEHEHSAAR--GNM 232
Query: 226 RVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKDS--YEDYAIHLNQEEL 275
VRE+ F H ++ GN + ++FD + ++ + ++DY++ ++++ +
Sbjct: 233 SVRELIVFKHDSEFGNCPAYKLFDAVSVCRKDKSNPPRCFDDYSVDIDEKAI 284
>gi|114331017|ref|YP_747239.1| CRISPR-associated protein, Csd2 family protein [Nitrosomonas
eutropha C91]
gi|114308031|gb|ABI59274.1| CRISPR-associated protein, Csd2 family protein [Nitrosomonas
eutropha C91]
Length = 304
Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats.
Identities = 93/303 (30%), Positives = 163/303 (53%), Gaps = 37/303 (12%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
L+ + DF++ +V++ N NGDP + N+PR DA+ GLM+DV++KRK+RN +
Sbjct: 3 LKNRYDFVLLFDVKDGNPNGDPDAGNLPRVDAETGLGLMTDVALKRKVRNFVS------M 56
Query: 61 VQARDRVDDCIYSLKQRLE------------NKEFFDTVK-----EGSKKKKSVENFVKQ 103
+ + V + + K+R E NK + + EG KK+ N V++
Sbjct: 57 TRDQSEVTESLNGDKKRFEIYVKEKAILNNQNKRAYVGIGKPELLEGEDKKRKGGNAVEE 116
Query: 104 INAEWL-----DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK-STNSE 157
+W+ DVR+FG V + G + VRGPV +++A+S+ +V IT+ + +E
Sbjct: 117 AR-QWMCKNFYDVRTFGAVMS-TGINCGQVRGPVQLTFARSINPIVALEHSITRMAVATE 174
Query: 158 LNS-KTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDA 216
+ + K ++ TMG K V YG+YV G I+ + A +T F + D E++ + L S+FE+D
Sbjct: 175 VEAEKQSGDNRTMGRKFTVPYGLYVAHGFISAHLANQTDFGEDDLELLWQALESMFEHDR 234
Query: 217 SSARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLE--FDKEKQDKDSYEDYAIHLNQEE 274
S+AR G M R ++ F H+++LGN + +F ++ E + DY + +++ +
Sbjct: 235 SAAR--GEMATRGLYVFKHNSELGNAPAHSLFARIQPKLKNENSIVRDFSDYTVMVDEAD 292
Query: 275 LAE 277
L +
Sbjct: 293 LPQ 295
>gi|94984344|ref|YP_603708.1| CRISPR-associated protein Csd2 [Deinococcus geothermalis DSM 11300]
gi|94554625|gb|ABF44539.1| CRISPR-associated protein Csd2 [Deinococcus geothermalis DSM 11300]
Length = 325
Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats.
Identities = 86/249 (34%), Positives = 141/249 (56%), Gaps = 27/249 (10%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
L + +F++ +V N NGDP S N PR D +D +GL+SDV++KR++RN +Q G+ IF
Sbjct: 8 LRNRYEFLLLFDVENGNPNGDPDSGNAPRVDPEDGHGLVSDVALKRRVRNYVQAAGEQIF 67
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
+Q ++ I+ KQ GSK K+ V+ + + + DVR+FG V +
Sbjct: 68 IQHGTNLNRPIFQAKQ---------ASGGGSKGKQDVDAARRWMCEHFYDVRTFGAVMS- 117
Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSE--LNSKT------------ELES 166
G +A VRGPV +++A+SL+ V IT+ +E N+KT E +
Sbjct: 118 TGANAGQVRGPVQLTFARSLDPVFAIEASITRGAVAEDIKNAKTLDDFLNWEAQQDEDKL 177
Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
TMG K + YG++ KG ++ + A+ TGFS+AD +++ E L++++E+D S+++ G M
Sbjct: 178 RTMGRKSLIPYGLFATKGFVSAHLAQGTGFSEADLKLLLEALLNMYEHDRSASK--GLMS 235
Query: 227 VREVFWFTH 235
R +F F H
Sbjct: 236 SRRLFVFRH 244
>gi|53804985|ref|YP_113167.1| CRISPR-associated TM1801 family protein [Methylococcus capsulatus
str. Bath]
gi|53758746|gb|AAU93037.1| CRISPR-associated protein, TM1801 family [Methylococcus capsulatus
str. Bath]
Length = 309
Score = 128 bits (322), Expect = 4e-28, Method: Composition-based stats.
Identities = 88/289 (30%), Positives = 153/289 (52%), Gaps = 29/289 (10%)
Query: 3 EQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRM---QDMGQP 58
+ + DF++ EV++ N NGDP + N+PR DA+ +GL++DV +KRKIRN + Q P
Sbjct: 6 QNRYDFVLLFEVKDGNPNGDPDAGNLPRLDAETGHGLVTDVCLKRKIRNFVGLTQGDAAP 65
Query: 59 IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKK--KSVENFVKQINAEWLDVRSFGQ 116
+ +++ + +R D + K+K V++ + + + DVR+FG
Sbjct: 66 YEIYVKEKA--VLNRQHERAYQALGVDLGADEGKRKGGDKVDDARRWMCQNFFDVRTFGA 123
Query: 117 VFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTKH 173
V + G + VRGPV +++A+S+ +V IT+ +T +E K ++ TMG KH
Sbjct: 124 VMS-TGVNCGQVRGPVQLTFARSISPIVALEHSITRMAVATEAEA-EKQGGDNRTMGRKH 181
Query: 174 FVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWF 233
V YG+Y G ++ + A++TGFS+ D E++ + L +F++D S+AR G M R ++ F
Sbjct: 182 TVPYGLYRAHGFVSAHLAQQTGFSEKDLELLWQALSQMFDHDHSAAR--GEMATRGLYVF 239
Query: 234 THSNK------------LGNVSSARVFDLLEFDKEKQDKDSYE--DYAI 268
H LG + ++FDL+ + + + E DYA+
Sbjct: 240 KHVGTDTDPDQRKQQAMLGCAPAHKLFDLIRVEPKDTGRPPREFGDYAV 288
>gi|21244564|ref|NP_644146.1| hypothetical protein XAC3840 [Xanthomonas axonopodis pv. citri str.
306]
gi|21110240|gb|AAM38682.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 288
Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats.
Identities = 84/281 (29%), Positives = 146/281 (51%), Gaps = 18/281 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
+ + +F+ +V N NGDP + N+PR D + + GL++DV++KRKIRN +
Sbjct: 4 IAHRYEFVYLFDVANGNPNGDPDAGNLPRLDPETNRGLVTDVALKRKIRNYVALEKDNAP 63
Query: 56 GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
G I++Q + +++ KQ K+ K+ + + DVR+FG
Sbjct: 64 GYTIYMQEKSVLNN---QHKQAYTALGIEHEAKKLPKEGDKARQLTAWMCENFFDVRTFG 120
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
V + + VRGPV +++A S+E V+ + IT+ + N K + TMG KH +
Sbjct: 121 AVMTTE-VNTGQVRGPVQLAFATSVEPVLPLEVSITRVAVT--NEKDLEKERTMGRKHIL 177
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
YG+Y G ++ AE+TGFS+ D +++ L +LFE+D S+AR G M R++ F H
Sbjct: 178 PYGLYRAHGFVSAKLAERTGFSEEDLQLLWRALTNLFEHDRSAAR--GEMAARKLIVFEH 235
Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQ 272
+ +GN + +FD ++ ++ Q S+ DY + ++
Sbjct: 236 EHPMGNAPAHVLFDKVKVERIDQADQGPARSFSDYRVVIDH 276
>gi|58580492|ref|YP_199508.1| hypothetical protein XOO0869 [Xanthomonas oryzae pv. oryzae
KACC10331]
gi|84622452|ref|YP_449824.1| hypothetical protein XOO_0795 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58425086|gb|AAW74123.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
KACC10331]
gi|84366392|dbj|BAE67550.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 288
Score = 125 bits (314), Expect = 3e-27, Method: Composition-based stats.
Identities = 84/281 (29%), Positives = 148/281 (52%), Gaps = 18/281 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQ-----DM 55
+ + +F+ +V N NGDP + N+PR D + + GL++DV++KRKIRN +
Sbjct: 4 IANRYEFVYLFDVINGNPNGDPDAGNLPRLDPETNRGLVTDVALKRKIRNYVALEQEAQA 63
Query: 56 GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
G I++Q + +++ KQ K+ K + + + DVR+FG
Sbjct: 64 GYAIYMQEKSVLNN---QHKQAYAALGIEHEAKKLPKDEAKARELTSWMCKNFFDVRTFG 120
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
V + ++ VRGPV +++A S+E V+ + IT+ + N K + TMG KH +
Sbjct: 121 AVMTTE-VNSGQVRGPVQLAFASSVEPVLPLEVSITRVAVT--NEKDLEKERTMGRKHIL 177
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
YG+Y G I+ AE+TGFS+ D +++ L +LFE+D S+AR G M R++ F H
Sbjct: 178 PYGLYRAHGFISAKLAERTGFSEDDLQLLWRALTNLFEHDRSAAR--GEMSARKLIVFKH 235
Query: 236 SNKLGNVSSARVFDLLEFDK----EKQDKDSYEDYAIHLNQ 272
+++GN + +FD + ++ + S+ DY + +++
Sbjct: 236 EHQMGNAPAHVLFDKVTVERVAEVDAGPARSFADYRVTIDR 276
>gi|126664601|ref|ZP_01735585.1| CRISPR-associated protein, TM1801 family [Marinobacter sp. ELB17]
gi|126630927|gb|EBA01541.1| CRISPR-associated protein, TM1801 family [Marinobacter sp. ELB17]
Length = 290
Score = 124 bits (311), Expect = 7e-27, Method: Composition-based stats.
Identities = 80/283 (28%), Positives = 156/283 (55%), Gaps = 15/283 (5%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQP-- 58
L + +F+ +V++ N NGDP + N+PR DA+ GL++DV +KRKIRN + + +
Sbjct: 3 LNNRYEFVFLFDVKDGNPNGDPDAGNLPRIDAETGQGLVTDVCLKRKIRNYVGMVKEETP 62
Query: 59 ---IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
I+++ + ++ + LE K K+ KK++ + + + + D+R+FG
Sbjct: 63 PFEIYIKEKAVLNRSNLRAYEALELKH---ESKKLPKKEEDAKRITQWMCQNFFDIRTFG 119
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKS--TNSELNSKTELESSTMGTKH 173
V + + + VRGP+ +++A+S++ VV+ IT+ T + + ++ TMG K
Sbjct: 120 AVMSTE-VNTGQVRGPIQMNFARSIDPVVSAEHSITRMAVTTEKEAEQQGGDNRTMGRKF 178
Query: 174 FVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWF 233
+ YG+Y G ++ N A +TGFS+ D E+ + L+++ ++D S++R G M ++ F
Sbjct: 179 TIPYGLYRCHGYVSANLAGQTGFSEEDLELFWDALINMLDHDRSASR--GEMSPCALYVF 236
Query: 234 THSNKLGNVSSARVFDLLEFDK-EKQDKDSYEDYAIHLNQEEL 275
H + LGN + ++F+L+E K + DY + ++++ L
Sbjct: 237 KHESALGNAPARKLFELIEIHKVSDGPARDFRDYEVTVHRDRL 279
>gi|52425041|ref|YP_088178.1| hypothetical protein MS0986 [Mannheimia succiniciproducens MBEL55E]
gi|52307093|gb|AAU37593.1| unknown [Mannheimia succiniciproducens MBEL55E]
Length = 321
Score = 124 bits (311), Expect = 7e-27, Method: Composition-based stats.
Identities = 84/284 (29%), Positives = 153/284 (53%), Gaps = 18/284 (6%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDY-GLMSDVSIKRKIRNRMQ-----DM 55
++ + +F+ +V N NGDP + NMPR D + GL++DV +KRKIRN ++
Sbjct: 37 IQNRYEFVYFFDVTNGNPNGDPDAGNMPRLDPESSKGLVTDVCLKRKIRNFVELANENQA 96
Query: 56 GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
G I+V+ + ++ K+ E E K+ K + + + + D+RSFG
Sbjct: 97 GYEIYVKEKSVLN---LQNKRAYEALEIEPEAKKLPKDEAKARDITAWMCKNFFDIRSFG 153
Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
V + ++ VRGPV +++A+S++ ++ + IT+ + N K + TMG K+ V
Sbjct: 154 AVMTTE-VNSGQVRGPVQLAFAQSIDPIIPLEVSITRMAVT--NEKDLEKERTMGRKYIV 210
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
Y +Y + G I+ N A KTGFS+ D + + + L +FE+D S+AR G M R++ F H
Sbjct: 211 PYALYRVHGFISANLAAKTGFSEEDLQKLWQALQLMFEHDRSAAR--GEMAARKLIVFKH 268
Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKDS----YEDYAIHLNQEEL 275
+ LG+V + ++FD ++ ++ + + + DY I + +++
Sbjct: 269 DSALGSVPAHKLFDSVKVERINGESGTPATGFADYQISIEKDKF 312
>gi|84703568|ref|ZP_01017396.1| CRISPR-associated protein [Parvularcula bermudensis HTCC2503]
gi|84690002|gb|EAQ15843.1| CRISPR-associated protein [Parvularcula bermudensis HTCC2503]
Length = 304
Score = 120 bits (301), Expect = 1e-25, Method: Composition-based stats.
Identities = 76/251 (30%), Positives = 129/251 (51%), Gaps = 8/251 (3%)
Query: 7 DFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMGQPIFVQARD 65
+F++ +V N NGDP + NMPR D + + GL+SDV++KRK+RN +
Sbjct: 10 EFVLYFDVMNGNPNGDPDAGNMPRLDPETNKGLVSDVALKRKVRNYVALASDNRIYMTEG 69
Query: 66 RVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
+ ++ E D K ++ + A + DVR+FG V + G +A
Sbjct: 70 STLNLLHKEAWAAVMPEITDKFNILPKDRQKARELTAWMCANFWDVRTFGAVMS-TGVNA 128
Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGS 185
VRGPV ++A+S+E ++ + IT+ + + TMG KH V YG+Y G
Sbjct: 129 GQVRGPVQFTFARSVEAILPLEISITRMAATTEKDAEDKHGRTMGRKHIVPYGLYRAHGF 188
Query: 186 INPNFA----EKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGN 241
++ A + TGFS+ D E++ + L ++F++D S+AR G M R++ F H + LGN
Sbjct: 189 VSAPLASDETKGTGFSEDDLELLWQALGNMFDHDRSAAR--GEMATRKLVVFRHESALGN 246
Query: 242 VSSARVFDLLE 252
+ +F+ ++
Sbjct: 247 AQAQSLFERVQ 257
>gi|68549523|ref|ZP_00588986.1| CRISPR-associated protein TM1801 [Pelodictyon phaeoclathratiforme
BU-1]
gi|68243611|gb|EAN25809.1| CRISPR-associated protein TM1801 [Pelodictyon phaeoclathratiforme
BU-1]
Length = 346
Score = 118 bits (296), Expect = 4e-25, Method: Composition-based stats.
Identities = 90/338 (26%), Positives = 167/338 (49%), Gaps = 66/338 (19%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQ--- 57
+E + +F++ +V+ N NGDP + N+PR D + +G+ +DV +KRKIRN ++ + +
Sbjct: 5 IENRYEFVLLFDVKNGNPNGDPDAGNLPRIDPETGHGITTDVCLKRKIRNYVELLKKNES 64
Query: 58 PIFVQARD--------------RVDDCIY-----SLKQRL------------ENKEFF-- 84
P + R+ D+ IY L+ L EN +
Sbjct: 65 PYEIHVREGEFLSEHHKRAHNALTDEKIYVFVPADLRGELSSFAEYPEGVGFENDAIYFQ 124
Query: 85 -----DTVKEGSKKKKSVENFVKQ------------INAEWL-----DVRSFGQVFAFDG 122
D VK+ +K K++ + K I +W+ DVR+FG V +
Sbjct: 125 LSSDIDKVKKDVEKLKNITDASKAKIKELFVDGKSVIAKKWMCKNFFDVRTFGAVMSTGD 184
Query: 123 YSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVI 182
+ VRGPV +S+++S+E +V ++I S + +N + K V YG+Y +
Sbjct: 185 KTCGQVRGPVQLSFSRSIEPIV--GLEIAMSRTAAVNVDKSSDKGLGARKSIVPYGLYRV 242
Query: 183 KGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGNV 242
G I+ A++TGF++ D E++ L+++F++D S++R G M R++F F H ++LGN
Sbjct: 243 HGFISAPLAKQTGFTEDDLELLWSALINMFDHDRSASR--GEMASRQLFVFQHESELGNT 300
Query: 243 SSARVFDLLEFDKEKQDKDS---YEDYAIHLNQEELAE 277
+ ++F+ ++ +++ YEDY I +++ L +
Sbjct: 301 PAHKLFERIKVERKPLSNGPARFYEDYQITIDETNLGK 338
>gi|149126262|ref|ZP_01851159.1| CRISPR-associated protein, Csd2 family [Methylobacterium sp. 4-46]
gi|148517078|gb|EDK90356.1| CRISPR-associated protein, Csd2 family [Methylobacterium sp. 4-46]
Length = 359
Score = 114 bits (285), Expect = 7e-24, Method: Composition-based stats.
Identities = 94/353 (26%), Positives = 162/353 (45%), Gaps = 83/353 (23%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
L+++ DF++ +V N NGDP + NMPR D + +GL+SDV +KRK+RN ++ +
Sbjct: 6 LQRRHDFVLYFDVTNGNPNGDPDAGNMPRMDPETGHGLVSDVCLKRKVRNYVEMAAE--- 62
Query: 61 VQARDRVDDCIYSLKQRLEN---KEFFDTVKEGSKKKKSVENFVKQINAE---------- 107
RD + + IY + + N +E + ++ K ++ + + + E
Sbjct: 63 ADGRDPIRNRIYVTEGAVLNEKHREAYLALRPDDPKARTDKKLTPKSDEEAVLIRRFMCD 122
Query: 108 -WLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTE 163
+ D+R+FG V + G +A VRGPV +S+A+S+E V+ + IT+ + +E N + +
Sbjct: 123 NFFDIRTFGAVLS-TGINAGQVRGPVQVSFARSVEPVLPLEVSITRMAATNEAERNERQD 181
Query: 164 LESS----------------------------------------TMGTKHFVDYGVYVIK 183
E TMG KH V YG+Y
Sbjct: 182 GEDKAGKRGDKRTMGRKHMAATNEAERNERQDGDDEAEKRGDKRTMGRKHIVPYGLYRAH 241
Query: 184 GSINPNFA----EKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
G ++ A + TGFSD D ++ E L ++FE+D S+ R G M R + F H++ L
Sbjct: 242 GYVSAPLASHPVKGTGFSDGDLALLFEALRNMFEHDRSATR--GEMATRRLVVFRHASAL 299
Query: 240 GNVSSARVFDLLEFDKEKQDK---------------DSYEDYAIHLNQEELAE 277
GN + +F+ + + + S+ DYAI +++E L +
Sbjct: 300 GNAPAQSLFERVRTLRAHKGSVHEIGAPGTDNWPPARSFADYAITVDREGLPQ 352
>gi|45658744|ref|YP_002830.1| hypothetical protein LIC12914 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|45601988|gb|AAS71467.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
Length = 306
Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats.
Identities = 88/301 (29%), Positives = 153/301 (50%), Gaps = 37/301 (12%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
++ + +F+ +V++ N NGDP + N PR D + GL++DVS+KRKIRN +
Sbjct: 5 IQNRYEFVYLFDVKDGNPNGDPDAGNQPRVDPETGNGLITDVSLKRKIRNYVT------I 58
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINA--------EWL--- 109
V++ +D K L V G+K + S + ++ EW+
Sbjct: 59 VKSATPPNDIYIKEKAVLIETHEKAYVAVGAKLETSKKEEKEKRTGGDQVGKAREWMCKN 118
Query: 110 --DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTEL 164
DVR+FG V A +A V+GP+ ++A+S++ V+ IT+ +T E + +
Sbjct: 119 FYDVRTFGAVMALK-VNAGVVKGPIQFTFARSIDPVINLEHSITRMAVATKKEAEDQ-DG 176
Query: 165 ESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGS 224
++ TMG KH + YG+Y G I+ +FA TGFS+ D E+ L ++F++D S+AR G
Sbjct: 177 DNRTMGRKHTISYGLYRAHGFISAHFANDTGFSEEDLELFWSSLQNMFDHDRSAAR--GE 234
Query: 225 MRVREVFWFTH--------SNKLGNVSSARVFDLLEFDKEKQDKDS--YEDYAIHLNQEE 274
M R ++ F H KLG + ++F+L+ K+ + + DY++ + + +
Sbjct: 235 MNCRGLYVFKHVGDGKNTNQAKLGVAPAHKLFNLISVSKKDNSTPARDFSDYSVKIQESD 294
Query: 275 L 275
L
Sbjct: 295 L 295
>gi|24213386|ref|NP_710867.1| hypothetical protein LA0686 [Leptospira interrogans serovar Lai
str. 56601]
gi|24194142|gb|AAN47885.1|AE011255_1 conserved hypothetical protein [Leptospira interrogans serovar Lai
str. 56601]
Length = 306
Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats.
Identities = 88/301 (29%), Positives = 153/301 (50%), Gaps = 37/301 (12%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
++ + +F+ +V++ N NGDP + N PR D + GL++DVS+KRKIRN +
Sbjct: 5 IQNRYEFVYLFDVKDGNPNGDPDAGNQPRVDPETGNGLITDVSLKRKIRNYVT------I 58
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINA--------EWL--- 109
V++ +D K L V G+K + S + ++ EW+
Sbjct: 59 VKSATPPNDIYIKEKAVLIETHEKAYVAVGAKLETSKKEEKEKRTGGDQVGKAREWMCKN 118
Query: 110 --DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTEL 164
DVR+FG V A +A V+GP+ ++A+S++ V+ IT+ +T E + +
Sbjct: 119 FYDVRTFGAVMALK-VNAGVVKGPIQFTFARSIDPVINLEHSITRMAVATKKEAEVQ-DG 176
Query: 165 ESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGS 224
++ TMG KH + YG+Y G I+ +FA TGFS+ D E+ L ++F++D S+AR G
Sbjct: 177 DNRTMGRKHTISYGLYRAHGFISAHFANDTGFSEEDLELFWSSLQNMFDHDRSAAR--GE 234
Query: 225 MRVREVFWFTH--------SNKLGNVSSARVFDLLEFDKEKQDKDS--YEDYAIHLNQEE 274
M R ++ F H KLG + ++F+L+ K+ + + DY++ + + +
Sbjct: 235 MNCRGLYVFKHVGDEKNTNQAKLGVAPAHKLFNLISVSKKDNSTPARDFSDYSVKIQESD 294
Query: 275 L 275
L
Sbjct: 295 L 295
>gi|86742030|ref|YP_482430.1| CRISPR-associated protein TM1801 [Frankia sp. CcI3]
gi|86568892|gb|ABD12701.1| CRISPR-associated protein TM1801 [Frankia sp. CcI3]
Length = 295
Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats.
Identities = 78/254 (30%), Positives = 127/254 (50%), Gaps = 29/254 (11%)
Query: 3 EQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQP--- 58
E+K D ++ +V + N NGDP + N PRTD + +GL++DV+IKRK+R+ + +
Sbjct: 8 EKKHDMVLLFDVTDGNPNGDPDNGNRPRTDDETGHGLVTDVAIKRKVRDTIGLAAEAEGL 67
Query: 59 ------IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWL--- 109
IFV+A ++L RLE ++ G K +++ EWL
Sbjct: 68 DLTRYQIFVEAG-------HALNTRLEESYLVKGLELGKK----IDDAKAAKAREWLANR 116
Query: 110 --DVRSFGQVFAF-DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELES 166
D+R FG V + S +RGP+ + A+SL+ V+ IT+ T + + E
Sbjct: 117 YVDIRLFGAVLSTGKTQSLGQIRGPIQVGMARSLDPVLPVDHAITRVTQTTQADIDKGER 176
Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
+ MG K V YG+Y + + +TG S AD ++ LV++F++D S+ R G M
Sbjct: 177 TEMGGKWTVPYGLYRAEIHYSAPRGRQTGVSAADLDLFLCTLVNMFDHDRSATR--GEMA 234
Query: 227 VREVFWFTHSNKLG 240
R ++ F+H N G
Sbjct: 235 TRGLYVFSHHNAFG 248
>gi|119357241|ref|YP_911885.1| CRISPR-associated protein, Csd2 family [Chlorobium phaeobacteroides
DSM 266]
gi|119354590|gb|ABL65461.1| CRISPR-associated protein, Csd2 family [Chlorobium phaeobacteroides
DSM 266]
Length = 366
Score = 96.3 bits (238), Expect = 2e-18, Method: Composition-based stats.
Identities = 64/228 (28%), Positives = 129/228 (56%), Gaps = 18/228 (7%)
Query: 64 RDRVDDCIYSLKQRLEN---KEFFDTVKEGSKKKKSVENFVK---QINAEWLDVRSFGQV 117
++ + D + + K+ L K + +KE +K + E + ++ ++ D+R+FG V
Sbjct: 135 KNEIKDWMKAEKESLSKNVIKVISEALKEAKPRKPTAEETSRGKEKMCQDYYDIRTFGAV 194
Query: 118 FAFDGY-SAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTKH 173
+ + VRGP+ +++A+S+E +V IT+ +T +E ++ ++ TMG K+
Sbjct: 195 MSLKSAPNCGQVRGPIQMTFARSVEPIVALEHSITRMAVATEAEAEKQSG-DNRTMGRKY 253
Query: 174 FVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWF 233
V YG+Y G ++ N A +TGFS+ D ++ L+++F++D S+AR G M R ++ F
Sbjct: 254 TVPYGLYRAHGFVSANLAHQTGFSENDLDLFWNALLNMFDHDRSAAR--GLMSTRGLYVF 311
Query: 234 THSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEELAE 277
HS+ LGN ++++F+ + K K+D + S+++Y + +++ L E
Sbjct: 312 EHSSVLGNAPASQLFERITV-KRKEDSEGPARSFKEYDVLIDESSLGE 358
>gi|46255206|ref|YP_006118.1| hypothetical protein TT_P0135 [Thermus thermophilus HB27]
gi|46198055|gb|AAS82465.1| hypothetical conserved protein [Thermus thermophilus HB27]
Length = 383
Score = 95.9 bits (237), Expect = 2e-18, Method: Composition-based stats.
Identities = 71/222 (31%), Positives = 110/222 (49%), Gaps = 14/222 (6%)
Query: 68 DDCIYSLKQRLEN--KEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
++ + K+ LE KE VK ++ E ++ + D+R FG V + G +A
Sbjct: 148 EEAVAPFKKELEKLAKELAKAVKGRKITEEDRERAQAKLLERFFDIRMFGAVLS-TGLNA 206
Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKS--TNSELNSKTELESSTMGTKHFVDYGVYVIK 183
VRGPV +++A+SL+ + + IT+ T E ++ E E MG K V YG+Y
Sbjct: 207 GQVRGPVQLTFARSLDPIAPLEVSITRVAITREEDRARKETE---MGRKPLVPYGLYRAH 263
Query: 184 GSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGNVS 243
G NP A KTG D E + + L LFE D S+AR G M VR + F+H + GN
Sbjct: 264 GFFNPFLAAKTGVQPEDLEALWDALQHLFELDRSAAR--GEMTVRGLAVFSHEDAKGNAP 321
Query: 244 SARVFDLLEFDKEK--QDKDSYEDYAIHLNQEELAEYEAKGL 283
+ R+F L+ ++ + + S+ DY + +E EA G
Sbjct: 322 AHRLFGLIRVERREGVEAPRSFADYRVRAPKE--GSLEAHGF 361
>gi|51891450|ref|YP_074141.1| hypothetical protein STH312 [Symbiobacterium thermophilum IAM
14863]
gi|51855139|dbj|BAD39297.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM
14863]
Length = 370
Score = 91.7 bits (226), Expect = 5e-17, Method: Composition-based stats.
Identities = 54/180 (30%), Positives = 94/180 (52%), Gaps = 4/180 (2%)
Query: 88 KEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQS 147
++G ++ + K + + D+R FG V + G +A VRGPV +++A+S +
Sbjct: 167 RKGELTAETQDKARKWLCQTYYDIRMFGAVLS-TGLNAGQVRGPVQLTFARSQHPITPLD 225
Query: 148 MQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEV 207
+ IT+ + + + MG K V YG+Y G NP AEKTG +D D I+ +
Sbjct: 226 LSITRQARTT-TVRMATGPTEMGRKPIVPYGLYRAHGFFNPFLAEKTGVTDDDLRILWDA 284
Query: 208 LVSLFENDASSARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYA 267
L LF+ D S+ R G M +R ++ FTH + G + ++F+L++ DK + + D++
Sbjct: 285 LQHLFDYDRSAVR--GEMNMRGLWVFTHDDAKGCAPTHKLFELIQTDKLRNGVEVPRDFS 342
Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats.
Identities = 25/65 (38%), Positives = 49/65 (75%), Gaps = 2/65 (3%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQD-MGQPI 59
+ ++ +F++ +VR+ N NGDP + N+PR D + +GL++DV++KRK+R+ + +G+PI
Sbjct: 6 VSKRHEFVLLFDVRDGNPNGDPDAGNLPRIDPETMHGLVTDVALKRKVRDYVSGVLGKPI 65
Query: 60 FVQAR 64
F+Q++
Sbjct: 66 FIQSK 70
>gi|59801386|ref|YP_208098.1| hypothetical protein, putative phage associated protein [Neisseria
gonorrhoeae FA 1090]
gi|59718281|gb|AAW89686.1| hypothetical protein, putative phage associated protein [Neisseria
gonorrhoeae FA 1090]
Length = 180
Score = 90.9 bits (224), Expect = 7e-17, Method: Composition-based stats.
Identities = 52/148 (35%), Positives = 88/148 (59%), Gaps = 5/148 (3%)
Query: 108 WLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESS 167
+ D+R+FG V + ++ VRGPV +++A+S++ +V + IT+ + N K +
Sbjct: 5 FFDIRTFGAVMTTE-VNSGQVRGPVQLAFAQSIDPIVPPEVSITRMAVT--NEKDLEKER 61
Query: 168 TMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRV 227
TMG K+ V Y VY + G I+ N A KTGFSD D + + L +FE+D S+AR G M
Sbjct: 62 TMGRKYIVPYVVYRVHGFISANLAAKTGFSDDDLAKLWQALTLMFEHDRSAAR--GEMAA 119
Query: 228 REVFWFTHSNKLGNVSSARVFDLLEFDK 255
R++ F H + LG+ + ++FD ++ ++
Sbjct: 120 RKLVVFKHDSALGSQPAHKLFDAVKVER 147
>gi|147678326|ref|YP_001212541.1| hypothetical protein PTH_1991 [Pelotomaculum thermopropionicum SI]
gi|146274423|dbj|BAF60172.1| Uncharacterized protein [Pelotomaculum thermopropionicum SI]
Length = 380
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 58/203 (28%), Positives = 105/203 (51%), Gaps = 15/203 (7%)
Query: 82 EFFDTVKEGSKKKKSVENFVKQINA-EWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSL 140
E + +G K V + +K A E+ D+R FG V G +A V GPV I++A+S+
Sbjct: 171 EKLQSATKGQKLTAEVRSKIKTTMASEFYDIRMFGAVLTM-GTNAGQVLGPVQITFARSV 229
Query: 141 EKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTG----- 195
V ++ IT++ + + + + + MG K + YG+YV G NP AEK
Sbjct: 230 SPVFPMNLTITRTAITRESDRLR-KQTEMGQKPIIPYGLYVAHGFYNPKLAEKLNPGSEL 288
Query: 196 -FSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNK--LGNVSSARVFDLLE 252
+ D +++ E L ++FE D S++R G M R ++ FTH ++ G + ++F+L++
Sbjct: 289 LVKEDDLKLLWEALCNMFEYDRSASR--GEMACRGLYIFTHEDEKGYGKAPAHKLFELVK 346
Query: 253 FDKEK--QDKDSYEDYAIHLNQE 273
+ + + S++DY + L +
Sbjct: 347 ITERDPGRPQRSFDDYTVMLEDK 369
>gi|109646593|ref|ZP_01370497.1| Uncharacterized protein predicted to be involved in DNA repair-like
[Desulfitobacterium hafniense DCB-2]
gi|109641839|gb|EAT51393.1| Uncharacterized protein predicted to be involved in DNA repair-like
[Desulfitobacterium hafniense DCB-2]
Length = 294
Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats.
Identities = 67/244 (27%), Positives = 115/244 (47%), Gaps = 22/244 (9%)
Query: 9 MVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF-----VQA 63
++ +EV +NANGDP + PR G +S VS KRK+R+ + D + V
Sbjct: 11 LMVIEVVNSNANGDPDRESDPRQRPNGIGEISPVSFKRKLRDLVGDHDSVFYQNLPEVYV 70
Query: 64 RDRVDDCIYSLKQRLE---NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
++ CI + R E +K +K FVK+ + D R FG F
Sbjct: 71 KNSDHYCILESRGRDRKSIQSEMSKDIKNFEQKSFLESTFVKK----YWDARIFGNTFLE 126
Query: 121 DGYSAANVR-GPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHF--VDY 177
+G + ++ G V S+ V I + TN+ E +++ M F V++
Sbjct: 127 EGANKGFIKTGVVQFGVGTSISPV-----NIIRHTNTNKAGVQEGKNAGMAPLAFRIVEH 181
Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSN 237
GVY + +NPN+A KTG + D +++K ++ ++ + S+ RP+ +R+R ++ H N
Sbjct: 182 GVYCMPFFVNPNYAAKTGCTQEDIDLLKLLIPKAYDLNRSAIRPD--VRIRHAWYIEHLN 239
Query: 238 KLGN 241
LG+
Sbjct: 240 ALGS 243
>gi|149125882|ref|ZP_01850852.1| CRISPR-associated protein, Csd2 family [Methylobacterium sp. 4-46]
gi|148517451|gb|EDK90656.1| CRISPR-associated protein, Csd2 family [Methylobacterium sp. 4-46]
Length = 242
Score = 79.7 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 59/198 (29%), Positives = 103/198 (52%), Gaps = 30/198 (15%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
L+++ DF++ +V N NGDP + NMPR D + +GL+SDV +KRK+RN ++ +
Sbjct: 6 LQRRHDFVLYFDVTNGNPNGDPDAGNMPRMDPETGHGLVSDVCLKRKVRNYVEMAAE--- 62
Query: 61 VQARDRVDDCIYSLKQRLEN---KEFFDTVKEGSKKKKSVENFVKQINAE---------- 107
RD + + IY + + N +E + ++ K ++ + + + E
Sbjct: 63 ADGRDPIRNRIYVTEGAVLNEKHREAYLALRPDDPKARTDKKLTPKSDEEAVLIRRFMCD 122
Query: 108 -WLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTE 163
+ D+R+FG V + G +A VRGPV +S+A+S+E V+ + IT+ + +E N + +
Sbjct: 123 NFFDIRTFGAVLS-TGINAGQVRGPVQVSFARSVEPVLPLEVSITRMAATNEAERNERQD 181
Query: 164 LESS--------TMGTKH 173
E TMG KH
Sbjct: 182 GEDKAGKRGDKRTMGRKH 199
>gi|125975680|ref|YP_001039590.1| CRISPR-associated protein, Csh2 family [Clostridium thermocellum
ATCC 27405]
gi|125715905|gb|ABN54397.1| CRISPR-associated protein, Csh2 family [Clostridium thermocellum
ATCC 27405]
Length = 305
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 76/293 (25%), Positives = 147/293 (50%), Gaps = 36/293 (12%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM---- 55
M++ + + + +V +AN NGDPL N PR D + +++DV +KR IR+ + D
Sbjct: 1 MIKNRQEILFLYDVTDANPNGDPLDENKPRIDEETGINIVTDVRLKRTIRDYLYDYKGFD 60
Query: 56 ---GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVR 112
G+ IFV + +E+++ +K+G + K V +I + +D+R
Sbjct: 61 GSNGKDIFV--------------REIESEK--GGIKDGKARAKDFNENVDEILQKAIDIR 104
Query: 113 SFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTK 172
FG V D ++ GPV + +SL KV +++ K T + + + + + T +
Sbjct: 105 LFGGVIPLDK-ASITFTGPVQFNMGRSLNKV---NLKHIKGTGAFASGEGKAQ-KTFREE 159
Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR--VREV 230
+ V Y + G IN N A++TG +D D +++ + + + +N + ++ R +R V
Sbjct: 160 YIVPYSIIAFHGIINENAAKRTGLTDEDVDLLDDAMWNGTKNLITRSKMGHMPRLMLRVV 219
Query: 231 FWFTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQ--EELAEYEAK 281
+ + +G++ + R+ L FD E++ S +D++I L++ +ELA Y K
Sbjct: 220 YKPGENFFIGDLQN-RIS--LNFDVEEEKIRSIKDFSIKLDELIDELANYGDK 269
>gi|154249620|ref|YP_001410445.1| CRISPR-associated protein, Csh2 family [Fervidobacterium nodosum
Rt17-B1]
gi|154153556|gb|ABS60788.1| CRISPR-associated protein, Csh2 family [Fervidobacterium nodosum
Rt17-B1]
Length = 302
Score = 70.9 bits (172), Expect = 1e-10, Method: Composition-based stats.
Identities = 63/215 (29%), Positives = 102/215 (47%), Gaps = 28/215 (13%)
Query: 18 NANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDMGQPIFVQARDRVDDCIYSLKQ 76
N NGDP N PR D ++ L+SD+ +KR IR+ + + G IFV+ ++DD + ++
Sbjct: 24 NPNGDPDEENRPRMDNEREINLVSDLRLKRYIRDYLYEKGYDIFVR---KIDDKPVTAEK 80
Query: 77 RLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSAANVRGPVSISW 136
R+E+ F KE +I ++ +DVR FG G + A + GPV +W
Sbjct: 81 RMED---FKNSKE------------DEILSKLIDVRLFGATMPVKGNNRAYI-GPVQFNW 124
Query: 137 AKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGF 196
SL KV IT S+ + + T+G V Y + G ++ AEKT
Sbjct: 125 GYSLNKVELLEASIT----SQFATSENAQQGTIGKDFRVKYSLIAFFGVVSGRRAEKTKL 180
Query: 197 SDADAEIIKEVLVSLFENDASSAR----PEGSMRV 227
++ D +++ E +V A+ ++ P MRV
Sbjct: 181 TNEDLKLLDEAMVKAIPLQATRSKIGQYPRLYMRV 215
>gi|145622619|ref|ZP_01778576.1| CRISPR-associated protein, Csh2 family [Petrotoga mobilis SJ95]
gi|144946978|gb|EDJ82013.1| CRISPR-associated protein, Csh2 family [Petrotoga mobilis SJ95]
Length = 302
Score = 68.9 bits (167), Expect = 4e-10, Method: Composition-based stats.
Identities = 60/234 (25%), Positives = 101/234 (43%), Gaps = 23/234 (9%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPI 59
M + + + V++AN NGDPL+ N PR D + L+SDV IKR IR+ + MG+ +
Sbjct: 1 MFNGRKELLFVYSVKDANPNGDPLNANHPRYDEETGQVLVSDVRIKRTIRDELMRMGEDV 60
Query: 60 FVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
F+ + +LK+R E + + G + K +D R FG FA
Sbjct: 61 FIDGEPK------TLKERFEELKTKLSTTNGDETLKKC-----------IDTRLFGVTFA 103
Query: 120 FDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGV 179
G + GPV W +SL K + +Q T + +K E T+ ++ V + +
Sbjct: 104 L-GKESFAWTGPVQFKWGRSLHKTKVEFVQGTGA----FVTKEGGEQRTIRNEYIVPFAL 158
Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWF 233
N +E+T +D D + + + N + ++ E R W+
Sbjct: 159 IGTYAIGNQYASERTQATDEDFNKLTQAAWNGTNNLITRSKTEHRSRFLMEIWY 212
>gi|78043120|ref|YP_360970.1| CRISPR-associated protein, Csh2 family [Carboxydothermus
hydrogenoformans Z-2901]
gi|77995235|gb|ABB14134.1| CRISPR-associated protein, Csh2 family [Carboxydothermus
hydrogenoformans Z-2901]
Length = 313
Score = 66.2 bits (160), Expect = 2e-09, Method: Composition-based stats.
Identities = 63/214 (29%), Positives = 99/214 (46%), Gaps = 19/214 (8%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMGQPI 59
+++ + + + + N NGDP N PR D + L+SDV +KR +R+ +Q+ G+ I
Sbjct: 4 LIKNNSEILFIYDAKLTNPNGDPDDENRPRMDYETKTNLVSDVRLKRYVRDYLQEKGKEI 63
Query: 60 FVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
FV +V+ + +RLE K T K SK V + + +DVR FG
Sbjct: 64 FVA---KVEGETVNATERLE-KLLGKTSKNISKDDVPV------LLEKLVDVRLFGATMP 113
Query: 120 F---DGYSAANVRGPVSISWAKSLEKV-VTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
DG S+ GPV +W SL KV + +S IT S +S + E TMG + +
Sbjct: 114 IKSEDGGSSLTFTGPVQFNWGYSLNKVELVESNTIT----SRFSSTSGNEQGTMGKDYRL 169
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLV 209
Y + G + + A+ T ++ E LV
Sbjct: 170 YYSLIAFHGIVAAHRAKFTQLDMETLSLLDEALV 203
>gi|114844588|ref|ZP_01455032.1| CRISPR-associated protein TM1801 [Thermoanaerobacter ethanolicus
X514]
gi|114805397|gb|EAU57194.1| CRISPR-associated protein TM1801 [Thermoanaerobacter ethanolicus
X514]
Length = 293
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 63/229 (27%), Positives = 99/229 (43%), Gaps = 28/229 (12%)
Query: 7 DFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDMGQPIFVQARD 65
+ + + + N NGDP N PR D ++ L+SD+ +KR IR+ + G IFV+
Sbjct: 7 EILYIYDAKLTNPNGDPDEENRPRMDYEREINLVSDLRLKRYIRDYLMLKGYDIFVRL-- 64
Query: 66 RVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
VDD + +R+++ E S + +EN W+DVR FG +
Sbjct: 65 -VDDKPVTADKRVKDLE-------DSSNEWILEN--------WIDVRMFGATMTVQKDTK 108
Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGS 185
+ GP+ +W SL KV IT S +S T+G V Y + G
Sbjct: 109 TFI-GPIQFNWGYSLNKVELLEASIT----SHFSSSETFAQGTIGKDFRVKYSLIAFSGV 163
Query: 186 INPNFAEKTGFSDADAEIIKEVLVSLFENDASSAR----PEGSMRVREV 230
++ + AEKT + D ++ E L N + ++ P MRV V
Sbjct: 164 VSGHRAEKTKLKEDDLYLLDEALKHAIPNLVTRSKIGQYPRIYMRVEYV 212
>gi|89895517|ref|YP_519004.1| hypothetical protein DSY2771 [Desulfitobacterium hafniense Y51]
gi|89334965|dbj|BAE84560.1| hypothetical protein [Desulfitobacterium hafniense Y51]
Length = 334
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 67/220 (30%), Positives = 103/220 (46%), Gaps = 30/220 (13%)
Query: 7 DFMVTVEVREANANGDPLSVN-----MPRTDAKDYGLMSDVSIKRKIRNRM----QDMGQ 57
+ + V++ N DPL+ + P D + +SDVSIKR +R+ + QD G+
Sbjct: 23 EILFVKSVKDGIPNRDPLNDSDARRLFPEEDGRIS--LSDVSIKRDVRDFVIALEQDGGK 80
Query: 58 P----IFVQARDRVDDCIYSLKQRLENKEFF--DTVKEGSKKKKSVENFVKQINAEWLDV 111
IFVQ ++V+D K +L + K K+KK+ E+ + DV
Sbjct: 81 DQKNHIFVQ--EKVND-----KGKLLGRGSLAEGIAKSVGKEKKAKEDMKSVLIEHCFDV 133
Query: 112 RSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMG- 170
R+FG V++ N+ GPV WA SL V TQ +Q T S ++S E E T G
Sbjct: 134 RTFGIVYSVK--PKFNLTGPVQFGWAHSLHPVDTQYVQGTVVMPS-MDSTAEGEGKTQGT 190
Query: 171 --TKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVL 208
T + V + V+ + IN AE +G + D E++ L
Sbjct: 191 IWTSYTVPFAVFAMPAVINAKAAEHSGMTAEDQELLLRAL 230
>gi|150398985|ref|YP_001322752.1| CRISPR-associated protein, Csh2 family [Methanococcus vannielii SB]
gi|150011688|gb|ABR54140.1| CRISPR-associated protein, Csh2 family [Methanococcus vannielii SB]
Length = 291
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 68/275 (24%), Positives = 132/275 (48%), Gaps = 25/275 (9%)
Query: 7 DFMVTVEVREANANGDPLSVNMPRTDAKDYGL-MSDVSIKRKIRNRMQDMGQPIFVQARD 65
+F++ + AN NGD L+ N PR D L +SDV IKR IR+ G+ + VQ +
Sbjct: 6 EFLLIWDSTMANPNGDMLNDNKPRQDEATGQLEVSDVRIKRFIRDHWISNGKNVLVQTKT 65
Query: 66 RVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
+ + + + ++ + E + K + + N++ + +++DV+ FG V Y
Sbjct: 66 DKNGKVMTCQGIVK-----EMASENNLKDEEIPNYLLE---KYIDVKLFGAVITKPKY-- 115
Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGS 185
++ GP+ I+W+KS+ + + MQ N+ S + +++ +K Y ++
Sbjct: 116 -DITGPLQIAWSKSVHEADVKFMQ----GNAAYASGEGKDQASIWSKFISPYALFKTYAV 170
Query: 186 INPNFAEKTGF--SDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGNVS 243
N AEK G SD D K+ L++ +N S+++ + + EV + ++NKL
Sbjct: 171 YNDKVAEKQGINVSDDDLNDFKDALLNGLKNYRSTSKNQMPRLLIEVIY--NNNKLDG-- 226
Query: 244 SARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEY 278
+ ++ E QD + + + +N +L+EY
Sbjct: 227 ---ELNYVDITYENQDLELRDISQVVINLGKLSEY 258
>gi|109645854|ref|ZP_01369774.1| CRISPR-associated protein, CT1132 family [Desulfitobacterium
hafniense DCB-2]
gi|109643803|gb|EAT53356.1| CRISPR-associated protein, CT1132 family [Desulfitobacterium
hafniense DCB-2]
Length = 316
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 66/213 (30%), Positives = 101/213 (47%), Gaps = 30/213 (14%)
Query: 14 VREANANGDPLSVN-----MPRTDAKDYGLMSDVSIKRKIRNRM----QDMGQP----IF 60
V++ N DPL+ + P D + +SDVSIKR +R+ + QD G+ IF
Sbjct: 12 VKDGIPNRDPLNDSDARRLFPEEDGRIS--LSDVSIKRDVRDFVIALEQDGGKDQKNHIF 69
Query: 61 VQARDRVDDCIYSLKQRLENKEFF--DTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVF 118
VQ ++V+D K +L + K K+KK+ E+ + DVR+FG V+
Sbjct: 70 VQ--EKVND-----KGKLLGRGSLAEGIAKSVGKEKKAKEDMKSVLIEHCFDVRTFGIVY 122
Query: 119 AFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMG---TKHFV 175
+ N+ GPV WA SL V TQ +Q T S ++S + E T G T + V
Sbjct: 123 SVK--PKFNLTGPVQFGWAHSLHPVDTQYVQGTVVMPS-MDSTADGEGKTQGTIWTSYTV 179
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVL 208
+ V+ + IN AE +G + D E++ L
Sbjct: 180 PFAVFAMPAVINAKAAEHSGMTAEDQELLLRAL 212
>gi|114568025|ref|YP_755179.1| hypothetical protein Swol_2520 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114338960|gb|ABI69808.1| conserved hypothetical protein [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 301
Score = 64.3 bits (155), Expect = 7e-09, Method: Composition-based stats.
Identities = 72/277 (25%), Positives = 117/277 (42%), Gaps = 30/277 (10%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLM-SDVSIKRKIRNRMQDMGQPIF 60
+Q+ +++ V++AN NGDPL+ N PR D M SDV +KR R+ G+ +F
Sbjct: 3 FKQRREYLFLYTVKDANPNGDPLNENHPRYDGDTAQAMASDVRVKRTTRDEWVRSGEIVF 62
Query: 61 VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
V + SLK R E KK + ++ ++I + LDVR FG FA
Sbjct: 63 VDGEPK------SLKTRFE-----------ELKKITGKSDAREIMKQCLDVRLFGVTFAL 105
Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDY--- 177
G A GPV W +SL + +Q T + +E + S ++ V +
Sbjct: 106 -GKEAFAWTGPVQFKWGRSLHSASFEFVQGTAAFATERGGADNRQRS-FRNEYLVPFALM 163
Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFE-NDASSARPEGSMRVREVFWFTHS 236
GVY I + ++DA E ++ +L L++ D R + + R + T+
Sbjct: 164 GVYAIANQY------ASQYTDAADEDLQRMLDGLWQGTDNLITRSKNEHKSRLLIEITYK 217
Query: 237 NKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQE 273
A + D+E + D + A+ QE
Sbjct: 218 EDFNGKIGALDDKVTLLDREGKVMDREKQKALRSLQE 254
>gi|89895552|ref|YP_519039.1| hypothetical protein DSY2806 [Desulfitobacterium hafniense Y51]
gi|89335000|dbj|BAE84595.1| hypothetical protein [Desulfitobacterium hafniense Y51]
Length = 287
Score = 64.3 bits (155), Expect = 8e-09, Method: Composition-based stats.
Identities = 58/183 (31%), Positives = 88/183 (48%), Gaps = 23/183 (12%)
Query: 39 MSDVSIKRKIRNRMQDMGQP--------IFVQARDRVDDCIYSLKQRLENKEFF--DTVK 88
+SDVSIKR +R+ + D+ + IFVQ ++V+D K +L + K
Sbjct: 11 LSDVSIKRDVRDFVIDLEEDGGKEQKNHIFVQ--EKVND-----KGKLLGRGSLAEGIAK 63
Query: 89 EGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSM 148
K+KK+ E+ + DVR+FG V++ N+ GPV WA SL V TQ +
Sbjct: 64 RVGKEKKAKEDMKSVLIEHCFDVRTFGIVYSVK--PKFNLTGPVQFGWAHSLHPVDTQYV 121
Query: 149 QITKSTNSELNSKTELESSTMG---TKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIK 205
Q T S +S E E T G T + V + V+V+ IN A+ +G + D E++
Sbjct: 122 QGTVVMPST-DSTAEGEGKTQGTIWTSYTVPFAVFVMPAVINAKAAQHSGMTPEDQELLL 180
Query: 206 EVL 208
L
Sbjct: 181 RAL 183
>gi|153869181|ref|ZP_01998851.1| CRISPR-associated protein TM1801 [Beggiatoa sp. PS]
gi|152074276|gb|EDN71148.1| CRISPR-associated protein TM1801 [Beggiatoa sp. PS]
Length = 327
Score = 63.5 bits (153), Expect = 2e-08, Method: Composition-based stats.
Identities = 59/213 (27%), Positives = 94/213 (44%), Gaps = 23/213 (10%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYG--LMSDVSIKRKIRN--------- 50
L + + + E + N NGDPL N PRTD D G ++DV IKR +R+
Sbjct: 4 LTNRYEILFLYECTDCNPNGDPLDENRPRTDP-DTGEATITDVRIKRTVRDYFIAQEPDV 62
Query: 51 --RMQDMGQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEW 108
R+ + G+ I ++ ++ D + R K+F + EG KKK + + I ++
Sbjct: 63 EKRLAN-GKEILIRDTEKPDGTLSQGSDRA--KQFSQELTEGKKKKGDNQKLQEVILSQC 119
Query: 109 LDVRSFGQVFAF-DGYSAANVRGPVSIS-WAKSLEKVVTQSMQITKSTNSELNSKTELES 166
+D R FG G S+ + GPV S + +SL KV +Q T + SK +
Sbjct: 120 IDARLFGTAVPLGKGESSLKLTGPVQFSAFNRSLHKVSPVMVQQTAA----FASKATAQQ 175
Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDA 199
+ V Y + G +N A+ T + A
Sbjct: 176 KGFAERWLVPYALIAAYGMVNEGAAQTTHMTKA 208
>gi|109672064|ref|ZP_01374306.1| crispr-associated protein, Csh2 family [Campylobacter concisus
13826]
gi|112800899|gb|EAT98243.1| crispr-associated protein, Csh2 family [Campylobacter concisus
13826]
Length = 313
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 69/302 (22%), Positives = 132/302 (43%), Gaps = 33/302 (10%)
Query: 4 QKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIFVQ 62
QK + + + N NGD L N PR D + ++DV IKR IR+ + +
Sbjct: 2 QKKEILFLWDGENWNPNGDMLKDNAPRRDDETGVAEVTDVRIKRTIRDEIMKKDEASIFI 61
Query: 63 ARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDG 122
R++D + K +++ K++ K+I ++++D+R+FG V
Sbjct: 62 KEYRIEDALLDAKT---------AIRQSINIKQNKSELQKEILSKFIDIRAFGGVLPISD 112
Query: 123 -----------YSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNS---KTELESST 168
+ GPV +KSL KV + ++ T + +S+ + K + + +T
Sbjct: 113 KDKMKQDKEIKTAGVQFTGPVQFRLSKSLNKVEVEHVKGTGAFSSDYDPNDPKKQKDQAT 172
Query: 169 MGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVR 228
+ F+ Y ++ G I+ A KTGFS+AD I + L +N + ++ + R
Sbjct: 173 FREEEFIKYAIFATYGIIDNYNAAKTGFSEADEAKILKALWHGTKNLTTRSKIGQTPRFM 232
Query: 229 EVFWFTHSNKLGNVSSARVFDLLEFDKEKQDK--DSYEDYAIHLN--QEELAEYEAKGLQ 284
+ + G+++++ + EK+D+ S +Y I + +LA Y A +
Sbjct: 233 LIITYKDDTFAGDLNNS-----ISLKSEKEDRVIRSINEYTIDFTNLKNKLARYAANIEK 287
Query: 285 VE 286
+E
Sbjct: 288 IE 289
>gi|21226665|ref|NP_632587.1| hypothetical protein MM_0563 [Methanosarcina mazei Go1]
gi|20904948|gb|AAM30259.1| hypothetical protein MM_0563 [Methanosarcina mazei Go1]
Length = 291
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 60/232 (25%), Positives = 104/232 (44%), Gaps = 18/232 (7%)
Query: 7 DFMVTVEVREANANGDPLSVNMPRTDAKDYGL-MSDVSIKRKIRNRMQDMGQPIFVQARD 65
++++ + AN NGD L+ N PR D L +SDV IKR +R+ Q G + V+ +
Sbjct: 6 EYLLVWDSTMANPNGDMLNDNKPRHDEITGQLEVSDVRIKRFVRDEWQSRGHNVLVRTKK 65
Query: 66 RVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
D + S ++ V E +K K++ + E++DVR FG V Y
Sbjct: 66 GDDGKVMSCTALIKE------VMEKAKVKEA--ELPSHLLNEYIDVRLFGAVITKPKY-- 115
Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGS 185
++ GP+ + W+KS+ + MQ NS ST+ +K+ Y ++
Sbjct: 116 -DITGPLQVMWSKSVNPAEIKFMQ----GNSAYAGGEGKSQSTIWSKYISPYAIFKTYAV 170
Query: 186 INPNFAEKTGF--SDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
N N A++ G S+ D L++ N S+++ + + EV + H
Sbjct: 171 YNDNAAKRQGIETSEKDLNEFTAALINGLINYRSTSKNQMPRLLVEVIYKEH 222
>gi|146295132|ref|YP_001178903.1| CRISPR-associated protein, Csh2 family [Caldicellulosiruptor
saccharolyticus DSM 8903]
gi|145408708|gb|ABP65712.1| CRISPR-associated protein, Csh2 family [Caldicellulosiruptor
saccharolyticus DSM 8903]
Length = 294
Score = 60.8 bits (146), Expect = 8e-08, Method: Composition-based stats.
Identities = 57/234 (24%), Positives = 98/234 (41%), Gaps = 34/234 (14%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDMGQPI 59
++++ + + T + + N NGDP N PR D K+ L+SDV +KR IR+ D G PI
Sbjct: 5 IIDKNSEILFTYDAKLCNPNGDPDEENRPRMDWEKEINLVSDVRVKRYIRDYADDQGIPI 64
Query: 60 FVQARDRVDDCIYSLKQRLENKEFF--DTVKEGSKKKKSVENFVKQINAEWLDVRSFGQV 117
+V +++E K + +K + +E F+ D+R FG
Sbjct: 65 YV--------------RKIEGKSVKPEEVIKSVGEDIDELETFI--------DIRLFGAT 102
Query: 118 FAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDY 177
+ + GPV +W SL KV IT S S + + +G + V Y
Sbjct: 103 IPIKKETRTYI-GPVQFNWGYSLNKVELLEASIT----SHFASDEKKQQGAIGKDYRVKY 157
Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSAR----PEGSMRV 227
G ++ A++T ++ D + + + A+ ++ P MRV
Sbjct: 158 SFIAFSGIVSARRAKETRLTEDDLKFLDRAMKEAIPLQATRSKIGQYPRLYMRV 211
>gi|134045809|ref|YP_001097295.1| CRISPR-associated protein, Csh2 family [Methanococcus maripaludis
C5]
gi|132663434|gb|ABO35080.1| CRISPR-associated protein, Csh2 family [Methanococcus maripaludis
C5]
Length = 292
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 55/206 (26%), Positives = 95/206 (46%), Gaps = 34/206 (16%)
Query: 7 DFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQD-MGQPIFVQAR 64
+ + + + N NGD + N PR D + L+SDV +KR IR+ + +G+ IF+
Sbjct: 14 EILFIYDAEKTNPNGDMDNQNKPRMDWDTNTNLVSDVRLKRYIRDYFEKYLGEEIFIT-- 71
Query: 65 DRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYS 124
+ K+ + K +++ KQ + + +DVR FG VFA +G S
Sbjct: 72 --------------------ENAKDSKDRAKQLDSNKKQ-HTDLIDVRLFGAVFAEEG-S 109
Query: 125 AANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKG 184
+++ GPV +W SL +V Q S+ S +G + V Y V G
Sbjct: 110 NSHISGPVQFNWGYSLNEVELQESTTITSSFSSGTG--------VGKDYRVKYSVIAFNG 161
Query: 185 SINPNFAEKTGFSDADAEIIKEVLVS 210
+IN N A+ + S+ D ++ E +++
Sbjct: 162 AINGNAAKTSTLSEKDIVLLDEAILN 187
>gi|116753951|ref|YP_843069.1| CRISPR-associated protein, Csh2 family [Methanosaeta thermophila
PT]
gi|116665402|gb|ABK14429.1| CRISPR-associated protein, Csh2 family [Methanosaeta thermophila
PT]
Length = 321
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 54/222 (24%), Positives = 100/222 (45%), Gaps = 35/222 (15%)
Query: 2 LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM-GQPI 59
+ + + + ++R+ N NGDP+ N PR D + L++DV +KR IR+ + + G I
Sbjct: 4 VSNRSELLFIYDIRDGNPNGDPMDENKPRMDEETGVNLVTDVRLKRTIRDYLHNFKGLEI 63
Query: 60 FVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
FV+ + IY E +++ ++ K ++I +E +DVR FG V
Sbjct: 64 FVR------EIIYD--------EENGYIQDAKRRAKDFGEDQERILSECIDVRLFGGVIP 109
Query: 120 F------------DGYSAAN---VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTEL 164
+G S + GPV +SL +V + ++ T + SK +
Sbjct: 110 LEKRRQNKQKDEGEGSSKGDSITYTGPVQFKMGRSLHRVALKHIKGTGA----FASKEGM 165
Query: 165 ESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKE 206
+T ++ + Y + + G IN N A+ T ++ D ++ E
Sbjct: 166 TQATFREEYVLPYSLILFYGIINENAAKHTALTEEDVRLLLE 207
>gi|124004088|ref|ZP_01688935.1| crispr-associated protein, Csh2 family [Microscilla marina ATCC
23134]
gi|123990667|gb|EAY30147.1| crispr-associated protein, Csh2 family [Microscilla marina ATCC
23134]
Length = 303
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 67/280 (23%), Positives = 129/280 (46%), Gaps = 29/280 (10%)
Query: 1 MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLM-SDVSIKRKIRNRMQDM---- 55
+++ + + + E+ AN NG+PL N PR D++D ++ SDV +KR +R+ +
Sbjct: 4 VIQNRSEILFLYEIENANPNGNPLDENRPRFDSEDSTIIVSDVRLKRTVRDYWYEYEGFN 63
Query: 56 ---GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVR 112
G+ IFV+ Q E + + V +G ++ K+ +++ +DVR
Sbjct: 64 GEGGKDIFVRE-----------TQYQEGDKSY--VSDGKRRAKAFGESKEKVLEACIDVR 110
Query: 113 SFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTK 172
+FG V ++ + GP +SL KV + Q T + S + +T T+
Sbjct: 111 TFGGVIPLTK-ASITLTGPTQFQMGRSLHKVEIATEQGTGA----FASGDKKSQATFRTE 165
Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
+ V Y + G IN A+ + ++AD ++ + L +N S ++ + + V
Sbjct: 166 YKVPYALIGFNGIINEKAAKYSQMTEADRALLLDGLWEGTKNLISRSKFGQTPVLMLVVN 225
Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQ 272
+ ++ LGN+ RV L+ +K + S D+ + L+Q
Sbjct: 226 YKDASHLGNLRQ-RV--ALQTEKNELALRSLNDFELDLSQ 262
>gi|108803121|ref|YP_643058.1| CRISPR-associated protein Csh2 [Rubrobacter xylanophilus DSM 9941]
gi|108764364|gb|ABG03246.1| CRISPR-associated protein Csh2 [Rubrobacter xylanophilus DSM 9941]
Length = 314
Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats.
Identities = 54/199 (27%), Positives = 87/199 (43%), Gaps = 16/199 (8%)
Query: 7 DFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDMGQPIFVQARD 65
D + + + N NGDP N PR D A L+SDV +KR +R+ + G+ I+V+ +
Sbjct: 7 DILYLYDAKLTNPNGDPDDENRPRMDEATGRNLVSDVRLKRYLRDYWLNAGEDIWVRRTE 66
Query: 66 RVDDCIYSLKQRLE------NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
+ + S KQR+ N+E + E ++ + F + DVR FG
Sbjct: 67 QEETT--SAKQRMSVLLEDYNRENGTNLNE--RQARQSREFKDWLLGRLRDVRLFGATMP 122
Query: 120 FDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGV 179
+ ++ GPV SW SL +V + +T S + E E T G V Y +
Sbjct: 123 MEN-TSVTFTGPVQFSWGYSLNRVEINN----SATISSHFAGRENEYGTFGKDWRVHYSL 177
Query: 180 YVIKGSINPNFAEKTGFSD 198
G ++ N A T ++
Sbjct: 178 LAFYGIVSRNRARHTRLTE 196
>gi|89211072|ref|ZP_01189450.1| CRISPR-associated protein TM1801 [Halothermothrix orenii H 168]
gi|89159297|gb|EAR78967.1| CRISPR-associated protein TM1801 [Halothermothrix orenii H 168]
Length = 302
Score = 58.2 bits (139), Expect = 7e-07, Method: Composition-based stats.
Identities = 54/205 (26%), Positives = 92/205 (44%), Gaps = 21/205 (10%)
Query: 7 DFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDM-GQPIFVQAR 64
+ + + + +N NGD + N PR D L+SDV +KR IR+ +Q + G+ +FV
Sbjct: 12 EILFLYDAKRSNPNGDMDNENKPRMDWDTGTNLVSDVRLKRYIRDYLQKVKGKNLFVSEE 71
Query: 65 DRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYS 124
++ EN+ ++ S K + + +I E DV FG V G
Sbjct: 72 ----------AEKAENRVHQILGRKPSSNKPVTDEELTKIAEECCDVIYFGAVLGTSG-G 120
Query: 125 AANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKG 184
++ GPV +W SL KV +Q +K+ S +S +G + V Y G
Sbjct: 121 NTHLTGPVQFNWGYSLNKV---ELQESKTITSSFSS-----GEGVGKDYRVKYSFIAFSG 172
Query: 185 SINPNFAEKTGFSDADAEIIKEVLV 209
IN A+ T ++ D +++ E ++
Sbjct: 173 GINGLAAKDTKLTENDVKLLDEAII 197
>gi|154175378|ref|YP_001408164.1| crispr-associated protein, Csh2 family [Campylobacter curvus
525.92]
gi|112802844|gb|EAU00188.1| crispr-associated protein, Csh2 family [Campylobacter curvus
525.92]
Length = 306
Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats.
Identities = 69/269 (25%), Positives = 125/269 (46%), Gaps = 40/269 (14%)
Query: 18 NANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMGQ-PIFVQARDR---VDDCIY 72
N NGD L N PR D + + +DV IKR IR+ + + IFV+ ++ V DC
Sbjct: 16 NPNGDMLRENAPRIDDETNIAEATDVRIKRTIRDEIMKKDEGAIFVKEYNKDENVLDCKT 75
Query: 73 SLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDG---------- 122
++++ + ++ +K ++E ++I ++++D+R+FG V
Sbjct: 76 AIREVINIRQ----------EKAAIE---REILSKFIDIRAFGGVLPISDKDEMKADKEI 122
Query: 123 -YSAANVRGPVSISWAKSLEKVVTQSMQITKS--TNSELNSKTELESSTMGTKHFVDYGV 179
+ GPV +KSL +V Q ++ T + + S+ +KT E +G F YGV
Sbjct: 123 KTAGVQFTGPVQFRMSKSLHRVQIQHIKGTGAFASGSDKGAKTFREEDFLGYAIFATYGV 182
Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
+ + N A+KT FS+ DA +I L + +N + ++ + R + +
Sbjct: 183 F------DNNNAKKTNFSEDDANVILSALWNGTKNLITRSKMGQTPRFMLIITYKDDTFA 236
Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAI 268
G++++ L+ DKE + S DY I
Sbjct: 237 GDLNNT--IKLIS-DKEDEAIRSVNDYTI 262
>gi|84489232|ref|YP_447464.1| hypothetical protein Msp_0420 [Methanosphaera stadtmanae DSM 3091]
gi|84372551|gb|ABC56821.1| conserved hypothetical protein [Methanosphaera stadtmanae DSM 3091]
Length = 319
Score = 56.6 bits (135), Expect = 2e-06, Method: Composition-based stats.
Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 27/208 (12%)
Query: 7 DFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMG-QPIFVQAR 64
+ + ++ +AN NGDPL N PR D + + +++DV +KR IR+ +++ + +FV+ +
Sbjct: 5 ELLFLYDISDANPNGDPLDENKPRIDEETEINIVTDVRLKRTIRDYLEEFANEELFVKEK 64
Query: 65 DRVDDCIYSLKQRLE------NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVF 118
+ + K R E N E FD K K I + +D R FG
Sbjct: 65 AGKEGGLQDAKTRAEDYLPEGNYESFDEAKNALK---------NNILEKCIDARLFGGTI 115
Query: 119 AFD------GYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTK 172
+ + + GPV +SL KV MQ K T + SK T +
Sbjct: 116 PLELKLKKKQTGSITLTGPVQFRMGRSLHKV---DMQYIKGTGA-FASKDGKSQKTFREE 171
Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDAD 200
+ + Y + G IN N A+ T + D
Sbjct: 172 YILPYSLIAFYGVINENAAKSTNLREDD 199
>gi|124521530|ref|ZP_01696443.1| CRISPR-associated protein, CT1132 family [Bacillus coagulans 36D1]
gi|124496739|gb|EAY44321.1| CRISPR-associated protein, CT1132 family [Bacillus coagulans 36D1]
Length = 318
Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats.
Identities = 60/213 (28%), Positives = 99/213 (46%), Gaps = 30/213 (14%)
Query: 14 VREANANGDPLSVN-----MPRTDAKDYGLMSDVSIKRKIRNRMQD--------MGQPIF 60
V++ N DPL+ + P D + +SDVSIKR +R+ + D IF
Sbjct: 14 VKDGIPNRDPLNDSDARRIFPEEDGRIS--LSDVSIKRDVRDFVIDYQADGGSQQKNYIF 71
Query: 61 VQARDRVDDCIYSLKQRLENK-EFFDTVKEGSKKKKSVENFVKQINAEW-LDVRSFGQVF 118
VQ + ++ K +L + + + + K+K + +K + E DVR+FG V+
Sbjct: 72 VQEK-------FNEKGKLLGRGSLAEGIAKAVGKEKESKTDMKSVLLEHSFDVRTFGVVY 124
Query: 119 AFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMG---TKHFV 175
+ N+ GPV WA S+ V +Q +Q T S +SK + E T G T + V
Sbjct: 125 SVK--PKFNLTGPVQFGWAHSMHPVDSQYVQGTVVMPST-DSKGDEEGKTQGTIWTSYTV 181
Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVL 208
+ V+ + G IN AE + ++ D E++ L
Sbjct: 182 PFAVFAMPGIINAKNAEHSQMTEEDQELLLRAL 214
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.314 0.130 0.360
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 978,677,796
Number of Sequences: 5470121
Number of extensions: 38056191
Number of successful extensions: 91415
Number of sequences better than 1.0e-05: 93
Number of HSP's better than 0.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 49
Number of HSP's that attempted gapping in prelim test: 91097
Number of HSP's gapped (non-prelim): 104
length of query: 291
length of database: 1,894,087,724
effective HSP length: 131
effective length of query: 160
effective length of database: 1,177,501,873
effective search space: 188400299680
effective search space used: 188400299680
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 129 (54.3 bits)