BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= SMu1602 
         (291 letters)

Database: nr 
           5,470,121 sequences; 1,894,087,724 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|24380128|ref|NP_722083.1|  hypothetical protein SMU.1760c...   541   e-152
gi|15675456|ref|NP_269630.1|  hypothetical protein SPy_1564 ...   349   1e-94
gi|71903954|ref|YP_280757.1|  hypothetical cytosolic protein...   349   1e-94
gi|56808099|ref|ZP_00365890.1|  COG3649: Uncharacterized pro...   346   8e-94
gi|94994791|ref|YP_602889.1|  hypothetical cytosolic protein...   345   1e-93
gi|94988912|ref|YP_597013.1|  hypothetical cytosolic protein...   344   3e-93
gi|15612902|ref|NP_241205.1|  hypothetical protein BH0339 [B...   254   4e-66
gi|68055788|ref|ZP_00539929.1|  CRISPR-associated protein TM...   246   1e-63
gi|124485669|ref|YP_001030285.1|  hypothetical protein Mlab_...   244   3e-63
gi|56965356|ref|YP_177088.1|  hypothetical protein ABC3594 [...   243   1e-62
gi|150389452|ref|YP_001319501.1|  CRISPR-associated protein,...   224   6e-57
gi|56477287|ref|YP_158876.1|  hypothetical protein ebA3286 [...   219   2e-55
gi|119026200|ref|YP_910045.1|  CRISPR-associated protein TM1...   212   2e-53
gi|78188974|ref|YP_379312.1|  CRISPR-associated TM1801 famil...   211   3e-53
gi|152978953|ref|YP_001344582.1|  CRISPR-associated protein,...   210   8e-53
gi|117926796|ref|YP_867413.1|  CRISPR-associated protein, Cs...   207   8e-52
gi|94269288|ref|ZP_01291422.1|  CRISPR-associated protein TM...   206   1e-51
gi|85860586|ref|YP_462788.1|  hypothetical cytosolic protein...   200   1e-49
gi|148658415|ref|YP_001278620.1|  CRISPR-associated protein,...   196   1e-48
gi|110601407|ref|ZP_01389595.1|  CRISPR-associated protein T...   196   1e-48
gi|89902663|ref|YP_525134.1|  CRISPR-associated protein TM18...   193   9e-48
gi|21673958|ref|NP_662023.1|  hypothetical protein CT1132 [C...   156   2e-36
gi|145220108|ref|YP_001130817.1|  CRISPR-associated protein,...   152   3e-35
gi|78187158|ref|YP_375201.1|  CRISPR-associated TM1801 famil...   150   8e-35
gi|85714553|ref|ZP_01045540.1|  CRISPR-associated protein [N...   147   9e-34
gi|120586829|ref|YP_961174.1|  CRISPR-associated protein, Cs...   145   2e-33
gi|46562130|ref|YP_009172.1|  CRISPR-associated TM1801 famil...   144   9e-33
gi|83592167|ref|YP_425919.1|  CRISPR-associated TM1801 famil...   142   2e-32
gi|78222285|ref|YP_384032.1|  CRISPR-associated TM1801 famil...   142   3e-32
gi|108758563|ref|YP_635130.1|  CRISPR-associated protein, Cs...   141   5e-32
gi|154506184|ref|ZP_02042922.1|  hypothetical protein RUMGNA...   140   7e-32
gi|114566030|ref|YP_753184.1|  hypothetical protein Swol_047...   139   3e-31
gi|75676129|ref|YP_318550.1|  CRISPR-associated TM1801 famil...   137   9e-31
gi|83589359|ref|YP_429368.1|  CRISPR-associated TM1801 famil...   136   1e-30
gi|150007823|ref|YP_001302566.1|  uncharacterized protein pr...   135   2e-30
gi|146284060|ref|YP_001174213.1|  CRISPR-associated protein,...   135   2e-30
gi|134298869|ref|YP_001112365.1|  CRISPR-associated protein,...   135   3e-30
gi|153091351|gb|EDN73325.1|  hypothetical protein MHA_0345 [...   135   3e-30
gi|83645584|ref|YP_434019.1|  uncharacterized protein predic...   135   4e-30
gi|91774959|ref|YP_544715.1|  CRISPR-associated protein TM18...   134   9e-30
gi|154495122|ref|ZP_02034127.1|  hypothetical protein PARMER...   134   1e-29
gi|154495727|ref|ZP_02034423.1|  hypothetical protein BACCAP...   132   2e-29
gi|67158962|ref|ZP_00419749.1|  CRISPR-associated protein TM...   132   2e-29
gi|34496682|ref|NP_900897.1|  hypothetical protein CV_1227 [...   131   4e-29
gi|118726119|ref|ZP_01574750.1|  CRISPR-associated protein, ...   131   6e-29
gi|114331017|ref|YP_747239.1|  CRISPR-associated protein, Cs...   130   1e-28
gi|94984344|ref|YP_603708.1|  CRISPR-associated protein Csd2...   130   1e-28
gi|53804985|ref|YP_113167.1|  CRISPR-associated TM1801 famil...   128   4e-28
gi|21244564|ref|NP_644146.1|  hypothetical protein XAC3840 [...   125   2e-27
gi|58580492|ref|YP_199508.1|  hypothetical protein XOO0869 [...   125   3e-27
gi|126664601|ref|ZP_01735585.1|  CRISPR-associated protein, ...   124   7e-27
gi|52425041|ref|YP_088178.1|  hypothetical protein MS0986 [M...   124   7e-27
gi|84703568|ref|ZP_01017396.1|  CRISPR-associated protein [P...   120   1e-25
gi|68549523|ref|ZP_00588986.1|  CRISPR-associated protein TM...   118   4e-25
gi|149126262|ref|ZP_01851159.1|  CRISPR-associated protein, ...   114   7e-24
gi|45658744|ref|YP_002830.1|  hypothetical protein LIC12914 ...   113   1e-23
gi|24213386|ref|NP_710867.1|  hypothetical protein LA0686 [L...   112   2e-23
gi|86742030|ref|YP_482430.1|  CRISPR-associated protein TM18...   105   3e-21
gi|119357241|ref|YP_911885.1|  CRISPR-associated protein, Cs...    96   2e-18
gi|46255206|ref|YP_006118.1|  hypothetical protein TT_P0135 ...    96   2e-18
gi|51891450|ref|YP_074141.1|  hypothetical protein STH312 [S...    92   5e-17
gi|59801386|ref|YP_208098.1|  hypothetical protein, putative...    91   7e-17
gi|147678326|ref|YP_001212541.1|  hypothetical protein PTH_1...    83   2e-14
gi|109646593|ref|ZP_01370497.1|  Uncharacterized protein pre...    80   1e-13
gi|149125882|ref|ZP_01850852.1|  CRISPR-associated protein, ...    80   2e-13
gi|125975680|ref|YP_001039590.1|  CRISPR-associated protein,...    77   2e-12
gi|154249620|ref|YP_001410445.1|  CRISPR-associated protein,...    71   1e-10
gi|145622619|ref|ZP_01778576.1|  CRISPR-associated protein, ...    69   4e-10
gi|78043120|ref|YP_360970.1|  CRISPR-associated protein, Csh...    66   2e-09
gi|114844588|ref|ZP_01455032.1|  CRISPR-associated protein T...    66   3e-09
gi|89895517|ref|YP_519004.1|  hypothetical protein DSY2771 [...    66   3e-09
gi|150398985|ref|YP_001322752.1|  CRISPR-associated protein,...    66   3e-09
gi|109645854|ref|ZP_01369774.1|  CRISPR-associated protein, ...    65   6e-09
gi|114568025|ref|YP_755179.1|  hypothetical protein Swol_252...    64   7e-09
gi|89895552|ref|YP_519039.1|  hypothetical protein DSY2806 [...    64   8e-09
gi|153869181|ref|ZP_01998851.1|  CRISPR-associated protein T...    64   2e-08
gi|109672064|ref|ZP_01374306.1|  crispr-associated protein, ...    63   2e-08
gi|21226665|ref|NP_632587.1|  hypothetical protein MM_0563 [...    62   5e-08
gi|146295132|ref|YP_001178903.1|  CRISPR-associated protein,...    61   8e-08
gi|134045809|ref|YP_001097295.1|  CRISPR-associated protein,...    60   1e-07
gi|116753951|ref|YP_843069.1|  CRISPR-associated protein, Cs...    60   2e-07
gi|124004088|ref|ZP_01688935.1|  crispr-associated protein, ...    60   2e-07
gi|108803121|ref|YP_643058.1|  CRISPR-associated protein Csh...    59   4e-07
gi|89211072|ref|ZP_01189450.1|  CRISPR-associated protein TM...    58   7e-07
gi|154175378|ref|YP_001408164.1|  crispr-associated protein,...    57   1e-06
gi|84489232|ref|YP_447464.1|  hypothetical protein Msp_0420 ...    57   2e-06
gi|124521530|ref|ZP_01696443.1|  CRISPR-associated protein, ...    55   4e-06
>gi|24380128|ref|NP_722083.1| hypothetical protein SMU.1760c [Streptococcus mutans UA159]
 gi|24378127|gb|AAN59389.1|AE015004_8 conserved hypothetical protein [Streptococcus mutans UA159]
          Length = 291

 Score =  541 bits (1393), Expect = e-152,   Method: Composition-based stats.
 Identities = 291/291 (100%), Positives = 291/291 (100%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
           MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF
Sbjct: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF
Sbjct: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120

Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVY 180
           DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVY
Sbjct: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVY 180

Query: 181 VIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLG 240
           VIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLG
Sbjct: 181 VIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLG 240

Query: 241 NVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
           NVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL
Sbjct: 241 NVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
>gi|15675456|ref|NP_269630.1| hypothetical protein SPy_1564 [Streptococcus pyogenes M1 GAS]
 gi|71911101|ref|YP_282651.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS5005]
 gi|13622647|gb|AAK34351.1| conserved hypothetical protein [Streptococcus pyogenes M1 GAS]
 gi|71853883|gb|AAZ51906.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS5005]
          Length = 282

 Score =  349 bits (895), Expect = 1e-94,   Method: Composition-based stats.
 Identities = 195/292 (66%), Positives = 228/292 (78%), Gaps = 11/292 (3%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
           MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 1   MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 60

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           VQA +R++D   SL++R      F          K +E   ++ NA W DVR+FGQVF +
Sbjct: 61  VQANERIEDDFRSLEKR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 111

Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
              S   VRGPVSIS AKSLE +V  S+QIT+STN  E  + +   S TMGTKHFVDYGV
Sbjct: 112 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 170

Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
           YV+KGSIN  FAEKTGFS  DAE IKEVLVSLFENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 171 YVLKGSINAYFAEKTGFSQEDAEAIKEVLVSLFENDASSARPEGSMRVCEVFWFTHSSKL 230

Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
           GNVSSARVFDLLE+ +  ++K +Y+ Y IHLNQE+LA+YEAKGL +EI+EGL
Sbjct: 231 GNVSSARVFDLLEYHQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTLEILEGL 282
>gi|71903954|ref|YP_280757.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS6180]
 gi|94990878|ref|YP_598978.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10270]
 gi|71803049|gb|AAX72402.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS6180]
 gi|94544386|gb|ABF34434.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10270]
          Length = 287

 Score =  349 bits (895), Expect = 1e-94,   Method: Composition-based stats.
 Identities = 196/292 (67%), Positives = 228/292 (78%), Gaps = 11/292 (3%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
           MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 6   MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 65

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           VQA +R++D   SL++R      F          K +E   ++ NA W DVR+FGQVF +
Sbjct: 66  VQANERIEDDFRSLERR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 116

Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
              S   VRGPVSIS AKSLE +V  S+QIT+STN  E  + +   S TMGTKHFVDYGV
Sbjct: 117 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 175

Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
           YV+KGSIN  FAEKTGFS  DAE IKEVLVSLFENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 176 YVLKGSINAYFAEKTGFSQEDAEAIKEVLVSLFENDASSARPEGSMRVCEVFWFTHSSKL 235

Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
           GNVSSARVFDLLE+ +  ++K +Y+ Y IHLNQE+LA+YEAKGL VEI+EGL
Sbjct: 236 GNVSSARVFDLLEYHQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTVEILEGL 287
>gi|56808099|ref|ZP_00365890.1| COG3649: Uncharacterized protein predicted to be involved in DNA
           repair [Streptococcus pyogenes M49 591]
          Length = 282

 Score =  346 bits (888), Expect = 8e-94,   Method: Composition-based stats.
 Identities = 195/292 (66%), Positives = 227/292 (77%), Gaps = 11/292 (3%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
           MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 1   MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 60

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           VQA +R++D   SL++R      F          K +E   ++ NA W DVR+FGQVF +
Sbjct: 61  VQANERIEDDFRSLEKR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 111

Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
              S   VRGPVSIS AKSLE +V  S+QIT+STN  E  + +   S TMGTKHFVDYGV
Sbjct: 112 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 170

Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
           YV+KGSIN  FAEKTGFS  DAE IKEVLVSLFENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 171 YVLKGSINAYFAEKTGFSQEDAEAIKEVLVSLFENDASSARPEGSMRVCEVFWFTHSSKL 230

Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
           GNVSSARVFDLLE+ +  ++K +Y+ Y IHLNQE+LA+YEAKGL VEI+E L
Sbjct: 231 GNVSSARVFDLLEYHQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTVEILERL 282
>gi|94994791|ref|YP_602889.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10750]
 gi|94548299|gb|ABF38345.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10750]
          Length = 287

 Score =  345 bits (886), Expect = 1e-93,   Method: Composition-based stats.
 Identities = 194/292 (66%), Positives = 227/292 (77%), Gaps = 11/292 (3%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
           MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 6   MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 65

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           VQA +R++D   SL++R      F          K +E   ++ NA W DVR+FGQVF +
Sbjct: 66  VQANERIEDDFRSLERR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 116

Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
              S   VRGPVSIS AKSLE +V  S+QIT+STN  E  + +   S TMGTKHFVDYGV
Sbjct: 117 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 175

Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
           YV++GSIN  FAEKTGFS  DAE IKEVLVSL ENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 176 YVLEGSINAYFAEKTGFSQEDAEAIKEVLVSLCENDASSARPEGSMRVCEVFWFTHSSKL 235

Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
           GNVSSARVFDLLE+ +  ++K +Y+ Y IHLNQE+LA+YEAKGL VEI+EGL
Sbjct: 236 GNVSSARVFDLLEYHQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTVEILEGL 287
>gi|94988912|ref|YP_597013.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS9429]
 gi|94992804|ref|YP_600903.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS2096]
 gi|94542420|gb|ABF32469.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS9429]
 gi|94546312|gb|ABF36359.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS2096]
          Length = 287

 Score =  344 bits (883), Expect = 3e-93,   Method: Composition-based stats.
 Identities = 194/292 (66%), Positives = 228/292 (78%), Gaps = 11/292 (3%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
           MLE KIDFMVT+EV+EANANGDPL+ NMPRTDAK YG+MSDVSIKRKIRNR+QDMG+ IF
Sbjct: 6   MLEHKIDFMVTLEVKEANANGDPLNGNMPRTDAKGYGVMSDVSIKRKIRNRLQDMGKSIF 65

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           VQA +R++D   SL++R      F          K +E   ++ NA W DVR+FGQVF +
Sbjct: 66  VQANERIEDDFRSLEKR------FSQHFTAKTPDKEIE---EKANALWFDVRAFGQVFTY 116

Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNS-ELNSKTELESSTMGTKHFVDYGV 179
              S   VRGPVSIS AKSLE +V  S+QIT+STN  E  + +   S TMGTKHFVDYGV
Sbjct: 117 LKKSIG-VRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGV 175

Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
           YV+KGSIN  FAEKT FS  DAE IKEVLVSLFENDASSARPEGSMRV EVFWFTHS+KL
Sbjct: 176 YVLKGSINAYFAEKTVFSQEDAEAIKEVLVSLFENDASSARPEGSMRVCEVFWFTHSSKL 235

Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
           GNVSSARVFDLLE+++  ++K +Y+ Y IHLNQE+LA+YEAKGL +EI+EGL
Sbjct: 236 GNVSSARVFDLLEYNQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTLEILEGL 287
>gi|15612902|ref|NP_241205.1| hypothetical protein BH0339 [Bacillus halodurans C-125]
 gi|10172952|dbj|BAB04058.1| BH0339 [Bacillus halodurans C-125]
          Length = 283

 Score =  254 bits (649), Expect = 4e-66,   Method: Composition-based stats.
 Identities = 150/292 (51%), Positives = 192/292 (65%), Gaps = 14/292 (4%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF 60
           +L+ KIDF V + V +AN NGDPL+ N PR +   +G +SDV+IKRKIRNR+ DM +PIF
Sbjct: 3   ILDHKIDFAVILSVTKANPNGDPLNGNRPRQNYDGHGEISDVAIKRKIRNRLLDMEEPIF 62

Query: 61  VQARDRVDDCIYSLKQRLE-NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
           VQ+ DR  D   SL+ R + N E    +K    K  SV+ F K    EW+DVRSFGQVFA
Sbjct: 63  VQSDDRKADSFKSLRDRADSNPELAKMLK---AKNASVDEFAKIACQEWMDVRSFGQVFA 119

Query: 120 FDGYS-AANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYG 178
           F G + +  VRGPVSI  A S++ +   S QITKS NS    K    S TMG KH VD+G
Sbjct: 120 FKGSNLSVGVRGPVSIHTATSIDPIDIVSTQITKSVNSVTGDKRS--SDTMGMKHRVDFG 177

Query: 179 VYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNK 238
           VYV KGSIN   AEKTGF++ DAE IK  L++LFEND+SSARP+GSM V +V+W+ HS+K
Sbjct: 178 VYVFKGSINTQLAEKTGFTNEDAEKIKRALITLFENDSSSARPDGSMEVHKVYWWEHSSK 237

Query: 239 LGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEG 290
           LG  SSA+V   L+ + +     S++DYA+ L       YE  GL VE+I+G
Sbjct: 238 LGQYSSAKVHRSLKIESKTDTPKSFDDYAVEL-------YELDGLGVEVIDG 282
>gi|68055788|ref|ZP_00539929.1| CRISPR-associated protein TM1801 [Exiguobacterium sibiricum 255-15]
 gi|68007634|gb|EAM86884.1| CRISPR-associated protein TM1801 [Exiguobacterium sibiricum 255-15]
          Length = 285

 Score =  246 bits (629), Expect = 1e-63,   Method: Composition-based stats.
 Identities = 149/294 (50%), Positives = 184/294 (62%), Gaps = 16/294 (5%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
           L++KIDF V + V +AN NGDPL+ N PR +   YG +SDV+IKRKIRNR+QDMG+ IFV
Sbjct: 4   LDRKIDFTVILSVTKANPNGDPLNGNRPRQNYDGYGEISDVAIKRKIRNRLQDMGESIFV 63

Query: 62  QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFD 121
           Q+ DR  D   SL+ R E     DT+K+  K K + +    +  A W+DVR+FGQVFAF 
Sbjct: 64  QSNDRNLDGYASLRDRAEAN---DTIKKLMKTKNASDQVAAEACATWMDVRAFGQVFAFK 120

Query: 122 GYSAANV----RGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDY 177
           G  A  V    RGPVSI  A S+  +   SMQITKS NSE  S  E  S TMG KH VD+
Sbjct: 121 GDKAGGVSVGVRGPVSIHTATSVAPIDVTSMQITKSVNSE--SGKERGSDTMGMKHRVDH 178

Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSN 237
           GVYV  GSIN   AEKT F+  DAE  K  LVSLFENDASSARPEGSM V +V+W+ H +
Sbjct: 179 GVYVFNGSINTQLAEKTNFTQEDAEKFKLALVSLFENDASSARPEGSMEVHKVYWWEHDS 238

Query: 238 KLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
           KLG  SSA+V   +       +  S+ DY I ++  E       G   EI++GL
Sbjct: 239 KLGRFSSAKVHRAVSVQALTDEPKSFLDYEIKVDALE-------GFTPEILDGL 285
>gi|124485669|ref|YP_001030285.1| hypothetical protein Mlab_0847 [Methanocorpusculum labreanum Z]
 gi|124363210|gb|ABN07018.1| CRISPR-associated protein, Csd2 family [Methanocorpusculum
           labreanum Z]
          Length = 307

 Score =  244 bits (624), Expect = 3e-63,   Method: Composition-based stats.
 Identities = 150/314 (47%), Positives = 187/314 (59%), Gaps = 36/314 (11%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
           L  KIDF V +  + AN NGDPL+ N PRT+    G MSDV IKRK+RNR+QDMGQP+FV
Sbjct: 4   LNNKIDFAVVISAKNANPNGDPLNGNCPRTNFAGIGEMSDVCIKRKLRNRLQDMGQPVFV 63

Query: 62  QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
           Q+ DR  D   SLK R +  E F T    +KK  + +   K    EW+DVRSFGQVFA+ 
Sbjct: 64  QSLDRRTDSFLSLKDRADGCEAFKTALADNKKDDAEKIACK----EWIDVRSFGQVFAYK 119

Query: 121 ----------------------DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSEL 158
                                 +G  +  +RGPVS+  A S++ V   S QI KS N E 
Sbjct: 120 GKKQNKKNKNEQKNDSESEDGEEGCVSIGIRGPVSVHPAFSVDAVEITSQQIVKSVNGET 179

Query: 159 NSKTELE--SSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDA 216
           + K   +  S TMG KH VD+G+YV  GSIN   AEKTGFSD DAE IKE L++LFENDA
Sbjct: 180 DKKNPYKRGSDTMGLKHRVDFGLYVFYGSINCQLAEKTGFSDEDAESIKEALMTLFENDA 239

Query: 217 SSARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELA 276
           SSARPEGSM V +V W+ H++K G  SSA+V   L  +K K+   S +DYAI ++  E  
Sbjct: 240 SSARPEGSMEVVQVAWWKHNSKSGQYSSAKVHRTLHVEKRKETPMSVDDYAIKIDDLE-- 297

Query: 277 EYEAKGLQVEIIEG 290
                GL+ EI EG
Sbjct: 298 -----GLKPEIFEG 306
>gi|56965356|ref|YP_177088.1| hypothetical protein ABC3594 [Bacillus clausii KSM-K16]
 gi|56911600|dbj|BAD66127.1| conserved hypothetical protein [Bacillus clausii KSM-K16]
          Length = 285

 Score =  243 bits (620), Expect = 1e-62,   Method: Composition-based stats.
 Identities = 146/293 (49%), Positives = 195/293 (66%), Gaps = 16/293 (5%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
           L+ KIDF V + V +AN NGDPL+ N PR +   +G +SDV+IKRKIRNR+QDMG+P+FV
Sbjct: 4   LDHKIDFAVILSVSKANPNGDPLNGNRPRQNYDGHGEISDVAIKRKIRNRLQDMGEPVFV 63

Query: 62  QARDRVDDCIYSLKQRLE-NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           Q+ DR  D   SL++R + N E    +K    K  S ++F +    EW+DVRSFGQVFAF
Sbjct: 64  QSDDRKVDSHKSLRERADSNPELAKMLK---AKNASSDDFAQIACEEWIDVRSFGQVFAF 120

Query: 121 DGYS-AANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGV 179
            G + +  VRGPVSI  A S++ +   S QITKS NS    K    S TMG KH VD+GV
Sbjct: 121 KGSNLSVGVRGPVSIHTATSIDPIDIVSTQITKSVNSVTGDKRS--SDTMGMKHRVDFGV 178

Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
           YV KGSIN   AEKTGF++ DAE IK+ LV+LFEND+SSARP+GSM V +V+W+ HS+KL
Sbjct: 179 YVFKGSINTQLAEKTGFTNEDAEKIKQALVTLFENDSSSARPDGSMEVHKVYWWEHSSKL 238

Query: 240 GNVSSARVFDLLEFDKEKQDKDSY--EDYAIHLNQEELAEYEAKGLQVEIIEG 290
           G  SSA+V   L+ + +     ++  E+Y++ L+  E       GL VE+++G
Sbjct: 239 GQYSSAKVHRSLKVEAKTDSPKAFDEENYSVELSDLE-------GLSVEVLDG 284
>gi|150389452|ref|YP_001319501.1| CRISPR-associated protein, Csd2 family [Alkaliphilus
           metalliredigens QYMF]
 gi|149949314|gb|ABR47842.1| CRISPR-associated protein, Csd2 family [Alkaliphilus
           metalliredigens QYMF]
          Length = 286

 Score =  224 bits (570), Expect = 6e-57,   Method: Composition-based stats.
 Identities = 136/296 (45%), Positives = 175/296 (59%), Gaps = 19/296 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
            E KIDF V + V+ AN NGDPL+ N PR +   +G +SDV IKRKIRNR QDM Q IFV
Sbjct: 4   FENKIDFAVVISVKNANPNGDPLNGNRPRENYDGFGEISDVCIKRKIRNRFQDMDQAIFV 63

Query: 62  QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
           Q+ +R  D   SLK R       D  +E  K  K  E + K    +W+DVRSFGQVFAF 
Sbjct: 64  QSDERRTDGFRSLKDRA------DGCEELKKSSKDKEQYAKIACEKWIDVRSFGQVFAFK 117

Query: 121 ----DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELES-STMGTKHFV 175
               D   +  +RGPVSI  A S+  +   SMQITKS N E     + +S  TMG KH V
Sbjct: 118 KGGDDNSVSIGIRGPVSIHSAVSISPIEISSMQITKSVNGETGKDPDKKSPDTMGMKHRV 177

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
           ++G YVI GSIN   A KTGF+  D+E++K+ L++LFEND SSARP+GSM V +++W+ H
Sbjct: 178 EFGAYVIYGSINTQLATKTGFNQEDSELVKKALITLFENDCSSARPDGSMEVCKLYWWKH 237

Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEGL 291
           ++K+G  SSA+V   L      +      DY I    E L      GL  EI +G+
Sbjct: 238 NSKIGQYSSAKVHRTLRIAPTIEMPKDINDYNI--THETL-----DGLAPEIYDGI 286
>gi|56477287|ref|YP_158876.1| hypothetical protein ebA3286 [Azoarcus sp. EbN1]
 gi|56313330|emb|CAI07975.1| conserved hypothetical protein [Azoarcus sp. EbN1]
          Length = 280

 Score =  219 bits (558), Expect = 2e-55,   Method: Composition-based stats.
 Identities = 130/256 (50%), Positives = 162/256 (63%), Gaps = 7/256 (2%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
           L+ KIDF V + V+ AN NGDPL+ N PRTD    G ++DV +KRK+R+R+Q+ G  IFV
Sbjct: 4   LQNKIDFAVVIRVKHANPNGDPLNGNRPRTDYGGLGEITDVCLKRKLRDRLQETGHAIFV 63

Query: 62  QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFD 121
           Q+ DR  D   SL+ R E+ E     KE  KK    E   K+   +W DVR+FGQVFAF 
Sbjct: 64  QSDDRKIDGEPSLRTRAES-EKNGLGKEAFKKGAKREETAKKACEKWFDVRAFGQVFAFG 122

Query: 122 GYSAAN-----VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVD 176
               AN     +RGP++I  A S E V   S QITKS + E    ++  S TMG KH VD
Sbjct: 123 KGDDANGVSIPIRGPLTIQSAFSKEPVSITSTQITKSVSGE-GDGSKRGSDTMGMKHRVD 181

Query: 177 YGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHS 236
            G+Y   GSINP  AE+TGFSDADAE IK +L  LFENDASSARP+GSM V +V W+ H+
Sbjct: 182 SGIYECFGSINPQLAERTGFSDADAETIKTILPKLFENDASSARPDGSMEVLKVVWWKHN 241

Query: 237 NKLGNVSSARVFDLLE 252
            K G  SSA+V  LL+
Sbjct: 242 CKAGQYSSAKVHRLLK 257
>gi|119026200|ref|YP_910045.1| CRISPR-associated protein TM1801 [Bifidobacterium adolescentis ATCC
           15703]
 gi|154489041|ref|ZP_02029890.1| hypothetical protein BIFADO_02351 [Bifidobacterium adolescentis
           L2-32]
 gi|118765784|dbj|BAF39963.1| CRISPR-associated protein TM1801 [Bifidobacterium adolescentis ATCC
           15703]
 gi|154083178|gb|EDN82223.1| hypothetical protein BIFADO_02351 [Bifidobacterium adolescentis
           L2-32]
          Length = 281

 Score =  212 bits (539), Expect = 2e-53,   Method: Composition-based stats.
 Identities = 135/295 (45%), Positives = 174/295 (58%), Gaps = 24/295 (8%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
           LE KIDF +   V  AN NGDPL+ N PRT ++  G +SDV++KRKIRNR+QD G+ +FV
Sbjct: 4   LENKIDFAIAFAVNNANPNGDPLNGNRPRTTSEGLGEVSDVALKRKIRNRLQDAGESVFV 63

Query: 62  QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFD 121
           Q+ DR DD   SL  R     +  T+ +  +K+K+V     ++   WLDVRSFGQVFAF 
Sbjct: 64  QSDDRSDDGAKSLSDRFNT--YLKTLPKDEQKQKNV--VFGKVCERWLDVRSFGQVFAFK 119

Query: 122 GYSAAN-----VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVD 176
                +     VRGPVSI  A S+  +    +QITKS NSE     +  S TMG K+ V 
Sbjct: 120 KAKDTDEVSIGVRGPVSIQPAFSINPIAIDDVQITKSVNSETTDSGKKSSDTMGMKYRVS 179

Query: 177 -YGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
              VYV  GSI+P  AE+TGFS  DAE IKE LV+LFEND SSARP GSM V +V WF H
Sbjct: 180 GRAVYVTYGSISPQLAERTGFSAEDAEKIKEALVTLFENDESSARPSGSMEVLDVVWFAH 239

Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEG 290
           + K G  SSA+V   +  D          D  ++++   +      GL+ E+IEG
Sbjct: 240 NCKGGQYSSAKVHRSVSVDA---------DGTVNVDDASIP-----GLRYEVIEG 280
>gi|78188974|ref|YP_379312.1| CRISPR-associated TM1801 family protein [Chlorobium chlorochromatii
           CaD3]
 gi|78171173|gb|ABB28269.1| CRISPR-associated protein TM1801 [Chlorobium chlorochromatii CaD3]
          Length = 280

 Score =  211 bits (538), Expect = 3e-53,   Method: Composition-based stats.
 Identities = 127/266 (47%), Positives = 162/266 (60%), Gaps = 19/266 (7%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQP--- 58
           L QKIDF + + V  AN NGDPL+ N PRTD   +G M+DV +KRKIRNR+ ++      
Sbjct: 4   LNQKIDFAIIMRVTNANPNGDPLNGNRPRTDLDGHGEMTDVCLKRKIRNRIMELKDKEQK 63

Query: 59  ----IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSF 114
               IFVQ  D   D   SLK R E+        E  K  K  ++  K+   +W DVR+F
Sbjct: 64  YQFDIFVQPDDSKRDSHTSLKARFES--------EIGKNVKDKDDAAKKACKKWFDVRAF 115

Query: 115 GQVFAFDGYSAAN----VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMG 170
           GQ+FAFDG  ++     VRGPVSI  A S+E V   S+QITKS +       +  S TMG
Sbjct: 116 GQLFAFDGEESSGLSIPVRGPVSIHSAFSVEPVNVSSIQITKSVSGNEGKNGKRSSDTMG 175

Query: 171 TKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREV 230
            KH VDYG+YV  GS+NP  AE+TGFSD DA++I E+L  LFENDASSARP+GSM V  V
Sbjct: 176 MKHRVDYGIYVTYGSMNPQLAERTGFSDEDAKVIMEILPKLFENDASSARPDGSMEVVSV 235

Query: 231 FWFTHSNKLGNVSSARVFDLLEFDKE 256
            W+ H +K G  SSA+V   L  +++
Sbjct: 236 IWWKHGSKAGKHSSAKVHKSLHVNED 261
>gi|152978953|ref|YP_001344582.1| CRISPR-associated protein, Csd2 family [Actinobacillus succinogenes
           130Z]
 gi|150840676|gb|ABR74647.1| CRISPR-associated protein, Csd2 family [Actinobacillus succinogenes
           130Z]
          Length = 279

 Score =  210 bits (535), Expect = 8e-53,   Method: Composition-based stats.
 Identities = 127/260 (48%), Positives = 167/260 (64%), Gaps = 7/260 (2%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
           L +KIDF + ++V  AN NGDPL+ N PRTD    G M+DV +KRKIR+R+Q  G+ IFV
Sbjct: 3   LTKKIDFALILKVTNANPNGDPLNGNRPRTDFAGIGEMTDVCLKRKIRDRLQSNGESIFV 62

Query: 62  QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
           Q+ ++  D + SL  R ++K+      E   KK + +   K   A+WLDVRSFGQVFAF 
Sbjct: 63  QSDEKKTDGMTSLANRAKDKDV-GLGAEAFGKKANKDETAKAACAKWLDVRSFGQVFAFG 121

Query: 121 ---DGYSAA-NVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVD 176
              DG   +  VRGPV+I  A S+E V   S QITKS + E    T+  S TMG KH VD
Sbjct: 122 KSDDGAGVSIAVRGPVTIQSAFSVEPVNITSTQITKSVSGE-GDGTKRGSDTMGMKHRVD 180

Query: 177 YGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHS 236
            G+YV  G+++P  AE+TGFSD DA+ IK VL  LFE DASSARPEGSM+V ++ W+ H+
Sbjct: 181 SGIYVAFGAMSPQLAERTGFSDEDADKIKAVLTKLFEGDASSARPEGSMQVLKLIWWEHN 240

Query: 237 NKLGNVSSARVFDLLEFDKE 256
            K G  SSA+V   L+ + +
Sbjct: 241 CKSGQYSSAKVHGSLKVNTD 260
>gi|117926796|ref|YP_867413.1| CRISPR-associated protein, Csd2 family [Magnetococcus sp. MC-1]
 gi|117610552|gb|ABK46007.1| CRISPR-associated protein, Csd2 family [Magnetococcus sp. MC-1]
          Length = 272

 Score =  207 bits (526), Expect = 8e-52,   Method: Composition-based stats.
 Identities = 117/252 (46%), Positives = 159/252 (63%), Gaps = 16/252 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
           L  KIDF V   V+ AN NGDPL+ N PR    + G +SDV++KRK+R+R+ + G  IFV
Sbjct: 3   LSHKIDFAVIFAVKNANPNGDPLNGNRPRLTFDNLGEVSDVALKRKLRDRLLEGGHAIFV 62

Query: 62  QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFD 121
           Q+ DR +D   SLK R +          GSK   + E   ++  A+WLDVR+FGQ+FA+ 
Sbjct: 63  QSNDRNNDGATSLKDRSDKTL-------GSKL--TSEELAQKACAQWLDVRAFGQLFAWK 113

Query: 122 GYSAAN------VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
           G   A       +RGPV+   A S+  V   S+QITKS N+E +S+ +  S TMG KH V
Sbjct: 114 GTKGAGDGVSVAIRGPVTFQSAFSIAPVDISSIQITKSVNTEGDSEKK-GSDTMGMKHRV 172

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
           D+G+Y+  GS+NP  A++T FSD DA++IK+ L  LFEND S+ARP GSM VR+V W+ H
Sbjct: 173 DHGIYLFYGSMNPQLAQRTHFSDEDAQLIKQTLPRLFENDESTARPAGSMEVRKVLWWQH 232

Query: 236 SNKLGNVSSARV 247
           +   G  SSA+V
Sbjct: 233 NCAAGQYSSAKV 244
>gi|94269288|ref|ZP_01291422.1| CRISPR-associated protein TM1801 [delta proteobacterium MLMS-1]
 gi|93451274|gb|EAT02162.1| CRISPR-associated protein TM1801 [delta proteobacterium MLMS-1]
          Length = 287

 Score =  206 bits (524), Expect = 1e-51,   Method: Composition-based stats.
 Identities = 131/276 (47%), Positives = 160/276 (57%), Gaps = 31/276 (11%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
           L  KIDF V   V  AN NGDPL+ N PRT  +  G +SDV IKRKIRNR+ + G+ IFV
Sbjct: 3   LSNKIDFAVIFRVVNANPNGDPLNGNRPRTIYEGNGEVSDVCIKRKIRNRLMEAGKAIFV 62

Query: 62  QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
           Q+ D   D   SL+ R       D+V  G K    +    K+  A WLDVR+FGQ+FAF 
Sbjct: 63  QSDDNKIDEHSSLRSRA------DSVLSGIKGSDEI---AKKACATWLDVRAFGQLFAFK 113

Query: 121 --------------------DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNS 160
                               D   +  +RGPVS+  A S+  V   S QITKS + E   
Sbjct: 114 AAGGQKTKAKEGEAAAGAGDDKGVSIGIRGPVSVQSAFSITPVSVTSTQITKSVSGE-GD 172

Query: 161 KTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSAR 220
            ++  S TMG KH VD GVYV  GS+NP  AEKTGFSDADAE IK +L  LFENDASSAR
Sbjct: 173 GSKRGSDTMGMKHRVDRGVYVFYGSMNPQLAEKTGFSDADAETIKAILPKLFENDASSAR 232

Query: 221 PEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKE 256
           PEGSM V +VFW+ H +K G  SSA+V   L  +++
Sbjct: 233 PEGSMAVEKVFWWRHDSKAGQYSSAKVHRSLSVNED 268
>gi|85860586|ref|YP_462788.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
 gi|85723677|gb|ABC78620.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
          Length = 291

 Score =  200 bits (508), Expect = 1e-49,   Method: Composition-based stats.
 Identities = 129/277 (46%), Positives = 167/277 (60%), Gaps = 30/277 (10%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRM--------- 52
           L +KIDF + + V+ AN NGDPL+ N PRTD +  G ++DV +KRKIR+R+         
Sbjct: 4   LSKKIDFAIIMSVKNANPNGDPLNGNRPRTDYEGLGEITDVCLKRKIRDRLVEQYVSLKN 63

Query: 53  --QDMGQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAE--- 107
             +  GQ IFVQ+ DR  D   SL+ R E++      K G  KK    N  K   A+   
Sbjct: 64  EEEKKGQAIFVQSDDRKIDGETSLRNRAESE------KNGLGKKAFGANAKKDETAKSAC 117

Query: 108 --WLDVRSFGQVFAF------DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELN 159
             W DVR+FGQVFAF      DG S   VRGPV+I  A S + V   S QITKS + E +
Sbjct: 118 EKWFDVRAFGQVFAFGKGNDADGVSIP-VRGPVTIQSAFSRDLVSISSTQITKSVSGEGD 176

Query: 160 SKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSA 219
            K    S TMG KH VD G+YV  G++NP  AE+TGFSD DA +IK +L  +FENDASSA
Sbjct: 177 GKKR-SSDTMGMKHRVDRGIYVTYGTMNPQLAERTGFSDKDAAVIKAILPKIFENDASSA 235

Query: 220 RPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKE 256
           RPEGSM V +V W+ H++K G+ SSA+V   L+ + +
Sbjct: 236 RPEGSMEVVKVLWWQHNSKSGDYSSAKVHRSLKVNPD 272
>gi|148658415|ref|YP_001278620.1| CRISPR-associated protein, Csd2 family [Roseiflexus sp. RS-1]
 gi|148570525|gb|ABQ92670.1| CRISPR-associated protein, Csd2 family [Roseiflexus sp. RS-1]
          Length = 324

 Score =  196 bits (498), Expect = 1e-48,   Method: Composition-based stats.
 Identities = 142/339 (41%), Positives = 182/339 (53%), Gaps = 70/339 (20%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMG----- 56
           L +KIDF V + VR AN NGDPL+ N PRT  +  G +SDV+IKRKIRNR+ ++      
Sbjct: 5   LSKKIDFAVILCVRNANPNGDPLNGNRPRTTYEGLGEISDVAIKRKIRNRVMELAKEIER 64

Query: 57  ---------------------QPIFVQARDRVDDCIYSLKQRLEN--KEFFDTVKEGSKK 93
                                 PIFVQ+ D  +D   SL++R E   K+F +     + +
Sbjct: 65  IKEEERTEAQKQAYERLKGVEHPIFVQSDDNRNDEYTSLRERAEAYLKDFMN--DRSAYQ 122

Query: 94  KKSVENFVKQINAEWLDVRSFGQVFAF------------------DGYSAANVRGPVSIS 135
           KK+ E         W DVR+FGQVF F                  DG S   +RGPVSI 
Sbjct: 123 KKACET--------WFDVRAFGQVFPFKGKGKGKKGDKSNEGEESDGVSIG-IRGPVSIH 173

Query: 136 WAKSLEKVVTQ--SMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEK 193
            A S+  V+ +  S+QITKS ++E   K    S TMG KH VD+GVYV  GSINP  A K
Sbjct: 174 PAFSVVPVLDRVSSIQITKSVSNEPGEKRG--SDTMGMKHRVDHGVYVFYGSINPQLASK 231

Query: 194 TGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEF 253
           TGFSD DA +IKE L +LF NDASSARPEGS+ V  V+W+ H++  G  SSA+V   L  
Sbjct: 232 TGFSDDDAAVIKEALRTLFRNDASSARPEGSIEVYRVYWWKHNSPNGQYSSAKVHRSLRV 291

Query: 254 DKEK--QDKDSYEDYAIHLNQEELAEYEAKGLQVEIIEG 290
                 +D  S +DY I L        +  GL+ E+IEG
Sbjct: 292 RVRDGIEDPKSIDDYVIELQ-------DLDGLKPEVIEG 323
>gi|110601407|ref|ZP_01389595.1| CRISPR-associated protein TM1801 [Geobacter sp. FRC-32]
 gi|110547870|gb|EAT61108.1| CRISPR-associated protein TM1801 [Geobacter sp. FRC-32]
          Length = 288

 Score =  196 bits (498), Expect = 1e-48,   Method: Composition-based stats.
 Identities = 126/264 (47%), Positives = 151/264 (57%), Gaps = 28/264 (10%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIFV 61
           L  KIDF V  +V  AN NGDPL+ N PRT  +  G +SDV IKRKIRNR+ + GQ IFV
Sbjct: 3   LSNKIDFAVVFKVTNANPNGDPLNGNRPRTIYEGNGEVSDVCIKRKIRNRLMEAGQRIFV 62

Query: 62  QARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF- 120
           Q+ D  +D   SLK R +  E  D +K   +K K            W DVR+FGQ+FAF 
Sbjct: 63  QSDDSKNDKHPSLKSRAD--EVLDGIKAADEKAKKA-------CETWFDVRAFGQLFAFK 113

Query: 121 -----------------DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTE 163
                            D   +  +RGPVS+  A S+  V   S QITKS + E    T+
Sbjct: 114 AAGGKKGKAKEGEEPGDDKGVSIGIRGPVSVQSAFSISPVSLTSTQITKSVSGE-GDGTK 172

Query: 164 LESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEG 223
             S TMG KH VD G+Y   GS+NP  A KTGFSDADA  IK VL  LFENDASSARPEG
Sbjct: 173 RGSDTMGMKHRVDRGIYTFYGSMNPQLAVKTGFSDADAAAIKAVLPRLFENDASSARPEG 232

Query: 224 SMRVREVFWFTHSNKLGNVSSARV 247
           SM V +V W+ H+   G  SSA+V
Sbjct: 233 SMEVLKVVWWQHNCASGQCSSAKV 256
>gi|89902663|ref|YP_525134.1| CRISPR-associated protein TM1801 [Rhodoferax ferrireducens T118]
 gi|89347400|gb|ABD71603.1| CRISPR-associated protein TM1801 [Rhodoferax ferrireducens DSM
           15236]
          Length = 294

 Score =  193 bits (491), Expect = 9e-48,   Method: Composition-based stats.
 Identities = 123/273 (45%), Positives = 154/273 (56%), Gaps = 37/273 (13%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRM--------- 52
           L+ KIDF V + V+ AN NGDPL+ N PRTD  ++G M+DVSIKRKIR+R+         
Sbjct: 4   LQNKIDFAVILRVKNANPNGDPLNGNRPRTDYSNFGEMTDVSIKRKIRDRLLERWVAAGK 63

Query: 53  QDMGQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVR 112
            D G  IFVQ+ DR  D   SL+ R E             K    +       ++WLDVR
Sbjct: 64  ADDGNMIFVQSDDRKADEYKSLRARAEAV---------LGKALGSDQTALLACSKWLDVR 114

Query: 113 SFGQVFAFDGYSAAN------------------VRGPVSISWAKSLEKVVTQSMQITKST 154
           +FGQ+FA      A                   +RGPV++  A S+E +   S QITKS 
Sbjct: 115 AFGQLFALKSNKKAGKKNDDGSDDEGDTGVSIGIRGPVTVQSAFSVEPIDITSTQITKSV 174

Query: 155 NSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFEN 214
           + E    T+  S TMGTKH VD G+Y   GS+NP  AEKTGFSDADA+ +K VL  LFEN
Sbjct: 175 SGE-GDGTKRSSDTMGTKHRVDQGIYRFFGSMNPQLAEKTGFSDADAQALKAVLPKLFEN 233

Query: 215 DASSARPEGSMRVREVFWFTHSNKLGNVSSARV 247
           D SSARP GSM V +V W+ H+ K G  SSA+V
Sbjct: 234 DESSARPAGSMEVVKVIWWQHNCKSGQYSSAKV 266
>gi|21673958|ref|NP_662023.1| hypothetical protein CT1132 [Chlorobium tepidum TLS]
 gi|21647100|gb|AAM72365.1| CRISPR-associated protein, TM1801 family [Chlorobium tepidum TLS]
          Length = 299

 Score =  156 bits (394), Expect = 2e-36,   Method: Composition-based stats.
 Identities = 101/295 (34%), Positives = 180/295 (61%), Gaps = 26/295 (8%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           ++++ DF+V  +V++ N NGDP + N+PR DA+   GL++DV +KRK+RN +Q +GQ IF
Sbjct: 4   VDKRYDFVVLFDVQDGNPNGDPDAGNLPRIDAETGMGLVTDVCLKRKVRNYVQLLGQDIF 63

Query: 61  VQAR----DRVDDCIYSLKQRLENKEFFDTVKEGSKKKK-------SVENFVKQINAEWL 109
           ++ +    +++D+   +L   L N    D+ K+GSK+ K        V+    Q+  ++ 
Sbjct: 64  IKEKAILNNKIDEAYKALNIDL-NAAPADS-KDGSKRNKPGVAQGGEVDKGRVQMCTKYY 121

Query: 110 DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELES 166
           D+R+FG V +  G +A  VRGP+ +++A+S+E VV     IT+   +T +E   ++  ++
Sbjct: 122 DIRAFGAVMS-TGANAGQVRGPIQMTFARSVEPVVALEHSITRMAVATEAEAEKQSG-DN 179

Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
            TMG K+ V YG+Y   G ++ N A +TGFS  D ++  + L+++FE+D S+AR  G M 
Sbjct: 180 RTMGRKYTVPYGLYRAHGFVSANLASQTGFSAEDLDLFWDALLNMFEHDRSAAR--GLMS 237

Query: 227 VREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEELAE 277
            R ++ F HS+ LGN  ++++F+ +   K K+D +    S++DY + +++  L E
Sbjct: 238 TRGLYVFEHSSALGNAPASQLFERITV-KRKEDSEGPARSFKDYEVLVDESNLGE 291
>gi|145220108|ref|YP_001130817.1| CRISPR-associated protein, Csd2 family [Prosthecochloris
           vibrioformis DSM 265]
 gi|145206272|gb|ABP37315.1| CRISPR-associated protein, Csd2 family [Prosthecochloris
           vibrioformis DSM 265]
          Length = 282

 Score =  152 bits (384), Expect = 3e-35,   Method: Composition-based stats.
 Identities = 95/281 (33%), Positives = 167/281 (59%), Gaps = 19/281 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           ++++ DF++  +V++ N NGDP + N+PR DA+   GL+SDV +KRK+RN +Q  GQ IF
Sbjct: 4   IKKRYDFVLLFDVQDGNPNGDPDAGNLPRIDAETGMGLVSDVCLKRKVRNYVQLAGQEIF 63

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           ++ +  ++  I    ++ E       VK  +K  K+ E     + +++ D+R+FG V + 
Sbjct: 64  IKEKGVLNTLIAESHEQPE-------VKSKTKGDKT-EAARSWMCSKYYDIRTFGAVMS- 114

Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTKHFVDY 177
            G +A  VRGPV I++A+S+E VV     IT+   +T +E   K +  + TMG K+ + Y
Sbjct: 115 TGENAGQVRGPVQITFARSVEPVVALEHSITRMAVATEAEA-EKQDGGNRTMGRKYTIPY 173

Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSN 237
           G+Y+  G ++ N A +TGFS+ D ++  + L+++FE+D S+AR  G M  R ++ F H++
Sbjct: 174 GLYLAHGFVSANLANQTGFSEKDLQLFWDALLNMFEHDRSAAR--GMMSTRGLYIFEHNS 231

Query: 238 KLGNVSSARVFDLLEFDKEKQDKD---SYEDYAIHLNQEEL 275
            LGN  +  +F+ +   ++        S+ +Y+I +NQ  L
Sbjct: 232 ALGNAPAHSLFERISVSRKNPASGPARSFAEYSIDINQANL 272
>gi|78187158|ref|YP_375201.1| CRISPR-associated TM1801 family protein [Pelodictyon luteolum DSM
           273]
 gi|78167060|gb|ABB24158.1| CRISPR-associated protein TM1801 [Pelodictyon luteolum DSM 273]
          Length = 299

 Score =  150 bits (379), Expect = 8e-35,   Method: Composition-based stats.
 Identities = 94/289 (32%), Positives = 172/289 (59%), Gaps = 18/289 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           L ++ DF +  +V++ N NGDP + N+PR DA+   GL++DV +KRK+RN +Q  G+ IF
Sbjct: 4   LTKRYDFALLFDVQDGNPNGDPDAGNLPRIDAETGMGLVTDVCLKRKVRNYVQLSGKDIF 63

Query: 61  VQARDRVDDCIYSL--KQRLENKEFFDTVKEGSKKKK-------SVENFVKQINAEWLDV 111
           ++ +  ++  I +   +Q+++  +    +K+G K+ K        VE     + + + D+
Sbjct: 64  IKEKAVLNTLISNAYEEQKIDLTKDPVDLKDGKKRNKDGTAQGGEVEKGRSYMCSRYYDI 123

Query: 112 RSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK-STNSELNS-KTELESSTM 169
           R+FG V +  G +A  VRGP+ I++A+S+E VV     IT+ +  +E ++ K   ++ TM
Sbjct: 124 RTFGAVMS-TGANAGQVRGPIQITFARSVEPVVALEHSITRMAVTTEADAEKQSGDNRTM 182

Query: 170 GTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVRE 229
           G K+ V YG+Y   G ++ + A +TGFS  D ++  E L ++FE+D S+AR  G M  R 
Sbjct: 183 GRKYTVPYGLYCSHGFVSAHLANQTGFSAEDLKLFWEALQNMFEHDRSAAR--GMMSTRG 240

Query: 230 VFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD---SYEDYAIHLNQEEL 275
           ++ F HS  LGN  + ++F+ ++ +++ + +    S+EDY + +++  L
Sbjct: 241 LYVFEHSTALGNAPAHKLFERIKVERKPESEGPARSFEDYTVTIDESGL 289
>gi|85714553|ref|ZP_01045540.1| CRISPR-associated protein [Nitrobacter sp. Nb-311A]
 gi|85698438|gb|EAQ36308.1| CRISPR-associated protein [Nitrobacter sp. Nb-311A]
          Length = 317

 Score =  147 bits (370), Expect = 9e-34,   Method: Composition-based stats.
 Identities = 108/320 (33%), Positives = 171/320 (53%), Gaps = 44/320 (13%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
           L  + DF++  +V   N NGDP + N+PR D + ++GL+SDVS+KRK+RN     R    
Sbjct: 4   LANRYDFVLLFDVMRGNPNGDPDAGNLPRLDPETNHGLVSDVSLKRKVRNYIEFARNGVA 63

Query: 56  GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQIN----AEWLDV 111
           G  I+VQ    +++     K  L  +   D V + +K     E+   ++       + DV
Sbjct: 64  GFNIYVQEGAILNE--QHRKAYLAVRPGDDKVAKDTKLNPKSEDEAARLRDFMCRNFFDV 121

Query: 112 RSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKT--------- 162
           R+FG V +  G +A  VRGPV +++A+S+E +V Q + IT+   +    KT         
Sbjct: 122 RAFGAVMS-TGINAGQVRGPVQMTFAQSVEPIVPQEITITRMAATTPAEKTLRAEGQEEG 180

Query: 163 --ELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSAR 220
               ++ TMG K+ V YG+Y   G ++   AE+TGFSDAD + ++E L S+FE+D S+AR
Sbjct: 181 NDRTDNRTMGRKYIVPYGLYRSHGFVSAKLAERTGFSDADLDALREALTSMFEHDRSAAR 240

Query: 221 PEGSMRVREVFWFTHSNKLGNVSSARVFDLL--------EFDKEKQDKDS------YEDY 266
             G M +R+   F H+N LGN  +  +FD +        EF K  +  D+      + DY
Sbjct: 241 --GEMAMRKAIAFKHANPLGNAPAHELFDRVKVGRGVDDEFRKIDRRLDNLPPAREFADY 298

Query: 267 AIHLNQEELAEYEAKGLQVE 286
           AI +++ EL E    G+++E
Sbjct: 299 AIEIDRNELPE----GVEIE 314
>gi|120586829|ref|YP_961174.1| CRISPR-associated protein, Csd2 family [Desulfovibrio vulgaris
           subsp. vulgaris DP4]
 gi|120564243|gb|ABM29986.1| CRISPR-associated protein, Csd2 family [Desulfovibrio vulgaris
           subsp. vulgaris DP4]
          Length = 290

 Score =  145 bits (367), Expect = 2e-33,   Method: Composition-based stats.
 Identities = 96/283 (33%), Positives = 160/283 (56%), Gaps = 19/283 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQ--- 57
           +  + +F++  +V   N NGDP + NMPR D +  +GL++DV +KRKIRN +    +   
Sbjct: 4   IANRYEFVLLFDVENGNPNGDPDAGNMPRIDPETGHGLVTDVCLKRKIRNHVALTKEGAE 63

Query: 58  --PIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
              I++Q +  +++         + K     + +  +  KSV ++   +   + D+R+FG
Sbjct: 64  RFNIYIQEKAILNETHERAYTACKLKPEPKKLPKKVEDAKSVTDW---MCTNFYDIRTFG 120

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTK 172
            V   +  +   VRGPV +++A+S+E VV Q + IT+   +T +E   K + ++ TMG K
Sbjct: 121 AVMTTE-VNCGQVRGPVQMAFARSVEPVVPQEVSITRMAVTTKAEA-EKQQGDNRTMGRK 178

Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
           H V YG+YV  G I+   AEKTGFSD D  +  + LV++FE+D S+AR  G M  R++  
Sbjct: 179 HIVPYGLYVAHGFISAPLAEKTGFSDEDLTLFWDALVNMFEHDRSAAR--GLMSSRKLIV 236

Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKD---SYEDYAIHLNQ 272
           F H NKLGN  + ++FDL++  + +       S+ DYA+ + Q
Sbjct: 237 FKHQNKLGNAPAHKLFDLVKVSRAEGSSGPARSFADYAVTVGQ 279
>gi|46562130|ref|YP_009172.1| CRISPR-associated TM1801 family protein [Desulfovibrio vulgaris
           subsp. vulgaris str. Hildenborough]
 gi|46447667|gb|AAS94333.1| CRISPR-associated protein, TM1801 family [Desulfovibrio vulgaris
           subsp. vulgaris str. Hildenborough]
          Length = 290

 Score =  144 bits (362), Expect = 9e-33,   Method: Composition-based stats.
 Identities = 94/283 (33%), Positives = 159/283 (56%), Gaps = 19/283 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQ--- 57
           +  + +F++  +V   N NGDP + NMPR D +  +GL++DV +KRKIRN +    +   
Sbjct: 4   IANRYEFVLLFDVENGNPNGDPDAGNMPRIDPETGHGLVTDVCLKRKIRNHVALTKEGAE 63

Query: 58  --PIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
              I++Q +  +++         + K     + +  +  K V ++   +   + D+R+FG
Sbjct: 64  RFNIYIQEKAILNETHERAYTACDLKPEPKKLPKKVEDAKRVTDW---MCTNFYDIRTFG 120

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTK 172
            V   +  +   VRGPV +++A+S+E VV Q + IT+   +T +E   K + ++ TMG K
Sbjct: 121 AVMTTE-VNCGQVRGPVQMAFARSVEPVVPQEVSITRMAVTTKAEA-EKQQGDNRTMGRK 178

Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
           H V YG+YV  G I+   AEKTGFSD D  +  + LV++FE+D S+AR  G M  R++  
Sbjct: 179 HIVPYGLYVAHGFISAPLAEKTGFSDEDLTLFWDALVNMFEHDRSAAR--GLMSSRKLIV 236

Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKD---SYEDYAIHLNQ 272
           F H N+LGN  + ++FDL++  + +       S+ DYA+ + Q
Sbjct: 237 FKHQNRLGNAPAHKLFDLVKVSRAEGSSGPARSFADYAVTVGQ 279
>gi|83592167|ref|YP_425919.1| CRISPR-associated TM1801 family protein [Rhodospirillum rubrum ATCC
           11170]
 gi|83575081|gb|ABC21632.1| CRISPR-associated protein TM1801 [Rhodospirillum rubrum ATCC 11170]
          Length = 317

 Score =  142 bits (358), Expect = 2e-32,   Method: Composition-based stats.
 Identities = 107/323 (33%), Positives = 163/323 (50%), Gaps = 42/323 (13%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
           L Q+ DF+V  +V   N NGDP + N PR D + ++GL+SDV +KRKIRN     + +D 
Sbjct: 4   LTQRHDFVVLFDVTNGNPNGDPDAGNTPRLDPETNHGLVSDVCLKRKIRNYVELAKGEDS 63

Query: 56  GQPIFVQARDRVDDCIYS--LKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRS 113
           G  I+VQ    ++D      +  R + ++     K   +     +     + A + DVR+
Sbjct: 64  GFHIYVQEGAILNDQHRKAYVALRPDKEKAAKEAKLNPQNDDEAKALRAFMCANFFDVRT 123

Query: 114 FGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKT----------E 163
           FG V +  G +   VRGPV  S+A+S+E +V   + IT+   +    KT           
Sbjct: 124 FGAVMS-TGINCGQVRGPVQFSFARSIEPIVPLEISITRMAATNEKEKTAQREGQEGDER 182

Query: 164 LESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEG 223
            E+ TMG KH + YG+Y   G I+   AE+TGF + D +++ E +  +FE+D S+AR  G
Sbjct: 183 TENRTMGRKHIIPYGLYRAHGFISAKLAERTGFDEGDLDLLLEAVEQMFEHDRSAAR--G 240

Query: 224 SMRVREVFWFTHSNKLGNVSSARVFDLLE----FDKEKQDKD-----------SYEDYAI 268
            M VR++  F H+N LGN  +  +FD +      D E +  D           S+ DY +
Sbjct: 241 EMAVRKLIVFRHANALGNAPAHSLFDRVTVGRVIDGEVRAVDDPAIDNRPPARSFGDYRV 300

Query: 269 HLNQEELAEYEAKGLQVEIIEGL 291
            + +E L E       VEIIE L
Sbjct: 301 TIGREGLPE------GVEIIERL 317
>gi|78222285|ref|YP_384032.1| CRISPR-associated TM1801 family protein [Geobacter metallireducens
           GS-15]
 gi|78193540|gb|ABB31307.1| CRISPR-associated protein TM1801 [Geobacter metallireducens GS-15]
          Length = 284

 Score =  142 bits (357), Expect = 3e-32,   Method: Composition-based stats.
 Identities = 92/282 (32%), Positives = 162/282 (57%), Gaps = 21/282 (7%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQ-----DM 55
           ++ + DF++  +V++ N NGDP + N+PR D +  +GL++DV +KRK+RN +Q       
Sbjct: 3   IQNRYDFVLFFDVKDGNPNGDPDAGNLPRIDPETGHGLVTDVCLKRKVRNYVQLDKELSQ 62

Query: 56  GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
           G  IFV+ +  +++ I    ++   K      KE  +K ++   F   +  ++ DVR+FG
Sbjct: 63  GYDIFVKEKAILNNLIDEAHEQENVK-----AKEKGEKTEAARQF---MCGKYFDVRTFG 114

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTK 172
            V +  G +A  VRGPV +++A+S++++V     IT+   +T +E   K + ++ TMG K
Sbjct: 115 AVMS-TGKNAGQVRGPVQLTFARSVDQIVPLEHSITRMAVATPAEA-EKQDGDNRTMGRK 172

Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
             V Y +Y   G I+   A +TGFS+ D E+  + LV++FE+D S+AR  G M  R++  
Sbjct: 173 FTVPYALYRCHGFISAPLAAQTGFSEEDLELFWQSLVNMFEHDRSAAR--GQMSARKLIV 230

Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQEE 274
           F H +K+GN  +  +FDL+  +  +    S++ Y I +  +E
Sbjct: 231 FKHDSKMGNAPAHHLFDLVSAEASETPVRSFDQYDICVPTQE 272
>gi|108758563|ref|YP_635130.1| CRISPR-associated protein, Csd3 family [Myxococcus xanthus DK 1622]
 gi|108462443|gb|ABF87628.1| CRISPR-associated protein, Csd3 family [Myxococcus xanthus DK 1622]
          Length = 304

 Score =  141 bits (355), Expect = 5e-32,   Method: Composition-based stats.
 Identities = 101/298 (33%), Positives = 169/298 (56%), Gaps = 28/298 (9%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRN--RMQDMGQP 58
           L+Q+ DF++  +V + N NGDP + N+PR DA+  +GL++DVS+KRK+RN   +   G+P
Sbjct: 4   LKQRHDFVLFFDVLDGNPNGDPDAGNLPRIDAETGHGLVTDVSLKRKVRNFVLLTQEGKP 63

Query: 59  ---IFVQAR----DRVDDCIYSLKQRLENKEFFDTVKEGSKKK-------KSVENFVKQI 104
              IFV+ +    +R+ D   SL   L  K      ++G K+          VE     +
Sbjct: 64  GLDIFVKEKAILNNRIADGYKSLGIDLNEKP--ARAEDGKKRNDKGRAQGSEVEKGRAWM 121

Query: 105 NAEWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSK 161
              + DVR+FG V +  G +A  VRGPV +++A+S++ +V+Q   IT+   +T  E   K
Sbjct: 122 CKTFFDVRTFGAVMS-TGPNAGQVRGPVQLTFARSVDPIVSQEHSITRMAVATEDEA-EK 179

Query: 162 TELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARP 221
              ++ TMG K+ V YG+Y   G ++P+ A++TGF  AD E++ +    +FE D S+AR 
Sbjct: 180 QGGDNRTMGRKNTVPYGLYRAHGFVSPHLAKQTGFGTADLELLFQSFTHMFELDRSAAR- 238

Query: 222 EGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD--SYEDYAIHLNQEELAE 277
            G M +R+V  F H ++LGN  +  +FD +   + + +K   S+ DY + +++  L +
Sbjct: 239 -GLMSMRKVVVFKHGSELGNAPAHALFDRVLAVRAQPEKPARSFSDYEVRVDKAGLPQ 295
>gi|154506184|ref|ZP_02042922.1| hypothetical protein RUMGNA_03726 [Ruminococcus gnavus ATCC 29149]
 gi|153793683|gb|EDN76103.1| hypothetical protein RUMGNA_03726 [Ruminococcus gnavus ATCC 29149]
          Length = 303

 Score =  140 bits (354), Expect = 7e-32,   Method: Composition-based stats.
 Identities = 93/296 (31%), Positives = 176/296 (59%), Gaps = 23/296 (7%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM---- 55
           +++ + +F+V  +V   N NGDP + NMPR D +   GL++DV +KRKIRN ++ +    
Sbjct: 4   VIKNRYEFVVLFDVENGNPNGDPDAGNMPRIDPESGLGLVTDVCLKRKIRNYIETVKEDA 63

Query: 56  -GQPIFVQ-----ARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAE-W 108
            G  I+++      R   + C     +  + K+  +++K+  K  ++V+  ++    + +
Sbjct: 64  EGYKIYIKDDVPLNRSDREACASVGVEETDEKKVTESLKKLKKNDENVDVKIRDYMCQNF 123

Query: 109 LDVRSFGQV---FAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK-STNSELNSKTEL 164
            D+R+FG V   F     +   VRGPV + +A+S++ +V+Q + IT+ +  +E ++  + 
Sbjct: 124 FDIRTFGAVMTTFVKAALNCGQVRGPVQLGFARSIDPIVSQEVTITRVAITTEKDAANK- 182

Query: 165 ESSTMGTKHFVDYGVYVIKGSINPNFAEK-TGFSDADAEIIKEVLVSLFENDASSARPEG 223
            S+ MG K  V YG+Y ++G ++ N A K TGFS+ D E++ E ++++FE+D S+AR  G
Sbjct: 183 -STEMGRKSVVPYGLYRVEGYVSANLARKVTGFSEEDLELLWEAIINMFEHDHSAAR--G 239

Query: 224 SMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDK--DSYEDYAIHLNQEELAE 277
           +M VRE+  F HS +LG+  + ++FD +E  K+++ +   SY DY + +++E + +
Sbjct: 240 NMAVRELIVFKHSKELGDCPAYKLFDSVEVRKKEEVEYPRSYRDYIVEVHEENIPD 295
>gi|114566030|ref|YP_753184.1| hypothetical protein Swol_0478 [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
 gi|114336965|gb|ABI67813.1| conserved hypothetical protein [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
          Length = 288

 Score =  139 bits (349), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 90/284 (31%), Positives = 160/284 (56%), Gaps = 16/284 (5%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMGQ--- 57
           ++ + +F++  +V   N NGDP + NMPR D +  +GL+SDV IKRKIRN +  + +   
Sbjct: 5   IKNRYEFVLFFDVENGNPNGDPDADNMPRIDPETSFGLVSDVCIKRKIRNYVALLKENED 64

Query: 58  --PIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
              I+VQ +  +++     K+  E+ +     K+  K     E   + +   + D+R+FG
Sbjct: 65  DYQIYVQEKAVLNN---QHKKAYEHFKIKPESKKLPKDTAQAEAITQFMCKNFYDIRTFG 121

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
            V   +  +   VRGPV + +++SL+ +V Q + IT+   +  N +   +  TMG KH V
Sbjct: 122 AVMTTE-VNCGQVRGPVQLGFSRSLDPIVPQEITITRMAVT--NERDLEKERTMGRKHIV 178

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
           +Y +Y  +G I+   A+KTGFS+ D E++ + L+++F++D S+AR  G M  R+++ F H
Sbjct: 179 NYALYRAEGFISAPLADKTGFSEEDLELLWDALINMFDHDRSAAR--GKMSSRKLYVFKH 236

Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKD--SYEDYAIHLNQEELAE 277
            +KLGN  ++ +FD +   K   +K   ++ DY I +   E+ +
Sbjct: 237 DSKLGNAPASILFDAITVKKINGNKPVRNFSDYEISVTGNEIPQ 280
>gi|75676129|ref|YP_318550.1| CRISPR-associated TM1801 family protein [Nitrobacter winogradskyi
           Nb-255]
 gi|74420999|gb|ABA05198.1| CRISPR-associated protein TM1801 [Nitrobacter winogradskyi Nb-255]
          Length = 316

 Score =  137 bits (344), Expect = 9e-31,   Method: Composition-based stats.
 Identities = 104/325 (32%), Positives = 171/325 (52%), Gaps = 49/325 (15%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQD 54
           ML  + DF++  +V + N NGDP + N+PR D + ++GL+SDVS+KRK+RN     R   
Sbjct: 3   MLTNRYDFVLLFDVTKGNPNGDPDAGNLPRLDPETNHGLVSDVSLKRKVRNYVDLVRSGT 62

Query: 55  MGQPIFVQARDRVDDCIYSLKQRLE------NKEFFDTVKEGSKKKKSVENFVKQINAEW 108
            G  I+V+    ++D      + L       +KE     ++  + KK  E   K     +
Sbjct: 63  DGHHIYVEEAAILNDKHRQAYKALRPDDPKVDKEAKLNPRDDVEAKKLREFMCKN----F 118

Query: 109 LDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTE-- 163
            DVR+FG V +  G +A  VRGPV +++A S+E +V Q + IT+   +  +E   + E  
Sbjct: 119 FDVRTFGAVMS-TGINAGQVRGPVQMTFANSVEPIVPQEISITRMAATNEAEKKQRAEGG 177

Query: 164 ------LESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDAS 217
                 +++ TMG K+ V YG+Y   G ++   AE+TGFS+AD E+  E L S+FE+D S
Sbjct: 178 EEGNDRVDNRTMGRKYIVPYGLYRAHGFVSAKLAERTGFSEADLELTFEALTSMFEHDRS 237

Query: 218 SARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDK---------EKQDK----DSYE 264
           +AR  G M  R++  F H N LG+  +  +F+ +   +         ++ D      ++ 
Sbjct: 238 AAR--GEMTTRKLVVFKHGNALGSAPAHALFERVRIGRNIDGQFRRIDRSDNYPPARAFS 295

Query: 265 DYAIHLNQEELAEYEAKGLQVEIIE 289
           DYA+ ++++   +       VEIIE
Sbjct: 296 DYAVEIDRDNPPD------GVEIIE 314
>gi|83589359|ref|YP_429368.1| CRISPR-associated TM1801 family protein [Moorella thermoacetica
           ATCC 39073]
 gi|83572273|gb|ABC18825.1| CRISPR-associated protein TM1801 [Moorella thermoacetica ATCC
           39073]
          Length = 297

 Score =  136 bits (343), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 90/291 (30%), Positives = 155/291 (53%), Gaps = 30/291 (10%)

Query: 3   EQKIDFMVTVEVREANANGDPLSVNMPRTDAKDY-GLMSDVSIKRKIRN-----RMQDMG 56
           E + DF++  +VR+ N NGDP + N+PR D +   GL++DV +KRKIR+     R  +  
Sbjct: 8   EVRHDFVLLFDVRDGNPNGDPDAGNLPRLDPETMQGLVTDVCLKRKIRDWVDMTRGSEAN 67

Query: 57  QPIFVQARDRVDDCIYSLKQRLENKEFFDTVKE---GSKKKKSVENFVKQ-INAEWLDVR 112
             I+VQ    ++          +++  +D + E   GSK+ + + +  +Q +   + D+R
Sbjct: 68  MKIYVQHHGILN---------AQHQRAYDAIGEKSTGSKQNREIVDKARQWMCQNFYDIR 118

Query: 113 SFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELES------ 166
            FG V    G +   VRGP+ +++A+S++ +V   + IT+   + +      E       
Sbjct: 119 MFGAVMT-TGVNCGQVRGPMQLTFARSIDPIVPLDISITRVAITRVEDAATSEQGEGGKV 177

Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
           + MG K  V YG+Y+  G  NP+FA  TG S AD EI  E L  +++ D S++R  G M 
Sbjct: 178 TEMGRKTLVPYGLYLGYGFFNPHFAADTGVSAADLEIFWEALQRMWDVDRSASR--GMMA 235

Query: 227 VREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDK--DSYEDYAIHLNQEEL 275
            R ++ F+H++ LGN  +  +F L+   +    K   S+ DY + +N+E+L
Sbjct: 236 CRGLYIFSHASALGNAPADNLFKLITVKRRDGVKAARSFADYQVTINEEDL 286
>gi|150007823|ref|YP_001302566.1| uncharacterized protein predicted to be involved in DNA repair
           [Parabacteroides distasonis ATCC 8503]
 gi|149936247|gb|ABR42944.1| uncharacterized protein predicted to be involved in DNA repair
           [Parabacteroides distasonis ATCC 8503]
          Length = 283

 Score =  135 bits (341), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 90/284 (31%), Positives = 160/284 (56%), Gaps = 24/284 (8%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQ-----DM 55
           ++ +IDF+   +V++ N NGDP + N+PR DA+   GL++DV +KRK+RN +Q     + 
Sbjct: 4   IKNRIDFVYIFDVQDGNPNGDPDAGNLPRVDAETGMGLVTDVCLKRKVRNYVQVAKGLED 63

Query: 56  GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
           G  IF++ +  ++  I       E       VK    K  +   F+ +    + DVR+FG
Sbjct: 64  GYDIFIKEKAVLNTLIDKAHDDSE-------VKNAKDKTDAARRFMCK---NYFDVRTFG 113

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTK 172
            V +  G +A  VRGP+  ++A+S++ +      IT+   +T++E   ++  ++ TMG K
Sbjct: 114 AVMS-TGKNAGQVRGPIQFTFARSVDPIAAAEHSITRMAVATDAEAKKQSG-DNRTMGRK 171

Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
             V YG+Y+  G I+ N A++TGFS+ D  +  E L ++F+ D S+AR  G M  +++  
Sbjct: 172 ATVPYGLYICHGFISANLAQQTGFSEEDLALFWEALKNMFDMDRSAAR--GLMSAQKLIV 229

Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKD-SYEDYAIHLNQEEL 275
           F H + LGN  + ++FDL++ +K       S+ DY + +++E +
Sbjct: 230 FKHDSVLGNAPANKLFDLVKVEKVCDGAPRSFGDYHVTIDKEHV 273
>gi|146284060|ref|YP_001174213.1| CRISPR-associated protein, TM1801 family [Pseudomonas stutzeri
           A1501]
 gi|145572265|gb|ABP81371.1| CRISPR-associated protein, TM1801 family [Pseudomonas stutzeri
           A1501]
          Length = 289

 Score =  135 bits (341), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 87/287 (30%), Positives = 155/287 (54%), Gaps = 18/287 (6%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN--RMQDMGQ 57
           ++  + +F+   +V   N NGDP + N+PR D + + GL++DV +KRK+RN   ++  G 
Sbjct: 3   VIANRYEFVYLFDVTNGNPNGDPDAGNLPRLDPETNQGLVTDVCLKRKLRNYVALEQEGA 62

Query: 58  P---IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSF 114
           P   I++Q +  +++     KQ  E        K+  K +         +   + DVR+F
Sbjct: 63  PGYAIYMQEKSVLNN---QHKQAYEALGIESEAKKLPKDEAKARELTGWMCKNFFDVRAF 119

Query: 115 GQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHF 174
           G V   +  +A  VRGP+ +++A S++ V+   + IT+   +  N K   +  TMG KH 
Sbjct: 120 GAVMTTE-VNAGQVRGPIQLAFATSIDPVLPMEISITRMAVT--NEKDLEKERTMGRKHI 176

Query: 175 VDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFT 234
           V YG+Y   G ++   AE+TGFS+ D  ++   L+++FE+D S+AR  G M  R++  F 
Sbjct: 177 VPYGLYRAHGFVSAKLAERTGFSEEDLGLLWRALINMFEHDRSAAR--GEMAARKLIVFK 234

Query: 235 HSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEELAE 277
           H + +GN  +  +FD ++ ++ + +      S++DY I +N E L +
Sbjct: 235 HEHPMGNAPAHVLFDSVQIERVEGEAHTPARSFKDYQISVNAEALPQ 281
>gi|134298869|ref|YP_001112365.1| CRISPR-associated protein, Csd2 family [Desulfotomaculum reducens
           MI-1]
 gi|134051569|gb|ABO49540.1| CRISPR-associated protein, Csd2 family [Desulfotomaculum reducens
           MI-1]
          Length = 288

 Score =  135 bits (340), Expect = 3e-30,   Method: Composition-based stats.
 Identities = 90/286 (31%), Positives = 158/286 (55%), Gaps = 23/286 (8%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM----- 55
           ++ + +F++  +V   N NGDP + NMPR DA+   GL++DV +KRKIRN +  +     
Sbjct: 5   IKNRYEFVLFFDVENGNPNGDPDAGNMPRIDAETGLGLVTDVCLKRKIRNYVDIVKNGIE 64

Query: 56  GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINA----EWLDV 111
           G  I+V+    +++      Q ++     D +K  SKK    E   +++ A     + D+
Sbjct: 65  GFDIYVREGSILNN------QHMKAYSALD-IKPESKKLPKNEEDARRVKAFMCKHFYDI 117

Query: 112 RSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGT 171
           R+FG V   +  +   VRGPV I++A+S++ +V Q + IT+   + +    E +   MG 
Sbjct: 118 RTFGAVMTTE-VNCGQVRGPVQINFARSIDPIVQQEVTITRMAVTSVKD-AEKKDREMGR 175

Query: 172 KHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVF 231
           KH V Y +Y  +G ++ + AEK+GF+  D E++ + L ++F++D S+AR  G M  R++ 
Sbjct: 176 KHIVPYALYRAEGYVSAHLAEKSGFTKEDLELLWQSLTNMFDHDRSAAR--GKMATRKLI 233

Query: 232 WFTHSNKLGNVSSARVFDLLEFDKEK--QDKDSYEDYAIHLNQEEL 275
            F H   LGN S+  +FD++E  ++   +   +Y DY + +N   L
Sbjct: 234 IFEHETALGNASAHSLFDMVEVSRKDLIRPPRAYSDYKVTVNMAAL 279
>gi|153091351|gb|EDN73325.1| hypothetical protein MHA_0345 [Mannheimia haemolytica PHL213]
          Length = 287

 Score =  135 bits (340), Expect = 3e-30,   Method: Composition-based stats.
 Identities = 92/290 (31%), Positives = 162/290 (55%), Gaps = 30/290 (10%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQ-----DM 55
           ++ + +F+   +V   N NGDP + NMPR D +   GL++DV +KRKIRN ++     + 
Sbjct: 3   IQNRYEFVFFFDVTNGNPNGDPDAGNMPRLDPETSKGLVTDVCLKRKIRNFIEMSYENEA 62

Query: 56  GQPIFVQARDRVDDCIYSLKQRLENKEFFDT--VKEGSKKKKSVENFVKQINA----EWL 109
           G  I+V+ +  ++         L+NK  ++   ++  +KK    E   ++I A     + 
Sbjct: 63  GYEIYVKEKSVLN---------LQNKRAYEALGIESEAKKLPKEEAKAREITAWMCKNFF 113

Query: 110 DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTM 169
           D+R+FG V   +  ++  VRGPV +++A+S++ ++   + IT+   +  N K   +  TM
Sbjct: 114 DIRTFGAVMTTE-VNSGQVRGPVQLAFAQSIDPIIPLEVSITRMAVT--NEKDLEKERTM 170

Query: 170 GTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVRE 229
           G K+ V Y +Y + G I+   AEKTGFSD D + + + L  +FE+D S+AR  G M  R+
Sbjct: 171 GRKYIVPYALYRVHGFISAKLAEKTGFSDEDVQKLWQALQLMFEHDRSAAR--GEMAARK 228

Query: 230 VFWFTHSNKLGNVSSARVFDLLEFDKEKQDKDS----YEDYAIHLNQEEL 275
           +  F H ++LGN  + ++FD ++ ++   +KD+    Y+DY I +  E L
Sbjct: 229 LIVFKHDSELGNQPAHKLFDSVKVERINGEKDTPAKDYDDYCISVQTEGL 278
>gi|83645584|ref|YP_434019.1| uncharacterized protein predicted to be involved in DNA repair
           [Hahella chejuensis KCTC 2396]
 gi|83633627|gb|ABC29594.1| uncharacterized protein predicted to be involved in DNA repair
           [Hahella chejuensis KCTC 2396]
          Length = 297

 Score =  135 bits (339), Expect = 4e-30,   Method: Composition-based stats.
 Identities = 92/293 (31%), Positives = 158/293 (53%), Gaps = 28/293 (9%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN----RMQDM- 55
           +  + +F+   +V   N NGDP + N+PR D + ++GL++DV +KRK+RN       DM 
Sbjct: 4   IANRYEFVFLFDVTNGNPNGDPDAGNLPRLDPETNHGLVTDVCLKRKVRNFVALEKSDMN 63

Query: 56  ----GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINA----E 107
               G  I+VQ +  +++     KQ  E        K+  KK    E   +++ A     
Sbjct: 64  GSAPGFNIYVQEKSVLNN---QHKQAWEALGIPPDAKDKYKKLPKDEAKARELTAWMCNN 120

Query: 108 WLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESS 167
           + D+R+FG V   +  +   VRGPV  ++A S++ V    + IT+     + ++ +LES 
Sbjct: 121 FFDIRAFGAVMTME-VNCGQVRGPVQFAFATSVDPVTPLEISITRMA---VTNERDLESE 176

Query: 168 -TMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
            TMG KH V YG+Y   G I+   AE+TGFS+ D +++   L+++FE+D S+AR  G M 
Sbjct: 177 RTMGRKHIVPYGLYRAHGFISAKLAERTGFSEEDLQLLWRALINMFEHDRSAAR--GEMA 234

Query: 227 VREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEEL 275
            R++  F H + +G+  S  +FD ++ ++ + + D    SY DY + ++ E +
Sbjct: 235 ARKLIVFKHEHPMGDAPSHVLFDKVKVERSEGEADTPARSYNDYRVTVDSESI 287
>gi|91774959|ref|YP_544715.1| CRISPR-associated protein TM1801 [Methylobacillus flagellatus KT]
 gi|91708946|gb|ABE48874.1| CRISPR-associated protein TM1801 [Methylobacillus flagellatus KT]
          Length = 293

 Score =  134 bits (336), Expect = 9e-30,   Method: Composition-based stats.
 Identities = 95/304 (31%), Positives = 167/304 (54%), Gaps = 35/304 (11%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDA-KDYGLMSDVSIKRKIRN--RMQDMGQP 58
           +  + +F+   +V   N NGDP + NMPR D     GL++DV +KRKIRN   + + G P
Sbjct: 3   ITNRYEFVYFFDVTNGNPNGDPDAGNMPRLDPDSSKGLVTDVCLKRKIRNFVELTEEGHP 62

Query: 59  ---IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQ------INAEWL 109
              I+V+ +  ++         L+NK+ ++ +    + KK  ++  K       + A + 
Sbjct: 63  GFEIYVKEKGILN---------LQNKKAYEALSITPEPKKLPKDEAKAREVTAWMCANFF 113

Query: 110 DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELES 166
           DVR+FG V   +  ++  VRGPV +++AKS++ ++   + IT+   +T  E  +++   +
Sbjct: 114 DVRTFGAVMTTE-VNSGQVRGPVQLAFAKSIDPIIPLELSITRMAVTTEKEAEAQSG-GN 171

Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
            TMG KH V YG+Y + G ++   +EKTGFSD D   + + L  +FE+D S+AR  G M 
Sbjct: 172 RTMGRKHIVPYGLYRVHGFVSAKLSEKTGFSDDDLAKLWQALTLMFEHDRSAAR--GEMA 229

Query: 227 VREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEELAEYEAKG 282
            R++  F H++ LG+  +  +FD ++ ++   ++D    ++ DY I L+   L   +A G
Sbjct: 230 ARKLVVFKHADALGSAPAHLLFDRVKVERVSGERDTPATTFSDYRIVLDTNGL---DALG 286

Query: 283 LQVE 286
           + VE
Sbjct: 287 VTVE 290
>gi|154495122|ref|ZP_02034127.1| hypothetical protein PARMER_04169 [Parabacteroides merdae ATCC
           43184]
 gi|154085672|gb|EDN84717.1| hypothetical protein PARMER_04169 [Parabacteroides merdae ATCC
           43184]
          Length = 288

 Score =  134 bits (336), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 90/289 (31%), Positives = 160/289 (55%), Gaps = 26/289 (8%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           +  + DF+   +V++ N NGDP + N+PR D +   GL+SDV +KRK+RN +Q       
Sbjct: 5   INNRYDFIYLFDVQDGNPNGDPDAGNLPRVDPETGEGLVSDVCLKRKVRNFVQ------I 58

Query: 61  VQARDRVDDCIYSLKQRLEN-------KEFFDTVKEGSKKKKSVENFVKQINAEWLDVRS 113
           V+  +R+ D     K  L N       +E    +KE   K ++   ++ +    + D+R+
Sbjct: 59  VKGGERLYDIFIKEKAVLNNLIADAHKQEGVKDIKEKGDKTEAARQWMCR---NFYDIRT 115

Query: 114 FGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMG 170
           FG V +  G +A  VRGP+  ++A+S+  +VT    IT+   +T  E   ++  ++ TMG
Sbjct: 116 FGAVLS-TGENAGQVRGPIQFTFARSISPIVTAEHSITRMAVATEDEAKKQSG-DNRTMG 173

Query: 171 TKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREV 230
            K  V YG+Y   G I+ + A +TGF++ D  +  + L ++F++D S+AR  G M  R++
Sbjct: 174 RKFTVPYGLYKANGFISAHLAAQTGFNEDDLNLFWDSLKNMFDHDHSAAR--GMMNARKL 231

Query: 231 FWFTHSNKLGNVSSARVFDL--LEFDKEKQDKDSYEDYAIHLNQEELAE 277
             F HS  LGN S+  +F L  ++   +++   S++DY + +++++L E
Sbjct: 232 IVFKHSTALGNASAHSLFGLVKVQLKDDQRPPRSFDDYIVTIDKDKLPE 280
>gi|154495727|ref|ZP_02034423.1| hypothetical protein BACCAP_00006 [Bacteroides capillosus ATCC
           29799]
 gi|150274925|gb|EDN01973.1| hypothetical protein BACCAP_00006 [Bacteroides capillosus ATCC
           29799]
          Length = 298

 Score =  132 bits (333), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 91/288 (31%), Positives = 157/288 (54%), Gaps = 18/288 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           ++ + +F++  +V   N NGDP + NMPR D +   G+++DV +KRKIRN ++ + +   
Sbjct: 5   IKNRYEFVILFDVENGNPNGDPDAGNMPRVDPETGLGIVTDVCLKRKIRNYVETVKEDA- 63

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWL-----DVRSFG 115
              R  V D +   +   E     D  ++  K+KK     + +   +W+     D+R+FG
Sbjct: 64  TGYRIYVKDGVPLNRSDAEAYAELDVTEKTVKEKKKANPDLDRKIRDWMCANFYDIRTFG 123

Query: 116 QV---FAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTK 172
            V   F     +   VRGPV + +A+S+E VV Q + IT+   +   +  E + + MG K
Sbjct: 124 AVMTTFVKAALNCGQVRGPVQLGFARSVEPVVPQEVTITRVAITT-EADAEKKGTEMGRK 182

Query: 173 HFVDYGVYVIKGSINPNFAEKT-GFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVF 231
           + V YG+Y  +G I+ N A KT GFS+ D  ++ E ++++FE+D S+AR  G M VRE+ 
Sbjct: 183 YIVPYGLYRCEGYISANLARKTTGFSEEDLSLLWEAILNMFEHDHSAAR--GKMAVRELI 240

Query: 232 WFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEEL 275
            F H ++LG   + ++FD +   ++  D      SY+DY + +++  L
Sbjct: 241 VFKHDSELGCAPAWKLFDAVSVTRKNPDDPAPARSYQDYTVAVDEAAL 288
>gi|67158962|ref|ZP_00419749.1| CRISPR-associated protein TM1801 [Azotobacter vinelandii AvOP]
 gi|67084459|gb|EAM03943.1| CRISPR-associated protein TM1801 [Azotobacter vinelandii AvOP]
          Length = 302

 Score =  132 bits (332), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 89/296 (30%), Positives = 159/296 (53%), Gaps = 29/296 (9%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
           +  + +F+   EV   N NGDP + N+PR D + + GL++DV +KRKIRN     +    
Sbjct: 4   IANRYEFVYLFEVTNGNPNGDPDAGNLPRLDPETNQGLVTDVCLKRKIRNYVALEKSDAE 63

Query: 56  GQP-----IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKK----KKSVENFVKQINA 106
           G+P     I++Q +  ++      +Q  +  E     K G KK        ++    + +
Sbjct: 64  GKPEQSYVIYMQEKAVLNQ---QHEQAWKACEIPPDAKNGYKKLPADTAKAKSLTDWMCS 120

Query: 107 EWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTE 163
            + DVR+FG V    G +   VRGP+ +++A S++ ++   + IT+   +T  E   ++ 
Sbjct: 121 NFFDVRAFGAVMT-TGVNCGQVRGPIQLAFATSIDPIIPLEVSITRMAVTTEKEAEEQSG 179

Query: 164 LESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEG 223
            ++ TMG KH + YG+Y   G I+   AE+TGFS+ D  ++   L ++FE+D S+AR  G
Sbjct: 180 -DNRTMGRKHIIPYGLYRAHGFISAKLAERTGFSEDDLALLWRALENMFEHDRSAAR--G 236

Query: 224 SMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEEL 275
            M  R++  F H + +GN  + ++FDL++  + + + D    S+ DY I ++++ L
Sbjct: 237 EMAARKLIVFKHEHPMGNAPAHKLFDLVKVARTEGEADTPARSFADYQISIDRDGL 292
>gi|34496682|ref|NP_900897.1| hypothetical protein CV_1227 [Chromobacterium violaceum ATCC 12472]
 gi|34102537|gb|AAQ58902.1| conserved hypothetical protein [Chromobacterium violaceum ATCC
           12472]
          Length = 288

 Score =  131 bits (330), Expect = 4e-29,   Method: Composition-based stats.
 Identities = 86/284 (30%), Positives = 147/284 (51%), Gaps = 18/284 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
           L  + +F+   +V   N NGDP + N+PR D + + GL++DV +KRK+RN     +  + 
Sbjct: 3   LANRYEFVYLFDVSNGNPNGDPDAGNLPRLDPETNQGLVTDVCLKRKLRNYVALEKENEP 62

Query: 56  GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
           G  I++Q +  ++   +  K+  E        K+  K +         +   + DVR+FG
Sbjct: 63  GFAIYMQEKSVLN---HQHKRAYEALSLEPEPKKLPKDQAKARELTAWMCKNFFDVRAFG 119

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
            V   +  +A  VRGP+ +++A S++ VV   + IT+   +  N K   +  TMG KH V
Sbjct: 120 AVMTTE-VNAGQVRGPIQLTFATSIDPVVPLEVSITRMAVT--NDKDLEKERTMGRKHIV 176

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
            YG+Y   G ++   AE+ GF + D +++   L+++FE+D S+AR  G M  R++  F H
Sbjct: 177 PYGLYRAHGFVSAKLAERAGFGEEDLQLLWRGLINMFEHDRSAAR--GEMTARKLIAFKH 234

Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKDS----YEDYAIHLNQEEL 275
              LGN  + R+FD +   +     DS    + DY + L++  L
Sbjct: 235 ECALGNAPAHRLFDSVRVSRADGAGDSPARGFTDYRVELDRAVL 278
>gi|118726119|ref|ZP_01574750.1| CRISPR-associated protein, Csd2 family [Clostridium cellulolyticum
           H10]
 gi|118664499|gb|EAV71130.1| CRISPR-associated protein, Csd2 family [Clostridium cellulolyticum
           H10]
          Length = 294

 Score =  131 bits (329), Expect = 6e-29,   Method: Composition-based stats.
 Identities = 88/292 (30%), Positives = 167/292 (57%), Gaps = 28/292 (9%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM---- 55
           +++ + +F V  EV+  N NGDP + NMPR D +  YG+++DV +KRKIRN ++ +    
Sbjct: 4   IIKNRYEFTVLFEVKNGNPNGDPDAGNMPRIDPETGYGIVTDVCLKRKIRNYIETVKADS 63

Query: 56  -GQPIFVQARDRVDDCIYSLKQRLENKEFFDTV--KEGSKKKKSVENFVKQINAE-WLDV 111
            G  I+++      D +   +   E   +F     KE   KK+ V+  +K    + + D+
Sbjct: 64  TGYKIYIK------DGVPLERSDREAFTYFGISDEKEAQSKKEEVDIKIKDFMCKNFFDI 117

Query: 112 RSFGQV---FAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK--STNSELNSKTELES 166
           R+FG V   F     +   VRGPV + +A+S++++V Q + IT+  +T  +  +K E E 
Sbjct: 118 RTFGAVMTTFVKAKLNCGQVRGPVQLGFARSIDQIVQQEISITRVVATTEKDAAKKETE- 176

Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEK-TGFSDADAEIIKEVLVSLFENDASSARPEGSM 225
             MG K+ + Y +Y + G I+ N A+K TGF++ D  ++ + ++++FE++ S+AR  G+M
Sbjct: 177 --MGRKYIIPYALYRVDGYISANLAQKTTGFNEDDLSMLWDAVINMFEHEHSAAR--GNM 232

Query: 226 RVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKDS--YEDYAIHLNQEEL 275
            VRE+  F H ++ GN  + ++FD +   ++ +      ++DY++ ++++ +
Sbjct: 233 SVRELIVFKHDSEFGNCPAYKLFDAVSVCRKDKSNPPRCFDDYSVDIDEKAI 284
>gi|114331017|ref|YP_747239.1| CRISPR-associated protein, Csd2 family protein [Nitrosomonas
           eutropha C91]
 gi|114308031|gb|ABI59274.1| CRISPR-associated protein, Csd2 family protein [Nitrosomonas
           eutropha C91]
          Length = 304

 Score =  130 bits (326), Expect = 1e-28,   Method: Composition-based stats.
 Identities = 93/303 (30%), Positives = 163/303 (53%), Gaps = 37/303 (12%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           L+ + DF++  +V++ N NGDP + N+PR DA+   GLM+DV++KRK+RN +        
Sbjct: 3   LKNRYDFVLLFDVKDGNPNGDPDAGNLPRVDAETGLGLMTDVALKRKVRNFVS------M 56

Query: 61  VQARDRVDDCIYSLKQRLE------------NKEFFDTVK-----EGSKKKKSVENFVKQ 103
            + +  V + +   K+R E            NK  +  +      EG  KK+   N V++
Sbjct: 57  TRDQSEVTESLNGDKKRFEIYVKEKAILNNQNKRAYVGIGKPELLEGEDKKRKGGNAVEE 116

Query: 104 INAEWL-----DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK-STNSE 157
              +W+     DVR+FG V +  G +   VRGPV +++A+S+  +V     IT+ +  +E
Sbjct: 117 AR-QWMCKNFYDVRTFGAVMS-TGINCGQVRGPVQLTFARSINPIVALEHSITRMAVATE 174

Query: 158 LNS-KTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDA 216
           + + K   ++ TMG K  V YG+YV  G I+ + A +T F + D E++ + L S+FE+D 
Sbjct: 175 VEAEKQSGDNRTMGRKFTVPYGLYVAHGFISAHLANQTDFGEDDLELLWQALESMFEHDR 234

Query: 217 SSARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLE--FDKEKQDKDSYEDYAIHLNQEE 274
           S+AR  G M  R ++ F H+++LGN  +  +F  ++     E      + DY + +++ +
Sbjct: 235 SAAR--GEMATRGLYVFKHNSELGNAPAHSLFARIQPKLKNENSIVRDFSDYTVMVDEAD 292

Query: 275 LAE 277
           L +
Sbjct: 293 LPQ 295
>gi|94984344|ref|YP_603708.1| CRISPR-associated protein Csd2 [Deinococcus geothermalis DSM 11300]
 gi|94554625|gb|ABF44539.1| CRISPR-associated protein Csd2 [Deinococcus geothermalis DSM 11300]
          Length = 325

 Score =  130 bits (326), Expect = 1e-28,   Method: Composition-based stats.
 Identities = 86/249 (34%), Positives = 141/249 (56%), Gaps = 27/249 (10%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           L  + +F++  +V   N NGDP S N PR D +D +GL+SDV++KR++RN +Q  G+ IF
Sbjct: 8   LRNRYEFLLLFDVENGNPNGDPDSGNAPRVDPEDGHGLVSDVALKRRVRNYVQAAGEQIF 67

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           +Q    ++  I+  KQ             GSK K+ V+   + +   + DVR+FG V + 
Sbjct: 68  IQHGTNLNRPIFQAKQ---------ASGGGSKGKQDVDAARRWMCEHFYDVRTFGAVMS- 117

Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSE--LNSKT------------ELES 166
            G +A  VRGPV +++A+SL+ V      IT+   +E   N+KT            E + 
Sbjct: 118 TGANAGQVRGPVQLTFARSLDPVFAIEASITRGAVAEDIKNAKTLDDFLNWEAQQDEDKL 177

Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
            TMG K  + YG++  KG ++ + A+ TGFS+AD +++ E L++++E+D S+++  G M 
Sbjct: 178 RTMGRKSLIPYGLFATKGFVSAHLAQGTGFSEADLKLLLEALLNMYEHDRSASK--GLMS 235

Query: 227 VREVFWFTH 235
            R +F F H
Sbjct: 236 SRRLFVFRH 244
>gi|53804985|ref|YP_113167.1| CRISPR-associated TM1801 family protein [Methylococcus capsulatus
           str. Bath]
 gi|53758746|gb|AAU93037.1| CRISPR-associated protein, TM1801 family [Methylococcus capsulatus
           str. Bath]
          Length = 309

 Score =  128 bits (322), Expect = 4e-28,   Method: Composition-based stats.
 Identities = 88/289 (30%), Positives = 153/289 (52%), Gaps = 29/289 (10%)

Query: 3   EQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRM---QDMGQP 58
           + + DF++  EV++ N NGDP + N+PR DA+  +GL++DV +KRKIRN +   Q    P
Sbjct: 6   QNRYDFVLLFEVKDGNPNGDPDAGNLPRLDAETGHGLVTDVCLKRKIRNFVGLTQGDAAP 65

Query: 59  IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKK--KSVENFVKQINAEWLDVRSFGQ 116
             +  +++    +    +R       D   +  K+K    V++  + +   + DVR+FG 
Sbjct: 66  YEIYVKEKA--VLNRQHERAYQALGVDLGADEGKRKGGDKVDDARRWMCQNFFDVRTFGA 123

Query: 117 VFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTKH 173
           V +  G +   VRGPV +++A+S+  +V     IT+   +T +E   K   ++ TMG KH
Sbjct: 124 VMS-TGVNCGQVRGPVQLTFARSISPIVALEHSITRMAVATEAEA-EKQGGDNRTMGRKH 181

Query: 174 FVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWF 233
            V YG+Y   G ++ + A++TGFS+ D E++ + L  +F++D S+AR  G M  R ++ F
Sbjct: 182 TVPYGLYRAHGFVSAHLAQQTGFSEKDLELLWQALSQMFDHDHSAAR--GEMATRGLYVF 239

Query: 234 THSNK------------LGNVSSARVFDLLEFDKEKQDKDSYE--DYAI 268
            H               LG   + ++FDL+  + +   +   E  DYA+
Sbjct: 240 KHVGTDTDPDQRKQQAMLGCAPAHKLFDLIRVEPKDTGRPPREFGDYAV 288
>gi|21244564|ref|NP_644146.1| hypothetical protein XAC3840 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21110240|gb|AAM38682.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 288

 Score =  125 bits (315), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 84/281 (29%), Positives = 146/281 (51%), Gaps = 18/281 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRN-----RMQDM 55
           +  + +F+   +V   N NGDP + N+PR D + + GL++DV++KRKIRN     +    
Sbjct: 4   IAHRYEFVYLFDVANGNPNGDPDAGNLPRLDPETNRGLVTDVALKRKIRNYVALEKDNAP 63

Query: 56  GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
           G  I++Q +  +++     KQ           K+  K+          +   + DVR+FG
Sbjct: 64  GYTIYMQEKSVLNN---QHKQAYTALGIEHEAKKLPKEGDKARQLTAWMCENFFDVRTFG 120

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
            V   +  +   VRGPV +++A S+E V+   + IT+   +  N K   +  TMG KH +
Sbjct: 121 AVMTTE-VNTGQVRGPVQLAFATSVEPVLPLEVSITRVAVT--NEKDLEKERTMGRKHIL 177

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
            YG+Y   G ++   AE+TGFS+ D +++   L +LFE+D S+AR  G M  R++  F H
Sbjct: 178 PYGLYRAHGFVSAKLAERTGFSEEDLQLLWRALTNLFEHDRSAAR--GEMAARKLIVFEH 235

Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQ 272
            + +GN  +  +FD ++ ++  Q       S+ DY + ++ 
Sbjct: 236 EHPMGNAPAHVLFDKVKVERIDQADQGPARSFSDYRVVIDH 276
>gi|58580492|ref|YP_199508.1| hypothetical protein XOO0869 [Xanthomonas oryzae pv. oryzae
           KACC10331]
 gi|84622452|ref|YP_449824.1| hypothetical protein XOO_0795 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|58425086|gb|AAW74123.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
           KACC10331]
 gi|84366392|dbj|BAE67550.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 288

 Score =  125 bits (314), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 84/281 (29%), Positives = 148/281 (52%), Gaps = 18/281 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQ-----DM 55
           +  + +F+   +V   N NGDP + N+PR D + + GL++DV++KRKIRN +        
Sbjct: 4   IANRYEFVYLFDVINGNPNGDPDAGNLPRLDPETNRGLVTDVALKRKIRNYVALEQEAQA 63

Query: 56  GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
           G  I++Q +  +++     KQ           K+  K +         +   + DVR+FG
Sbjct: 64  GYAIYMQEKSVLNN---QHKQAYAALGIEHEAKKLPKDEAKARELTSWMCKNFFDVRTFG 120

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
            V   +  ++  VRGPV +++A S+E V+   + IT+   +  N K   +  TMG KH +
Sbjct: 121 AVMTTE-VNSGQVRGPVQLAFASSVEPVLPLEVSITRVAVT--NEKDLEKERTMGRKHIL 177

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
            YG+Y   G I+   AE+TGFS+ D +++   L +LFE+D S+AR  G M  R++  F H
Sbjct: 178 PYGLYRAHGFISAKLAERTGFSEDDLQLLWRALTNLFEHDRSAAR--GEMSARKLIVFKH 235

Query: 236 SNKLGNVSSARVFDLLEFDK----EKQDKDSYEDYAIHLNQ 272
            +++GN  +  +FD +  ++    +     S+ DY + +++
Sbjct: 236 EHQMGNAPAHVLFDKVTVERVAEVDAGPARSFADYRVTIDR 276
>gi|126664601|ref|ZP_01735585.1| CRISPR-associated protein, TM1801 family [Marinobacter sp. ELB17]
 gi|126630927|gb|EBA01541.1| CRISPR-associated protein, TM1801 family [Marinobacter sp. ELB17]
          Length = 290

 Score =  124 bits (311), Expect = 7e-27,   Method: Composition-based stats.
 Identities = 80/283 (28%), Positives = 156/283 (55%), Gaps = 15/283 (5%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQP-- 58
           L  + +F+   +V++ N NGDP + N+PR DA+   GL++DV +KRKIRN +  + +   
Sbjct: 3   LNNRYEFVFLFDVKDGNPNGDPDAGNLPRIDAETGQGLVTDVCLKRKIRNYVGMVKEETP 62

Query: 59  ---IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
              I+++ +  ++       + LE K      K+  KK++  +   + +   + D+R+FG
Sbjct: 63  PFEIYIKEKAVLNRSNLRAYEALELKH---ESKKLPKKEEDAKRITQWMCQNFFDIRTFG 119

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKS--TNSELNSKTELESSTMGTKH 173
            V + +  +   VRGP+ +++A+S++ VV+    IT+   T  +   +   ++ TMG K 
Sbjct: 120 AVMSTE-VNTGQVRGPIQMNFARSIDPVVSAEHSITRMAVTTEKEAEQQGGDNRTMGRKF 178

Query: 174 FVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWF 233
            + YG+Y   G ++ N A +TGFS+ D E+  + L+++ ++D S++R  G M    ++ F
Sbjct: 179 TIPYGLYRCHGYVSANLAGQTGFSEEDLELFWDALINMLDHDRSASR--GEMSPCALYVF 236

Query: 234 THSNKLGNVSSARVFDLLEFDK-EKQDKDSYEDYAIHLNQEEL 275
            H + LGN  + ++F+L+E  K        + DY + ++++ L
Sbjct: 237 KHESALGNAPARKLFELIEIHKVSDGPARDFRDYEVTVHRDRL 279
>gi|52425041|ref|YP_088178.1| hypothetical protein MS0986 [Mannheimia succiniciproducens MBEL55E]
 gi|52307093|gb|AAU37593.1| unknown [Mannheimia succiniciproducens MBEL55E]
          Length = 321

 Score =  124 bits (311), Expect = 7e-27,   Method: Composition-based stats.
 Identities = 84/284 (29%), Positives = 153/284 (53%), Gaps = 18/284 (6%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDY-GLMSDVSIKRKIRNRMQ-----DM 55
           ++ + +F+   +V   N NGDP + NMPR D +   GL++DV +KRKIRN ++       
Sbjct: 37  IQNRYEFVYFFDVTNGNPNGDPDAGNMPRLDPESSKGLVTDVCLKRKIRNFVELANENQA 96

Query: 56  GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFG 115
           G  I+V+ +  ++      K+  E  E     K+  K +    +    +   + D+RSFG
Sbjct: 97  GYEIYVKEKSVLN---LQNKRAYEALEIEPEAKKLPKDEAKARDITAWMCKNFFDIRSFG 153

Query: 116 QVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
            V   +  ++  VRGPV +++A+S++ ++   + IT+   +  N K   +  TMG K+ V
Sbjct: 154 AVMTTE-VNSGQVRGPVQLAFAQSIDPIIPLEVSITRMAVT--NEKDLEKERTMGRKYIV 210

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
            Y +Y + G I+ N A KTGFS+ D + + + L  +FE+D S+AR  G M  R++  F H
Sbjct: 211 PYALYRVHGFISANLAAKTGFSEEDLQKLWQALQLMFEHDRSAAR--GEMAARKLIVFKH 268

Query: 236 SNKLGNVSSARVFDLLEFDKEKQDKDS----YEDYAIHLNQEEL 275
            + LG+V + ++FD ++ ++   +  +    + DY I + +++ 
Sbjct: 269 DSALGSVPAHKLFDSVKVERINGESGTPATGFADYQISIEKDKF 312
>gi|84703568|ref|ZP_01017396.1| CRISPR-associated protein [Parvularcula bermudensis HTCC2503]
 gi|84690002|gb|EAQ15843.1| CRISPR-associated protein [Parvularcula bermudensis HTCC2503]
          Length = 304

 Score =  120 bits (301), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 76/251 (30%), Positives = 129/251 (51%), Gaps = 8/251 (3%)

Query: 7   DFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMGQPIFVQARD 65
           +F++  +V   N NGDP + NMPR D + + GL+SDV++KRK+RN +             
Sbjct: 10  EFVLYFDVMNGNPNGDPDAGNMPRLDPETNKGLVSDVALKRKVRNYVALASDNRIYMTEG 69

Query: 66  RVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
              + ++         E  D      K ++        + A + DVR+FG V +  G +A
Sbjct: 70  STLNLLHKEAWAAVMPEITDKFNILPKDRQKARELTAWMCANFWDVRTFGAVMS-TGVNA 128

Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGS 185
             VRGPV  ++A+S+E ++   + IT+   +      +    TMG KH V YG+Y   G 
Sbjct: 129 GQVRGPVQFTFARSVEAILPLEISITRMAATTEKDAEDKHGRTMGRKHIVPYGLYRAHGF 188

Query: 186 INPNFA----EKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGN 241
           ++   A    + TGFS+ D E++ + L ++F++D S+AR  G M  R++  F H + LGN
Sbjct: 189 VSAPLASDETKGTGFSEDDLELLWQALGNMFDHDRSAAR--GEMATRKLVVFRHESALGN 246

Query: 242 VSSARVFDLLE 252
             +  +F+ ++
Sbjct: 247 AQAQSLFERVQ 257
>gi|68549523|ref|ZP_00588986.1| CRISPR-associated protein TM1801 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|68243611|gb|EAN25809.1| CRISPR-associated protein TM1801 [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 346

 Score =  118 bits (296), Expect = 4e-25,   Method: Composition-based stats.
 Identities = 90/338 (26%), Positives = 167/338 (49%), Gaps = 66/338 (19%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQ--- 57
           +E + +F++  +V+  N NGDP + N+PR D +  +G+ +DV +KRKIRN ++ + +   
Sbjct: 5   IENRYEFVLLFDVKNGNPNGDPDAGNLPRIDPETGHGITTDVCLKRKIRNYVELLKKNES 64

Query: 58  PIFVQARD--------------RVDDCIY-----SLKQRL------------ENKEFF-- 84
           P  +  R+                D+ IY      L+  L            EN   +  
Sbjct: 65  PYEIHVREGEFLSEHHKRAHNALTDEKIYVFVPADLRGELSSFAEYPEGVGFENDAIYFQ 124

Query: 85  -----DTVKEGSKKKKSVENFVKQ------------INAEWL-----DVRSFGQVFAFDG 122
                D VK+  +K K++ +  K             I  +W+     DVR+FG V +   
Sbjct: 125 LSSDIDKVKKDVEKLKNITDASKAKIKELFVDGKSVIAKKWMCKNFFDVRTFGAVMSTGD 184

Query: 123 YSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVI 182
            +   VRGPV +S+++S+E +V   ++I  S  + +N     +      K  V YG+Y +
Sbjct: 185 KTCGQVRGPVQLSFSRSIEPIV--GLEIAMSRTAAVNVDKSSDKGLGARKSIVPYGLYRV 242

Query: 183 KGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGNV 242
            G I+   A++TGF++ D E++   L+++F++D S++R  G M  R++F F H ++LGN 
Sbjct: 243 HGFISAPLAKQTGFTEDDLELLWSALINMFDHDRSASR--GEMASRQLFVFQHESELGNT 300

Query: 243 SSARVFDLLEFDKEKQDKDS---YEDYAIHLNQEELAE 277
            + ++F+ ++ +++         YEDY I +++  L +
Sbjct: 301 PAHKLFERIKVERKPLSNGPARFYEDYQITIDETNLGK 338
>gi|149126262|ref|ZP_01851159.1| CRISPR-associated protein, Csd2 family [Methylobacterium sp. 4-46]
 gi|148517078|gb|EDK90356.1| CRISPR-associated protein, Csd2 family [Methylobacterium sp. 4-46]
          Length = 359

 Score =  114 bits (285), Expect = 7e-24,   Method: Composition-based stats.
 Identities = 94/353 (26%), Positives = 162/353 (45%), Gaps = 83/353 (23%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           L+++ DF++  +V   N NGDP + NMPR D +  +GL+SDV +KRK+RN ++   +   
Sbjct: 6   LQRRHDFVLYFDVTNGNPNGDPDAGNMPRMDPETGHGLVSDVCLKRKVRNYVEMAAE--- 62

Query: 61  VQARDRVDDCIYSLKQRLEN---KEFFDTVKEGSKKKKSVENFVKQINAE---------- 107
              RD + + IY  +  + N   +E +  ++    K ++ +    + + E          
Sbjct: 63  ADGRDPIRNRIYVTEGAVLNEKHREAYLALRPDDPKARTDKKLTPKSDEEAVLIRRFMCD 122

Query: 108 -WLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTE 163
            + D+R+FG V +  G +A  VRGPV +S+A+S+E V+   + IT+   +  +E N + +
Sbjct: 123 NFFDIRTFGAVLS-TGINAGQVRGPVQVSFARSVEPVLPLEVSITRMAATNEAERNERQD 181

Query: 164 LESS----------------------------------------TMGTKHFVDYGVYVIK 183
            E                                          TMG KH V YG+Y   
Sbjct: 182 GEDKAGKRGDKRTMGRKHMAATNEAERNERQDGDDEAEKRGDKRTMGRKHIVPYGLYRAH 241

Query: 184 GSINPNFA----EKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
           G ++   A    + TGFSD D  ++ E L ++FE+D S+ R  G M  R +  F H++ L
Sbjct: 242 GYVSAPLASHPVKGTGFSDGDLALLFEALRNMFEHDRSATR--GEMATRRLVVFRHASAL 299

Query: 240 GNVSSARVFDLLEFDKEKQDK---------------DSYEDYAIHLNQEELAE 277
           GN  +  +F+ +   +  +                  S+ DYAI +++E L +
Sbjct: 300 GNAPAQSLFERVRTLRAHKGSVHEIGAPGTDNWPPARSFADYAITVDREGLPQ 352
>gi|45658744|ref|YP_002830.1| hypothetical protein LIC12914 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|45601988|gb|AAS71467.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
          Length = 306

 Score =  113 bits (283), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 88/301 (29%), Positives = 153/301 (50%), Gaps = 37/301 (12%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           ++ + +F+   +V++ N NGDP + N PR D +   GL++DVS+KRKIRN +        
Sbjct: 5   IQNRYEFVYLFDVKDGNPNGDPDAGNQPRVDPETGNGLITDVSLKRKIRNYVT------I 58

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINA--------EWL--- 109
           V++    +D     K  L        V  G+K + S +   ++           EW+   
Sbjct: 59  VKSATPPNDIYIKEKAVLIETHEKAYVAVGAKLETSKKEEKEKRTGGDQVGKAREWMCKN 118

Query: 110 --DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTEL 164
             DVR+FG V A    +A  V+GP+  ++A+S++ V+     IT+   +T  E   + + 
Sbjct: 119 FYDVRTFGAVMALK-VNAGVVKGPIQFTFARSIDPVINLEHSITRMAVATKKEAEDQ-DG 176

Query: 165 ESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGS 224
           ++ TMG KH + YG+Y   G I+ +FA  TGFS+ D E+    L ++F++D S+AR  G 
Sbjct: 177 DNRTMGRKHTISYGLYRAHGFISAHFANDTGFSEEDLELFWSSLQNMFDHDRSAAR--GE 234

Query: 225 MRVREVFWFTH--------SNKLGNVSSARVFDLLEFDKEKQDKDS--YEDYAIHLNQEE 274
           M  R ++ F H          KLG   + ++F+L+   K+     +  + DY++ + + +
Sbjct: 235 MNCRGLYVFKHVGDGKNTNQAKLGVAPAHKLFNLISVSKKDNSTPARDFSDYSVKIQESD 294

Query: 275 L 275
           L
Sbjct: 295 L 295
>gi|24213386|ref|NP_710867.1| hypothetical protein LA0686 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|24194142|gb|AAN47885.1|AE011255_1 conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. 56601]
          Length = 306

 Score =  112 bits (281), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 88/301 (29%), Positives = 153/301 (50%), Gaps = 37/301 (12%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           ++ + +F+   +V++ N NGDP + N PR D +   GL++DVS+KRKIRN +        
Sbjct: 5   IQNRYEFVYLFDVKDGNPNGDPDAGNQPRVDPETGNGLITDVSLKRKIRNYVT------I 58

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINA--------EWL--- 109
           V++    +D     K  L        V  G+K + S +   ++           EW+   
Sbjct: 59  VKSATPPNDIYIKEKAVLIETHEKAYVAVGAKLETSKKEEKEKRTGGDQVGKAREWMCKN 118

Query: 110 --DVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTEL 164
             DVR+FG V A    +A  V+GP+  ++A+S++ V+     IT+   +T  E   + + 
Sbjct: 119 FYDVRTFGAVMALK-VNAGVVKGPIQFTFARSIDPVINLEHSITRMAVATKKEAEVQ-DG 176

Query: 165 ESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGS 224
           ++ TMG KH + YG+Y   G I+ +FA  TGFS+ D E+    L ++F++D S+AR  G 
Sbjct: 177 DNRTMGRKHTISYGLYRAHGFISAHFANDTGFSEEDLELFWSSLQNMFDHDRSAAR--GE 234

Query: 225 MRVREVFWFTH--------SNKLGNVSSARVFDLLEFDKEKQDKDS--YEDYAIHLNQEE 274
           M  R ++ F H          KLG   + ++F+L+   K+     +  + DY++ + + +
Sbjct: 235 MNCRGLYVFKHVGDEKNTNQAKLGVAPAHKLFNLISVSKKDNSTPARDFSDYSVKIQESD 294

Query: 275 L 275
           L
Sbjct: 295 L 295
>gi|86742030|ref|YP_482430.1| CRISPR-associated protein TM1801 [Frankia sp. CcI3]
 gi|86568892|gb|ABD12701.1| CRISPR-associated protein TM1801 [Frankia sp. CcI3]
          Length = 295

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 78/254 (30%), Positives = 127/254 (50%), Gaps = 29/254 (11%)

Query: 3   EQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQP--- 58
           E+K D ++  +V + N NGDP + N PRTD +  +GL++DV+IKRK+R+ +    +    
Sbjct: 8   EKKHDMVLLFDVTDGNPNGDPDNGNRPRTDDETGHGLVTDVAIKRKVRDTIGLAAEAEGL 67

Query: 59  ------IFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWL--- 109
                 IFV+A        ++L  RLE       ++ G K    +++       EWL   
Sbjct: 68  DLTRYQIFVEAG-------HALNTRLEESYLVKGLELGKK----IDDAKAAKAREWLANR 116

Query: 110 --DVRSFGQVFAF-DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELES 166
             D+R FG V +     S   +RGP+ +  A+SL+ V+     IT+ T +      + E 
Sbjct: 117 YVDIRLFGAVLSTGKTQSLGQIRGPIQVGMARSLDPVLPVDHAITRVTQTTQADIDKGER 176

Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR 226
           + MG K  V YG+Y  +   +     +TG S AD ++    LV++F++D S+ R  G M 
Sbjct: 177 TEMGGKWTVPYGLYRAEIHYSAPRGRQTGVSAADLDLFLCTLVNMFDHDRSATR--GEMA 234

Query: 227 VREVFWFTHSNKLG 240
            R ++ F+H N  G
Sbjct: 235 TRGLYVFSHHNAFG 248
>gi|119357241|ref|YP_911885.1| CRISPR-associated protein, Csd2 family [Chlorobium phaeobacteroides
           DSM 266]
 gi|119354590|gb|ABL65461.1| CRISPR-associated protein, Csd2 family [Chlorobium phaeobacteroides
           DSM 266]
          Length = 366

 Score = 96.3 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 64/228 (28%), Positives = 129/228 (56%), Gaps = 18/228 (7%)

Query: 64  RDRVDDCIYSLKQRLEN---KEFFDTVKEGSKKKKSVENFVK---QINAEWLDVRSFGQV 117
           ++ + D + + K+ L     K   + +KE   +K + E   +   ++  ++ D+R+FG V
Sbjct: 135 KNEIKDWMKAEKESLSKNVIKVISEALKEAKPRKPTAEETSRGKEKMCQDYYDIRTFGAV 194

Query: 118 FAFDGY-SAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTELESSTMGTKH 173
            +     +   VRGP+ +++A+S+E +V     IT+   +T +E   ++  ++ TMG K+
Sbjct: 195 MSLKSAPNCGQVRGPIQMTFARSVEPIVALEHSITRMAVATEAEAEKQSG-DNRTMGRKY 253

Query: 174 FVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWF 233
            V YG+Y   G ++ N A +TGFS+ D ++    L+++F++D S+AR  G M  R ++ F
Sbjct: 254 TVPYGLYRAHGFVSANLAHQTGFSENDLDLFWNALLNMFDHDRSAAR--GLMSTRGLYVF 311

Query: 234 THSNKLGNVSSARVFDLLEFDKEKQDKD----SYEDYAIHLNQEELAE 277
            HS+ LGN  ++++F+ +   K K+D +    S+++Y + +++  L E
Sbjct: 312 EHSSVLGNAPASQLFERITV-KRKEDSEGPARSFKEYDVLIDESSLGE 358
>gi|46255206|ref|YP_006118.1| hypothetical protein TT_P0135 [Thermus thermophilus HB27]
 gi|46198055|gb|AAS82465.1| hypothetical conserved protein [Thermus thermophilus HB27]
          Length = 383

 Score = 95.9 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 71/222 (31%), Positives = 110/222 (49%), Gaps = 14/222 (6%)

Query: 68  DDCIYSLKQRLEN--KEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
           ++ +   K+ LE   KE    VK     ++  E    ++   + D+R FG V +  G +A
Sbjct: 148 EEAVAPFKKELEKLAKELAKAVKGRKITEEDRERAQAKLLERFFDIRMFGAVLS-TGLNA 206

Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKS--TNSELNSKTELESSTMGTKHFVDYGVYVIK 183
             VRGPV +++A+SL+ +    + IT+   T  E  ++ E E   MG K  V YG+Y   
Sbjct: 207 GQVRGPVQLTFARSLDPIAPLEVSITRVAITREEDRARKETE---MGRKPLVPYGLYRAH 263

Query: 184 GSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGNVS 243
           G  NP  A KTG    D E + + L  LFE D S+AR  G M VR +  F+H +  GN  
Sbjct: 264 GFFNPFLAAKTGVQPEDLEALWDALQHLFELDRSAAR--GEMTVRGLAVFSHEDAKGNAP 321

Query: 244 SARVFDLLEFDKEK--QDKDSYEDYAIHLNQEELAEYEAKGL 283
           + R+F L+  ++ +  +   S+ DY +   +E     EA G 
Sbjct: 322 AHRLFGLIRVERREGVEAPRSFADYRVRAPKE--GSLEAHGF 361
>gi|51891450|ref|YP_074141.1| hypothetical protein STH312 [Symbiobacterium thermophilum IAM
           14863]
 gi|51855139|dbj|BAD39297.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM
           14863]
          Length = 370

 Score = 91.7 bits (226), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 54/180 (30%), Positives = 94/180 (52%), Gaps = 4/180 (2%)

Query: 88  KEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQS 147
           ++G    ++ +   K +   + D+R FG V +  G +A  VRGPV +++A+S   +    
Sbjct: 167 RKGELTAETQDKARKWLCQTYYDIRMFGAVLS-TGLNAGQVRGPVQLTFARSQHPITPLD 225

Query: 148 MQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEV 207
           + IT+   +    +     + MG K  V YG+Y   G  NP  AEKTG +D D  I+ + 
Sbjct: 226 LSITRQARTT-TVRMATGPTEMGRKPIVPYGLYRAHGFFNPFLAEKTGVTDDDLRILWDA 284

Query: 208 LVSLFENDASSARPEGSMRVREVFWFTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYA 267
           L  LF+ D S+ R  G M +R ++ FTH +  G   + ++F+L++ DK +   +   D++
Sbjct: 285 LQHLFDYDRSAVR--GEMNMRGLWVFTHDDAKGCAPTHKLFELIQTDKLRNGVEVPRDFS 342

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 25/65 (38%), Positives = 49/65 (75%), Gaps = 2/65 (3%)

Query: 2  LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQD-MGQPI 59
          + ++ +F++  +VR+ N NGDP + N+PR D +  +GL++DV++KRK+R+ +   +G+PI
Sbjct: 6  VSKRHEFVLLFDVRDGNPNGDPDAGNLPRIDPETMHGLVTDVALKRKVRDYVSGVLGKPI 65

Query: 60 FVQAR 64
          F+Q++
Sbjct: 66 FIQSK 70
>gi|59801386|ref|YP_208098.1| hypothetical protein, putative phage associated protein [Neisseria
           gonorrhoeae FA 1090]
 gi|59718281|gb|AAW89686.1| hypothetical protein, putative phage associated protein [Neisseria
           gonorrhoeae FA 1090]
          Length = 180

 Score = 90.9 bits (224), Expect = 7e-17,   Method: Composition-based stats.
 Identities = 52/148 (35%), Positives = 88/148 (59%), Gaps = 5/148 (3%)

Query: 108 WLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESS 167
           + D+R+FG V   +  ++  VRGPV +++A+S++ +V   + IT+   +  N K   +  
Sbjct: 5   FFDIRTFGAVMTTE-VNSGQVRGPVQLAFAQSIDPIVPPEVSITRMAVT--NEKDLEKER 61

Query: 168 TMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRV 227
           TMG K+ V Y VY + G I+ N A KTGFSD D   + + L  +FE+D S+AR  G M  
Sbjct: 62  TMGRKYIVPYVVYRVHGFISANLAAKTGFSDDDLAKLWQALTLMFEHDRSAAR--GEMAA 119

Query: 228 REVFWFTHSNKLGNVSSARVFDLLEFDK 255
           R++  F H + LG+  + ++FD ++ ++
Sbjct: 120 RKLVVFKHDSALGSQPAHKLFDAVKVER 147
>gi|147678326|ref|YP_001212541.1| hypothetical protein PTH_1991 [Pelotomaculum thermopropionicum SI]
 gi|146274423|dbj|BAF60172.1| Uncharacterized protein [Pelotomaculum thermopropionicum SI]
          Length = 380

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 58/203 (28%), Positives = 105/203 (51%), Gaps = 15/203 (7%)

Query: 82  EFFDTVKEGSKKKKSVENFVKQINA-EWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSL 140
           E   +  +G K    V + +K   A E+ D+R FG V    G +A  V GPV I++A+S+
Sbjct: 171 EKLQSATKGQKLTAEVRSKIKTTMASEFYDIRMFGAVLTM-GTNAGQVLGPVQITFARSV 229

Query: 141 EKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTG----- 195
             V   ++ IT++  +  + +   + + MG K  + YG+YV  G  NP  AEK       
Sbjct: 230 SPVFPMNLTITRTAITRESDRLR-KQTEMGQKPIIPYGLYVAHGFYNPKLAEKLNPGSEL 288

Query: 196 -FSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNK--LGNVSSARVFDLLE 252
              + D +++ E L ++FE D S++R  G M  R ++ FTH ++   G   + ++F+L++
Sbjct: 289 LVKEDDLKLLWEALCNMFEYDRSASR--GEMACRGLYIFTHEDEKGYGKAPAHKLFELVK 346

Query: 253 FDKEK--QDKDSYEDYAIHLNQE 273
             +    + + S++DY + L  +
Sbjct: 347 ITERDPGRPQRSFDDYTVMLEDK 369
>gi|109646593|ref|ZP_01370497.1| Uncharacterized protein predicted to be involved in DNA repair-like
           [Desulfitobacterium hafniense DCB-2]
 gi|109641839|gb|EAT51393.1| Uncharacterized protein predicted to be involved in DNA repair-like
           [Desulfitobacterium hafniense DCB-2]
          Length = 294

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 67/244 (27%), Positives = 115/244 (47%), Gaps = 22/244 (9%)

Query: 9   MVTVEVREANANGDPLSVNMPRTDAKDYGLMSDVSIKRKIRNRMQDMGQPIF-----VQA 63
           ++ +EV  +NANGDP   + PR      G +S VS KRK+R+ + D     +     V  
Sbjct: 11  LMVIEVVNSNANGDPDRESDPRQRPNGIGEISPVSFKRKLRDLVGDHDSVFYQNLPEVYV 70

Query: 64  RDRVDDCIYSLKQRLE---NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           ++    CI   + R       E    +K   +K      FVK+    + D R FG  F  
Sbjct: 71  KNSDHYCILESRGRDRKSIQSEMSKDIKNFEQKSFLESTFVKK----YWDARIFGNTFLE 126

Query: 121 DGYSAANVR-GPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHF--VDY 177
           +G +   ++ G V      S+  V      I + TN+      E +++ M    F  V++
Sbjct: 127 EGANKGFIKTGVVQFGVGTSISPV-----NIIRHTNTNKAGVQEGKNAGMAPLAFRIVEH 181

Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSN 237
           GVY +   +NPN+A KTG +  D +++K ++   ++ + S+ RP+  +R+R  ++  H N
Sbjct: 182 GVYCMPFFVNPNYAAKTGCTQEDIDLLKLLIPKAYDLNRSAIRPD--VRIRHAWYIEHLN 239

Query: 238 KLGN 241
            LG+
Sbjct: 240 ALGS 243
>gi|149125882|ref|ZP_01850852.1| CRISPR-associated protein, Csd2 family [Methylobacterium sp. 4-46]
 gi|148517451|gb|EDK90656.1| CRISPR-associated protein, Csd2 family [Methylobacterium sp. 4-46]
          Length = 242

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 59/198 (29%), Positives = 103/198 (52%), Gaps = 30/198 (15%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIF 60
           L+++ DF++  +V   N NGDP + NMPR D +  +GL+SDV +KRK+RN ++   +   
Sbjct: 6   LQRRHDFVLYFDVTNGNPNGDPDAGNMPRMDPETGHGLVSDVCLKRKVRNYVEMAAE--- 62

Query: 61  VQARDRVDDCIYSLKQRLEN---KEFFDTVKEGSKKKKSVENFVKQINAE---------- 107
              RD + + IY  +  + N   +E +  ++    K ++ +    + + E          
Sbjct: 63  ADGRDPIRNRIYVTEGAVLNEKHREAYLALRPDDPKARTDKKLTPKSDEEAVLIRRFMCD 122

Query: 108 -WLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITK---STNSELNSKTE 163
            + D+R+FG V +  G +A  VRGPV +S+A+S+E V+   + IT+   +  +E N + +
Sbjct: 123 NFFDIRTFGAVLS-TGINAGQVRGPVQVSFARSVEPVLPLEVSITRMAATNEAERNERQD 181

Query: 164 LESS--------TMGTKH 173
            E          TMG KH
Sbjct: 182 GEDKAGKRGDKRTMGRKH 199
>gi|125975680|ref|YP_001039590.1| CRISPR-associated protein, Csh2 family [Clostridium thermocellum
           ATCC 27405]
 gi|125715905|gb|ABN54397.1| CRISPR-associated protein, Csh2 family [Clostridium thermocellum
           ATCC 27405]
          Length = 305

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 76/293 (25%), Positives = 147/293 (50%), Gaps = 36/293 (12%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM---- 55
           M++ + + +   +V +AN NGDPL  N PR D +    +++DV +KR IR+ + D     
Sbjct: 1   MIKNRQEILFLYDVTDANPNGDPLDENKPRIDEETGINIVTDVRLKRTIRDYLYDYKGFD 60

Query: 56  ---GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVR 112
              G+ IFV              + +E+++    +K+G  + K     V +I  + +D+R
Sbjct: 61  GSNGKDIFV--------------REIESEK--GGIKDGKARAKDFNENVDEILQKAIDIR 104

Query: 113 SFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTK 172
            FG V   D  ++    GPV  +  +SL KV   +++  K T +  + + + +  T   +
Sbjct: 105 LFGGVIPLDK-ASITFTGPVQFNMGRSLNKV---NLKHIKGTGAFASGEGKAQ-KTFREE 159

Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMR--VREV 230
           + V Y +    G IN N A++TG +D D +++ + + +  +N  + ++     R  +R V
Sbjct: 160 YIVPYSIIAFHGIINENAAKRTGLTDEDVDLLDDAMWNGTKNLITRSKMGHMPRLMLRVV 219

Query: 231 FWFTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQ--EELAEYEAK 281
           +    +  +G++ + R+   L FD E++   S +D++I L++  +ELA Y  K
Sbjct: 220 YKPGENFFIGDLQN-RIS--LNFDVEEEKIRSIKDFSIKLDELIDELANYGDK 269
>gi|154249620|ref|YP_001410445.1| CRISPR-associated protein, Csh2 family [Fervidobacterium nodosum
           Rt17-B1]
 gi|154153556|gb|ABS60788.1| CRISPR-associated protein, Csh2 family [Fervidobacterium nodosum
           Rt17-B1]
          Length = 302

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 63/215 (29%), Positives = 102/215 (47%), Gaps = 28/215 (13%)

Query: 18  NANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDMGQPIFVQARDRVDDCIYSLKQ 76
           N NGDP   N PR D  ++  L+SD+ +KR IR+ + + G  IFV+   ++DD   + ++
Sbjct: 24  NPNGDPDEENRPRMDNEREINLVSDLRLKRYIRDYLYEKGYDIFVR---KIDDKPVTAEK 80

Query: 77  RLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSAANVRGPVSISW 136
           R+E+   F   KE             +I ++ +DVR FG      G + A + GPV  +W
Sbjct: 81  RMED---FKNSKE------------DEILSKLIDVRLFGATMPVKGNNRAYI-GPVQFNW 124

Query: 137 AKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGSINPNFAEKTGF 196
             SL KV      IT    S+  +    +  T+G    V Y +    G ++   AEKT  
Sbjct: 125 GYSLNKVELLEASIT----SQFATSENAQQGTIGKDFRVKYSLIAFFGVVSGRRAEKTKL 180

Query: 197 SDADAEIIKEVLVSLFENDASSAR----PEGSMRV 227
           ++ D +++ E +V      A+ ++    P   MRV
Sbjct: 181 TNEDLKLLDEAMVKAIPLQATRSKIGQYPRLYMRV 215
>gi|145622619|ref|ZP_01778576.1| CRISPR-associated protein, Csh2 family [Petrotoga mobilis SJ95]
 gi|144946978|gb|EDJ82013.1| CRISPR-associated protein, Csh2 family [Petrotoga mobilis SJ95]
          Length = 302

 Score = 68.9 bits (167), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 60/234 (25%), Positives = 101/234 (43%), Gaps = 23/234 (9%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPI 59
           M   + + +    V++AN NGDPL+ N PR D +    L+SDV IKR IR+ +  MG+ +
Sbjct: 1   MFNGRKELLFVYSVKDANPNGDPLNANHPRYDEETGQVLVSDVRIKRTIRDELMRMGEDV 60

Query: 60  FVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
           F+    +      +LK+R E  +   +   G +  K             +D R FG  FA
Sbjct: 61  FIDGEPK------TLKERFEELKTKLSTTNGDETLKKC-----------IDTRLFGVTFA 103

Query: 120 FDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGV 179
             G  +    GPV   W +SL K   + +Q T +      +K   E  T+  ++ V + +
Sbjct: 104 L-GKESFAWTGPVQFKWGRSLHKTKVEFVQGTGA----FVTKEGGEQRTIRNEYIVPFAL 158

Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWF 233
                  N   +E+T  +D D   + +   +   N  + ++ E   R     W+
Sbjct: 159 IGTYAIGNQYASERTQATDEDFNKLTQAAWNGTNNLITRSKTEHRSRFLMEIWY 212
>gi|78043120|ref|YP_360970.1| CRISPR-associated protein, Csh2 family [Carboxydothermus
           hydrogenoformans Z-2901]
 gi|77995235|gb|ABB14134.1| CRISPR-associated protein, Csh2 family [Carboxydothermus
           hydrogenoformans Z-2901]
          Length = 313

 Score = 66.2 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 63/214 (29%), Positives = 99/214 (46%), Gaps = 19/214 (8%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMGQPI 59
           +++   + +   + +  N NGDP   N PR D +    L+SDV +KR +R+ +Q+ G+ I
Sbjct: 4   LIKNNSEILFIYDAKLTNPNGDPDDENRPRMDYETKTNLVSDVRLKRYVRDYLQEKGKEI 63

Query: 60  FVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
           FV    +V+    +  +RLE K    T K  SK    V      +  + +DVR FG    
Sbjct: 64  FVA---KVEGETVNATERLE-KLLGKTSKNISKDDVPV------LLEKLVDVRLFGATMP 113

Query: 120 F---DGYSAANVRGPVSISWAKSLEKV-VTQSMQITKSTNSELNSKTELESSTMGTKHFV 175
               DG S+    GPV  +W  SL KV + +S  IT    S  +S +  E  TMG  + +
Sbjct: 114 IKSEDGGSSLTFTGPVQFNWGYSLNKVELVESNTIT----SRFSSTSGNEQGTMGKDYRL 169

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLV 209
            Y +    G +  + A+ T        ++ E LV
Sbjct: 170 YYSLIAFHGIVAAHRAKFTQLDMETLSLLDEALV 203
>gi|114844588|ref|ZP_01455032.1| CRISPR-associated protein TM1801 [Thermoanaerobacter ethanolicus
           X514]
 gi|114805397|gb|EAU57194.1| CRISPR-associated protein TM1801 [Thermoanaerobacter ethanolicus
           X514]
          Length = 293

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 63/229 (27%), Positives = 99/229 (43%), Gaps = 28/229 (12%)

Query: 7   DFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDMGQPIFVQARD 65
           + +   + +  N NGDP   N PR D  ++  L+SD+ +KR IR+ +   G  IFV+   
Sbjct: 7   EILYIYDAKLTNPNGDPDEENRPRMDYEREINLVSDLRLKRYIRDYLMLKGYDIFVRL-- 64

Query: 66  RVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
            VDD   +  +R+++ E        S  +  +EN        W+DVR FG        + 
Sbjct: 65  -VDDKPVTADKRVKDLE-------DSSNEWILEN--------WIDVRMFGATMTVQKDTK 108

Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGS 185
             + GP+  +W  SL KV      IT    S  +S       T+G    V Y +    G 
Sbjct: 109 TFI-GPIQFNWGYSLNKVELLEASIT----SHFSSSETFAQGTIGKDFRVKYSLIAFSGV 163

Query: 186 INPNFAEKTGFSDADAEIIKEVLVSLFENDASSAR----PEGSMRVREV 230
           ++ + AEKT   + D  ++ E L     N  + ++    P   MRV  V
Sbjct: 164 VSGHRAEKTKLKEDDLYLLDEALKHAIPNLVTRSKIGQYPRIYMRVEYV 212
>gi|89895517|ref|YP_519004.1| hypothetical protein DSY2771 [Desulfitobacterium hafniense Y51]
 gi|89334965|dbj|BAE84560.1| hypothetical protein [Desulfitobacterium hafniense Y51]
          Length = 334

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 67/220 (30%), Positives = 103/220 (46%), Gaps = 30/220 (13%)

Query: 7   DFMVTVEVREANANGDPLSVN-----MPRTDAKDYGLMSDVSIKRKIRNRM----QDMGQ 57
           + +    V++   N DPL+ +      P  D +    +SDVSIKR +R+ +    QD G+
Sbjct: 23  EILFVKSVKDGIPNRDPLNDSDARRLFPEEDGRIS--LSDVSIKRDVRDFVIALEQDGGK 80

Query: 58  P----IFVQARDRVDDCIYSLKQRLENKEFF--DTVKEGSKKKKSVENFVKQINAEWLDV 111
                IFVQ  ++V+D     K +L  +        K   K+KK+ E+    +     DV
Sbjct: 81  DQKNHIFVQ--EKVND-----KGKLLGRGSLAEGIAKSVGKEKKAKEDMKSVLIEHCFDV 133

Query: 112 RSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMG- 170
           R+FG V++       N+ GPV   WA SL  V TQ +Q T    S ++S  E E  T G 
Sbjct: 134 RTFGIVYSVK--PKFNLTGPVQFGWAHSLHPVDTQYVQGTVVMPS-MDSTAEGEGKTQGT 190

Query: 171 --TKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVL 208
             T + V + V+ +   IN   AE +G +  D E++   L
Sbjct: 191 IWTSYTVPFAVFAMPAVINAKAAEHSGMTAEDQELLLRAL 230
>gi|150398985|ref|YP_001322752.1| CRISPR-associated protein, Csh2 family [Methanococcus vannielii SB]
 gi|150011688|gb|ABR54140.1| CRISPR-associated protein, Csh2 family [Methanococcus vannielii SB]
          Length = 291

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 68/275 (24%), Positives = 132/275 (48%), Gaps = 25/275 (9%)

Query: 7   DFMVTVEVREANANGDPLSVNMPRTDAKDYGL-MSDVSIKRKIRNRMQDMGQPIFVQARD 65
           +F++  +   AN NGD L+ N PR D     L +SDV IKR IR+     G+ + VQ + 
Sbjct: 6   EFLLIWDSTMANPNGDMLNDNKPRQDEATGQLEVSDVRIKRFIRDHWISNGKNVLVQTKT 65

Query: 66  RVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
             +  + + +  ++     +   E + K + + N++ +   +++DV+ FG V     Y  
Sbjct: 66  DKNGKVMTCQGIVK-----EMASENNLKDEEIPNYLLE---KYIDVKLFGAVITKPKY-- 115

Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGS 185
            ++ GP+ I+W+KS+ +   + MQ     N+   S    + +++ +K    Y ++     
Sbjct: 116 -DITGPLQIAWSKSVHEADVKFMQ----GNAAYASGEGKDQASIWSKFISPYALFKTYAV 170

Query: 186 INPNFAEKTGF--SDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKLGNVS 243
            N   AEK G   SD D    K+ L++  +N  S+++ +    + EV +  ++NKL    
Sbjct: 171 YNDKVAEKQGINVSDDDLNDFKDALLNGLKNYRSTSKNQMPRLLIEVIY--NNNKLDG-- 226

Query: 244 SARVFDLLEFDKEKQDKDSYEDYAIHLNQEELAEY 278
                + ++   E QD +  +   + +N  +L+EY
Sbjct: 227 ---ELNYVDITYENQDLELRDISQVVINLGKLSEY 258
>gi|109645854|ref|ZP_01369774.1| CRISPR-associated protein, CT1132 family [Desulfitobacterium
           hafniense DCB-2]
 gi|109643803|gb|EAT53356.1| CRISPR-associated protein, CT1132 family [Desulfitobacterium
           hafniense DCB-2]
          Length = 316

 Score = 64.7 bits (156), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 66/213 (30%), Positives = 101/213 (47%), Gaps = 30/213 (14%)

Query: 14  VREANANGDPLSVN-----MPRTDAKDYGLMSDVSIKRKIRNRM----QDMGQP----IF 60
           V++   N DPL+ +      P  D +    +SDVSIKR +R+ +    QD G+     IF
Sbjct: 12  VKDGIPNRDPLNDSDARRLFPEEDGRIS--LSDVSIKRDVRDFVIALEQDGGKDQKNHIF 69

Query: 61  VQARDRVDDCIYSLKQRLENKEFF--DTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVF 118
           VQ  ++V+D     K +L  +        K   K+KK+ E+    +     DVR+FG V+
Sbjct: 70  VQ--EKVND-----KGKLLGRGSLAEGIAKSVGKEKKAKEDMKSVLIEHCFDVRTFGIVY 122

Query: 119 AFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMG---TKHFV 175
           +       N+ GPV   WA SL  V TQ +Q T    S ++S  + E  T G   T + V
Sbjct: 123 SVK--PKFNLTGPVQFGWAHSLHPVDTQYVQGTVVMPS-MDSTADGEGKTQGTIWTSYTV 179

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVL 208
            + V+ +   IN   AE +G +  D E++   L
Sbjct: 180 PFAVFAMPAVINAKAAEHSGMTAEDQELLLRAL 212
>gi|114568025|ref|YP_755179.1| hypothetical protein Swol_2520 [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
 gi|114338960|gb|ABI69808.1| conserved hypothetical protein [Syntrophomonas wolfei subsp. wolfei
           str. Goettingen]
          Length = 301

 Score = 64.3 bits (155), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 72/277 (25%), Positives = 117/277 (42%), Gaps = 30/277 (10%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLM-SDVSIKRKIRNRMQDMGQPIF 60
            +Q+ +++    V++AN NGDPL+ N PR D      M SDV +KR  R+     G+ +F
Sbjct: 3   FKQRREYLFLYTVKDANPNGDPLNENHPRYDGDTAQAMASDVRVKRTTRDEWVRSGEIVF 62

Query: 61  VQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAF 120
           V    +      SLK R E             KK + ++  ++I  + LDVR FG  FA 
Sbjct: 63  VDGEPK------SLKTRFE-----------ELKKITGKSDAREIMKQCLDVRLFGVTFAL 105

Query: 121 DGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDY--- 177
            G  A    GPV   W +SL     + +Q T +  +E       + S    ++ V +   
Sbjct: 106 -GKEAFAWTGPVQFKWGRSLHSASFEFVQGTAAFATERGGADNRQRS-FRNEYLVPFALM 163

Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFE-NDASSARPEGSMRVREVFWFTHS 236
           GVY I           + ++DA  E ++ +L  L++  D    R +   + R +   T+ 
Sbjct: 164 GVYAIANQY------ASQYTDAADEDLQRMLDGLWQGTDNLITRSKNEHKSRLLIEITYK 217

Query: 237 NKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQE 273
                   A    +   D+E +  D  +  A+   QE
Sbjct: 218 EDFNGKIGALDDKVTLLDREGKVMDREKQKALRSLQE 254
>gi|89895552|ref|YP_519039.1| hypothetical protein DSY2806 [Desulfitobacterium hafniense Y51]
 gi|89335000|dbj|BAE84595.1| hypothetical protein [Desulfitobacterium hafniense Y51]
          Length = 287

 Score = 64.3 bits (155), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 58/183 (31%), Positives = 88/183 (48%), Gaps = 23/183 (12%)

Query: 39  MSDVSIKRKIRNRMQDMGQP--------IFVQARDRVDDCIYSLKQRLENKEFF--DTVK 88
           +SDVSIKR +R+ + D+ +         IFVQ  ++V+D     K +L  +        K
Sbjct: 11  LSDVSIKRDVRDFVIDLEEDGGKEQKNHIFVQ--EKVND-----KGKLLGRGSLAEGIAK 63

Query: 89  EGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSM 148
              K+KK+ E+    +     DVR+FG V++       N+ GPV   WA SL  V TQ +
Sbjct: 64  RVGKEKKAKEDMKSVLIEHCFDVRTFGIVYSVK--PKFNLTGPVQFGWAHSLHPVDTQYV 121

Query: 149 QITKSTNSELNSKTELESSTMG---TKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIK 205
           Q T    S  +S  E E  T G   T + V + V+V+   IN   A+ +G +  D E++ 
Sbjct: 122 QGTVVMPST-DSTAEGEGKTQGTIWTSYTVPFAVFVMPAVINAKAAQHSGMTPEDQELLL 180

Query: 206 EVL 208
             L
Sbjct: 181 RAL 183
>gi|153869181|ref|ZP_01998851.1| CRISPR-associated protein TM1801 [Beggiatoa sp. PS]
 gi|152074276|gb|EDN71148.1| CRISPR-associated protein TM1801 [Beggiatoa sp. PS]
          Length = 327

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 59/213 (27%), Positives = 94/213 (44%), Gaps = 23/213 (10%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYG--LMSDVSIKRKIRN--------- 50
           L  + + +   E  + N NGDPL  N PRTD  D G   ++DV IKR +R+         
Sbjct: 4   LTNRYEILFLYECTDCNPNGDPLDENRPRTDP-DTGEATITDVRIKRTVRDYFIAQEPDV 62

Query: 51  --RMQDMGQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEW 108
             R+ + G+ I ++  ++ D  +     R   K+F   + EG KKK   +   + I ++ 
Sbjct: 63  EKRLAN-GKEILIRDTEKPDGTLSQGSDRA--KQFSQELTEGKKKKGDNQKLQEVILSQC 119

Query: 109 LDVRSFGQVFAF-DGYSAANVRGPVSIS-WAKSLEKVVTQSMQITKSTNSELNSKTELES 166
           +D R FG       G S+  + GPV  S + +SL KV    +Q T +      SK   + 
Sbjct: 120 IDARLFGTAVPLGKGESSLKLTGPVQFSAFNRSLHKVSPVMVQQTAA----FASKATAQQ 175

Query: 167 STMGTKHFVDYGVYVIKGSINPNFAEKTGFSDA 199
                +  V Y +    G +N   A+ T  + A
Sbjct: 176 KGFAERWLVPYALIAAYGMVNEGAAQTTHMTKA 208
>gi|109672064|ref|ZP_01374306.1| crispr-associated protein, Csh2 family [Campylobacter concisus
           13826]
 gi|112800899|gb|EAT98243.1| crispr-associated protein, Csh2 family [Campylobacter concisus
           13826]
          Length = 313

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 69/302 (22%), Positives = 132/302 (43%), Gaps = 33/302 (10%)

Query: 4   QKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDMGQPIFVQ 62
           QK + +   +    N NGD L  N PR D +     ++DV IKR IR+ +    +     
Sbjct: 2   QKKEILFLWDGENWNPNGDMLKDNAPRRDDETGVAEVTDVRIKRTIRDEIMKKDEASIFI 61

Query: 63  ARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDG 122
              R++D +   K           +++    K++     K+I ++++D+R+FG V     
Sbjct: 62  KEYRIEDALLDAKT---------AIRQSINIKQNKSELQKEILSKFIDIRAFGGVLPISD 112

Query: 123 -----------YSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNS---KTELESST 168
                       +     GPV    +KSL KV  + ++ T + +S+ +    K + + +T
Sbjct: 113 KDKMKQDKEIKTAGVQFTGPVQFRLSKSLNKVEVEHVKGTGAFSSDYDPNDPKKQKDQAT 172

Query: 169 MGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVR 228
              + F+ Y ++   G I+   A KTGFS+AD   I + L    +N  + ++   + R  
Sbjct: 173 FREEEFIKYAIFATYGIIDNYNAAKTGFSEADEAKILKALWHGTKNLTTRSKIGQTPRFM 232

Query: 229 EVFWFTHSNKLGNVSSARVFDLLEFDKEKQDK--DSYEDYAIHLN--QEELAEYEAKGLQ 284
            +  +      G+++++     +    EK+D+   S  +Y I     + +LA Y A   +
Sbjct: 233 LIITYKDDTFAGDLNNS-----ISLKSEKEDRVIRSINEYTIDFTNLKNKLARYAANIEK 287

Query: 285 VE 286
           +E
Sbjct: 288 IE 289
>gi|21226665|ref|NP_632587.1| hypothetical protein MM_0563 [Methanosarcina mazei Go1]
 gi|20904948|gb|AAM30259.1| hypothetical protein MM_0563 [Methanosarcina mazei Go1]
          Length = 291

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 60/232 (25%), Positives = 104/232 (44%), Gaps = 18/232 (7%)

Query: 7   DFMVTVEVREANANGDPLSVNMPRTDAKDYGL-MSDVSIKRKIRNRMQDMGQPIFVQARD 65
           ++++  +   AN NGD L+ N PR D     L +SDV IKR +R+  Q  G  + V+ + 
Sbjct: 6   EYLLVWDSTMANPNGDMLNDNKPRHDEITGQLEVSDVRIKRFVRDEWQSRGHNVLVRTKK 65

Query: 66  RVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYSA 125
             D  + S    ++       V E +K K++       +  E++DVR FG V     Y  
Sbjct: 66  GDDGKVMSCTALIKE------VMEKAKVKEA--ELPSHLLNEYIDVRLFGAVITKPKY-- 115

Query: 126 ANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKGS 185
            ++ GP+ + W+KS+     + MQ     NS          ST+ +K+   Y ++     
Sbjct: 116 -DITGPLQVMWSKSVNPAEIKFMQ----GNSAYAGGEGKSQSTIWSKYISPYAIFKTYAV 170

Query: 186 INPNFAEKTGF--SDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTH 235
            N N A++ G   S+ D       L++   N  S+++ +    + EV +  H
Sbjct: 171 YNDNAAKRQGIETSEKDLNEFTAALINGLINYRSTSKNQMPRLLVEVIYKEH 222
>gi|146295132|ref|YP_001178903.1| CRISPR-associated protein, Csh2 family [Caldicellulosiruptor
           saccharolyticus DSM 8903]
 gi|145408708|gb|ABP65712.1| CRISPR-associated protein, Csh2 family [Caldicellulosiruptor
           saccharolyticus DSM 8903]
          Length = 294

 Score = 60.8 bits (146), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 57/234 (24%), Positives = 98/234 (41%), Gaps = 34/234 (14%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDMGQPI 59
           ++++  + + T + +  N NGDP   N PR D  K+  L+SDV +KR IR+   D G PI
Sbjct: 5   IIDKNSEILFTYDAKLCNPNGDPDEENRPRMDWEKEINLVSDVRVKRYIRDYADDQGIPI 64

Query: 60  FVQARDRVDDCIYSLKQRLENKEFF--DTVKEGSKKKKSVENFVKQINAEWLDVRSFGQV 117
           +V              +++E K     + +K   +    +E F+        D+R FG  
Sbjct: 65  YV--------------RKIEGKSVKPEEVIKSVGEDIDELETFI--------DIRLFGAT 102

Query: 118 FAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDY 177
                 +   + GPV  +W  SL KV      IT    S   S  + +   +G  + V Y
Sbjct: 103 IPIKKETRTYI-GPVQFNWGYSLNKVELLEASIT----SHFASDEKKQQGAIGKDYRVKY 157

Query: 178 GVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSAR----PEGSMRV 227
                 G ++   A++T  ++ D + +   +       A+ ++    P   MRV
Sbjct: 158 SFIAFSGIVSARRAKETRLTEDDLKFLDRAMKEAIPLQATRSKIGQYPRLYMRV 211
>gi|134045809|ref|YP_001097295.1| CRISPR-associated protein, Csh2 family [Methanococcus maripaludis
           C5]
 gi|132663434|gb|ABO35080.1| CRISPR-associated protein, Csh2 family [Methanococcus maripaludis
           C5]
          Length = 292

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 55/206 (26%), Positives = 95/206 (46%), Gaps = 34/206 (16%)

Query: 7   DFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQD-MGQPIFVQAR 64
           + +   +  + N NGD  + N PR D   +  L+SDV +KR IR+  +  +G+ IF+   
Sbjct: 14  EILFIYDAEKTNPNGDMDNQNKPRMDWDTNTNLVSDVRLKRYIRDYFEKYLGEEIFIT-- 71

Query: 65  DRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYS 124
                               +  K+   + K +++  KQ + + +DVR FG VFA +G S
Sbjct: 72  --------------------ENAKDSKDRAKQLDSNKKQ-HTDLIDVRLFGAVFAEEG-S 109

Query: 125 AANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKG 184
            +++ GPV  +W  SL +V  Q      S+ S            +G  + V Y V    G
Sbjct: 110 NSHISGPVQFNWGYSLNEVELQESTTITSSFSSGTG--------VGKDYRVKYSVIAFNG 161

Query: 185 SINPNFAEKTGFSDADAEIIKEVLVS 210
           +IN N A+ +  S+ D  ++ E +++
Sbjct: 162 AINGNAAKTSTLSEKDIVLLDEAILN 187
>gi|116753951|ref|YP_843069.1| CRISPR-associated protein, Csh2 family [Methanosaeta thermophila
           PT]
 gi|116665402|gb|ABK14429.1| CRISPR-associated protein, Csh2 family [Methanosaeta thermophila
           PT]
          Length = 321

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 54/222 (24%), Positives = 100/222 (45%), Gaps = 35/222 (15%)

Query: 2   LEQKIDFMVTVEVREANANGDPLSVNMPRTDAKD-YGLMSDVSIKRKIRNRMQDM-GQPI 59
           +  + + +   ++R+ N NGDP+  N PR D +    L++DV +KR IR+ + +  G  I
Sbjct: 4   VSNRSELLFIYDIRDGNPNGDPMDENKPRMDEETGVNLVTDVRLKRTIRDYLHNFKGLEI 63

Query: 60  FVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
           FV+      + IY         E    +++  ++ K      ++I +E +DVR FG V  
Sbjct: 64  FVR------EIIYD--------EENGYIQDAKRRAKDFGEDQERILSECIDVRLFGGVIP 109

Query: 120 F------------DGYSAAN---VRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTEL 164
                        +G S  +     GPV     +SL +V  + ++ T +      SK  +
Sbjct: 110 LEKRRQNKQKDEGEGSSKGDSITYTGPVQFKMGRSLHRVALKHIKGTGA----FASKEGM 165

Query: 165 ESSTMGTKHFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKE 206
             +T   ++ + Y + +  G IN N A+ T  ++ D  ++ E
Sbjct: 166 TQATFREEYVLPYSLILFYGIINENAAKHTALTEEDVRLLLE 207
>gi|124004088|ref|ZP_01688935.1| crispr-associated protein, Csh2 family [Microscilla marina ATCC
           23134]
 gi|123990667|gb|EAY30147.1| crispr-associated protein, Csh2 family [Microscilla marina ATCC
           23134]
          Length = 303

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 67/280 (23%), Positives = 129/280 (46%), Gaps = 29/280 (10%)

Query: 1   MLEQKIDFMVTVEVREANANGDPLSVNMPRTDAKDYGLM-SDVSIKRKIRNRMQDM---- 55
           +++ + + +   E+  AN NG+PL  N PR D++D  ++ SDV +KR +R+   +     
Sbjct: 4   VIQNRSEILFLYEIENANPNGNPLDENRPRFDSEDSTIIVSDVRLKRTVRDYWYEYEGFN 63

Query: 56  ---GQPIFVQARDRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVR 112
              G+ IFV+             Q  E  + +  V +G ++ K+     +++    +DVR
Sbjct: 64  GEGGKDIFVRE-----------TQYQEGDKSY--VSDGKRRAKAFGESKEKVLEACIDVR 110

Query: 113 SFGQVFAFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTK 172
           +FG V      ++  + GP      +SL KV   + Q T +      S  +   +T  T+
Sbjct: 111 TFGGVIPLTK-ASITLTGPTQFQMGRSLHKVEIATEQGTGA----FASGDKKSQATFRTE 165

Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFW 232
           + V Y +    G IN   A+ +  ++AD  ++ + L    +N  S ++   +  +  V  
Sbjct: 166 YKVPYALIGFNGIINEKAAKYSQMTEADRALLLDGLWEGTKNLISRSKFGQTPVLMLVVN 225

Query: 233 FTHSNKLGNVSSARVFDLLEFDKEKQDKDSYEDYAIHLNQ 272
           +  ++ LGN+   RV   L+ +K +    S  D+ + L+Q
Sbjct: 226 YKDASHLGNLRQ-RV--ALQTEKNELALRSLNDFELDLSQ 262
>gi|108803121|ref|YP_643058.1| CRISPR-associated protein Csh2 [Rubrobacter xylanophilus DSM 9941]
 gi|108764364|gb|ABG03246.1| CRISPR-associated protein Csh2 [Rubrobacter xylanophilus DSM 9941]
          Length = 314

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 54/199 (27%), Positives = 87/199 (43%), Gaps = 16/199 (8%)

Query: 7   DFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDMGQPIFVQARD 65
           D +   + +  N NGDP   N PR D A    L+SDV +KR +R+   + G+ I+V+  +
Sbjct: 7   DILYLYDAKLTNPNGDPDDENRPRMDEATGRNLVSDVRLKRYLRDYWLNAGEDIWVRRTE 66

Query: 66  RVDDCIYSLKQRLE------NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFA 119
           + +    S KQR+       N+E    + E  ++ +    F   +     DVR FG    
Sbjct: 67  QEETT--SAKQRMSVLLEDYNRENGTNLNE--RQARQSREFKDWLLGRLRDVRLFGATMP 122

Query: 120 FDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGV 179
            +  ++    GPV  SW  SL +V   +     +T S   +  E E  T G    V Y +
Sbjct: 123 MEN-TSVTFTGPVQFSWGYSLNRVEINN----SATISSHFAGRENEYGTFGKDWRVHYSL 177

Query: 180 YVIKGSINPNFAEKTGFSD 198
               G ++ N A  T  ++
Sbjct: 178 LAFYGIVSRNRARHTRLTE 196
>gi|89211072|ref|ZP_01189450.1| CRISPR-associated protein TM1801 [Halothermothrix orenii H 168]
 gi|89159297|gb|EAR78967.1| CRISPR-associated protein TM1801 [Halothermothrix orenii H 168]
          Length = 302

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 54/205 (26%), Positives = 92/205 (44%), Gaps = 21/205 (10%)

Query: 7   DFMVTVEVREANANGDPLSVNMPRTD-AKDYGLMSDVSIKRKIRNRMQDM-GQPIFVQAR 64
           + +   + + +N NGD  + N PR D      L+SDV +KR IR+ +Q + G+ +FV   
Sbjct: 12  EILFLYDAKRSNPNGDMDNENKPRMDWDTGTNLVSDVRLKRYIRDYLQKVKGKNLFVSEE 71

Query: 65  DRVDDCIYSLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDGYS 124
                      ++ EN+      ++ S  K   +  + +I  E  DV  FG V    G  
Sbjct: 72  ----------AEKAENRVHQILGRKPSSNKPVTDEELTKIAEECCDVIYFGAVLGTSG-G 120

Query: 125 AANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTKHFVDYGVYVIKG 184
             ++ GPV  +W  SL KV    +Q +K+  S  +S        +G  + V Y      G
Sbjct: 121 NTHLTGPVQFNWGYSLNKV---ELQESKTITSSFSS-----GEGVGKDYRVKYSFIAFSG 172

Query: 185 SINPNFAEKTGFSDADAEIIKEVLV 209
            IN   A+ T  ++ D +++ E ++
Sbjct: 173 GINGLAAKDTKLTENDVKLLDEAII 197
>gi|154175378|ref|YP_001408164.1| crispr-associated protein, Csh2 family [Campylobacter curvus
           525.92]
 gi|112802844|gb|EAU00188.1| crispr-associated protein, Csh2 family [Campylobacter curvus
           525.92]
          Length = 306

 Score = 57.4 bits (137), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 69/269 (25%), Positives = 125/269 (46%), Gaps = 40/269 (14%)

Query: 18  NANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMGQ-PIFVQARDR---VDDCIY 72
           N NGD L  N PR D + +    +DV IKR IR+ +    +  IFV+  ++   V DC  
Sbjct: 16  NPNGDMLRENAPRIDDETNIAEATDVRIKRTIRDEIMKKDEGAIFVKEYNKDENVLDCKT 75

Query: 73  SLKQRLENKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVFAFDG---------- 122
           ++++ +  ++          +K ++E   ++I ++++D+R+FG V               
Sbjct: 76  AIREVINIRQ----------EKAAIE---REILSKFIDIRAFGGVLPISDKDEMKADKEI 122

Query: 123 -YSAANVRGPVSISWAKSLEKVVTQSMQITKS--TNSELNSKTELESSTMGTKHFVDYGV 179
             +     GPV    +KSL +V  Q ++ T +  + S+  +KT  E   +G   F  YGV
Sbjct: 123 KTAGVQFTGPVQFRMSKSLHRVQIQHIKGTGAFASGSDKGAKTFREEDFLGYAIFATYGV 182

Query: 180 YVIKGSINPNFAEKTGFSDADAEIIKEVLVSLFENDASSARPEGSMRVREVFWFTHSNKL 239
           +      + N A+KT FS+ DA +I   L +  +N  + ++   + R   +  +      
Sbjct: 183 F------DNNNAKKTNFSEDDANVILSALWNGTKNLITRSKMGQTPRFMLIITYKDDTFA 236

Query: 240 GNVSSARVFDLLEFDKEKQDKDSYEDYAI 268
           G++++     L+  DKE +   S  DY I
Sbjct: 237 GDLNNT--IKLIS-DKEDEAIRSVNDYTI 262
>gi|84489232|ref|YP_447464.1| hypothetical protein Msp_0420 [Methanosphaera stadtmanae DSM 3091]
 gi|84372551|gb|ABC56821.1| conserved hypothetical protein [Methanosphaera stadtmanae DSM 3091]
          Length = 319

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 27/208 (12%)

Query: 7   DFMVTVEVREANANGDPLSVNMPRTDAK-DYGLMSDVSIKRKIRNRMQDMG-QPIFVQAR 64
           + +   ++ +AN NGDPL  N PR D + +  +++DV +KR IR+ +++   + +FV+ +
Sbjct: 5   ELLFLYDISDANPNGDPLDENKPRIDEETEINIVTDVRLKRTIRDYLEEFANEELFVKEK 64

Query: 65  DRVDDCIYSLKQRLE------NKEFFDTVKEGSKKKKSVENFVKQINAEWLDVRSFGQVF 118
              +  +   K R E      N E FD  K   K           I  + +D R FG   
Sbjct: 65  AGKEGGLQDAKTRAEDYLPEGNYESFDEAKNALK---------NNILEKCIDARLFGGTI 115

Query: 119 AFD------GYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMGTK 172
             +         +  + GPV     +SL KV    MQ  K T +   SK      T   +
Sbjct: 116 PLELKLKKKQTGSITLTGPVQFRMGRSLHKV---DMQYIKGTGA-FASKDGKSQKTFREE 171

Query: 173 HFVDYGVYVIKGSINPNFAEKTGFSDAD 200
           + + Y +    G IN N A+ T   + D
Sbjct: 172 YILPYSLIAFYGVINENAAKSTNLREDD 199
>gi|124521530|ref|ZP_01696443.1| CRISPR-associated protein, CT1132 family [Bacillus coagulans 36D1]
 gi|124496739|gb|EAY44321.1| CRISPR-associated protein, CT1132 family [Bacillus coagulans 36D1]
          Length = 318

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 60/213 (28%), Positives = 99/213 (46%), Gaps = 30/213 (14%)

Query: 14  VREANANGDPLSVN-----MPRTDAKDYGLMSDVSIKRKIRNRMQD--------MGQPIF 60
           V++   N DPL+ +      P  D +    +SDVSIKR +R+ + D            IF
Sbjct: 14  VKDGIPNRDPLNDSDARRIFPEEDGRIS--LSDVSIKRDVRDFVIDYQADGGSQQKNYIF 71

Query: 61  VQARDRVDDCIYSLKQRLENK-EFFDTVKEGSKKKKSVENFVKQINAEW-LDVRSFGQVF 118
           VQ +       ++ K +L  +    + + +   K+K  +  +K +  E   DVR+FG V+
Sbjct: 72  VQEK-------FNEKGKLLGRGSLAEGIAKAVGKEKESKTDMKSVLLEHSFDVRTFGVVY 124

Query: 119 AFDGYSAANVRGPVSISWAKSLEKVVTQSMQITKSTNSELNSKTELESSTMG---TKHFV 175
           +       N+ GPV   WA S+  V +Q +Q T    S  +SK + E  T G   T + V
Sbjct: 125 SVK--PKFNLTGPVQFGWAHSMHPVDSQYVQGTVVMPST-DSKGDEEGKTQGTIWTSYTV 181

Query: 176 DYGVYVIKGSINPNFAEKTGFSDADAEIIKEVL 208
            + V+ + G IN   AE +  ++ D E++   L
Sbjct: 182 PFAVFAMPGIINAKNAEHSQMTEEDQELLLRAL 214
  Database: nr
    Posted date:  Sep 17, 2007 11:41 AM
  Number of letters in database: 999,999,834
  Number of sequences in database:  2,976,859
  
  Database: /nucleus1/users/jsaw/ncbi/db/nr.01
    Posted date:  Sep 17, 2007 11:48 AM
  Number of letters in database: 894,087,890
  Number of sequences in database:  2,493,262
  
Lambda     K      H
   0.314    0.130    0.360 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 978,677,796
Number of Sequences: 5470121
Number of extensions: 38056191
Number of successful extensions: 91415
Number of sequences better than 1.0e-05: 93
Number of HSP's better than  0.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 49
Number of HSP's that attempted gapping in prelim test: 91097
Number of HSP's gapped (non-prelim): 104
length of query: 291
length of database: 1,894,087,724
effective HSP length: 131
effective length of query: 160
effective length of database: 1,177,501,873
effective search space: 188400299680
effective search space used: 188400299680
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 129 (54.3 bits)