BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= PG1774 
         (150 letters)

Database: nr 
           5,470,121 sequences; 1,894,087,724 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|34541619|ref|NP_906098.1|  hypothetical protein PG2031 [P...   288   7e-77
gi|150010075|ref|YP_001304818.1|  hypothetical protein BDI_3...   104   1e-21
gi|154491976|ref|ZP_02031602.1|  hypothetical protein PARMER...   103   2e-21
gi|156109355|gb|EDO11100.1|  hypothetical protein BACOVA_030...    87   2e-16
gi|153808345|ref|ZP_01961013.1|  hypothetical protein BACCAC...    86   5e-16
gi|29347828|ref|NP_811331.1|  hypothetical protein BT_2418 [...    85   1e-15
gi|150002747|ref|YP_001297491.1|  hypothetical protein BVU_0...    84   2e-15
gi|53711883|ref|YP_097875.1|  hypothetical protein BF0592 [B...    84   3e-15
gi|86131885|ref|ZP_01050482.1|  hypothetical protein MED134_...    80   3e-14
gi|116246348|ref|XP_001230367.1|  ENSANGP00000030003 [Anophe...    80   4e-14
gi|149372189|ref|ZP_01891459.1|  hypothetical protein SCB49_...    77   2e-13
gi|156862767|gb|EDO56198.1|  hypothetical protein BACUNI_001...    75   9e-13
gi|89891492|ref|ZP_01202997.1|  conserved hypothetical prote...    75   1e-12
gi|120435794|ref|YP_861480.1|  hypothetical protein GFO_1440...    72   7e-12
gi|146297908|ref|YP_001192499.1|  hypothetical protein Fjoh_...    72   7e-12
gi|150026044|ref|YP_001296870.1|  hypothetical protein FP200...    72   8e-12
gi|88714202|ref|ZP_01108278.1|  hypothetical protein FB2170_...    69   1e-10
gi|126663446|ref|ZP_01734443.1|  hypothetical protein FBBAL3...    68   2e-10
gi|86141621|ref|ZP_01060167.1|  hypothetical protein MED217_...    67   5e-10
gi|91214899|ref|ZP_01251872.1|  hypothetical protein P700755...    65   1e-09
gi|88803471|ref|ZP_01118997.1|  hypothetical protein PI23P_1...    65   1e-09
gi|86133081|ref|ZP_01051663.1|  hypothetical protein MED152_...    65   1e-09
gi|88805384|ref|ZP_01120903.1|  hypothetical protein RB2501_...    65   1e-09
gi|83857050|ref|ZP_00950578.1|  hypothetical protein CA2559_...    63   6e-09
gi|126645441|ref|ZP_01717985.1|  hypothetical protein ALPR1_...    60   3e-08
gi|124006212|ref|ZP_01691047.1|  conserved hypothetical prot...    58   2e-07
gi|94502325|ref|ZP_01308799.1|  Hypothetical protein SMU_153...    57   4e-07
>gi|34541619|ref|NP_906098.1| hypothetical protein PG2031 [Porphyromonas gingivalis W83]
 gi|34397937|gb|AAQ66997.1| hypothetical protein PG_2031 [Porphyromonas gingivalis W83]
          Length = 150

 Score =  288 bits (737), Expect = 7e-77,   Method: Composition-based stats.
 Identities = 149/150 (99%), Positives = 150/150 (100%)

Query: 1   LPLRQKRLYWPLFFRDDLIMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLE 60
           +PLRQKRLYWPLFFRDDLIMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLE
Sbjct: 1   MPLRQKRLYWPLFFRDDLIMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLE 60

Query: 61  DNGDFYFQWDKNSGEKASVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVI 120
           DNGDFYFQWDKNSGEKASVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVI
Sbjct: 61  DNGDFYFQWDKNSGEKASVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVI 120

Query: 121 DYIAPEEVADMEHFWNNLISRLRLSLGAPE 150
           DYIAPEEVADMEHFWNNLISRLRLSLGAPE
Sbjct: 121 DYIAPEEVADMEHFWNNLISRLRLSLGAPE 150
>gi|150010075|ref|YP_001304818.1| hypothetical protein BDI_3494 [Parabacteroides distasonis ATCC
           8503]
 gi|149938499|gb|ABR45196.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 127

 Score =  104 bits (260), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 56/128 (43%), Positives = 85/128 (66%), Gaps = 2/128 (1%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
           M+K K  +EY+FD+VS RSLW +L+T  GLS WFA +V + DN  + F+W K + E A+V
Sbjct: 1   MKKEKFHIEYIFDKVSRRSLWNHLTTALGLSAWFADEVIINDNL-YTFKWSKEAQE-ATV 58

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
           ++ +    IR++W  E+D+NA FEF+IH  ELTG  +LE+ D+  P E  D  + W++ +
Sbjct: 59  IDSKPENFIRYRWVDEEDENAYFEFIIHTIELTGSTALEITDFSEPGEKKDSINLWDSQV 118

Query: 140 SRLRLSLG 147
             L+ +LG
Sbjct: 119 EDLKRTLG 126
>gi|154491976|ref|ZP_02031602.1| hypothetical protein PARMER_01607 [Parabacteroides merdae ATCC
           43184]
 gi|154088217|gb|EDN87262.1| hypothetical protein PARMER_01607 [Parabacteroides merdae ATCC
           43184]
          Length = 127

 Score =  103 bits (258), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 58/129 (44%), Positives = 84/129 (65%), Gaps = 4/129 (3%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKAS 78
           M+K K  +EY+FD+VS RSLW +L+TP GLS WFA DV +  NG+ Y F+W+K + ++A 
Sbjct: 1   MKKEKFHIEYIFDKVSRRSLWNHLTTPPGLSAWFADDVII--NGNIYVFKWNK-AEQEAE 57

Query: 79  VLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNL 138
           VL  +    IR++W  E+D+NA FEF IH  ELTG  SL++ D+   +E  D    W+  
Sbjct: 58  VLSIKPEMSIRYRWMDEEDENAYFEFQIHSHELTGSTSLQITDFAEQDEKKDSIDLWDTQ 117

Query: 139 ISRLRLSLG 147
           +  L+ +LG
Sbjct: 118 VEELKRTLG 126
>gi|156109355|gb|EDO11100.1| hypothetical protein BACOVA_03002 [Bacteroides ovatus ATCC 8483]
          Length = 128

 Score = 87.4 bits (215), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 46/128 (35%), Positives = 73/128 (57%), Gaps = 1/128 (0%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
           M+K K+ LEY+ +  S   LW  +STP GL +WFA D  + D+    F W K    KA +
Sbjct: 1   MKKEKIHLEYLLNATSKNILWGAISTPTGLEDWFA-DKVISDDKIVEFHWGKTEQRKAEI 59

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
           +  R    IRF+W  ++++   FE  + ++ELT    LE+ D+  P+EV DM+  W + +
Sbjct: 60  IAIRSFSFIRFRWQDDENERDYFEIKMTYNELTSDYVLEITDFAEPDEVDDMKELWESQV 119

Query: 140 SRLRLSLG 147
           ++LR + G
Sbjct: 120 AKLRRTCG 127
>gi|153808345|ref|ZP_01961013.1| hypothetical protein BACCAC_02639 [Bacteroides caccae ATCC 43185]
 gi|149129248|gb|EDM20464.1| hypothetical protein BACCAC_02639 [Bacteroides caccae ATCC 43185]
          Length = 128

 Score = 86.3 bits (212), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 46/128 (35%), Positives = 72/128 (56%), Gaps = 1/128 (0%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
           M+K K+ LEY+ +  S   LW  +STP GL +WFA D  + D+    F W K    KA +
Sbjct: 1   MKKEKIHLEYLLNATSKNILWAAISTPTGLEDWFA-DKVISDDKIVEFHWGKTEQRKAEI 59

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
              R    IRF+W  ++++   FE  + ++ELT    LE+ D+  P+EV DM+  W + +
Sbjct: 60  TAIRSFSFIRFRWEDDENERDYFEIKMTYNELTSDYVLEITDFAEPDEVDDMKELWESQV 119

Query: 140 SRLRLSLG 147
           ++LR + G
Sbjct: 120 AKLRRTCG 127
>gi|29347828|ref|NP_811331.1| hypothetical protein BT_2418 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29339730|gb|AAO77525.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 128

 Score = 85.1 bits (209), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 45/128 (35%), Positives = 72/128 (56%), Gaps = 1/128 (0%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
           M+K K+ LEY+ +  S   LW  +STP GL +WFA D  + D+    F W K     A +
Sbjct: 1   MKKEKIHLEYLLNATSKNILWAAISTPTGLEDWFA-DKVVSDDKIVEFHWGKTEQRNAEI 59

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
           +  R    IRF+W  ++++   FE  + ++ELT    LE+ D+   +EVADM+  W + +
Sbjct: 60  IAIRSFSFIRFRWQDDENERDYFEIKMTYNELTSDYVLEITDFAEADEVADMKELWESQV 119

Query: 140 SRLRLSLG 147
           ++LR + G
Sbjct: 120 AKLRRTCG 127
>gi|150002747|ref|YP_001297491.1| hypothetical protein BVU_0141 [Bacteroides vulgatus ATCC 8482]
 gi|149931171|gb|ABR37869.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
          Length = 128

 Score = 84.3 bits (207), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 49/124 (39%), Positives = 69/124 (55%), Gaps = 1/124 (0%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
           M K K++LEY+    S   +W  +STP GL  WFA  V  +D   F F W K    +A V
Sbjct: 1   MRKEKIRLEYMLKAGSGNIVWSIISTPSGLETWFADKVIFKDKV-FTFYWGKTETRQAEV 59

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
              R    IRF+W  ++D  A FE  + ++ELT    LEVID+ AP+EV D +  W++ I
Sbjct: 60  TNFRVNSFIRFRWLDDEDPKAYFELKMVYNELTSDYMLEVIDWAAPDEVEDTKELWDSEI 119

Query: 140 SRLR 143
            +L+
Sbjct: 120 EKLK 123
>gi|53711883|ref|YP_097875.1| hypothetical protein BF0592 [Bacteroides fragilis YCH46]
 gi|60680111|ref|YP_210255.1| hypothetical protein BF0542 [Bacteroides fragilis NCTC 9343]
 gi|52214748|dbj|BAD47341.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
 gi|60491545|emb|CAH06297.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
          Length = 128

 Score = 84.0 bits (206), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 1/128 (0%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
           M+K K+ LEY+ +  S   LW  +STP GL +WFA D  + D+    F W K    +A +
Sbjct: 1   MKKEKIHLEYLLNATSKNILWSAISTPTGLEDWFA-DKVVSDDKTVTFCWGKTEQRQAGI 59

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
           +  R    IRF W  ++++   FE  + ++ELTG   LE+ D+   +E  D++  W++ +
Sbjct: 60  VAIRAYSFIRFHWLDDENERDYFEIKMSYNELTGDYVLEITDFSEADEADDLKELWDSQV 119

Query: 140 SRLRLSLG 147
           S+LR + G
Sbjct: 120 SKLRRTCG 127
>gi|86131885|ref|ZP_01050482.1| hypothetical protein MED134_02760 [Cellulophaga sp. MED134]
 gi|85817707|gb|EAQ38881.1| hypothetical protein MED134_02760 [Dokdonia donghaensis MED134]
          Length = 128

 Score = 80.5 bits (197), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 50/129 (38%), Positives = 77/129 (59%), Gaps = 5/129 (3%)

Query: 21  EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASV 79
           +K K  +E+V  R S   L++Y++TP G+SEWFA +V     G+FY F WD  S EKA +
Sbjct: 3   DKKKYVIEFVV-RASPSLLYQYMATPSGMSEWFADNV--NSRGEFYTFIWD-GSEEKAKL 58

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
           L ++ GE I+FQW  ++ +   F+  I   E+T   SL + DY   +E+ + + FW N I
Sbjct: 59  LSKKSGEYIKFQWLDDEGEEYFFQLRIQVDEITKDVSLMITDYAEEDEIDEGKMFWENQI 118

Query: 140 SRLRLSLGA 148
           S L+  +G+
Sbjct: 119 SELKQVIGS 127
>gi|116246348|ref|XP_001230367.1| ENSANGP00000030003 [Anopheles gambiae str. PEST]
          Length = 128

 Score = 79.7 bits (195), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 44/129 (34%), Positives = 72/129 (55%), Gaps = 2/129 (1%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
           M K K+  EY      +  L+ YL++ +GL+EWFA DV +E   DFYF W+    EKA++
Sbjct: 1   MAKTKVHFEYPM-HCQSEILYEYLASAEGLAEWFADDV-VEKGDDFYFSWNGGEPEKATM 58

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
           +  +    +R++W  ++     FE  I   E+T   SL V D+    +  +++ +W+NLI
Sbjct: 59  IRYKPESFVRYRWEADEGTKNFFELTIVIDEITNDLSLNVTDFADEGDEEEVQQYWDNLI 118

Query: 140 SRLRLSLGA 148
             L++ LGA
Sbjct: 119 ENLQIKLGA 127
>gi|149372189|ref|ZP_01891459.1| hypothetical protein SCB49_00675 [unidentified eubacterium SCB49]
 gi|149354956|gb|EDM43518.1| hypothetical protein SCB49_00675 [unidentified eubacterium SCB49]
          Length = 128

 Score = 77.4 bits (189), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 50/128 (39%), Positives = 75/128 (58%), Gaps = 5/128 (3%)

Query: 22  KNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASVL 80
           K K ++E+V   VS   L+ Y+STP GLSEW+A +V     G+F+ F W+  S EKA +L
Sbjct: 4   KIKYEMEFVI-YVSPAMLYNYISTPSGLSEWYADNV--NSRGEFFTFIWE-GSEEKAKLL 59

Query: 81  EQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLIS 140
            ++   RI+FQW  ++D    FE  I   E+T   SL +ID+   +EV + +  W N+I 
Sbjct: 60  SKKSPHRIKFQWMEDEDTEYFFELRIQVDEITKDVSLMIIDFAEEDEVEEGKLLWENMIG 119

Query: 141 RLRLSLGA 148
            L+  LG+
Sbjct: 120 NLKQILGS 127
>gi|156862767|gb|EDO56198.1| hypothetical protein BACUNI_00138 [Bacteroides uniformis ATCC 8492]
          Length = 128

 Score = 75.5 bits (184), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 43/128 (33%), Positives = 67/128 (52%), Gaps = 1/128 (0%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
           ME+ K+ LEY+ +  S   LW  +STP GL  WFA  V+ +D    +F W K     A +
Sbjct: 1   MERKKIHLEYLLNATSKSILWAAISTPTGLEGWFADRVQSDDKTVTFF-WGKTEKRDAEI 59

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
           +  R    IRF+W  ++++   FE  + ++ELT    LE+ D+   +EV D    W + +
Sbjct: 60  IAVRAYSFIRFRWLDDENEREYFELKMTNNELTNDFVLEITDFADIDEVGDSRELWESQV 119

Query: 140 SRLRLSLG 147
             LR + G
Sbjct: 120 DTLRRTCG 127
>gi|89891492|ref|ZP_01202997.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
 gi|89516266|gb|EAS18928.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
          Length = 129

 Score = 74.7 bits (182), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 52/132 (39%), Positives = 75/132 (56%), Gaps = 6/132 (4%)

Query: 19  IMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKA 77
           + E  K  LE+    VS   L++Y++TP G+SEWFA +V     G+++ F WD  S EKA
Sbjct: 1   MQEPIKFNLEFPI-HVSPALLYQYIATPSGMSEWFADNV--NSRGEYFRFIWD-GSEEKA 56

Query: 78  SVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPE-EVADMEHFWN 136
            ++++  GER RFQW  ++D    FE  I   E+T   SL + D+   E EV D + FW 
Sbjct: 57  KIVKRVSGERARFQWDYDEDTKKYFEMSIQVDEITKDVSLMISDFGDDEDEVEDQKMFWE 116

Query: 137 NLISRLRLSLGA 148
           N I  L+  LG+
Sbjct: 117 NQIGELKKVLGS 128
>gi|120435794|ref|YP_861480.1| hypothetical protein GFO_1440 [Gramella forsetii KT0803]
 gi|117577944|emb|CAL66413.1| conserved hypothetical protein [Gramella forsetii KT0803]
          Length = 128

 Score = 72.4 bits (176), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 46/129 (35%), Positives = 77/129 (59%), Gaps = 5/129 (3%)

Query: 21  EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASV 79
           EK K ++E+   + S   L++Y+STP GLSEW+A +V     G+F+ F W+  S EKA +
Sbjct: 3   EKIKYEMEFPI-QASPSLLYQYISTPSGLSEWYADNV--NSRGEFFTFIWE-GSEEKAKL 58

Query: 80  LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
           + ++  ERI+F+W  ++D    FE  I   ++T   SL + D+   +EV + +  W N++
Sbjct: 59  VSKKSDERIKFKWTDDEDTPYFFELRIQVDDITKDVSLMITDFAEDDEVDEGKMLWENMV 118

Query: 140 SRLRLSLGA 148
           S L+  LG+
Sbjct: 119 SDLKQILGS 127
>gi|146297908|ref|YP_001192499.1| hypothetical protein Fjoh_0142 [Flavobacterium johnsoniae UW101]
 gi|146152326|gb|ABQ03180.1| hypothetical protein Fjoh_0142 [Flavobacterium johnsoniae UW101]
          Length = 130

 Score = 72.4 bits (176), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 46/117 (39%), Positives = 71/117 (60%), Gaps = 6/117 (5%)

Query: 35  STRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASVLEQREGERIRFQWH 93
           S + L++Y+STP GLSEWFA +V     G+F+ F W+ +S EKA +  ++ GE+++F+W 
Sbjct: 16  SPQLLYQYISTPSGLSEWFADNV--NSRGEFFTFIWN-DSQEKARLASKKSGEKVKFKWV 72

Query: 94  RE--QDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLISRLRLSLGA 148
            E  +D    FE  I   ELT   SL V+D+   EE+ + +  W N IS L+  +G+
Sbjct: 73  DESSKDTEYFFELHILVDELTKDVSLMVVDFAEKEEIGEAKQLWENQISDLKHLIGS 129
>gi|150026044|ref|YP_001296870.1| hypothetical protein FP2005 [Flavobacterium psychrophilum JIP02/86]
 gi|149772585|emb|CAL44068.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
          Length = 130

 Score = 72.4 bits (176), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 48/131 (36%), Positives = 77/131 (58%), Gaps = 7/131 (5%)

Query: 22  KNKLQLEYVFD-RVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASV 79
           +NK++ E  F    S + L++Y+STP GL EWFA +V     G+FY F W+ +S EKA +
Sbjct: 2   QNKIRYELEFPINSSPQLLYQYISTPSGLQEWFADNV--NSRGEFYTFIWN-DSEEKARL 58

Query: 80  LEQREGERIRFQW--HREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
             ++ GE+I+F+W  +  +D    FE  I   E+T   SL ++DY  P ++ + +  W N
Sbjct: 59  YSKKTGEKIKFKWMDNDNKDTEYYFELKILVDEITKDVSLMIVDYAEPNDIQESKLLWEN 118

Query: 138 LISRLRLSLGA 148
            IS L+  +G+
Sbjct: 119 QISDLKHVIGS 129
>gi|88714202|ref|ZP_01108278.1| hypothetical protein FB2170_10364 [Flavobacteriales bacterium
           HTCC2170]
 gi|88707465|gb|EAQ99709.1| hypothetical protein FB2170_10364 [Flavobacteriales bacterium
           HTCC2170]
          Length = 173

 Score = 68.6 bits (166), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 46/131 (35%), Positives = 76/131 (58%), Gaps = 4/131 (3%)

Query: 18  LIMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKA 77
           ++ +K K ++E+V  + S + L++YLSTP GLSEWFA +V       F F WD  + E A
Sbjct: 46  VMSDKIKFEIEFVI-QSSPQLLYQYLSTPSGLSEWFADNVNSRGE-KFSFIWD-GTEEDA 102

Query: 78  SVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
            +L+++  E ++F W   +D +A FE  I   E+T   SL + D+   +E+ + +  W N
Sbjct: 103 ILLKKKSDEFVKFAWEEVED-DAYFEMKIIVDEITKDVSLFITDFAEEDELDEAKMLWEN 161

Query: 138 LISRLRLSLGA 148
            I+ L+  LG+
Sbjct: 162 QITDLKHVLGS 172
>gi|126663446|ref|ZP_01734443.1| hypothetical protein FBBAL38_08859 [Flavobacteria bacterium BAL38]
 gi|126624394|gb|EAZ95085.1| hypothetical protein FBBAL38_08859 [Flavobacteria bacterium BAL38]
          Length = 130

 Score = 67.8 bits (164), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 48/130 (36%), Positives = 77/130 (59%), Gaps = 7/130 (5%)

Query: 22  KNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASVL 80
           K K +LE+   + S + L++Y+STP GLSEWFA +V     G+F+ F W+ +S EKA + 
Sbjct: 4   KVKYELEFPI-QSSPQLLYQYISTPSGLSEWFADNV--NSRGEFFTFIWN-DSEEKAKLA 59

Query: 81  EQREGERIRFQWHREQ--DQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNL 138
            ++ GERI+F+W  E   + +  FE  I   E+T   S+ + D+   +E+ + +  W N 
Sbjct: 60  SKKSGERIKFRWLEEDNTETDYFFEIKIMEDEITKDVSIVISDFAHEDELDESKLLWENQ 119

Query: 139 ISRLRLSLGA 148
           IS L+  LG+
Sbjct: 120 ISDLKHVLGS 129
>gi|86141621|ref|ZP_01060167.1| hypothetical protein MED217_06367 [Flavobacterium sp. MED217]
 gi|85832180|gb|EAQ50635.1| hypothetical protein MED217_06367 [Leeuwenhoekiella blandensis
           MED217]
          Length = 130

 Score = 66.6 bits (161), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 48/131 (36%), Positives = 79/131 (60%), Gaps = 7/131 (5%)

Query: 21  EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASV 79
           EK K ++E+   + S   L++Y+S+P G+SEWFA +V     G++Y F W+  S EKA +
Sbjct: 3   EKVKFEIEFPI-QASPSLLYQYMSSPSGMSEWFADNV--NSRGEYYTFIWN-GSEEKAKL 58

Query: 80  LEQREGERIRFQWHRE-QDQNA-CFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
           L ++  ERI+F+W  + +D+N   FE  I   E+T   SL V D+   +E+ + + FW N
Sbjct: 59  LGKKSDERIKFRWMDDVEDENLYFFELRIVVDEITKDVSLIVTDFAEEDEIDEGKMFWEN 118

Query: 138 LISRLRLSLGA 148
            I+ L+  +G+
Sbjct: 119 QIAELKHIIGS 129
>gi|91214899|ref|ZP_01251872.1| hypothetical protein P700755_18579 [Psychroflexus torquis ATCC
           700755]
 gi|91187326|gb|EAS73696.1| hypothetical protein P700755_18579 [Psychroflexus torquis ATCC
           700755]
          Length = 139

 Score = 65.5 bits (158), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 45/131 (34%), Positives = 77/131 (58%), Gaps = 7/131 (5%)

Query: 21  EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGD-FYFQWDKNSGEKASV 79
           +K K ++E+   +VS   L++Y+STP GLSEWFA +V     G+ F F W+  S E+A +
Sbjct: 12  DKVKYEMEFPI-QVSPSLLYQYISTPSGLSEWFADNV--NSRGELFTFMWE-GSEEEAKL 67

Query: 80  LEQREGERIRFQWHREQDQ--NACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
           L ++  +RI+ +W  ++++  N  FE  I   E+T   SL + D+    E  + +  W+N
Sbjct: 68  LSKKADDRIKLRWTEDEEESLNYFFEMKIQVDEITKDVSLMITDFAEEGEEDEGKMLWDN 127

Query: 138 LISRLRLSLGA 148
           ++S L+  LG+
Sbjct: 128 MVSDLKQILGS 138
>gi|88803471|ref|ZP_01118997.1| hypothetical protein PI23P_12802 [Polaribacter irgensii 23-P]
 gi|88781037|gb|EAR12216.1| hypothetical protein PI23P_12802 [Polaribacter irgensii 23-P]
          Length = 127

 Score = 65.1 bits (157), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 50/131 (38%), Positives = 69/131 (52%), Gaps = 6/131 (4%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKAS 78
           MEK K +LE V    S   L++Y+STP  L EWFA  V     G  Y F W+    E A 
Sbjct: 1   MEKVKFELE-VPIHASPNMLYQYISTPSNLQEWFADKV--NSRGKIYSFVWEGEE-ELAE 56

Query: 79  VLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIA-PEEVADMEHFWNN 137
           ++ ++ GERIR++W   +D  + FE  I    LT   +L V DY    +EV + +  W N
Sbjct: 57  LITKKAGERIRWKWLESEDDESYFEIRIDLDALTKDVTLVVTDYAEDADEVEESKQLWEN 116

Query: 138 LISRLRLSLGA 148
            I  LR ++GA
Sbjct: 117 QIEELRHTIGA 127
>gi|86133081|ref|ZP_01051663.1| hypothetical protein MED152_00215 [Tenacibaculum sp. MED152]
 gi|85819944|gb|EAQ41091.1| hypothetical protein MED152_00215 [Polaribacter dokdonensis MED152]
          Length = 127

 Score = 64.7 bits (156), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 46/131 (35%), Positives = 71/131 (54%), Gaps = 6/131 (4%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKAS 78
           M+K K ++E +    S   L++Y+S+P  L EWFA  V     G  Y F WD  + E+A 
Sbjct: 1   MDKVKYEIE-IPVHASPNMLYQYISSPSNLQEWFADTV--NSRGKIYTFSWD-GTEEQAE 56

Query: 79  VLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPE-EVADMEHFWNN 137
           ++ ++  ERIRF+W   +D ++ FE  I    LT   SL + D+   E EV + +  W N
Sbjct: 57  LVTKKSEERIRFKWLESEDDDSFFEIKIQVDPLTKDVSLIITDFADDEDEVEEAKQLWEN 116

Query: 138 LISRLRLSLGA 148
            I  L+ ++GA
Sbjct: 117 QIDELKHTIGA 127
>gi|88805384|ref|ZP_01120903.1| hypothetical protein RB2501_13629 [Robiginitalea biformata
           HTCC2501]
 gi|88784202|gb|EAR15372.1| hypothetical protein RB2501_13629 [Robiginitalea biformata
           HTCC2501]
          Length = 127

 Score = 64.7 bits (156), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 45/128 (35%), Positives = 73/128 (57%), Gaps = 4/128 (3%)

Query: 21  EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASVL 80
           EK K ++E+V  + S + L++YL+TP GLSEWFA +V       F F W+  + E A +L
Sbjct: 3   EKTKFEIEFVI-QASPQLLFQYLATPSGLSEWFADNVNSRGE-RFTFIWE-GTEEVARLL 59

Query: 81  EQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLIS 140
           +++  E +RF W    D ++ FE  I   E+T   SL + D+   +E+ + +  W N +S
Sbjct: 60  KRKTDEFVRFAWEY-NDDDSYFEMRIIVDEITKDVSLFITDFAEEDELEEAKMLWTNQVS 118

Query: 141 RLRLSLGA 148
            L+  LG+
Sbjct: 119 DLKQILGS 126
>gi|83857050|ref|ZP_00950578.1| hypothetical protein CA2559_09638 [Croceibacter atlanticus
           HTCC2559]
 gi|83848417|gb|EAP86286.1| hypothetical protein CA2559_09638 [Croceibacter atlanticus
           HTCC2559]
          Length = 131

 Score = 62.8 bits (151), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 47/132 (35%), Positives = 77/132 (58%), Gaps = 8/132 (6%)

Query: 21  EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGD-FYFQWDKNSGEKASV 79
           ++ K ++E+V  + S   L+ YLSTP GLSEW+A +V     G+ F F W+  S E+A +
Sbjct: 3   DRIKYEMEFVI-QSSPSLLYNYLSTPSGLSEWYADNV--NSRGELFTFIWE-GSEEQAKL 58

Query: 80  LEQREGERIRFQWHREQD--QNACFEFVIHHSELTGGKSLEVIDYIAPE-EVADMEHFWN 136
           L ++  ER++F+W  + D  ++  FE  I   E+T   SL + D+   E EV + +  W 
Sbjct: 59  LSKKTDERVKFRWMDDVDNEESYFFELRIQVDEITKDVSLMITDFADDEDEVEEGKMLWE 118

Query: 137 NLISRLRLSLGA 148
           N++S L+  LG+
Sbjct: 119 NMVSNLKQVLGS 130
>gi|126645441|ref|ZP_01717985.1| hypothetical protein ALPR1_12385 [Algoriphagus sp. PR1]
 gi|126578852|gb|EAZ83016.1| hypothetical protein ALPR1_12385 [Algoriphagus sp. PR1]
          Length = 131

 Score = 60.5 bits (145), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 40/133 (30%), Positives = 70/133 (52%), Gaps = 6/133 (4%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
           M KNK   +Y  +  S + +++YLST  GL EWFA +VR+ ++ +F F +D N    A +
Sbjct: 1   MVKNKFVADYQIN-ASKKIVFQYLSTASGLEEWFADEVRINEDKNFIFNFD-NEDHYARL 58

Query: 80  LEQREGERIRFQWHREQD----QNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFW 135
              R    ++F+++  ++     N+  EF +  +ELT    L+VIDY    +  ++   W
Sbjct: 59  ASIRTNSHVKFEFYDPKNPDASDNSYIEFKLEENELTQTLFLKVIDYSDGYDDEELIAIW 118

Query: 136 NNLISRLRLSLGA 148
             L+  L+  +G 
Sbjct: 119 GGLVGSLKEIIGG 131
>gi|124006212|ref|ZP_01691047.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
 gi|123988136|gb|EAY27794.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
          Length = 129

 Score = 57.8 bits (138), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 4/131 (3%)

Query: 20  MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGD-FYFQWDKNSGEKAS 78
           M K K   EY     S + ++ Y+ T +GL +WFA  V ++     F F WD N    A 
Sbjct: 1   MTKYKFSSEYEI-HASPKRIYSYVHTANGLEQWFATKVEVKGKEKVFNFVWD-NEDHFAR 58

Query: 79  VLEQREGERIRFQW-HREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
           ++  R  + ++F++   E+ ++   EF + H+E+T    L+V DY   E+  D+   W+ 
Sbjct: 59  IVAHRTNKLVKFEFISDEEGKSNYIEFRLEHNEMTDSTFLKVTDYSEMEDEDDLNGLWDG 118

Query: 138 LISRLRLSLGA 148
           L++ LR  +G 
Sbjct: 119 LVADLREVVGG 129
>gi|94502325|ref|ZP_01308799.1| Hypothetical protein SMU_153 [Candidatus Sulcia muelleri str. Hc
           (Homalodisca coagulata)]
 gi|94451121|gb|EAT14072.1| Hypothetical protein SMU_153 [Candidatus Sulcia muelleri str. Hc
           (Homalodisca coagulata)]
          Length = 125

 Score = 56.6 bits (135), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 32/109 (29%), Positives = 58/109 (53%), Gaps = 2/109 (1%)

Query: 35  STRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASVLEQREGERIRFQWHR 94
           ST  L+ Y++ P  LSEWFA +V +     + F W  N  E   +++++  + +R++W +
Sbjct: 14  STELLYYYITAPVKLSEWFADNV-ISIGNRYIFTW-YNYDETCFLIKKKPYQYVRYKWEQ 71

Query: 95  EQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLISRLR 143
           + D    FEF I  + LT    L++ D+    E+   + +W N I +L+
Sbjct: 72  DVDNKYYFEFFIKKNNLTKNVYLKITDFAIKTEIKKSKMWWKNRIKKLK 120
  Database: nr
    Posted date:  Sep 17, 2007 11:41 AM
  Number of letters in database: 999,999,834
  Number of sequences in database:  2,976,859
  
  Database: /nucleus1/users/jsaw/ncbi/db/nr.01
    Posted date:  Sep 17, 2007 11:48 AM
  Number of letters in database: 894,087,890
  Number of sequences in database:  2,493,262
  
Lambda     K      H
   0.322    0.139    0.442 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 600,300,352
Number of Sequences: 5470121
Number of extensions: 24457775
Number of successful extensions: 48766
Number of sequences better than 1.0e-05: 28
Number of HSP's better than  0.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 48717
Number of HSP's gapped (non-prelim): 28
length of query: 150
length of database: 1,894,087,724
effective HSP length: 113
effective length of query: 37
effective length of database: 1,275,964,051
effective search space: 47210669887
effective search space used: 47210669887
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 124 (52.4 bits)