BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= PI0019 
         (211 letters)

Database: nr 
           5,470,121 sequences; 1,894,087,724 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|156862379|gb|EDO55810.1|  hypothetical protein BACUNI_004...   209   8e-53
gi|156109887|gb|EDO11632.1|  hypothetical protein BACOVA_028...   207   3e-52
gi|29348575|ref|NP_812078.1|  hypothetical protein BT_3166 [...   203   4e-51
gi|153807073|ref|ZP_01959741.1|  hypothetical protein BACCAC...   201   2e-50
gi|60679609|ref|YP_209753.1|  hypothetical protein BF0012 [B...   200   4e-50
gi|53711304|ref|YP_097296.1|  hypothetical protein BF0013 [B...   199   6e-50
gi|154493344|ref|ZP_02032664.1|  hypothetical protein PARMER...   168   2e-40
gi|150009603|ref|YP_001304346.1|  hypothetical protein BDI_3...   162   8e-39
gi|34541618|ref|NP_906097.1|  hypothetical protein PG2030 [P...   125   2e-27
gi|150003711|ref|YP_001298455.1|  hypothetical protein BVU_1...   110   5e-23
gi|149280728|ref|ZP_01886837.1|  hypothetical protein PBAL39...    84   5e-15
gi|83857071|ref|ZP_00950599.1|  hypothetical protein CA2559_...    84   6e-15
gi|86132729|ref|ZP_01051321.1|  hypothetical protein MED134_...    83   1e-14
gi|120436247|ref|YP_861933.1|  conserved hypothetical protei...    79   2e-13
gi|88807072|ref|ZP_01122586.1|  hypothetical protein RB2501_...    78   3e-13
gi|149372768|ref|ZP_01891789.1|  hypothetical protein SCB49_...    75   2e-12
gi|126662341|ref|ZP_01733340.1|  hypothetical protein FBBAL3...    75   2e-12
gi|91214862|ref|ZP_01251835.1|  hypothetical protein P700755...    75   2e-12
gi|86143123|ref|ZP_01061545.1|  hypothetical protein MED217_...    75   3e-12
gi|88803632|ref|ZP_01119156.1|  hypothetical protein PI23P_0...    71   4e-11
gi|88711921|ref|ZP_01106008.1|  hypothetical protein FB2170_...    68   3e-10
gi|150025150|ref|YP_001295976.1|  hypothetical protein FP107...    66   1e-09
gi|146300487|ref|YP_001195078.1|  hypothetical protein Fjoh_...    65   2e-09
gi|86135718|ref|ZP_01054299.1|  hypothetical protein MED152_...    65   3e-09
gi|89890844|ref|ZP_01202353.1|  hypothetical protein BBFL7_0...    60   7e-08
>gi|156862379|gb|EDO55810.1| hypothetical protein BACUNI_00476 [Bacteroides uniformis ATCC 8492]
          Length = 204

 Score =  209 bits (532), Expect = 8e-53,   Method: Composition-based stats.
 Identities = 97/159 (61%), Positives = 130/159 (81%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           D+I YV+   +YIF P++FKN +    Y +L+ ++KKVLP++KE N+ I+ET  Y+ T+P
Sbjct: 46  DTIPYVKLPTVYIFKPLKFKNKRDMNKYYKLIRDVKKVLPISKEINRAIIETYEYMMTLP 105

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
           T+K R  HMKAVEK LKE+YTPR+KKLT++QGKLLIKL+DR+T+STGYEL++AF+GP +A
Sbjct: 106 TEKARQKHMKAVEKSLKEQYTPRMKKLTFAQGKLLIKLVDRQTNSTGYELVKAFMGPFKA 165

Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
           GFYQ FA +FGASLKK+YDP G D L ER++L VE+GQL
Sbjct: 166 GFYQTFAALFGASLKKQYDPMGDDALTERVILMVESGQL 204
>gi|156109887|gb|EDO11632.1| hypothetical protein BACOVA_02842 [Bacteroides ovatus ATCC 8483]
          Length = 192

 Score =  207 bits (527), Expect = 3e-52,   Method: Composition-based stats.
 Identities = 96/159 (60%), Positives = 130/159 (81%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           D+I  V+   +YIF P++FKN+K+RQ Y RL+ N+KKV P+++E NQ I+ET  YLQT+P
Sbjct: 34  DTIPCVQLRTVYIFRPLKFKNEKERQEYYRLIRNVKKVYPISREINQAIIETYEYLQTLP 93

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
            +K R  H+K VEKGLK++YTPR+KKL+++QGKLLIKLIDR+++ST YEL++AF+GP +A
Sbjct: 94  NEKARQKHIKRVEKGLKDQYTPRMKKLSFAQGKLLIKLIDRQSNSTSYELVKAFMGPFKA 153

Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
           GFYQ FA +FG SLKK YDP+G D+L ER+VL VE GQ+
Sbjct: 154 GFYQTFAALFGVSLKKEYDPQGEDKLTERVVLMVENGQI 192
>gi|29348575|ref|NP_812078.1| hypothetical protein BT_3166 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340480|gb|AAO78272.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 190

 Score =  203 bits (517), Expect = 4e-51,   Method: Composition-based stats.
 Identities = 98/159 (61%), Positives = 128/159 (80%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           D+I  V+   +YIF P++FKN+K+R  Y RLV N+KKV P++KE NQ I+ET  YLQT+P
Sbjct: 32  DTIPCVQLRTVYIFRPLKFKNEKERLEYYRLVRNVKKVYPISKEINQAIIETYEYLQTLP 91

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
            +K R  H+K VEKGLKE+YT R+KKL+++QGKLLIKLIDR+++ST YEL++AF+GP +A
Sbjct: 92  NEKARQKHLKRVEKGLKEQYTARMKKLSFTQGKLLIKLIDRQSNSTSYELVKAFMGPFKA 151

Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
           GFYQ FA +FGASLKK YDP G D+L ER+VL VE GQ+
Sbjct: 152 GFYQTFAALFGASLKKEYDPLGEDKLTERVVLLVENGQI 190
>gi|153807073|ref|ZP_01959741.1| hypothetical protein BACCAC_01350 [Bacteroides caccae ATCC 43185]
 gi|149130193|gb|EDM21403.1| hypothetical protein BACCAC_01350 [Bacteroides caccae ATCC 43185]
          Length = 192

 Score =  201 bits (512), Expect = 2e-50,   Method: Composition-based stats.
 Identities = 95/159 (59%), Positives = 130/159 (81%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           D+I  V+   +YIF P++FKN+K+R+ Y +LV N+KKV P++KE NQ I+ET  YLQT+P
Sbjct: 34  DTIPCVQLRTVYIFRPLKFKNEKERREYYKLVRNVKKVYPISKEINQAIIETYEYLQTLP 93

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
            +K R  H+K VEKGLK++YT R+KKL+++QGKLLIKL+DR+++ T YEL++AF+GP +A
Sbjct: 94  NEKARQKHIKRVEKGLKDQYTARMKKLSFAQGKLLIKLVDRQSNQTSYELVKAFMGPFKA 153

Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
           GFYQAFA +FGASLKK YDP+G D+L ER+VL VE GQ+
Sbjct: 154 GFYQAFAALFGASLKKEYDPEGEDKLTERVVLLVENGQI 192
>gi|60679609|ref|YP_209753.1| hypothetical protein BF0012 [Bacteroides fragilis NCTC 9343]
 gi|60491043|emb|CAH05791.1| conserved hypothetical exported protein [Bacteroides fragilis NCTC
           9343]
          Length = 199

 Score =  200 bits (509), Expect = 4e-50,   Method: Composition-based stats.
 Identities = 93/159 (58%), Positives = 129/159 (81%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           D+I   +   I+IF P++F+N K++  Y +LV N+KKV P+A+E N+ I+ET  YLQT+P
Sbjct: 41  DTIPAFQIPTIHIFKPLKFRNRKEQMEYYKLVRNVKKVYPIAREINRTIIETYEYLQTLP 100

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
            +K R  H+K VEKGLKE+YTPR+KKL+++QGKLLIKLIDR++H + YEL++AF+GP +A
Sbjct: 101 NEKARQRHIKRVEKGLKEQYTPRMKKLSFAQGKLLIKLIDRQSHQSSYELVKAFMGPFKA 160

Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
           GFYQ FA +FGASLKK+YDP+G D+L ER++L VE+GQL
Sbjct: 161 GFYQTFAALFGASLKKQYDPEGEDKLTERVILLVESGQL 199
>gi|53711304|ref|YP_097296.1| hypothetical protein BF0013 [Bacteroides fragilis YCH46]
 gi|52214169|dbj|BAD46762.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
          Length = 191

 Score =  199 bits (507), Expect = 6e-50,   Method: Composition-based stats.
 Identities = 93/159 (58%), Positives = 129/159 (81%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           D+I   +   I+IF P++F+N K++  Y +LV N+KKV P+A+E N+ I+ET  YLQT+P
Sbjct: 33  DTIPAFQIPTIHIFKPLKFRNRKEQMEYYKLVRNVKKVYPIAREINRTIIETYEYLQTLP 92

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
            +K R  H+K VEKGLKE+YTPR+KKL+++QGKLLIKLIDR++H + YEL++AF+GP +A
Sbjct: 93  NEKTRQRHIKRVEKGLKEQYTPRMKKLSFAQGKLLIKLIDRQSHQSSYELVKAFMGPFKA 152

Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
           GFYQ FA +FGASLKK+YDP+G D+L ER++L VE+GQL
Sbjct: 153 GFYQTFAALFGASLKKQYDPEGEDKLTERVILLVESGQL 191
>gi|154493344|ref|ZP_02032664.1| hypothetical protein PARMER_02681 [Parabacteroides merdae ATCC
           43184]
 gi|154086554|gb|EDN85599.1| hypothetical protein PARMER_02681 [Parabacteroides merdae ATCC
           43184]
          Length = 201

 Score =  168 bits (426), Expect = 2e-40,   Method: Composition-based stats.
 Identities = 80/159 (50%), Positives = 116/159 (72%)

Query: 51  GSDSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQT 110
           G D++  V   +IYI+P ++FKN +++  Y +LV ++K+ LP AK   + ++ET  Y++T
Sbjct: 41  GKDTVPIVNLREIYIYPQVKFKNKREQARYTKLVRDVKRTLPYAKMVYETLIETYEYIET 100

Query: 111 IPTKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPV 170
           +P +K R AH+K +EK L +EY P++KKLT+SQGKLLIKLIDRE + + Y L++A+LG  
Sbjct: 101 LPDEKSRQAHLKRMEKELFQEYKPQLKKLTFSQGKLLIKLIDRECNQSSYNLLKAYLGSF 160

Query: 171 RAGFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAG 209
           RAGF+  FA +FGASLK  YDPKG D + ER+V+ VE G
Sbjct: 161 RAGFWNIFAGMFGASLKTEYDPKGKDAMTERVVVLVENG 199
>gi|150009603|ref|YP_001304346.1| hypothetical protein BDI_3016 [Parabacteroides distasonis ATCC
           8503]
 gi|149938027|gb|ABR44724.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 201

 Score =  162 bits (411), Expect = 8e-39,   Method: Composition-based stats.
 Identities = 76/157 (48%), Positives = 112/157 (71%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           D++  V   +++I+P ++FKN +++  YN+LV ++K+ LP AK     ++ET  Y++T+P
Sbjct: 43  DTMAVVNLREVFIYPQVKFKNKREQAKYNKLVRDVKRTLPYAKMVYDTLIETYEYMETLP 102

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
             K R AH+K +EK L  +Y P +KKL++SQGKLLIKLIDRE + + Y L++A+LG  RA
Sbjct: 103 NDKARQAHLKRMEKELFAQYKPELKKLSFSQGKLLIKLIDRECNQSSYNLLKAYLGSFRA 162

Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAG 209
           GF+  FA +FGASLK  YDPKG D + ER+V+ VE G
Sbjct: 163 GFWNFFAGMFGASLKSEYDPKGKDAMTERVVVLVENG 199
>gi|34541618|ref|NP_906097.1| hypothetical protein PG2030 [Porphyromonas gingivalis W83]
 gi|34397936|gb|AAQ66996.1| hypothetical protein PG_2030 [Porphyromonas gingivalis W83]
          Length = 193

 Score =  125 bits (313), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 64/139 (46%), Positives = 95/139 (68%), Gaps = 2/139 (1%)

Query: 75  KQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLKEEYTP 134
           ++RQ   RL+ ++K+ LP AK     ++ET  Y++T+P +K R  H+K +EK +K++Y P
Sbjct: 55  QERQELWRLIRDVKRTLPYAKMIAATLIETYEYMETMPDEKSRQKHLKRMEKEMKQQYMP 114

Query: 135 RIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKRYDPKG 194
            +K+LT  QGKLLIKLIDR+   + Y+L++AFLG  +AG++  FA  +GASLK RY+P  
Sbjct: 115 EMKRLTLRQGKLLIKLIDRQCAQSSYQLLKAFLGGWKAGWWNLFARFYGASLKTRYEPDK 174

Query: 195 --ADRLVERIVLQVEAGQL 211
              D L ERIVL VE  ++
Sbjct: 175 NPEDALTERIVLLVEEKRI 193
>gi|150003711|ref|YP_001298455.1| hypothetical protein BVU_1142 [Bacteroides vulgatus ATCC 8482]
 gi|149932135|gb|ABR38833.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
          Length = 76

 Score =  110 bits (275), Expect = 5e-23,   Method: Composition-based stats.
 Identities = 51/76 (67%), Positives = 65/76 (85%)

Query: 136 IKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKRYDPKGA 195
           +KKLT+SQGKLLIKL++R+T S+ YEL++AF+GP +AGFYQ FA +FGASLKK Y P+G 
Sbjct: 1   MKKLTFSQGKLLIKLVNRQTDSSSYELVKAFMGPFKAGFYQTFAALFGASLKKEYHPEGE 60

Query: 196 DRLVERIVLQVEAGQL 211
           DRL ER+VL VE GQ+
Sbjct: 61  DRLTERVVLLVENGQI 76
>gi|149280728|ref|ZP_01886837.1| hypothetical protein PBAL39_06021 [Pedobacter sp. BAL39]
 gi|149228511|gb|EDM33921.1| hypothetical protein PBAL39_06021 [Pedobacter sp. BAL39]
          Length = 205

 Score = 84.3 bits (207), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 53/152 (34%), Positives = 83/152 (54%), Gaps = 1/152 (0%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           + I ++  N++ I+    FK    R A+NRL  N+ KV+P A    +   +    L    
Sbjct: 49  EMIPWIPLNEVVIYGYRIFKTPADRAAFNRLRYNVMKVMPYALYAKRRYEQLEKDLAMTA 108

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
            KKE    +K  +K +K+ +   IK+LT SQG++L KLIDRE   T Y++++   G V A
Sbjct: 109 DKKEHKRLVKQCDKEIKDMFNREIKELTISQGQILTKLIDRELGRTTYDIVKQTKGGVTA 168

Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVL 204
             YQ+ A V G +LK  Y+ +  DR +E I++
Sbjct: 169 FLYQSVARVVGHNLKSTYN-REEDRDIESIIV 199
>gi|83857071|ref|ZP_00950599.1| hypothetical protein CA2559_09743 [Croceibacter atlanticus
           HTCC2559]
 gi|83848438|gb|EAP86307.1| hypothetical protein CA2559_09743 [Croceibacter atlanticus
           HTCC2559]
          Length = 210

 Score = 83.6 bits (205), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 49/150 (32%), Positives = 86/150 (57%), Gaps = 2/150 (1%)

Query: 56  QYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKK 115
           +++   ++ I   ++F +D +R+ Y  L    +KV P AK  ++ + E  A L +I +K 
Sbjct: 33  EFIDLEEVIILNKIKFTSDLERRRYLILRRKTRKVWPYAKLASERLSELNARLASIKSKS 92

Query: 116 ERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFY 175
           ER  + K V+K ++E++   +KKLT ++G++L+KLI R+T  T ++LI+      RA +Y
Sbjct: 93  ERRKYTKIVQKYIEEQFAAELKKLTRTEGQILVKLIHRQTGVTTFQLIKNLRSGWRAFWY 152

Query: 176 QAFAWVFGASLKKRYDP--KGADRLVERIV 203
              A +F   LK  Y+P     D L+E I+
Sbjct: 153 DTTAGLFDIELKSEYNPVNNREDFLIEDIL 182
>gi|86132729|ref|ZP_01051321.1| hypothetical protein MED134_14051 [Cellulophaga sp. MED134]
 gi|85816683|gb|EAQ37869.1| hypothetical protein MED134_14051 [Dokdonia donghaensis MED134]
          Length = 226

 Score = 83.2 bits (204), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 57/196 (29%), Positives = 97/196 (49%), Gaps = 14/196 (7%)

Query: 12  FLLIAVGGFAQEHINPEDRVVDLNSPTFVPMVHIGKAKVGSDSI--QYVRTNKIYIFPPM 69
           ++++ + G  Q    P D             V I    +  D+I    +  N++ +F  +
Sbjct: 8   YIIVFIAGLTQAQETPVDST----------EVEIEYYIIQGDTIPRSAIDLNEVIVFKRL 57

Query: 70  EFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLK 129
           +F N + R+ Y  L    +KV P AK     ++     L +I  K+ R  + K + K L+
Sbjct: 58  KFDNKQDRRRYLILRRKTRKVFPYAKLAADRLVALNNRLDSIEGKRARKKYTKIIHKYLE 117

Query: 130 EEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKR 189
            E++  +KKLT ++G++LIKLI R+T  T +ELI+      RA +Y   A  F  SLK+ 
Sbjct: 118 GEFSAELKKLTRTEGQILIKLIHRQTGETAFELIKRLRSGWRAFWYNTTASAFDISLKRE 177

Query: 190 YDP--KGADRLVERIV 203
           ++P  +  D L+E I+
Sbjct: 178 FNPEQEQEDYLIEDIL 193
>gi|120436247|ref|YP_861933.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
 gi|117578397|emb|CAL66866.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
          Length = 238

 Score = 79.0 bits (193), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 47/158 (29%), Positives = 90/158 (56%), Gaps = 4/158 (2%)

Query: 50  VGSDSI--QYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAY 107
           +  DSI  + +   ++ +   ++F ++  R+ Y  L    +KV P AK  ++ ++E  + 
Sbjct: 43  IAGDSIPREAIDLEEVVLLRKLKFDSNTDRKRYLILRRKTRKVYPYAKLASERLIELNSR 102

Query: 108 LQTIPTKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFL 167
           L  I +K+ +  + K V+  ++EE++  +KKLT ++G++L+KLI R+T  T ++L++   
Sbjct: 103 LDDIKSKRAQKKYTKIVQNYIEEEFSVELKKLTRTEGQILVKLIHRQTGITTFDLVKELK 162

Query: 168 GPVRAGFYQAFAWVFGASLKKRYDPKG--ADRLVERIV 203
              RA +Y   A +F  SLK+ Y P+    D L+E I+
Sbjct: 163 SGWRAFWYNTTASMFDISLKEEYHPESDQEDFLIEDIL 200
>gi|88807072|ref|ZP_01122586.1| hypothetical protein RB2501_00025 [Robiginitalea biformata
           HTCC2501]
 gi|88782855|gb|EAR14030.1| hypothetical protein RB2501_00025 [Robiginitalea biformata
           HTCC2501]
          Length = 232

 Score = 78.2 bits (191), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 43/156 (27%), Positives = 91/156 (58%), Gaps = 2/156 (1%)

Query: 58  VRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKER 117
           +  + +YIF  ++F +   +  Y  L     KV P AK  ++ ++E  + L++I +++++
Sbjct: 50  IELDAVYIFGKLKFDSYDDKLRYLILRRKTIKVYPYAKLASERLVELNSRLESITSRRKQ 109

Query: 118 DAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQA 177
             + + V++ +++E+   +KKLT ++G++L+KLI R+T  T ++L++      RA +YQ 
Sbjct: 110 KRYTRIVQRFIEDEFAAELKKLTRTEGQILVKLIYRQTGITAFDLVKQLRSGWRAFWYQT 169

Query: 178 FAWVFGASLKKRYDPKGA--DRLVERIVLQVEAGQL 211
            A +F  S+K+ + P+    D L+E I+ +  A  +
Sbjct: 170 TASLFDISIKEEFHPESVHEDYLIEDILQRAFAANI 205
>gi|149372768|ref|ZP_01891789.1| hypothetical protein SCB49_12434 [unidentified eubacterium SCB49]
 gi|149354465|gb|EDM43030.1| hypothetical protein SCB49_12434 [unidentified eubacterium SCB49]
          Length = 219

 Score = 75.5 bits (184), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 53/195 (27%), Positives = 101/195 (51%), Gaps = 12/195 (6%)

Query: 13  LLIAVGGFAQEHINPEDRVVDLNSPTFVPMVHIGKAKVGSDSI--QYVRTNKIYIFPPME 70
           +L  V  FAQ+ +   D+ VD  +  +          +  D+I  +++   ++ +   + 
Sbjct: 1   MLFGVFVFAQKDLGHTDKKVDSTNTLYY--------IIEGDTIPREFIDLEEVVLLNKLS 52

Query: 71  FKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLKE 130
           F   + R+ Y  L    +KV P AK  ++ +      L+ I  K ++  + K ++K ++E
Sbjct: 53  FNTKEDRKRYLILRRKTRKVYPYAKLASERLNTMYRRLEEIENKGDKRRYTKRIQKYIEE 112

Query: 131 EYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKRY 190
           E++ ++KKLT ++G++L+KLI R+T  T ++L++      RA +Y   A +F  S+KK Y
Sbjct: 113 EFSEKLKKLTRTEGQILVKLIHRQTGITAFDLVKDLRTGWRAFWYNTTANMFDISIKKEY 172

Query: 191 DP--KGADRLVERIV 203
            P     D L+E I+
Sbjct: 173 KPFEVKEDYLIEDIL 187
>gi|126662341|ref|ZP_01733340.1| hypothetical protein FBBAL38_03280 [Flavobacteria bacterium BAL38]
 gi|126625720|gb|EAZ96409.1| hypothetical protein FBBAL38_03280 [Flavobacteria bacterium BAL38]
          Length = 228

 Score = 75.5 bits (184), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 45/141 (31%), Positives = 78/141 (55%)

Query: 52  SDSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTI 111
           S  I  +   +I   P   +  D+ ++A   L   I KV P AK     + +  A +  +
Sbjct: 34  SSIIYSIELREIIFTPDNVYSMDEDKKAKLILKRRIFKVYPYAKMTADKLTQLNATMAKL 93

Query: 112 PTKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVR 171
            T +E+  + K VEK L+EE+ PR+KKL+   G++L+KLI R+T +T ++LI+ +    +
Sbjct: 94  KTNREKKKYFKIVEKYLEEEFEPRLKKLSRKDGQILVKLIYRQTGNTTFDLIKEYKSGWK 153

Query: 172 AGFYQAFAWVFGASLKKRYDP 192
           A +  + A++F  +LK +Y P
Sbjct: 154 AFWANSTAYLFDINLKTQYKP 174
>gi|91214862|ref|ZP_01251835.1| hypothetical protein P700755_18394 [Psychroflexus torquis ATCC
           700755]
 gi|91187289|gb|EAS73659.1| hypothetical protein P700755_18394 [Psychroflexus torquis ATCC
           700755]
          Length = 237

 Score = 75.1 bits (183), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 42/145 (28%), Positives = 81/145 (55%), Gaps = 2/145 (1%)

Query: 50  VGSDS--IQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAY 107
           +G D+  +  V   ++ I+P ++FK+    + Y  L    KKV P  K  ++        
Sbjct: 36  IGHDASWVSEVELEEVMIYPKLKFKSRDDFRDYLILKRKTKKVWPYVKLASERFETLNKR 95

Query: 108 LQTIPTKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFL 167
           L +I +K  +  + + +++ ++EE+T  +KKLT ++G++L+KLI R+T  T ++L++   
Sbjct: 96  LSSIDSKPSKRRYTRVIQRYVEEEFTEELKKLTKTEGQILVKLIHRQTGVTTFDLVKDLR 155

Query: 168 GPVRAGFYQAFAWVFGASLKKRYDP 192
              RA +Y   A +F  SLK+ ++P
Sbjct: 156 SGWRAFWYNTTASLFNISLKEEFNP 180
>gi|86143123|ref|ZP_01061545.1| hypothetical protein MED217_10772 [Flavobacterium sp. MED217]
 gi|85830568|gb|EAQ49027.1| hypothetical protein MED217_10772 [Leeuwenhoekiella blandensis
           MED217]
          Length = 265

 Score = 75.1 bits (183), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 42/145 (28%), Positives = 83/145 (57%), Gaps = 2/145 (1%)

Query: 61  NKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAH 120
           + + +   ++F ++  R+ Y  L    +KV P AK   + ++E    L ++ TK++R  +
Sbjct: 86  DDVVVLGRLKFDDNTARRRYLILRRKTRKVWPYAKLAGERLVELNERLDSMETKRDRKRY 145

Query: 121 MKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAW 180
            K +++ +++E+   +KKLT ++G++L+KL+ R+T  T ++LI+ +   ++A      A 
Sbjct: 146 TKIIQRYVEDEFKEELKKLTRTEGQILVKLMYRQTGQTTFDLIKNYRSGLKAFLLNTTAS 205

Query: 181 VFGASLKKRYDPKGA--DRLVERIV 203
            F  SLK+ YDP     D L+E I+
Sbjct: 206 FFDISLKEIYDPAEVQEDYLIEDIL 230
>gi|88803632|ref|ZP_01119156.1| hypothetical protein PI23P_00030 [Polaribacter irgensii 23-P]
 gi|88780365|gb|EAR11546.1| hypothetical protein PI23P_00030 [Polaribacter irgensii 23-P]
          Length = 229

 Score = 70.9 bits (172), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 45/153 (29%), Positives = 82/153 (53%), Gaps = 4/153 (2%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           DSI  ++ N+I + P  +F   +  + Y      + K  P AK   + +    A L  I 
Sbjct: 27  DSI-VIQLNEIALLPKPKFSAKEDIRYYLWFRKKVYKAYPFAKLAAERLDSLNARLDRID 85

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
           +K++R  + + ++  ++ E+T +IKK+T ++G++LIKLI R+T  T ++ I+      +A
Sbjct: 86  SKRKRRKYTRLIQNYIEGEFTTQIKKMTTTEGRVLIKLIHRQTGKTAFDNIRGLRSGWKA 145

Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQ 205
            +Y   A VF  SL+  Y P   DR+ E  +++
Sbjct: 146 FWYNTTANVFKLSLRTEYQP---DRINEDFLIE 175
>gi|88711921|ref|ZP_01106008.1| hypothetical protein FB2170_13568 [Flavobacteriales bacterium
           HTCC2170]
 gi|88709327|gb|EAR01560.1| hypothetical protein FB2170_13568 [Flavobacteriales bacterium
           HTCC2170]
          Length = 203

 Score = 68.2 bits (165), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 51/154 (33%), Positives = 85/154 (55%), Gaps = 3/154 (1%)

Query: 61  NKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAH 120
           +++Y+F  ++F + K +  Y  L     KV P AK   + +LE    L  I   ++R  +
Sbjct: 24  DEVYVFSKLKFPSYKDKLRYYILRRKTIKVYPYAKMAAERLLELNDSLTKIKKSRKRKKY 83

Query: 121 MKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAW 180
            K V+K ++ E++  +KKLT ++G++LIKLI R+T  T + L++      RA +Y   A 
Sbjct: 84  TKKVQKYIEGEFSEELKKLTRTEGQILIKLIYRQTGKTAFGLVKELRSGWRAFWYSTTAK 143

Query: 181 VFGASLKKRYDPK--GADRLVERIVLQV-EAGQL 211
           +F  SLK+ Y P     D L+E I+ +   AG+L
Sbjct: 144 MFKISLKEEYRPDVVQEDYLIEDILQRAFAAGRL 177
>gi|150025150|ref|YP_001295976.1| hypothetical protein FP1077 [Flavobacterium psychrophilum JIP02/86]
 gi|149771691|emb|CAL43165.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
          Length = 225

 Score = 66.2 bits (160), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 37/135 (27%), Positives = 73/135 (54%)

Query: 58  VRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKER 117
           +   ++YI    +FK+++ ++ +  L   + KV P AK     +      +Q++ +++++
Sbjct: 44  IELQEVYISNKRDFKSEEDQKRFYILQRRVLKVYPYAKTAADRLTTLNIGMQSLKSERDK 103

Query: 118 DAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQA 177
             + K VE  L  E+  ++KKL+   G++L+KLI R+T S+ + LI+      +A +   
Sbjct: 104 RKYFKLVENYLTSEFEDQLKKLSRKDGQVLVKLIHRQTGSSTFNLIKELKSGWKAFWSNQ 163

Query: 178 FAWVFGASLKKRYDP 192
            A +F  +LK  YDP
Sbjct: 164 TAKIFDINLKTTYDP 178
>gi|146300487|ref|YP_001195078.1| hypothetical protein Fjoh_2737 [Flavobacterium johnsoniae UW101]
 gi|146154905|gb|ABQ05759.1| hypothetical protein Fjoh_2737 [Flavobacterium johnsoniae UW101]
          Length = 227

 Score = 65.5 bits (158), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 43/140 (30%), Positives = 72/140 (51%), Gaps = 3/140 (2%)

Query: 75  KQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLKEEYTP 134
           ++ + +  L   + KV P A+     +      +  + T +E+  + K VE  L  E+  
Sbjct: 60  EELKQFQILQMRVYKVYPYARLAADRLTALNNGMARLKTSREKKKYFKIVEDYLNNEFED 119

Query: 135 RIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKRYDP-- 192
           R+KKL+  QG++L+KL+ R+T  T YELI+      +A      A +F  SLK  Y P  
Sbjct: 120 RLKKLSRKQGQILVKLVHRQTGKTTYELIKTLKSGFKAFVSNTTANLFDISLKTEYKPFE 179

Query: 193 KGADRLVERIVLQV-EAGQL 211
              D L+E I+++  E+G+L
Sbjct: 180 VNEDYLIETILVRAFESGRL 199
>gi|86135718|ref|ZP_01054299.1| hypothetical protein MED152_13459 [Tenacibaculum sp. MED152]
 gi|85819891|gb|EAQ41048.1| hypothetical protein MED152_13459 [Polaribacter dokdonensis MED152]
          Length = 231

 Score = 64.7 bits (156), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 47/153 (30%), Positives = 82/153 (53%), Gaps = 3/153 (1%)

Query: 53  DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
           DSI  +  N+I + P  +FK+    + Y      + K  P A   ++ +      L  I 
Sbjct: 32  DSIT-INLNEITLLPKQKFKSKDDIRYYLWFRRKVFKAYPYAILASKRLDSLNVRLSKIK 90

Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
           +K+++  + + V+K ++ E+T +IKK+T ++G++LIKLI R+T  T +  I+      +A
Sbjct: 91  SKRKKRKYTRQVQKYIEGEFTDQIKKMTKTEGRILIKLIHRQTGETAFNNIKELRSGWKA 150

Query: 173 GFYQAFAWVFGASLKKRYDPKGA--DRLVERIV 203
            +Y   A +F  SLK  YDP+    D L+E I+
Sbjct: 151 FWYNTTANLFKLSLKDEYDPENVNEDYLIEDIL 183
>gi|89890844|ref|ZP_01202353.1| hypothetical protein BBFL7_00193 [Flavobacteria bacterium BBFL7]
 gi|89516989|gb|EAS19647.1| hypothetical protein BBFL7_00193 [Flavobacteria bacterium BBFL7]
          Length = 275

 Score = 60.5 bits (145), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 52/197 (26%), Positives = 100/197 (50%), Gaps = 7/197 (3%)

Query: 10  FWFLLIAVGGFAQEHINPEDRVVDLNSPTFVPMVHIGKAKVGSDSIQYVRTNKIYIFPPM 69
           F  +LI V  FA+   +P    V  +S T  P        +  DS+  +  +++ +   +
Sbjct: 7   FITILIWVSAFAKAQTDPIILPVKTDS-TKTPEYFF----IDGDSLSAIELDRVMLLQSL 61

Query: 70  EFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLK 129
           +F +  +   Y  +   +KKV P AK   + + E    L ++  + ++  + K V++ ++
Sbjct: 62  KFDSRYENIRYQIIKRKVKKVWPYAKLAAERLTELDRRLASLEYESDKKKYTKIVQRYVE 121

Query: 130 EEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKR 189
           EE+T  +KKLT ++G++LIKL+ R+T  T +EL++       A +++  A +F   LK  
Sbjct: 122 EEFTTELKKLTTTEGQILIKLLHRQTGMTTFELLKQLRSGWSAFWFKNAASIFDIDLKAE 181

Query: 190 YDPKG--ADRLVERIVL 204
           Y P+    D  +E ++L
Sbjct: 182 YLPESNIEDFYIEDVLL 198
  Database: nr
    Posted date:  Sep 17, 2007 11:41 AM
  Number of letters in database: 999,999,834
  Number of sequences in database:  2,976,859
  
  Database: /nucleus1/users/jsaw/ncbi/db/nr.01
    Posted date:  Sep 17, 2007 11:48 AM
  Number of letters in database: 894,087,890
  Number of sequences in database:  2,493,262
  
Lambda     K      H
   0.323    0.139    0.414 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 762,786,386
Number of Sequences: 5470121
Number of extensions: 30465818
Number of successful extensions: 65815
Number of sequences better than 1.0e-05: 25
Number of HSP's better than  0.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 65789
Number of HSP's gapped (non-prelim): 25
length of query: 211
length of database: 1,894,087,724
effective HSP length: 127
effective length of query: 84
effective length of database: 1,199,382,357
effective search space: 100748117988
effective search space used: 100748117988
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 127 (53.5 bits)