BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI0019
(211 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|156862379|gb|EDO55810.1| hypothetical protein BACUNI_004... 209 8e-53
gi|156109887|gb|EDO11632.1| hypothetical protein BACOVA_028... 207 3e-52
gi|29348575|ref|NP_812078.1| hypothetical protein BT_3166 [... 203 4e-51
gi|153807073|ref|ZP_01959741.1| hypothetical protein BACCAC... 201 2e-50
gi|60679609|ref|YP_209753.1| hypothetical protein BF0012 [B... 200 4e-50
gi|53711304|ref|YP_097296.1| hypothetical protein BF0013 [B... 199 6e-50
gi|154493344|ref|ZP_02032664.1| hypothetical protein PARMER... 168 2e-40
gi|150009603|ref|YP_001304346.1| hypothetical protein BDI_3... 162 8e-39
gi|34541618|ref|NP_906097.1| hypothetical protein PG2030 [P... 125 2e-27
gi|150003711|ref|YP_001298455.1| hypothetical protein BVU_1... 110 5e-23
gi|149280728|ref|ZP_01886837.1| hypothetical protein PBAL39... 84 5e-15
gi|83857071|ref|ZP_00950599.1| hypothetical protein CA2559_... 84 6e-15
gi|86132729|ref|ZP_01051321.1| hypothetical protein MED134_... 83 1e-14
gi|120436247|ref|YP_861933.1| conserved hypothetical protei... 79 2e-13
gi|88807072|ref|ZP_01122586.1| hypothetical protein RB2501_... 78 3e-13
gi|149372768|ref|ZP_01891789.1| hypothetical protein SCB49_... 75 2e-12
gi|126662341|ref|ZP_01733340.1| hypothetical protein FBBAL3... 75 2e-12
gi|91214862|ref|ZP_01251835.1| hypothetical protein P700755... 75 2e-12
gi|86143123|ref|ZP_01061545.1| hypothetical protein MED217_... 75 3e-12
gi|88803632|ref|ZP_01119156.1| hypothetical protein PI23P_0... 71 4e-11
gi|88711921|ref|ZP_01106008.1| hypothetical protein FB2170_... 68 3e-10
gi|150025150|ref|YP_001295976.1| hypothetical protein FP107... 66 1e-09
gi|146300487|ref|YP_001195078.1| hypothetical protein Fjoh_... 65 2e-09
gi|86135718|ref|ZP_01054299.1| hypothetical protein MED152_... 65 3e-09
gi|89890844|ref|ZP_01202353.1| hypothetical protein BBFL7_0... 60 7e-08
>gi|156862379|gb|EDO55810.1| hypothetical protein BACUNI_00476 [Bacteroides uniformis ATCC 8492]
Length = 204
Score = 209 bits (532), Expect = 8e-53, Method: Composition-based stats.
Identities = 97/159 (61%), Positives = 130/159 (81%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
D+I YV+ +YIF P++FKN + Y +L+ ++KKVLP++KE N+ I+ET Y+ T+P
Sbjct: 46 DTIPYVKLPTVYIFKPLKFKNKRDMNKYYKLIRDVKKVLPISKEINRAIIETYEYMMTLP 105
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
T+K R HMKAVEK LKE+YTPR+KKLT++QGKLLIKL+DR+T+STGYEL++AF+GP +A
Sbjct: 106 TEKARQKHMKAVEKSLKEQYTPRMKKLTFAQGKLLIKLVDRQTNSTGYELVKAFMGPFKA 165
Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
GFYQ FA +FGASLKK+YDP G D L ER++L VE+GQL
Sbjct: 166 GFYQTFAALFGASLKKQYDPMGDDALTERVILMVESGQL 204
>gi|156109887|gb|EDO11632.1| hypothetical protein BACOVA_02842 [Bacteroides ovatus ATCC 8483]
Length = 192
Score = 207 bits (527), Expect = 3e-52, Method: Composition-based stats.
Identities = 96/159 (60%), Positives = 130/159 (81%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
D+I V+ +YIF P++FKN+K+RQ Y RL+ N+KKV P+++E NQ I+ET YLQT+P
Sbjct: 34 DTIPCVQLRTVYIFRPLKFKNEKERQEYYRLIRNVKKVYPISREINQAIIETYEYLQTLP 93
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
+K R H+K VEKGLK++YTPR+KKL+++QGKLLIKLIDR+++ST YEL++AF+GP +A
Sbjct: 94 NEKARQKHIKRVEKGLKDQYTPRMKKLSFAQGKLLIKLIDRQSNSTSYELVKAFMGPFKA 153
Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
GFYQ FA +FG SLKK YDP+G D+L ER+VL VE GQ+
Sbjct: 154 GFYQTFAALFGVSLKKEYDPQGEDKLTERVVLMVENGQI 192
>gi|29348575|ref|NP_812078.1| hypothetical protein BT_3166 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340480|gb|AAO78272.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 190
Score = 203 bits (517), Expect = 4e-51, Method: Composition-based stats.
Identities = 98/159 (61%), Positives = 128/159 (80%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
D+I V+ +YIF P++FKN+K+R Y RLV N+KKV P++KE NQ I+ET YLQT+P
Sbjct: 32 DTIPCVQLRTVYIFRPLKFKNEKERLEYYRLVRNVKKVYPISKEINQAIIETYEYLQTLP 91
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
+K R H+K VEKGLKE+YT R+KKL+++QGKLLIKLIDR+++ST YEL++AF+GP +A
Sbjct: 92 NEKARQKHLKRVEKGLKEQYTARMKKLSFTQGKLLIKLIDRQSNSTSYELVKAFMGPFKA 151
Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
GFYQ FA +FGASLKK YDP G D+L ER+VL VE GQ+
Sbjct: 152 GFYQTFAALFGASLKKEYDPLGEDKLTERVVLLVENGQI 190
>gi|153807073|ref|ZP_01959741.1| hypothetical protein BACCAC_01350 [Bacteroides caccae ATCC 43185]
gi|149130193|gb|EDM21403.1| hypothetical protein BACCAC_01350 [Bacteroides caccae ATCC 43185]
Length = 192
Score = 201 bits (512), Expect = 2e-50, Method: Composition-based stats.
Identities = 95/159 (59%), Positives = 130/159 (81%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
D+I V+ +YIF P++FKN+K+R+ Y +LV N+KKV P++KE NQ I+ET YLQT+P
Sbjct: 34 DTIPCVQLRTVYIFRPLKFKNEKERREYYKLVRNVKKVYPISKEINQAIIETYEYLQTLP 93
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
+K R H+K VEKGLK++YT R+KKL+++QGKLLIKL+DR+++ T YEL++AF+GP +A
Sbjct: 94 NEKARQKHIKRVEKGLKDQYTARMKKLSFAQGKLLIKLVDRQSNQTSYELVKAFMGPFKA 153
Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
GFYQAFA +FGASLKK YDP+G D+L ER+VL VE GQ+
Sbjct: 154 GFYQAFAALFGASLKKEYDPEGEDKLTERVVLLVENGQI 192
>gi|60679609|ref|YP_209753.1| hypothetical protein BF0012 [Bacteroides fragilis NCTC 9343]
gi|60491043|emb|CAH05791.1| conserved hypothetical exported protein [Bacteroides fragilis NCTC
9343]
Length = 199
Score = 200 bits (509), Expect = 4e-50, Method: Composition-based stats.
Identities = 93/159 (58%), Positives = 129/159 (81%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
D+I + I+IF P++F+N K++ Y +LV N+KKV P+A+E N+ I+ET YLQT+P
Sbjct: 41 DTIPAFQIPTIHIFKPLKFRNRKEQMEYYKLVRNVKKVYPIAREINRTIIETYEYLQTLP 100
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
+K R H+K VEKGLKE+YTPR+KKL+++QGKLLIKLIDR++H + YEL++AF+GP +A
Sbjct: 101 NEKARQRHIKRVEKGLKEQYTPRMKKLSFAQGKLLIKLIDRQSHQSSYELVKAFMGPFKA 160
Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
GFYQ FA +FGASLKK+YDP+G D+L ER++L VE+GQL
Sbjct: 161 GFYQTFAALFGASLKKQYDPEGEDKLTERVILLVESGQL 199
>gi|53711304|ref|YP_097296.1| hypothetical protein BF0013 [Bacteroides fragilis YCH46]
gi|52214169|dbj|BAD46762.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 191
Score = 199 bits (507), Expect = 6e-50, Method: Composition-based stats.
Identities = 93/159 (58%), Positives = 129/159 (81%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
D+I + I+IF P++F+N K++ Y +LV N+KKV P+A+E N+ I+ET YLQT+P
Sbjct: 33 DTIPAFQIPTIHIFKPLKFRNRKEQMEYYKLVRNVKKVYPIAREINRTIIETYEYLQTLP 92
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
+K R H+K VEKGLKE+YTPR+KKL+++QGKLLIKLIDR++H + YEL++AF+GP +A
Sbjct: 93 NEKTRQRHIKRVEKGLKEQYTPRMKKLSFAQGKLLIKLIDRQSHQSSYELVKAFMGPFKA 152
Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAGQL 211
GFYQ FA +FGASLKK+YDP+G D+L ER++L VE+GQL
Sbjct: 153 GFYQTFAALFGASLKKQYDPEGEDKLTERVILLVESGQL 191
>gi|154493344|ref|ZP_02032664.1| hypothetical protein PARMER_02681 [Parabacteroides merdae ATCC
43184]
gi|154086554|gb|EDN85599.1| hypothetical protein PARMER_02681 [Parabacteroides merdae ATCC
43184]
Length = 201
Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats.
Identities = 80/159 (50%), Positives = 116/159 (72%)
Query: 51 GSDSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQT 110
G D++ V +IYI+P ++FKN +++ Y +LV ++K+ LP AK + ++ET Y++T
Sbjct: 41 GKDTVPIVNLREIYIYPQVKFKNKREQARYTKLVRDVKRTLPYAKMVYETLIETYEYIET 100
Query: 111 IPTKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPV 170
+P +K R AH+K +EK L +EY P++KKLT+SQGKLLIKLIDRE + + Y L++A+LG
Sbjct: 101 LPDEKSRQAHLKRMEKELFQEYKPQLKKLTFSQGKLLIKLIDRECNQSSYNLLKAYLGSF 160
Query: 171 RAGFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAG 209
RAGF+ FA +FGASLK YDPKG D + ER+V+ VE G
Sbjct: 161 RAGFWNIFAGMFGASLKTEYDPKGKDAMTERVVVLVENG 199
>gi|150009603|ref|YP_001304346.1| hypothetical protein BDI_3016 [Parabacteroides distasonis ATCC
8503]
gi|149938027|gb|ABR44724.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 201
Score = 162 bits (411), Expect = 8e-39, Method: Composition-based stats.
Identities = 76/157 (48%), Positives = 112/157 (71%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
D++ V +++I+P ++FKN +++ YN+LV ++K+ LP AK ++ET Y++T+P
Sbjct: 43 DTMAVVNLREVFIYPQVKFKNKREQAKYNKLVRDVKRTLPYAKMVYDTLIETYEYMETLP 102
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
K R AH+K +EK L +Y P +KKL++SQGKLLIKLIDRE + + Y L++A+LG RA
Sbjct: 103 NDKARQAHLKRMEKELFAQYKPELKKLSFSQGKLLIKLIDRECNQSSYNLLKAYLGSFRA 162
Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQVEAG 209
GF+ FA +FGASLK YDPKG D + ER+V+ VE G
Sbjct: 163 GFWNFFAGMFGASLKSEYDPKGKDAMTERVVVLVENG 199
>gi|34541618|ref|NP_906097.1| hypothetical protein PG2030 [Porphyromonas gingivalis W83]
gi|34397936|gb|AAQ66996.1| hypothetical protein PG_2030 [Porphyromonas gingivalis W83]
Length = 193
Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats.
Identities = 64/139 (46%), Positives = 95/139 (68%), Gaps = 2/139 (1%)
Query: 75 KQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLKEEYTP 134
++RQ RL+ ++K+ LP AK ++ET Y++T+P +K R H+K +EK +K++Y P
Sbjct: 55 QERQELWRLIRDVKRTLPYAKMIAATLIETYEYMETMPDEKSRQKHLKRMEKEMKQQYMP 114
Query: 135 RIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKRYDPKG 194
+K+LT QGKLLIKLIDR+ + Y+L++AFLG +AG++ FA +GASLK RY+P
Sbjct: 115 EMKRLTLRQGKLLIKLIDRQCAQSSYQLLKAFLGGWKAGWWNLFARFYGASLKTRYEPDK 174
Query: 195 --ADRLVERIVLQVEAGQL 211
D L ERIVL VE ++
Sbjct: 175 NPEDALTERIVLLVEEKRI 193
>gi|150003711|ref|YP_001298455.1| hypothetical protein BVU_1142 [Bacteroides vulgatus ATCC 8482]
gi|149932135|gb|ABR38833.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 76
Score = 110 bits (275), Expect = 5e-23, Method: Composition-based stats.
Identities = 51/76 (67%), Positives = 65/76 (85%)
Query: 136 IKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKRYDPKGA 195
+KKLT+SQGKLLIKL++R+T S+ YEL++AF+GP +AGFYQ FA +FGASLKK Y P+G
Sbjct: 1 MKKLTFSQGKLLIKLVNRQTDSSSYELVKAFMGPFKAGFYQTFAALFGASLKKEYHPEGE 60
Query: 196 DRLVERIVLQVEAGQL 211
DRL ER+VL VE GQ+
Sbjct: 61 DRLTERVVLLVENGQI 76
>gi|149280728|ref|ZP_01886837.1| hypothetical protein PBAL39_06021 [Pedobacter sp. BAL39]
gi|149228511|gb|EDM33921.1| hypothetical protein PBAL39_06021 [Pedobacter sp. BAL39]
Length = 205
Score = 84.3 bits (207), Expect = 5e-15, Method: Composition-based stats.
Identities = 53/152 (34%), Positives = 83/152 (54%), Gaps = 1/152 (0%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
+ I ++ N++ I+ FK R A+NRL N+ KV+P A + + L
Sbjct: 49 EMIPWIPLNEVVIYGYRIFKTPADRAAFNRLRYNVMKVMPYALYAKRRYEQLEKDLAMTA 108
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
KKE +K +K +K+ + IK+LT SQG++L KLIDRE T Y++++ G V A
Sbjct: 109 DKKEHKRLVKQCDKEIKDMFNREIKELTISQGQILTKLIDRELGRTTYDIVKQTKGGVTA 168
Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVL 204
YQ+ A V G +LK Y+ + DR +E I++
Sbjct: 169 FLYQSVARVVGHNLKSTYN-REEDRDIESIIV 199
>gi|83857071|ref|ZP_00950599.1| hypothetical protein CA2559_09743 [Croceibacter atlanticus
HTCC2559]
gi|83848438|gb|EAP86307.1| hypothetical protein CA2559_09743 [Croceibacter atlanticus
HTCC2559]
Length = 210
Score = 83.6 bits (205), Expect = 6e-15, Method: Composition-based stats.
Identities = 49/150 (32%), Positives = 86/150 (57%), Gaps = 2/150 (1%)
Query: 56 QYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKK 115
+++ ++ I ++F +D +R+ Y L +KV P AK ++ + E A L +I +K
Sbjct: 33 EFIDLEEVIILNKIKFTSDLERRRYLILRRKTRKVWPYAKLASERLSELNARLASIKSKS 92
Query: 116 ERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFY 175
ER + K V+K ++E++ +KKLT ++G++L+KLI R+T T ++LI+ RA +Y
Sbjct: 93 ERRKYTKIVQKYIEEQFAAELKKLTRTEGQILVKLIHRQTGVTTFQLIKNLRSGWRAFWY 152
Query: 176 QAFAWVFGASLKKRYDP--KGADRLVERIV 203
A +F LK Y+P D L+E I+
Sbjct: 153 DTTAGLFDIELKSEYNPVNNREDFLIEDIL 182
>gi|86132729|ref|ZP_01051321.1| hypothetical protein MED134_14051 [Cellulophaga sp. MED134]
gi|85816683|gb|EAQ37869.1| hypothetical protein MED134_14051 [Dokdonia donghaensis MED134]
Length = 226
Score = 83.2 bits (204), Expect = 1e-14, Method: Composition-based stats.
Identities = 57/196 (29%), Positives = 97/196 (49%), Gaps = 14/196 (7%)
Query: 12 FLLIAVGGFAQEHINPEDRVVDLNSPTFVPMVHIGKAKVGSDSI--QYVRTNKIYIFPPM 69
++++ + G Q P D V I + D+I + N++ +F +
Sbjct: 8 YIIVFIAGLTQAQETPVDST----------EVEIEYYIIQGDTIPRSAIDLNEVIVFKRL 57
Query: 70 EFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLK 129
+F N + R+ Y L +KV P AK ++ L +I K+ R + K + K L+
Sbjct: 58 KFDNKQDRRRYLILRRKTRKVFPYAKLAADRLVALNNRLDSIEGKRARKKYTKIIHKYLE 117
Query: 130 EEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKR 189
E++ +KKLT ++G++LIKLI R+T T +ELI+ RA +Y A F SLK+
Sbjct: 118 GEFSAELKKLTRTEGQILIKLIHRQTGETAFELIKRLRSGWRAFWYNTTASAFDISLKRE 177
Query: 190 YDP--KGADRLVERIV 203
++P + D L+E I+
Sbjct: 178 FNPEQEQEDYLIEDIL 193
>gi|120436247|ref|YP_861933.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
gi|117578397|emb|CAL66866.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
Length = 238
Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats.
Identities = 47/158 (29%), Positives = 90/158 (56%), Gaps = 4/158 (2%)
Query: 50 VGSDSI--QYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAY 107
+ DSI + + ++ + ++F ++ R+ Y L +KV P AK ++ ++E +
Sbjct: 43 IAGDSIPREAIDLEEVVLLRKLKFDSNTDRKRYLILRRKTRKVYPYAKLASERLIELNSR 102
Query: 108 LQTIPTKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFL 167
L I +K+ + + K V+ ++EE++ +KKLT ++G++L+KLI R+T T ++L++
Sbjct: 103 LDDIKSKRAQKKYTKIVQNYIEEEFSVELKKLTRTEGQILVKLIHRQTGITTFDLVKELK 162
Query: 168 GPVRAGFYQAFAWVFGASLKKRYDPKG--ADRLVERIV 203
RA +Y A +F SLK+ Y P+ D L+E I+
Sbjct: 163 SGWRAFWYNTTASMFDISLKEEYHPESDQEDFLIEDIL 200
>gi|88807072|ref|ZP_01122586.1| hypothetical protein RB2501_00025 [Robiginitalea biformata
HTCC2501]
gi|88782855|gb|EAR14030.1| hypothetical protein RB2501_00025 [Robiginitalea biformata
HTCC2501]
Length = 232
Score = 78.2 bits (191), Expect = 3e-13, Method: Composition-based stats.
Identities = 43/156 (27%), Positives = 91/156 (58%), Gaps = 2/156 (1%)
Query: 58 VRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKER 117
+ + +YIF ++F + + Y L KV P AK ++ ++E + L++I +++++
Sbjct: 50 IELDAVYIFGKLKFDSYDDKLRYLILRRKTIKVYPYAKLASERLVELNSRLESITSRRKQ 109
Query: 118 DAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQA 177
+ + V++ +++E+ +KKLT ++G++L+KLI R+T T ++L++ RA +YQ
Sbjct: 110 KRYTRIVQRFIEDEFAAELKKLTRTEGQILVKLIYRQTGITAFDLVKQLRSGWRAFWYQT 169
Query: 178 FAWVFGASLKKRYDPKGA--DRLVERIVLQVEAGQL 211
A +F S+K+ + P+ D L+E I+ + A +
Sbjct: 170 TASLFDISIKEEFHPESVHEDYLIEDILQRAFAANI 205
>gi|149372768|ref|ZP_01891789.1| hypothetical protein SCB49_12434 [unidentified eubacterium SCB49]
gi|149354465|gb|EDM43030.1| hypothetical protein SCB49_12434 [unidentified eubacterium SCB49]
Length = 219
Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats.
Identities = 53/195 (27%), Positives = 101/195 (51%), Gaps = 12/195 (6%)
Query: 13 LLIAVGGFAQEHINPEDRVVDLNSPTFVPMVHIGKAKVGSDSI--QYVRTNKIYIFPPME 70
+L V FAQ+ + D+ VD + + + D+I +++ ++ + +
Sbjct: 1 MLFGVFVFAQKDLGHTDKKVDSTNTLYY--------IIEGDTIPREFIDLEEVVLLNKLS 52
Query: 71 FKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLKE 130
F + R+ Y L +KV P AK ++ + L+ I K ++ + K ++K ++E
Sbjct: 53 FNTKEDRKRYLILRRKTRKVYPYAKLASERLNTMYRRLEEIENKGDKRRYTKRIQKYIEE 112
Query: 131 EYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKRY 190
E++ ++KKLT ++G++L+KLI R+T T ++L++ RA +Y A +F S+KK Y
Sbjct: 113 EFSEKLKKLTRTEGQILVKLIHRQTGITAFDLVKDLRTGWRAFWYNTTANMFDISIKKEY 172
Query: 191 DP--KGADRLVERIV 203
P D L+E I+
Sbjct: 173 KPFEVKEDYLIEDIL 187
>gi|126662341|ref|ZP_01733340.1| hypothetical protein FBBAL38_03280 [Flavobacteria bacterium BAL38]
gi|126625720|gb|EAZ96409.1| hypothetical protein FBBAL38_03280 [Flavobacteria bacterium BAL38]
Length = 228
Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats.
Identities = 45/141 (31%), Positives = 78/141 (55%)
Query: 52 SDSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTI 111
S I + +I P + D+ ++A L I KV P AK + + A + +
Sbjct: 34 SSIIYSIELREIIFTPDNVYSMDEDKKAKLILKRRIFKVYPYAKMTADKLTQLNATMAKL 93
Query: 112 PTKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVR 171
T +E+ + K VEK L+EE+ PR+KKL+ G++L+KLI R+T +T ++LI+ + +
Sbjct: 94 KTNREKKKYFKIVEKYLEEEFEPRLKKLSRKDGQILVKLIYRQTGNTTFDLIKEYKSGWK 153
Query: 172 AGFYQAFAWVFGASLKKRYDP 192
A + + A++F +LK +Y P
Sbjct: 154 AFWANSTAYLFDINLKTQYKP 174
>gi|91214862|ref|ZP_01251835.1| hypothetical protein P700755_18394 [Psychroflexus torquis ATCC
700755]
gi|91187289|gb|EAS73659.1| hypothetical protein P700755_18394 [Psychroflexus torquis ATCC
700755]
Length = 237
Score = 75.1 bits (183), Expect = 2e-12, Method: Composition-based stats.
Identities = 42/145 (28%), Positives = 81/145 (55%), Gaps = 2/145 (1%)
Query: 50 VGSDS--IQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAY 107
+G D+ + V ++ I+P ++FK+ + Y L KKV P K ++
Sbjct: 36 IGHDASWVSEVELEEVMIYPKLKFKSRDDFRDYLILKRKTKKVWPYVKLASERFETLNKR 95
Query: 108 LQTIPTKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFL 167
L +I +K + + + +++ ++EE+T +KKLT ++G++L+KLI R+T T ++L++
Sbjct: 96 LSSIDSKPSKRRYTRVIQRYVEEEFTEELKKLTKTEGQILVKLIHRQTGVTTFDLVKDLR 155
Query: 168 GPVRAGFYQAFAWVFGASLKKRYDP 192
RA +Y A +F SLK+ ++P
Sbjct: 156 SGWRAFWYNTTASLFNISLKEEFNP 180
>gi|86143123|ref|ZP_01061545.1| hypothetical protein MED217_10772 [Flavobacterium sp. MED217]
gi|85830568|gb|EAQ49027.1| hypothetical protein MED217_10772 [Leeuwenhoekiella blandensis
MED217]
Length = 265
Score = 75.1 bits (183), Expect = 3e-12, Method: Composition-based stats.
Identities = 42/145 (28%), Positives = 83/145 (57%), Gaps = 2/145 (1%)
Query: 61 NKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAH 120
+ + + ++F ++ R+ Y L +KV P AK + ++E L ++ TK++R +
Sbjct: 86 DDVVVLGRLKFDDNTARRRYLILRRKTRKVWPYAKLAGERLVELNERLDSMETKRDRKRY 145
Query: 121 MKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAW 180
K +++ +++E+ +KKLT ++G++L+KL+ R+T T ++LI+ + ++A A
Sbjct: 146 TKIIQRYVEDEFKEELKKLTRTEGQILVKLMYRQTGQTTFDLIKNYRSGLKAFLLNTTAS 205
Query: 181 VFGASLKKRYDPKGA--DRLVERIV 203
F SLK+ YDP D L+E I+
Sbjct: 206 FFDISLKEIYDPAEVQEDYLIEDIL 230
>gi|88803632|ref|ZP_01119156.1| hypothetical protein PI23P_00030 [Polaribacter irgensii 23-P]
gi|88780365|gb|EAR11546.1| hypothetical protein PI23P_00030 [Polaribacter irgensii 23-P]
Length = 229
Score = 70.9 bits (172), Expect = 4e-11, Method: Composition-based stats.
Identities = 45/153 (29%), Positives = 82/153 (53%), Gaps = 4/153 (2%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
DSI ++ N+I + P +F + + Y + K P AK + + A L I
Sbjct: 27 DSI-VIQLNEIALLPKPKFSAKEDIRYYLWFRKKVYKAYPFAKLAAERLDSLNARLDRID 85
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
+K++R + + ++ ++ E+T +IKK+T ++G++LIKLI R+T T ++ I+ +A
Sbjct: 86 SKRKRRKYTRLIQNYIEGEFTTQIKKMTTTEGRVLIKLIHRQTGKTAFDNIRGLRSGWKA 145
Query: 173 GFYQAFAWVFGASLKKRYDPKGADRLVERIVLQ 205
+Y A VF SL+ Y P DR+ E +++
Sbjct: 146 FWYNTTANVFKLSLRTEYQP---DRINEDFLIE 175
>gi|88711921|ref|ZP_01106008.1| hypothetical protein FB2170_13568 [Flavobacteriales bacterium
HTCC2170]
gi|88709327|gb|EAR01560.1| hypothetical protein FB2170_13568 [Flavobacteriales bacterium
HTCC2170]
Length = 203
Score = 68.2 bits (165), Expect = 3e-10, Method: Composition-based stats.
Identities = 51/154 (33%), Positives = 85/154 (55%), Gaps = 3/154 (1%)
Query: 61 NKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAH 120
+++Y+F ++F + K + Y L KV P AK + +LE L I ++R +
Sbjct: 24 DEVYVFSKLKFPSYKDKLRYYILRRKTIKVYPYAKMAAERLLELNDSLTKIKKSRKRKKY 83
Query: 121 MKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAW 180
K V+K ++ E++ +KKLT ++G++LIKLI R+T T + L++ RA +Y A
Sbjct: 84 TKKVQKYIEGEFSEELKKLTRTEGQILIKLIYRQTGKTAFGLVKELRSGWRAFWYSTTAK 143
Query: 181 VFGASLKKRYDPK--GADRLVERIVLQV-EAGQL 211
+F SLK+ Y P D L+E I+ + AG+L
Sbjct: 144 MFKISLKEEYRPDVVQEDYLIEDILQRAFAAGRL 177
>gi|150025150|ref|YP_001295976.1| hypothetical protein FP1077 [Flavobacterium psychrophilum JIP02/86]
gi|149771691|emb|CAL43165.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 225
Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats.
Identities = 37/135 (27%), Positives = 73/135 (54%)
Query: 58 VRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKER 117
+ ++YI +FK+++ ++ + L + KV P AK + +Q++ +++++
Sbjct: 44 IELQEVYISNKRDFKSEEDQKRFYILQRRVLKVYPYAKTAADRLTTLNIGMQSLKSERDK 103
Query: 118 DAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQA 177
+ K VE L E+ ++KKL+ G++L+KLI R+T S+ + LI+ +A +
Sbjct: 104 RKYFKLVENYLTSEFEDQLKKLSRKDGQVLVKLIHRQTGSSTFNLIKELKSGWKAFWSNQ 163
Query: 178 FAWVFGASLKKRYDP 192
A +F +LK YDP
Sbjct: 164 TAKIFDINLKTTYDP 178
>gi|146300487|ref|YP_001195078.1| hypothetical protein Fjoh_2737 [Flavobacterium johnsoniae UW101]
gi|146154905|gb|ABQ05759.1| hypothetical protein Fjoh_2737 [Flavobacterium johnsoniae UW101]
Length = 227
Score = 65.5 bits (158), Expect = 2e-09, Method: Composition-based stats.
Identities = 43/140 (30%), Positives = 72/140 (51%), Gaps = 3/140 (2%)
Query: 75 KQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLKEEYTP 134
++ + + L + KV P A+ + + + T +E+ + K VE L E+
Sbjct: 60 EELKQFQILQMRVYKVYPYARLAADRLTALNNGMARLKTSREKKKYFKIVEDYLNNEFED 119
Query: 135 RIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKRYDP-- 192
R+KKL+ QG++L+KL+ R+T T YELI+ +A A +F SLK Y P
Sbjct: 120 RLKKLSRKQGQILVKLVHRQTGKTTYELIKTLKSGFKAFVSNTTANLFDISLKTEYKPFE 179
Query: 193 KGADRLVERIVLQV-EAGQL 211
D L+E I+++ E+G+L
Sbjct: 180 VNEDYLIETILVRAFESGRL 199
>gi|86135718|ref|ZP_01054299.1| hypothetical protein MED152_13459 [Tenacibaculum sp. MED152]
gi|85819891|gb|EAQ41048.1| hypothetical protein MED152_13459 [Polaribacter dokdonensis MED152]
Length = 231
Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats.
Identities = 47/153 (30%), Positives = 82/153 (53%), Gaps = 3/153 (1%)
Query: 53 DSIQYVRTNKIYIFPPMEFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIP 112
DSI + N+I + P +FK+ + Y + K P A ++ + L I
Sbjct: 32 DSIT-INLNEITLLPKQKFKSKDDIRYYLWFRRKVFKAYPYAILASKRLDSLNVRLSKIK 90
Query: 113 TKKERDAHMKAVEKGLKEEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRA 172
+K+++ + + V+K ++ E+T +IKK+T ++G++LIKLI R+T T + I+ +A
Sbjct: 91 SKRKKRKYTRQVQKYIEGEFTDQIKKMTKTEGRILIKLIHRQTGETAFNNIKELRSGWKA 150
Query: 173 GFYQAFAWVFGASLKKRYDPKGA--DRLVERIV 203
+Y A +F SLK YDP+ D L+E I+
Sbjct: 151 FWYNTTANLFKLSLKDEYDPENVNEDYLIEDIL 183
>gi|89890844|ref|ZP_01202353.1| hypothetical protein BBFL7_00193 [Flavobacteria bacterium BBFL7]
gi|89516989|gb|EAS19647.1| hypothetical protein BBFL7_00193 [Flavobacteria bacterium BBFL7]
Length = 275
Score = 60.5 bits (145), Expect = 7e-08, Method: Composition-based stats.
Identities = 52/197 (26%), Positives = 100/197 (50%), Gaps = 7/197 (3%)
Query: 10 FWFLLIAVGGFAQEHINPEDRVVDLNSPTFVPMVHIGKAKVGSDSIQYVRTNKIYIFPPM 69
F +LI V FA+ +P V +S T P + DS+ + +++ + +
Sbjct: 7 FITILIWVSAFAKAQTDPIILPVKTDS-TKTPEYFF----IDGDSLSAIELDRVMLLQSL 61
Query: 70 EFKNDKQRQAYNRLVANIKKVLPLAKECNQIILETGAYLQTIPTKKERDAHMKAVEKGLK 129
+F + + Y + +KKV P AK + + E L ++ + ++ + K V++ ++
Sbjct: 62 KFDSRYENIRYQIIKRKVKKVWPYAKLAAERLTELDRRLASLEYESDKKKYTKIVQRYVE 121
Query: 130 EEYTPRIKKLTYSQGKLLIKLIDRETHSTGYELIQAFLGPVRAGFYQAFAWVFGASLKKR 189
EE+T +KKLT ++G++LIKL+ R+T T +EL++ A +++ A +F LK
Sbjct: 122 EEFTTELKKLTTTEGQILIKLLHRQTGMTTFELLKQLRSGWSAFWFKNAASIFDIDLKAE 181
Query: 190 YDPKG--ADRLVERIVL 204
Y P+ D +E ++L
Sbjct: 182 YLPESNIEDFYIEDVLL 198
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.323 0.139 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 762,786,386
Number of Sequences: 5470121
Number of extensions: 30465818
Number of successful extensions: 65815
Number of sequences better than 1.0e-05: 25
Number of HSP's better than 0.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 65789
Number of HSP's gapped (non-prelim): 25
length of query: 211
length of database: 1,894,087,724
effective HSP length: 127
effective length of query: 84
effective length of database: 1,199,382,357
effective search space: 100748117988
effective search space used: 100748117988
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 127 (53.5 bits)