BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= TF2681
(167 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|154491956|ref|ZP_02031582.1| hypothetical protein PARMER... 196 3e-49
gi|150010112|ref|YP_001304855.1| hypothetical protein BDI_3... 190 2e-47
gi|53711552|ref|YP_097544.1| hypothetical protein BF0261 [B... 170 3e-41
gi|156862555|gb|EDO55986.1| hypothetical protein BACUNI_002... 164 2e-39
gi|150003920|ref|YP_001298664.1| hypothetical protein BVU_1... 161 1e-38
gi|153809149|ref|ZP_01961817.1| hypothetical protein BACCAC... 160 2e-38
gi|156110660|gb|EDO12405.1| hypothetical protein BACOVA_019... 155 8e-37
gi|29348811|ref|NP_812314.1| hypothetical protein BT_3402 [... 154 1e-36
gi|34540098|ref|NP_904577.1| hypothetical protein PG0253 [P... 120 3e-26
gi|86140763|ref|ZP_01059322.1| hypothetical protein MED217_... 107 2e-22
gi|86131974|ref|ZP_01050570.1| hypothetical protein MED134_... 99 6e-20
gi|88803928|ref|ZP_01119448.1| hypothetical protein RB2501_... 92 7e-18
gi|89889479|ref|ZP_01200990.1| conserved hypothetical prote... 91 3e-17
gi|88711066|ref|ZP_01105154.1| hypothetical protein FB2170_... 91 3e-17
gi|149370940|ref|ZP_01890535.1| hypothetical protein SCB49_... 88 2e-16
gi|149277311|ref|ZP_01883453.1| hypothetical protein PBAL39... 86 6e-16
gi|86133948|ref|ZP_01052530.1| hypothetical protein MED152_... 86 8e-16
gi|146299388|ref|YP_001193979.1| protein of unknown functio... 86 9e-16
gi|116246257|ref|XP_001230419.1| ENSANGP00000030013 [Anophe... 84 3e-15
gi|126663077|ref|ZP_01734075.1| hypothetical protein FBBAL3... 81 2e-14
gi|83855690|ref|ZP_00949219.1| hypothetical protein CA2559_... 80 3e-14
gi|91217186|ref|ZP_01254148.1| hypothetical protein P700755... 76 5e-13
gi|120434525|ref|YP_860221.1| protein containing DUF150 [Gr... 75 1e-12
gi|88802099|ref|ZP_01117627.1| hypothetical protein PI23P_0... 74 4e-12
gi|150024571|ref|YP_001295397.1| hypothetical protein FP046... 72 1e-11
gi|156319550|ref|XP_001618133.1| hypothetical protein NEMVE... 72 1e-11
gi|124002856|ref|ZP_01687708.1| 15 kDa protein [Microscilla... 69 1e-10
gi|126647232|ref|ZP_01719742.1| hypothetical protein ALPR1_... 67 5e-10
gi|146283632|ref|YP_001173785.1| hypothetical protein PST_3... 56 8e-07
gi|83816343|ref|YP_445896.1| Uncharacterized BCR, YhbC fami... 55 2e-06
gi|110639789|ref|YP_679999.1| hypothetical protein CHU_3420... 54 3e-06
gi|114562166|ref|YP_749679.1| protein of unknown function D... 54 3e-06
gi|116214708|ref|ZP_01480944.1| hypothetical protein VchoR_... 54 4e-06
gi|15640661|ref|NP_230290.1| hypothetical protein VC0641 [V... 54 4e-06
gi|67158742|ref|ZP_00419603.1| Protein of unknown function ... 54 5e-06
>gi|154491956|ref|ZP_02031582.1| hypothetical protein PARMER_01586 [Parabacteroides merdae ATCC
43184]
gi|154088197|gb|EDN87242.1| hypothetical protein PARMER_01586 [Parabacteroides merdae ATCC
43184]
Length = 157
Score = 196 bits (499), Expect = 3e-49, Method: Composition-based stats.
Identities = 104/155 (67%), Positives = 124/155 (80%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
MIE +++ L E LA S YLVDV + P NLI++EID+D+ V I+DC LSRY+E LD
Sbjct: 3 MIEKDVISQLVEEKLASSGNYLVDVVIKPGNLIIIEIDNDEGVCIDDCAELSRYVEGHLD 62
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
RDVED+ELEV S G+TSPFK LRQY+KNIGNEVE+LLKSG KL+G LKSAD+ G VVTV
Sbjct: 63 RDVEDFELEVGSAGITSPFKVLRQYVKNIGNEVEMLLKSGTKLTGVLKSADENGVVVTVE 122
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRFK 167
+ VKPEGAKRKVTV D+SY +DEIKYTK +IRFK
Sbjct: 123 KQVKPEGAKRKVTVREDQSYTFDEIKYTKYLIRFK 157
>gi|150010112|ref|YP_001304855.1| hypothetical protein BDI_3532 [Parabacteroides distasonis ATCC
8503]
gi|149938536|gb|ABR45233.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 155
Score = 190 bits (482), Expect = 2e-47, Method: Composition-based stats.
Identities = 109/155 (70%), Positives = 126/155 (81%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
MIE K+V+ L E LA S YLVDV + P NLIVVEID+D+AV I+DC LSRY+E LD
Sbjct: 1 MIEKKVVSQLIEEKLASSSNYLVDVVIKPGNLIVVEIDNDEAVSIDDCAELSRYLEEHLD 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
RDVEDYELEV S G+TSPFK LRQY+KNIGNEVE+LLK+G KL+G LKSAD+ G VV+V
Sbjct: 61 RDVEDYELEVGSAGITSPFKVLRQYVKNIGNEVEMLLKNGSKLTGVLKSADENGVVVSVE 120
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRFK 167
+ VKPEGAKRKVTV DESY +DEIKYTK +IRFK
Sbjct: 121 KKVKPEGAKRKVTVVEDESYTFDEIKYTKYLIRFK 155
>gi|53711552|ref|YP_097544.1| hypothetical protein BF0261 [Bacteroides fragilis YCH46]
gi|60679811|ref|YP_209955.1| hypothetical protein BF0217 [Bacteroides fragilis NCTC 9343]
gi|81317116|sp|Q5LIN3|Y217_BACFN UPF0090 protein BF0217
gi|81383870|sp|Q64ZR6|Y261_BACFR UPF0090 protein BF0261
gi|52214417|dbj|BAD47010.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60491245|emb|CAH05993.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 155
Score = 170 bits (430), Expect = 3e-41, Method: Composition-based stats.
Identities = 92/155 (59%), Positives = 114/155 (73%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
MIE + V + E +L D + +LV+VTV+PD+ IVVEIDH + V IEDCV LSR+IE+ L+
Sbjct: 1 MIEKRTVCQIVEEWLEDKDYFLVEVTVSPDDKIVVEIDHAEGVWIEDCVELSRFIESKLN 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
R+ EDYELEV S G+ PFK L+QY +IG EVEVL K G KLSG LK AD+ FVVTV
Sbjct: 61 REEEDYELEVGSAGIGQPFKVLQQYYNHIGLEVEVLTKGGRKLSGVLKDADEEKFVVTVQ 120
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRFK 167
+ VKPEGAKR V DE++ YD+IKYTK +I FK
Sbjct: 121 KKVKPEGAKRPQLVEEDETFTYDDIKYTKYLISFK 155
>gi|156862555|gb|EDO55986.1| hypothetical protein BACUNI_00272 [Bacteroides uniformis ATCC 8492]
Length = 155
Score = 164 bits (415), Expect = 2e-39, Method: Composition-based stats.
Identities = 88/155 (56%), Positives = 113/155 (72%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
MIE K V + + +LA + +LV+VT++PD+ I+VEIDH + V IEDCV LSRYIE+ L+
Sbjct: 1 MIEKKTVCQIVDEWLAGKDYFLVEVTISPDDKILVEIDHKEGVWIEDCVELSRYIESKLN 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
R+ EDYELEV S G+ PFK L+QY+ +IG EVEVL K G KLSG LK AD+ FVVT+
Sbjct: 61 REDEDYELEVGSAGIGQPFKVLQQYINHIGKEVEVLAKDGRKLSGVLKEADEQHFVVTIQ 120
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRFK 167
+ VK EGAKR V D ++ Y+EIKYTK +I FK
Sbjct: 121 KKVKEEGAKRPKLVDEDLTFTYEEIKYTKYLISFK 155
>gi|150003920|ref|YP_001298664.1| hypothetical protein BVU_1353 [Bacteroides vulgatus ATCC 8482]
gi|149932344|gb|ABR39042.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 155
Score = 161 bits (408), Expect = 1e-38, Method: Composition-based stats.
Identities = 87/155 (56%), Positives = 112/155 (72%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
MI+ +V + + +L + +LVDVTV+PD+ IVVEIDH + V I+DCV LSRYIE+ LD
Sbjct: 1 MIDKNVVTRIVDEWLEGKDYFLVDVTVSPDDKIVVEIDHAEGVWIDDCVELSRYIESKLD 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
R+ EDYELEV S G+ PFK L+QYL +IG EVE+L K G+KL G LK A++ F VT+
Sbjct: 61 REEEDYELEVGSAGIGQPFKVLQQYLIHIGKEVEILTKEGKKLEGVLKDANEENFTVTIE 120
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRFK 167
+ VKPEGAKR V D ++ YDEIKYTK +I FK
Sbjct: 121 KKVKPEGAKRPKLVEEDITFAYDEIKYTKYLISFK 155
>gi|153809149|ref|ZP_01961817.1| hypothetical protein BACCAC_03459 [Bacteroides caccae ATCC 43185]
gi|149128125|gb|EDM19345.1| hypothetical protein BACCAC_03459 [Bacteroides caccae ATCC 43185]
Length = 155
Score = 160 bits (405), Expect = 2e-38, Method: Composition-based stats.
Identities = 87/155 (56%), Positives = 112/155 (72%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
MIE K V + E +L + +LV+VTV+PD+ IVVEIDH + V IEDCV LSRYIE+ L+
Sbjct: 1 MIEKKTVCQIVEEWLEGKDYFLVEVTVSPDDKIVVEIDHAEGVWIEDCVELSRYIESKLN 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
R+ EDYELEV S G+ PFK L+QY +IG EVEV+ K G+KL+G LK AD+ F VTV
Sbjct: 61 REEEDYELEVGSAGIGQPFKVLQQYYIHIGQEVEVMTKGGQKLTGILKDADEEKFTVTVQ 120
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRFK 167
+ VK EG+KR V DE++ Y++IKYTK +I FK
Sbjct: 121 KKVKLEGSKRPKLVEEDETFTYEQIKYTKYLISFK 155
>gi|156110660|gb|EDO12405.1| hypothetical protein BACOVA_01904 [Bacteroides ovatus ATCC 8483]
Length = 155
Score = 155 bits (392), Expect = 8e-37, Method: Composition-based stats.
Identities = 85/155 (54%), Positives = 109/155 (70%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
MIE K V + E +L + +LV+VTV+PD+ IVVEIDH + V IEDCV LSR+IE+ L+
Sbjct: 1 MIEKKTVCQIVEEWLEGKDYFLVEVTVSPDDKIVVEIDHAEGVWIEDCVELSRFIESKLN 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
R+ EDYELEV S G+ PFK L+QY +IG EVEVL G KL+G LK AD+ F V V
Sbjct: 61 REEEDYELEVGSAGIGQPFKVLQQYYIHIGQEVEVLTGDGRKLAGILKDADEEKFTVGVQ 120
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRFK 167
+ VK EG+KR V DE++ Y++IKYTK +I FK
Sbjct: 121 KKVKTEGSKRPKLVEEDETFTYEQIKYTKYLISFK 155
>gi|29348811|ref|NP_812314.1| hypothetical protein BT_3402 [Bacteroides thetaiotaomicron
VPI-5482]
gi|34223111|sp|Q8A2A3|Y3402_BACTN UPF0090 protein BT_3402
gi|29340717|gb|AAO78508.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 155
Score = 154 bits (390), Expect = 1e-36, Method: Composition-based stats.
Identities = 83/155 (53%), Positives = 110/155 (70%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
MIE K V + E +L + +LV+VTV+PD+ IVVEIDH + V IEDCV LSR+IE+ L+
Sbjct: 1 MIEKKTVCQIVEEWLEGKDYFLVEVTVSPDDKIVVEIDHAEGVWIEDCVELSRFIESKLN 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
R+ EDYELEV S G+ PFK L+QY +IG EVEV+ + G KL+G LK AD+ F V V
Sbjct: 61 REEEDYELEVGSAGIGQPFKVLQQYYIHIGQEVEVVTRDGRKLAGILKDADEEKFTVGVQ 120
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRFK 167
+ VK EG+KR + DE++ Y++IKYTK +I FK
Sbjct: 121 KKVKLEGSKRPKLIEEDETFTYEQIKYTKYLISFK 155
>gi|34540098|ref|NP_904577.1| hypothetical protein PG0253 [Porphyromonas gingivalis W83]
gi|51316749|sp|Q7MXE6|Y253_PORGI UPF0090 protein PG_0253
gi|34396409|gb|AAQ65476.1| conserved hypothetical protein [Porphyromonas gingivalis W83]
Length = 152
Score = 120 bits (300), Expect = 3e-26, Method: Composition-based stats.
Identities = 69/152 (45%), Positives = 97/152 (63%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
MI+ + + + YL+ E +LV+V + P N I+VE+D Q V I++CVALSR+IE+ +D
Sbjct: 1 MIDREQIVRIVSDYLSTGETFLVEVAIHPGNRILVELDSAQGVCIDECVALSRHIESQVD 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
RD+EDYELEV STGLTSP K +RQ+ I +E+ VLL +G K +G L + + V
Sbjct: 61 RDIEDYELEVGSTGLTSPLKVMRQWENCIDSELSVLLTNGMKETGRLITVAPEAIKLEVV 120
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNII 164
R VKPEGAKRK + + + +IK II
Sbjct: 121 RMVKPEGAKRKKPETQELTIVMADIKQAVRII 152
>gi|86140763|ref|ZP_01059322.1| hypothetical protein MED217_16465 [Flavobacterium sp. MED217]
gi|85832705|gb|EAQ51154.1| hypothetical protein MED217_16465 [Leeuwenhoekiella blandensis
MED217]
Length = 154
Score = 107 bits (267), Expect = 2e-22, Method: Composition-based stats.
Identities = 64/154 (41%), Positives = 89/154 (57%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
M++ K+ A L EA+ YL+D+ + +N IVV ID D+ V ++DC+ +SR +E LD
Sbjct: 1 MLKEKVAALLDEAFKEYPSLYLIDLKIKGNNEIVVIIDGDEGVSVQDCINVSRKVEHNLD 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
R+ ED+ LEV S G T P RQY KN G ++EV+L+ G K+ G L D G V+
Sbjct: 61 REEEDFSLEVMSAGATEPLVNKRQYKKNEGRDLEVILQDGAKIKGNLIQVHDEGIVLFWK 120
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRF 166
V E K K+TV +E YD IK K I+F
Sbjct: 121 ERVPKEVGKGKMTVEKEEVIAYDAIKQAKVKIKF 154
>gi|86131974|ref|ZP_01050570.1| hypothetical protein MED134_11346 [Cellulophaga sp. MED134]
gi|85817308|gb|EAQ38488.1| hypothetical protein MED134_11346 [Dokdonia donghaensis MED134]
Length = 154
Score = 99.4 bits (246), Expect = 6e-20, Method: Composition-based stats.
Identities = 63/155 (40%), Positives = 88/155 (56%), Gaps = 2/155 (1%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
M++ K+ L EA + +L+ + + +N I + ID D V +EDC+A+SR +E LD
Sbjct: 1 MLKEKVTNLLQEALDENPNLFLISLDIQGNNEIKIIIDGDDGVKVEDCIAVSRKVEHNLD 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVT-V 131
R+ ED+ LEV S G TSP RQY KN+G +EV + G K+ G + A DT +T
Sbjct: 61 REEEDFSLEVMSAGATSPLAIPRQYKKNVGRHLEVKKEDGTKIEGLMTDATDTDVTLTWK 120
Query: 132 GRPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRF 166
R KP G K KVTV E+ Y EI K +I+F
Sbjct: 121 TREPKPVG-KGKVTVQKTETISYSEIVQAKVMIKF 154
>gi|88803928|ref|ZP_01119448.1| hypothetical protein RB2501_03740 [Robiginitalea biformata
HTCC2501]
gi|88784807|gb|EAR15976.1| hypothetical protein RB2501_03740 [Robiginitalea biformata
HTCC2501]
Length = 157
Score = 92.4 bits (228), Expect = 7e-18, Method: Composition-based stats.
Identities = 56/154 (36%), Positives = 91/154 (59%), Gaps = 3/154 (1%)
Query: 14 IEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDR 73
+E ++ L EA E +L+D+ V PD I V +D D+ + ++DC+ +SR +E LDR
Sbjct: 6 LEKRVSELLDEALAGRPELFLIDLKVNPDQSIQVTLDGDEGISLQDCMDISRAVEHSLDR 65
Query: 74 DVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVT-VG 132
D ++ LEV S G T+P RQY K+IG ++ V ++G+++ G L +ADD ++
Sbjct: 66 DEFNFSLEVGSAGATAPLLMPRQYRKHIGRKLAV-RRAGKEVQGTLTAADDKDITLSWKA 124
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRF 166
R KP G K K TV +E+ + EI+ K +++F
Sbjct: 125 REPKPVG-KGKHTVRKEETIPHSEIEQAKVVLKF 157
>gi|89889479|ref|ZP_01200990.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
gi|89517752|gb|EAS20408.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
Length = 155
Score = 90.9 bits (224), Expect = 3e-17, Method: Composition-based stats.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 2/146 (1%)
Query: 22 LAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVEDYELE 81
L +A+ + +L+++++ N I V ID D+ V + DC+ +SR +E LDRD D+ LE
Sbjct: 10 LDDAFEERPDLFLMEMSIDGANEIKVIIDGDEGVTVADCIFISRAVEHNLDRDELDFSLE 69
Query: 82 VTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVT-VGRPVKPEGA 140
V S G ++P RQ+ +N+G ++EV+ + K+ G L SADD + R KP G
Sbjct: 70 VASAGASAPLTLPRQFKRNVGRQLEVINREKRKVVGQLVSADDESIAIQWKAREPKPIG- 128
Query: 141 KRKVTVCTDESYLYDEIKYTKNIIRF 166
K KVTV + S YD+IK K +I F
Sbjct: 129 KGKVTVEKEWSLKYDDIKQAKVVITF 154
>gi|88711066|ref|ZP_01105154.1| hypothetical protein FB2170_03110 [Flavobacteriales bacterium
HTCC2170]
gi|88710007|gb|EAR02239.1| hypothetical protein FB2170_03110 [Flavobacteriales bacterium
HTCC2170]
Length = 153
Score = 90.5 bits (223), Expect = 3e-17, Method: Composition-based stats.
Identities = 62/156 (39%), Positives = 88/156 (56%), Gaps = 5/156 (3%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
M++ K+ + L EA D +L+D T+ DN I V +D D V ++DC+ +SR IE LD
Sbjct: 1 MLKDKVKSLLNEALSQDESLFLIDFTMGADNSINVVLDGDNGVSVQDCMKVSRGIEHNLD 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKL-SGCLKSADDTGFVVT- 130
R+ ED+ L VTS G SP RQY KNIG +V+V ++ E + G L +A G V+
Sbjct: 61 REEEDFSLTVTSAGAASPMVNPRQYQKNIGRKVKV--QTLENVYEGNLTAASTNGIVLEW 118
Query: 131 VGRPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRF 166
R KP G K K TV + + +IK K I++F
Sbjct: 119 KAREPKPIG-KGKTTVQKKKEITFSDIKEAKVILKF 153
>gi|149370940|ref|ZP_01890535.1| hypothetical protein SCB49_04625 [unidentified eubacterium SCB49]
gi|149355726|gb|EDM44284.1| hypothetical protein SCB49_04625 [unidentified eubacterium SCB49]
Length = 153
Score = 87.8 bits (216), Expect = 2e-16, Method: Composition-based stats.
Identities = 57/146 (39%), Positives = 82/146 (56%), Gaps = 3/146 (2%)
Query: 22 LAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVEDYELE 81
L +A + +L+D+ V DN I V ID D V +EDC+A+SR +E LD ++ D+ ++
Sbjct: 9 LDKALKENPSLFLIDLEVTADNQIRVTIDGDDGVKVEDCIAVSRAVEHNLDEEL-DFSID 67
Query: 82 VTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVT-VGRPVKPEGA 140
V S G + P RQY+KN+G V V GEK+ L +AD+ + R KP G
Sbjct: 68 VMSCGASEPLTMQRQYVKNVGRNVLVKTDKGEKIEATLVAADEENITLNWKAREPKPVG- 126
Query: 141 KRKVTVCTDESYLYDEIKYTKNIIRF 166
K KVTV + Y+EI TK +I+F
Sbjct: 127 KGKVTVKKEAVIPYNEIVQTKVMIKF 152
>gi|149277311|ref|ZP_01883453.1| hypothetical protein PBAL39_10486 [Pedobacter sp. BAL39]
gi|149232188|gb|EDM37565.1| hypothetical protein PBAL39_10486 [Pedobacter sp. BAL39]
Length = 154
Score = 86.3 bits (212), Expect = 6e-16, Method: Composition-based stats.
Identities = 58/155 (37%), Positives = 88/155 (56%), Gaps = 9/155 (5%)
Query: 17 KMVAGLAEAYLADS-ECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRD- 74
K V L E ++D E +LV+V + P+N +++ +D D+ + I+DC A+SR++ L+ +
Sbjct: 5 KRVTELIEEKISDRPELFLVEVKMLPNNKLIIHVDGDEGISIQDCAAISRHVGFHLEEEN 64
Query: 75 --VEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVG 132
+ Y LEV+S G+ P K RQY KNIG EV V L GE G L S DD G ++
Sbjct: 65 TIEKAYNLEVSSPGVGEPLKLKRQYDKNIGREVSVKLSGGEVKEGKLLSVDDKGIIIEA- 123
Query: 133 RPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRFK 167
VK +G K ++ + S ++ I TK +I FK
Sbjct: 124 -KVKEKGKKAQL---VETSVDFNSITETKVLISFK 154
>gi|86133948|ref|ZP_01052530.1| hypothetical protein MED152_04550 [Tenacibaculum sp. MED152]
gi|85820811|gb|EAQ41958.1| hypothetical protein MED152_04550 [Polaribacter dokdonensis MED152]
Length = 153
Score = 85.9 bits (211), Expect = 8e-16, Method: Composition-based stats.
Identities = 57/144 (39%), Positives = 82/144 (56%), Gaps = 3/144 (2%)
Query: 24 EAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVEDYELEVT 83
EA + +L+D+T++ +N I V +D D V + +C+ +SR +E LDR+ ED+ LEVT
Sbjct: 12 EALALNESLFLIDLTISENNKIQVTVDGDNGVPLSECIRISRNVEHNLDREEEDFSLEVT 71
Query: 84 STGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVT-VGRPVKPEGAKR 142
+ ++ P K RQY+KNI N + + + E+L G L ADD V+ R KP G K
Sbjct: 72 TPDISHPLKEKRQYVKNI-NRILKVKTAEEELEGTLVEADDEKIVLNWKAREPKPIG-KG 129
Query: 143 KVTVCTDESYLYDEIKYTKNIIRF 166
KVTV + Y EIK K I F
Sbjct: 130 KVTVKKSATLTYTEIKEAKVKIVF 153
>gi|146299388|ref|YP_001193979.1| protein of unknown function DUF150 [Flavobacterium johnsoniae
UW101]
gi|146153806|gb|ABQ04660.1| protein of unknown function DUF150 [Flavobacterium johnsoniae
UW101]
Length = 166
Score = 85.5 bits (210), Expect = 9e-16, Method: Composition-based stats.
Identities = 56/158 (35%), Positives = 86/158 (54%), Gaps = 3/158 (1%)
Query: 2 GASSPHLSIIEMIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCV 61
G S P I + K+ + EA L +L+D+ V+ I V +D D V ++DC+
Sbjct: 3 GLSVPSFYKIMTFKEKVNELITEALLEKPSIFLIDLAVSDSFKISVGLDGDNGVALQDCI 62
Query: 62 ALSRYIEAGLDRDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKS 121
+SR IE LDR+ +D+ LEV S G+ SP K +RQY KNIG + ++ + EK+ L
Sbjct: 63 DISRAIENNLDREEQDFSLEVASVGVGSPLKLIRQYKKNIGRTL-IVTTNNEKIEAELIE 121
Query: 122 ADDTGFVVTVGRPVKPEG-AKRKVTVCTDESYLYDEIK 158
A+D F++ + +P+ K K TV ++ Y EIK
Sbjct: 122 ANDV-FIILSWKAREPKKVGKGKETVQKEQQIPYTEIK 158
>gi|116246257|ref|XP_001230419.1| ENSANGP00000030013 [Anopheles gambiae str. PEST]
Length = 155
Score = 84.0 bits (206), Expect = 3e-15, Method: Composition-based stats.
Identities = 54/149 (36%), Positives = 82/149 (55%), Gaps = 1/149 (0%)
Query: 19 VAGLAEAYL-ADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVED 77
V L + YL A + +L+++ ++ D+ I V ID DQ+V ++DC+ +SR +E LDR+ D
Sbjct: 7 VQELVDQYLEAREDLFLIELKISADSNITVIIDGDQSVSLQDCLDVSRAVEFQLDREEHD 66
Query: 78 YELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVGRPVKP 137
+ L+V S GL+ P K RQ+ KNIG E++VLL K+ G LK A + + +
Sbjct: 67 FSLQVMSPGLSEPLKLPRQFAKNIGRELDVLLNDDTKIQGELKIAGEDSITLELKYRRPK 126
Query: 138 EGAKRKVTVCTDESYLYDEIKYTKNIIRF 166
K K V D EIK +I+F
Sbjct: 127 LVGKGKEDVVEDRIIPLTEIKKALVVIKF 155
>gi|126663077|ref|ZP_01734075.1| hypothetical protein FBBAL38_06985 [Flavobacteria bacterium BAL38]
gi|126624735|gb|EAZ95425.1| hypothetical protein FBBAL38_06985 [Flavobacteria bacterium BAL38]
Length = 155
Score = 80.9 bits (198), Expect = 2e-14, Method: Composition-based stats.
Identities = 54/149 (36%), Positives = 81/149 (54%), Gaps = 1/149 (0%)
Query: 19 VAGLAEAYLADSE-CYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVED 77
V + +A LA+ E +L+D+++ N I V +D D V ++DC+ +SR +E LDR+ +D
Sbjct: 7 VQEVLDAALAEREQLFLIDLSINEANKISVILDGDSGVNLQDCIDISRAVENNLDREEQD 66
Query: 78 YELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVGRPVKP 137
+ LEV S G++SP K +RQY KNIG ++V S E++ L ADD +
Sbjct: 67 FSLEVASAGVSSPLKLVRQYKKNIGRTIKVKTISLEEIEAKLTMADDEKITLEWQDREPK 126
Query: 138 EGAKRKVTVCTDESYLYDEIKYTKNIIRF 166
+ K K TV Y+ IK II F
Sbjct: 127 KIGKGKETVDKKLQLSYENIKEAIVIISF 155
>gi|83855690|ref|ZP_00949219.1| hypothetical protein CA2559_01345 [Croceibacter atlanticus
HTCC2559]
gi|83849490|gb|EAP87358.1| hypothetical protein CA2559_01345 [Croceibacter atlanticus
HTCC2559]
Length = 153
Score = 80.5 bits (197), Expect = 3e-14, Method: Composition-based stats.
Identities = 60/150 (40%), Positives = 83/150 (55%), Gaps = 4/150 (2%)
Query: 19 VAGLAEAYLADS-ECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVED 77
V L +A L D+ +L+D + DN I + ID D V +EDC+ +SR IE LDR+ ED
Sbjct: 6 VDKLLQAVLDDNPSLFLIDYNITNDNSIGILIDGDNGVTVEDCITVSRAIEHNLDREEED 65
Query: 78 YELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVT-VGRPVK 136
+ LEV S G T+ RQY+KNIG + V ++ EK+ L SADD + R K
Sbjct: 66 FSLEVGSIGATASLTLPRQYVKNIGRVLSVKTET-EKIEAELVSADDNEIALKWKAREPK 124
Query: 137 PEGAKRKVTVCTDESYLYDEIKYTKNIIRF 166
P G K K TV + Y++I K +I+F
Sbjct: 125 PVG-KGKHTVEKEAVIPYEKIVEAKVMIKF 153
>gi|91217186|ref|ZP_01254148.1| hypothetical protein P700755_04298 [Psychroflexus torquis ATCC
700755]
gi|91184786|gb|EAS71167.1| hypothetical protein P700755_04298 [Psychroflexus torquis ATCC
700755]
Length = 157
Score = 76.3 bits (186), Expect = 5e-13, Method: Composition-based stats.
Identities = 52/139 (37%), Positives = 76/139 (54%), Gaps = 3/139 (2%)
Query: 29 DSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVEDYELEVTSTGLT 88
D +L+D+ ++ N I V ID D+ + I DCV +SR IE LDR++ED+ LEV S G T
Sbjct: 19 DEALFLIDLQISDKNNINVIIDGDRDLSITDCVNMSRAIEHNLDREIEDFSLEVASCGAT 78
Query: 89 SPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVT-VGRPVKPEGAKRKVTVC 147
P K RQ+ KNIG ++ V ++ E + L +A D + + KP G K K V
Sbjct: 79 EPLKFPRQFKKNIGRKLAVESQN-ESIEATLTAASDEDIELKWTAKEPKPIG-KGKHKVE 136
Query: 148 TDESYLYDEIKYTKNIIRF 166
+ Y++I + II F
Sbjct: 137 KEAKIAYEDIDKAQVIINF 155
>gi|120434525|ref|YP_860221.1| protein containing DUF150 [Gramella forsetii KT0803]
gi|117576675|emb|CAL65144.1| protein containing DUF150 [Gramella forsetii KT0803]
Length = 153
Score = 75.1 bits (183), Expect = 1e-12, Method: Composition-based stats.
Identities = 53/155 (34%), Positives = 83/155 (53%), Gaps = 3/155 (1%)
Query: 13 MIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
M++ K+ + + ++ +L+ + + N I + +D D+ V + DC+ +SR IE LD
Sbjct: 1 MLKEKVEKLAEKVFEENNSLFLISLDINSANHIKIVLDGDEGVSVNDCIMVSRGIEHNLD 60
Query: 73 RDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVT-V 131
R+ ED+ LEVTS G++ P RQY KNIG ++V ++ +K L SAD ++
Sbjct: 61 REEEDFSLEVTSAGVSEPLSMPRQYKKNIGRRLQVKTEN-DKFEADLLSADQNEIKLSWK 119
Query: 132 GRPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRF 166
R KP G K KVTV + Y +I K I F
Sbjct: 120 AREPKPVG-KGKVTVQKEVVLPYTDIVEAKVKITF 153
>gi|88802099|ref|ZP_01117627.1| hypothetical protein PI23P_05532 [Polaribacter irgensii 23-P]
gi|88782757|gb|EAR13934.1| hypothetical protein PI23P_05532 [Polaribacter irgensii 23-P]
Length = 153
Score = 73.6 bits (179), Expect = 4e-12, Method: Composition-based stats.
Identities = 42/123 (34%), Positives = 71/123 (57%), Gaps = 1/123 (0%)
Query: 24 EAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVEDYELEVT 83
EA + YL++++++ ++ I V +D D V + +C+ +S+ I+A LDR+ ED+ LEVT
Sbjct: 12 EALALNDSLYLIELSISVNDKIKVVVDGDNGVPLSECIRISKNIDANLDRESEDFSLEVT 71
Query: 84 STGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVGRPVKPEGAKRK 143
+ + P K RQYLKN+ N + + + E+ G L +AD+ ++ E K K
Sbjct: 72 TPDIAHPLKVKRQYLKNL-NRILKVKTAAEEFEGTLTAADEDKIILQWKAREPKEIGKGK 130
Query: 144 VTV 146
VTV
Sbjct: 131 VTV 133
>gi|150024571|ref|YP_001295397.1| hypothetical protein FP0468 [Flavobacterium psychrophilum JIP02/86]
gi|149771112|emb|CAL42579.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 154
Score = 72.0 bits (175), Expect = 1e-11, Method: Composition-based stats.
Identities = 51/159 (32%), Positives = 82/159 (51%), Gaps = 5/159 (3%)
Query: 8 LSIIEMIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYI 67
++ E ++ + GLAE +L+D+ + N I++ +D D V ++DC+ +SR I
Sbjct: 1 MAFKEKVKELLEQGLAEY----PNLFLIDLNINDSNKIIITLDGDNGVQLQDCINISRSI 56
Query: 68 EAGLDRDVEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGF 127
+ LDR+ D+ LEV S G++ P K +RQY KNIG +++ + + L+ +D
Sbjct: 57 DNNLDREEVDFALEVASAGVSLPLKLVRQYKKNIGRTLKIKTATQTIEALLLEVSDQDIT 116
Query: 128 VVTVGRPVKPEGAKRKVTVCTDESYLYDEIKYTKNIIRF 166
V R K G K K TV +E Y I+ II F
Sbjct: 117 VEWSSREPKKIG-KGKETVVHNEKIAYAAIQEAIVIIIF 154
>gi|156319550|ref|XP_001618133.1| hypothetical protein NEMVEDRAFT_v1g225486 [Nematostella vectensis]
gi|156197569|gb|EDO26033.1| predicted protein [Nematostella vectensis]
Length = 271
Score = 72.0 bits (175), Expect = 1e-11, Method: Composition-based stats.
Identities = 36/85 (42%), Positives = 53/85 (62%)
Query: 22 LAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVEDYELE 81
L A L +L+D+ + N I + +D D V ++DCV +SR +E LDR+ D+ LE
Sbjct: 11 LDAAILERDHLFLIDLKIDEANKINIVLDGDNGVSLQDCVDISRLVEQDLDREENDFSLE 70
Query: 82 VTSTGLTSPFKTLRQYLKNIGNEVE 106
V S GL+SP K +RQY KNIG +++
Sbjct: 71 VASAGLSSPLKLVRQYKKNIGRKLK 95
>gi|124002856|ref|ZP_01687708.1| 15 kDa protein [Microscilla marina ATCC 23134]
gi|123992084|gb|EAY31471.1| 15 kDa protein [Microscilla marina ATCC 23134]
Length = 152
Score = 68.9 bits (167), Expect = 1e-10, Method: Composition-based stats.
Identities = 50/144 (34%), Positives = 77/144 (53%), Gaps = 14/144 (9%)
Query: 29 DSECYLVDVTVA---PDNLIVVEIDHDQAVGIEDCVALSRYIEAGLDRD--VED-YELEV 82
+ E +L+DV + P + V ID DQ V I+ C LSR++ ++ + +E Y LEV
Sbjct: 16 EPEYFLIDVILKNQKPKAKLTVLIDGDQGVSIDRCATLSRWLGKYIEEENLIEGAYTLEV 75
Query: 83 TSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVGRPVKPEGAKR 142
+S G+ P K RQY+KNIG +V+V L G +G L+ +D V ++PE K+
Sbjct: 76 SSPGVDLPLKNQRQYIKNIGRKVKVSLNEGGNKTGVLEKVEDKLIV------IQPE--KK 127
Query: 143 KVTVCTDESYLYDEIKYTKNIIRF 166
+ E +D+I TK +I F
Sbjct: 128 GKIIPEKEEIPFDQIMKTKVMISF 151
>gi|126647232|ref|ZP_01719742.1| hypothetical protein ALPR1_20868 [Algoriphagus sp. PR1]
gi|126577280|gb|EAZ81528.1| hypothetical protein ALPR1_20868 [Algoriphagus sp. PR1]
Length = 157
Score = 66.6 bits (161), Expect = 5e-10, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 66/116 (56%), Gaps = 6/116 (5%)
Query: 22 LAEAYLADSECYLVDVTVAPDN---LIVVEIDHDQAVGIEDCVALSRYIEAGLDRD---V 75
+ E +L D ++V+V + P + ++ + ID DQ + ++ C +SR + L+
Sbjct: 11 IVEKHLPDESHFVVEVNLVPKSGKTVLSILIDADQGLNVQTCANVSRAVAEELEAKELMS 70
Query: 76 EDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTV 131
E Y LEV+S G+ P + RQ+ KNIG E++VLL SG++L G L D TG + V
Sbjct: 71 EAYILEVSSPGVDYPLSSRRQFQKNIGRELKVLLTSGQELKGKLLEVDTTGVKMLV 126
>gi|146283632|ref|YP_001173785.1| hypothetical protein PST_3312 [Pseudomonas stutzeri A1501]
gi|145571837|gb|ABP80943.1| conserved hypothetical protein [Pseudomonas stutzeri A1501]
Length = 182
Score = 55.8 bits (133), Expect = 8e-07, Method: Composition-based stats.
Identities = 39/131 (29%), Positives = 72/131 (54%), Gaps = 11/131 (8%)
Query: 9 SIIEMIEAKMVAGLAEAYLADSECYLVD-VTVAPDNLIVVEIDHDQAVGIEDCVALSRYI 67
S +E ++A ++A + EA +C+ ++ ++ +L+ V IDH + I+DC +SR +
Sbjct: 33 SKLEQLQA-LLAPVVEAL--GYQCWGIEFISQGRHSLLRVYIDHANGILIDDCEKVSRQL 89
Query: 68 EAGLDRD---VEDYELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSG----EKLSGCLK 120
LD + DY LEV+S G+ P T+ QY+ ++G++V++ L+S G L+
Sbjct: 90 SGVLDVEDPISVDYTLEVSSPGMDRPLFTIEQYVAHVGDQVKIKLRSPFEGRRNFQGLLR 149
Query: 121 SADDTGFVVTV 131
++ VV V
Sbjct: 150 GVEEQDVVVLV 160
>gi|83816343|ref|YP_445896.1| Uncharacterized BCR, YhbC family COG0779 [Salinibacter ruber DSM
13855]
gi|83757737|gb|ABC45850.1| Uncharacterized BCR, YhbC family COG0779 [Salinibacter ruber DSM
13855]
Length = 217
Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats.
Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 13/120 (10%)
Query: 16 AKMVAGLAEAYLADSECYLVDVTV---APDNLIVVEIDHDQAVGIEDCVALSRYIEAGLD 72
A V+GL E + ++ +LVDV V ++ V ID ++ VG +D +S+ E G
Sbjct: 71 ADRVSGLTEEVIVGTDYFLVDVEVRGHKGTRVVEVYIDSEEEVGHDDLALISK--EIGFL 128
Query: 73 RDVED-----YELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSG---EKLSGCLKSADD 124
DVED Y+LE++S G+ P QY KN+G + V +S E + G L ADD
Sbjct: 129 LDVEDVVDGSYKLELSSPGIKRPLTMPAQYRKNVGRTLRVRFESDGDEEIVVGDLTDADD 188
>gi|110639789|ref|YP_679999.1| hypothetical protein CHU_3420 [Cytophaga hutchinsonii ATCC 33406]
gi|110282470|gb|ABG60656.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 167
Score = 54.3 bits (129), Expect = 3e-06, Method: Composition-based stats.
Identities = 48/150 (32%), Positives = 74/150 (49%), Gaps = 14/150 (9%)
Query: 30 SECYLVDVTVAPDNL---IVVEIDHDQAVGIEDCVALSRYI-----EAGLDRDVEDYELE 81
+E +LVD+ + + I+V +D D + I++C SR + E L D Y LE
Sbjct: 20 AELFLVDIAFSGTHSRRKILVILDKDSGILIDECGEFSRALGNLIEENNLFGD-NAYVLE 78
Query: 82 VTSTGLTSPFKTLRQYLKNIGNEVEVLLKSGEKLSGCLKSADDTGFVVTVGRPVKPEGAK 141
V+S G+ P RQY + IGN + LL G + L+S + G VV + P K + +
Sbjct: 79 VSSPGMDRPLLVSRQYKRRIGNTLSFLLNDGTQFDAVLESVSEEG-VVVMPAPQKVKKSN 137
Query: 142 RK---VTVCTDESYL-YDEIKYTKNIIRFK 167
+K V V + L ++EIK I+ FK
Sbjct: 138 KKEAVVDVAIEPRKLRFEEIKKCNLIVSFK 167
>gi|114562166|ref|YP_749679.1| protein of unknown function DUF150 [Shewanella frigidimarina NCIMB
400]
gi|114333459|gb|ABI70841.1| protein of unknown function DUF150 [Shewanella frigidimarina NCIMB
400]
Length = 165
Score = 53.9 bits (128), Expect = 3e-06, Method: Composition-based stats.
Identities = 36/98 (36%), Positives = 48/98 (48%), Gaps = 11/98 (11%)
Query: 43 NLIVVEIDHDQAVGIEDCVALSRYIEAGLDRDVED-----YELEVTSTGLTSPFKTLRQY 97
+++ V IDH+ + IEDC SR + A + DVED Y LEV+S G+ P T QY
Sbjct: 49 SILRVFIDHENGINIEDCAEASRQVSAVM--DVEDPISTEYTLEVSSPGVDRPLFTAEQY 106
Query: 98 LKNIGNEVEVLLK----SGEKLSGCLKSADDTGFVVTV 131
IG E +V L L G + S D +TV
Sbjct: 107 RAYIGEETKVQLTMPVAGSRNLKGVISSVDGQMLTITV 144
>gi|116214708|ref|ZP_01480944.1| hypothetical protein VchoR_02003207 [Vibrio cholerae RC385]
gi|150419842|gb|EDN12154.1| conserved hypothetical protein [Vibrio cholerae RC385]
Length = 151
Score = 53.9 bits (128), Expect = 4e-06, Method: Composition-based stats.
Identities = 44/131 (33%), Positives = 62/131 (47%), Gaps = 17/131 (12%)
Query: 10 IIEMIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEA 69
+ EM+EA +VA E L V + + + IDH+ + +EDC +SR + A
Sbjct: 8 LTEMLEAPVVAAGYEL------VGLEFVRAGQHSTLRIFIDHENGITVEDCAEVSRQVSA 61
Query: 70 GLDRDVED-----YELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSG----EKLSGCLK 120
LD VED Y LEV+S GL P Y + IG+EV ++LK K G ++
Sbjct: 62 VLD--VEDPISVVYNLEVSSPGLEKPLFKAAHYEQFIGHEVSIVLKMAVGNRRKWKGVIQ 119
Query: 121 SADDTGFVVTV 131
S D V V
Sbjct: 120 SIDGETVAVMV 130
>gi|15640661|ref|NP_230290.1| hypothetical protein VC0641 [Vibrio cholerae O1 biovar eltor str.
N16961]
gi|116188476|ref|ZP_01478249.1| hypothetical protein VchoM_02002572 [Vibrio cholerae MO10]
gi|116218419|ref|ZP_01484117.1| hypothetical protein VchoV5_02003320 [Vibrio cholerae V51]
gi|121587845|ref|ZP_01677602.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
gi|121728393|ref|ZP_01681421.1| conserved hypothetical protein [Vibrio cholerae V52]
gi|147673580|ref|YP_001216135.1| hypothetical protein VC0395_A0172 [Vibrio cholerae O395]
gi|153215845|ref|ZP_01950177.1| conserved hypothetical protein [Vibrio cholerae 1587]
gi|153801997|ref|ZP_01956583.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
gi|153818682|ref|ZP_01971349.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
gi|153822534|ref|ZP_01975201.1| conserved hypothetical protein [Vibrio cholerae B33]
gi|153826540|ref|ZP_01979207.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
gi|153829155|ref|ZP_01981822.1| conserved hypothetical protein [Vibrio cholerae 623-39]
gi|34223077|sp|Q9KU82|Y641_VIBCH UPF0090 protein VC_0641
gi|9655077|gb|AAF93807.1| conserved hypothetical protein [Vibrio cholerae O1 biovar eltor
str. N16961]
gi|121547881|gb|EAX57965.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
gi|121629327|gb|EAX61759.1| conserved hypothetical protein [Vibrio cholerae V52]
gi|124114562|gb|EAY33382.1| conserved hypothetical protein [Vibrio cholerae 1587]
gi|124122456|gb|EAY41199.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
gi|125617769|gb|EAZ46323.1| conserved hypothetical protein [Vibrio cholerae MO10]
gi|126510762|gb|EAZ73356.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
gi|126519952|gb|EAZ77175.1| conserved hypothetical protein [Vibrio cholerae B33]
gi|146315463|gb|ABQ20002.1| conserved hypothetical protein [Vibrio cholerae O395]
gi|148875344|gb|EDL73479.1| conserved hypothetical protein [Vibrio cholerae 623-39]
gi|149739720|gb|EDM53927.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
gi|150421888|gb|EDN13867.1| conserved hypothetical protein [Vibrio cholerae AM-19226]
Length = 151
Score = 53.5 bits (127), Expect = 4e-06, Method: Composition-based stats.
Identities = 44/131 (33%), Positives = 62/131 (47%), Gaps = 17/131 (12%)
Query: 10 IIEMIEAKMVAGLAEAYLADSECYLVDVTVAPDNLIVVEIDHDQAVGIEDCVALSRYIEA 69
+ EM+EA +VA E L V + + + IDH+ + +EDC +SR + A
Sbjct: 8 LTEMLEAPVVAAGYEL------VGLEFVRAGQHSTLRIFIDHENGITVEDCAEVSRQVSA 61
Query: 70 GLDRDVED-----YELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKSG----EKLSGCLK 120
LD VED Y LEV+S GL P Y + IG+EV ++LK K G ++
Sbjct: 62 VLD--VEDPISVVYNLEVSSPGLERPLFKAAHYEQFIGHEVSIVLKMAVGNRRKWKGVIQ 119
Query: 121 SADDTGFVVTV 131
S D V V
Sbjct: 120 SIDGETVAVMV 130
>gi|67158742|ref|ZP_00419603.1| Protein of unknown function DUF150 [Azotobacter vinelandii AvOP]
gi|67084615|gb|EAM04097.1| Protein of unknown function DUF150 [Azotobacter vinelandii AvOP]
Length = 182
Score = 53.5 bits (127), Expect = 5e-06, Method: Composition-based stats.
Identities = 38/109 (34%), Positives = 63/109 (57%), Gaps = 11/109 (10%)
Query: 9 SIIEMIEAKMVAGLAEAYLADSECYLVD-VTVAPDNLIVVEIDHDQAVGIEDCVALSRYI 67
S +E ++A ++A + EA EC+ ++ ++ +L+ V IDH + I+DC +SR I
Sbjct: 33 SKLEQLQA-LLAPVVEAL--GYECWGIEFLSQGRHSLLRVYIDHADGILIDDCEKVSRQI 89
Query: 68 EAGLDRDVED-----YELEVTSTGLTSPFKTLRQYLKNIGNEVEVLLKS 111
L DVED Y LEV+S G+ P TL Q+++ G +V++ L+S
Sbjct: 90 SGVL--DVEDPISSEYTLEVSSPGMDRPLFTLEQFVRCAGEQVKIRLRS 136
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.316 0.135 0.377
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 579,140,727
Number of Sequences: 5470121
Number of extensions: 22580319
Number of successful extensions: 58076
Number of sequences better than 1.0e-05: 42
Number of HSP's better than 0.0 without gapping: 26
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 58036
Number of HSP's gapped (non-prelim): 43
length of query: 167
length of database: 1,894,087,724
effective HSP length: 123
effective length of query: 44
effective length of database: 1,221,262,841
effective search space: 53735565004
effective search space used: 53735565004
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 124 (52.4 bits)