BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PG0379
(626 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34540237|ref|NP_904716.1| hypothetical protein PG0414 [P... 1224 0.0
gi|150008993|ref|YP_001303736.1| hypothetical protein BDI_2... 425 e-117
gi|154494372|ref|ZP_02033692.1| hypothetical protein PARMER... 417 e-115
gi|153809077|ref|ZP_01961745.1| hypothetical protein BACCAC... 410 e-112
gi|29349257|ref|NP_812760.1| hypothetical protein BT_3849 [... 407 e-111
gi|156112206|gb|EDO13951.1| hypothetical protein BACOVA_003... 405 e-111
gi|156859578|gb|EDO53009.1| hypothetical protein BACUNI_030... 399 e-109
gi|150003017|ref|YP_001297761.1| hypothetical protein BVU_0... 397 e-109
gi|53715355|ref|YP_101347.1| hypothetical protein BF4071 [B... 396 e-108
gi|110639393|ref|YP_679601.1| hypothetical protein CHU_3019... 233 2e-59
gi|88712817|ref|ZP_01106902.1| hypothetical protein FB2170_... 231 1e-58
gi|89891372|ref|ZP_01202878.1| conserved hypothetical prote... 231 2e-58
gi|124006426|ref|ZP_01691260.1| conserved hypothetical prot... 230 2e-58
gi|146300347|ref|YP_001194938.1| OstA family protein [Flavo... 230 2e-58
gi|150025820|ref|YP_001296646.1| hypothetical protein FP177... 227 2e-57
gi|83856772|ref|ZP_00950301.1| hypothetical protein CA2559_... 226 4e-57
gi|149370671|ref|ZP_01890360.1| hypothetical protein SCB49_... 226 5e-57
gi|86133424|ref|ZP_01052006.1| hypothetical protein MED152_... 225 6e-57
gi|120435129|ref|YP_860815.1| conserved hypothetical protei... 224 1e-56
gi|88802837|ref|ZP_01118364.1| hypothetical protein PI23P_0... 222 6e-56
gi|88806998|ref|ZP_01122513.1| hypothetical protein RB2501_... 219 4e-55
gi|91215797|ref|ZP_01252767.1| hypothetical protein P700755... 219 5e-55
gi|126662025|ref|ZP_01733024.1| hypothetical protein FBBAL3... 218 7e-55
gi|126646936|ref|ZP_01719446.1| hypothetical protein ALPR1_... 216 5e-54
gi|86143092|ref|ZP_01061514.1| hypothetical protein MED217_... 209 5e-52
gi|86132235|ref|ZP_01050830.1| hypothetical protein MED134_... 206 3e-51
gi|149280297|ref|ZP_01886419.1| hypothetical protein PBAL39... 166 4e-39
gi|21674905|ref|NP_662970.1| hypothetical protein CT2096 [C... 58 2e-06
>gi|34540237|ref|NP_904716.1| hypothetical protein PG0414 [Porphyromonas gingivalis W83]
gi|34396549|gb|AAQ65615.1| hypothetical protein PG_0414 [Porphyromonas gingivalis W83]
Length = 626
Score = 1224 bits (3166), Expect = 0.0, Method: Composition-based stats.
Identities = 626/626 (100%), Positives = 626/626 (100%)
Query: 1 MRKGEKRESRLGSRQLGAIILIVTLSFSALASLQGPPPKGSKGKTHVILEHADELRYDRL 60
MRKGEKRESRLGSRQLGAIILIVTLSFSALASLQGPPPKGSKGKTHVILEHADELRYDRL
Sbjct: 1 MRKGEKRESRLGSRQLGAIILIVTLSFSALASLQGPPPKGSKGKTHVILEHADELRYDRL 60
Query: 61 YNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDG 120
YNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDG
Sbjct: 61 YNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDG 120
Query: 121 NIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTT 180
NIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTT
Sbjct: 121 NIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTT 180
Query: 181 SDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDV 240
SDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDV
Sbjct: 181 SDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDV 240
Query: 241 GILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKK 300
GILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKK
Sbjct: 241 GILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKK 300
Query: 301 DYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIAD 360
DYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIAD
Sbjct: 301 DYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIAD 360
Query: 361 SMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDYVDVLTKALAVRRIDSVMY 420
SMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDYVDVLTKALAVRRIDSVMY
Sbjct: 361 SMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDYVDVLTKALAVRRIDSVMY 420
Query: 421 DQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQL 480
DQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQL
Sbjct: 421 DQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQL 480
Query: 481 KKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQPDSVLQVHRSLSD 540
KKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQPDSVLQVHRSLSD
Sbjct: 481 KKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQPDSVLQVHRSLSD 540
Query: 541 LRRFSGALAALRAYTALAEEERKDSLTIAALQTDSIPPTPAAGKEATDPTDRLSPYIARP 600
LRRFSGALAALRAYTALAEEERKDSLTIAALQTDSIPPTPAAGKEATDPTDRLSPYIARP
Sbjct: 541 LRRFSGALAALRAYTALAEEERKDSLTIAALQTDSIPPTPAAGKEATDPTDRLSPYIARP 600
Query: 601 TTDTKEEGFFDLFFTPFIFNREKLWD 626
TTDTKEEGFFDLFFTPFIFNREKLWD
Sbjct: 601 TTDTKEEGFFDLFFTPFIFNREKLWD 626
>gi|150008993|ref|YP_001303736.1| hypothetical protein BDI_2389 [Parabacteroides distasonis ATCC
8503]
gi|149937417|gb|ABR44114.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 533
Score = 425 bits (1092), Expect = e-117, Method: Composition-based stats.
Identities = 224/482 (46%), Positives = 317/482 (65%), Gaps = 6/482 (1%)
Query: 44 KTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSM 103
KT V LEHA+ L +D+ N + Q L G+V +H+ + M CDSA+ ++ N+ EAF V M
Sbjct: 40 KTKVFLEHANTLSFDKERNAEAQVLNGDVCFRHDSSYMYCDSAYFFEQTNSLEAFSNVRM 99
Query: 104 QQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIV 163
+QGDT+ ++ YL YDGN + A LR VR+EN TLFTDSL+Y+R+ ++GYYF+GG IV
Sbjct: 100 EQGDTLFVYGNYLFYDGNTQIAYLRENVRMENGQVTLFTDSLNYERIPDIGYYFDGGLIV 159
Query: 164 DSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSD 223
DSLN L+S YG+YSP+T AIF D+V LEN+ +T+ ++ LHYNTD+KI+ ILGP+ + SD
Sbjct: 160 DSLNQLSSFYGQYSPSTKLAIFNDSVRLENEQFTLYSDTLHYNTDSKIATILGPSIIVSD 219
Query: 224 SGYIVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDT 283
SG I S+RG YD+ + +LLDRS V S G + LTGDSI Y+R GFGEAFGNM L DT
Sbjct: 220 SGTIYSSRGWYDTVNNTSLLLDRSQVVS--GDRILTGDSIAYNRELGFGEAFGNMSLQDT 277
Query: 284 VNRSSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDRRIAR 343
L G+YG+Y+EK +YAFAT + ++FS+ DTL+ DTL+M T V R +
Sbjct: 278 AQHVMLEGQYGFYNEKSEYAFATDSARFLEFSQGDTLFLHGDTLKMTT---VDSLYREVK 334
Query: 344 GYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDYV 403
Y VR YRTD+Q + DSMQ+++RDS+LY+Y +PI+WNE Q+ GDTI + S+D+
Sbjct: 335 AYYGVRFYRTDMQGVCDSMQFNTRDSILYMYTDPIVWNEQYQIYGDTILIFMNDSSIDFA 394
Query: 404 DVLTKALAVRRIDSVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLM 463
V A A+++IDS ++QL G ++AY + +V QI V GNAE I + K M
Sbjct: 395 HVKQFAFAIQQIDSTAFNQLKGNDLKAYFEGQVVNQIDVSGNAESIFFPLEKDGSM-VGM 453
Query: 464 NRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDL 523
N ++ + ++ +L K+ + +G PI L PD + L F W + +RPK K+D+
Sbjct: 454 NETKSGFLTIWLKDNKLDKLKIWPTPTGTMTPIPDLKPDQKYLKDFYWFDYIRPKDKDDI 513
Query: 524 FR 525
++
Sbjct: 514 YQ 515
>gi|154494372|ref|ZP_02033692.1| hypothetical protein PARMER_03727 [Parabacteroides merdae ATCC
43184]
gi|154085816|gb|EDN84861.1| hypothetical protein PARMER_03727 [Parabacteroides merdae ATCC
43184]
Length = 559
Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats.
Identities = 220/488 (45%), Positives = 317/488 (64%), Gaps = 10/488 (2%)
Query: 37 PPKGSKGKTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFE 96
PPK KT V L H++ L +D+ PD Q L G+V +H+ + M CDSA+ ++ N+ E
Sbjct: 63 PPK----KTKVYLIHSNTLSFDKAVKPDAQILNGDVCFRHDSSYMYCDSAYFFEQTNSLE 118
Query: 97 AFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYY 156
AF V M+QGDT+ ++ YL YDGN + A LR VR+EN TLFTDSL+Y+R+ N+GYY
Sbjct: 119 AFSNVRMEQGDTLFVYGDYLFYDGNTQVAYLRENVRMENGQVTLFTDSLNYERIPNIGYY 178
Query: 157 FEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILG 216
FEGG IVDSLN L+S YG+YSP T A+F D+V +EN D+T+ ++ LHY+T++K++ ILG
Sbjct: 179 FEGGLIVDSLNQLSSFYGQYSPETKLAVFNDSVQVENPDFTLYSDTLHYDTESKVATILG 238
Query: 217 PTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFG 276
P+ + SDSG I ++RG YD+ + +LLD+S V S G K L GDSIFY+R TG GE +G
Sbjct: 239 PSVIVSDSGTIHTSRGWYDTVNNTSLLLDQSQVES--GEKILIGDSIFYNRDTGMGEVYG 296
Query: 277 NMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVP 336
NM L DT +L GEYGYY+E+ YAFAT + +++S+ DTL+ ADTL+M+T V
Sbjct: 297 NMSLIDTAQHVTLQGEYGYYNEQTGYAFATDSARFLEYSQGDTLFLHADTLQMVTVDSV- 355
Query: 337 EDRRIARGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFR 396
R + Y VR YR D+Q + DSMQ+++RDS+LY+Y P++WNE QL GDTI
Sbjct: 356 --YREIKAYYGVRFYRIDMQGVCDSMQFNTRDSVLYMYTEPVLWNEQYQLYGDTIAIYMN 413
Query: 397 NDSLDYVDVLTKALAVRRIDSVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKR 456
+ +++Y V+ A A + +DS Y+QL G ++AY + +R+I V+GNAE+ Y K
Sbjct: 414 DSTIEYAHVIQFAFAAQHVDSSYYNQLKGNDLKAYFEGQELRRIDVNGNAELNYYPLEKD 473
Query: 457 SKRWYLMNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVR 516
+ MN + + +L+++ L ASG PI L PD + L F W + +R
Sbjct: 474 GSK-VGMNNAKGSHLSMWIRNNKLERMGLYVNASGTLTPIPDLKPDQKMLKDFYWFDYLR 532
Query: 517 PKSKEDLF 524
PK+++D++
Sbjct: 533 PKNRDDIY 540
>gi|153809077|ref|ZP_01961745.1| hypothetical protein BACCAC_03385 [Bacteroides caccae ATCC 43185]
gi|149128410|gb|EDM19629.1| hypothetical protein BACCAC_03385 [Bacteroides caccae ATCC 43185]
Length = 572
Score = 410 bits (1054), Expect = e-112, Method: Composition-based stats.
Identities = 207/482 (42%), Positives = 320/482 (66%), Gaps = 4/482 (0%)
Query: 44 KTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSM 103
KT V L HADE + D+L PDVQ L+GNV ++H+ M CDSA + ++ N+ EAF V M
Sbjct: 64 KTKVYLLHADEGQADKLARPDVQVLIGNVKMRHDSMYMYCDSALIFEKTNSVEAFSNVRM 123
Query: 104 QQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIV 163
+QGDT+ ++ YL+YDG + A+LR V++ NR+ TL TDSL+YDR+ +LGYYFEGG+++
Sbjct: 124 EQGDTLFIYGDYLYYDGMTQIAQLRENVKMINRNTTLLTDSLNYDRLYDLGYYFEGGTLM 183
Query: 164 DSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSD 223
D N LTS +GEYSP T ++F +V L N + + ++ L YNT+ KI+ ILGP+ + SD
Sbjct: 184 DEENVLTSDWGEYSPATKQSVFNHDVKLVNPKFVLTSDTLRYNTENKIAVILGPSNIVSD 243
Query: 224 SGYIVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDT 283
+ +I S RG Y++ T+ LLDRSI+ +N K+L GDS+FYDR G+GEAF N+ +TDT
Sbjct: 244 NNHIYSERGFYNTMTEQAELLDRSIL--TNQGKKLVGDSLFYDRLVGYGEAFDNVKMTDT 301
Query: 284 VNRSSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPED-RRIA 342
+N++ L G+Y +Y+E D AFAT+R+ ID+S+ D+L+ DTL+M++ + R+
Sbjct: 302 INKNMLTGDYCFYNELTDSAFATKRAVAIDYSQGDSLFMHGDTLQMVSYNLNTDSLYRLM 361
Query: 343 RGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDY 402
+ Y VR+YRTDVQ + DS+ Y+S+DS + +Y +PI+WNE QL G+ I+ + ++D+
Sbjct: 362 KAYHKVRMYRTDVQGVCDSLVYNSKDSCMTMYVDPILWNEGQQLLGEQIKIYMNDSTIDW 421
Query: 403 VDVLTKALAVRRIDSVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYL 462
++ +AL V DS+ Y+Q++G+ ++AY ++ +R I+V GN Y + K S
Sbjct: 422 AHIINQALTVEMKDSIHYNQVSGKEMKAYFENGDMRHIEVIGNVLTAFYPEEKDSTMTGF 481
Query: 463 MNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKED 522
N +E + ++ +++K L G ++G YP+ + PD RL +F W + VRP +KED
Sbjct: 482 -NCLEGSLLHLYMKDKKMEKGLFVGKSNGTMYPMDQIPPDKLRLPTFSWFDYVRPLNKED 540
Query: 523 LF 524
+F
Sbjct: 541 IF 542
>gi|29349257|ref|NP_812760.1| hypothetical protein BT_3849 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341165|gb|AAO78954.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 576
Score = 407 bits (1045), Expect = e-111, Method: Composition-based stats.
Identities = 204/482 (42%), Positives = 322/482 (66%), Gaps = 4/482 (0%)
Query: 44 KTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSM 103
KT V L HA++ + D+L PDVQ L+GNV ++H+ M CDSA + ++ N+ EAF V M
Sbjct: 68 KTKVYLLHANQGQADKLARPDVQVLIGNVKLRHDSMYMFCDSALIYEKTNSVEAFSNVRM 127
Query: 104 QQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIV 163
+QGDT+ ++ YL+YDG + A++R V++ NR+ TL TDSL+YDR+ +LGYYFEGG+++
Sbjct: 128 EQGDTLFIYGDYLYYDGMTQIAQIRENVKMINRNTTLLTDSLNYDRLYDLGYYFEGGTLM 187
Query: 164 DSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSD 223
D N LTS +GEYSP T ++F +V L N + + ++ L YNT +KI+ ILGP+ + SD
Sbjct: 188 DEENVLTSDWGEYSPATKQSVFNHDVKLVNPKFVLTSDTLKYNTFSKIATILGPSNIVSD 247
Query: 224 SGYIVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDT 283
+ +I S RG Y++ ++ LLDRSI+ +N K+L GDS+FYDR+ G+GEAF N+ +TDT
Sbjct: 248 NNHIYSERGFYNTLSEQAELLDRSIL--TNEGKKLIGDSLFYDRKVGYGEAFDNIRMTDT 305
Query: 284 VNRSSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDR-RIA 342
+N++ L G+Y +Y+E D AFAT+R+ ID+S+ D+L+ DTL++I+ + R+
Sbjct: 306 INKNMLTGDYCFYNELADSAFATKRAVAIDYSQGDSLFMHGDTLQLISYNLNTDSVFRLM 365
Query: 343 RGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDY 402
+ Y VR+YRTDVQ + DS+ Y+S+DS L +Y +PI+WNE QL G+ I+ + ++++
Sbjct: 366 KAYHKVRMYRTDVQGVCDSLVYNSKDSCLTMYTDPILWNEGQQLLGEEIKIYMNDSTINW 425
Query: 403 VDVLTKALAVRRIDSVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYL 462
++ +AL V DSV Y+Q++G+ ++AY ++ +R I+V GN Y + K S
Sbjct: 426 AHIINQALTVEMKDSVHYNQVSGKEMKAYFENGDMRHIEVIGNVMTAFYPEEKDSTMTGF 485
Query: 463 MNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKED 522
N +E + +E +++K + G ++G YP+ + PD RL++F W + VRP +KED
Sbjct: 486 -NNMEGSVLHLYMKEKKMEKGMFVGKSNGTLYPMDQIPPDKLRLSTFAWFDYVRPLNKED 544
Query: 523 LF 524
+F
Sbjct: 545 IF 546
>gi|156112206|gb|EDO13951.1| hypothetical protein BACOVA_00342 [Bacteroides ovatus ATCC 8483]
Length = 587
Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats.
Identities = 205/500 (41%), Positives = 325/500 (65%), Gaps = 4/500 (0%)
Query: 44 KTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSM 103
KT V L HADE + D+L PDVQ L+GNV ++H+ M CDSA + ++ N+ EAF V M
Sbjct: 79 KTKVYLLHADEGQADKLARPDVQVLIGNVKLRHDSMYMYCDSALIFEKTNSVEAFSNVRM 138
Query: 104 QQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIV 163
+QGDT+ ++ YL+YDG + A+LR V++ NR+ TL TDSL+YDR+ +LGYYFEGG+++
Sbjct: 139 EQGDTLFIYGDYLYYDGMTQIAQLRENVKMINRNTTLLTDSLNYDRLYDLGYYFEGGTLM 198
Query: 164 DSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSD 223
D N LTS +GEYSP T ++F +V L N + + ++ L YNT+ KI+ ILGP+ + SD
Sbjct: 199 DEENVLTSDWGEYSPATKQSVFNHDVKLVNPKFVLTSDTLRYNTENKIAVILGPSNIVSD 258
Query: 224 SGYIVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDT 283
+ +I S RG Y++ T+ LLDRS++ +N K+L GDS+FYDR G+GEAF N+ +TD+
Sbjct: 259 NNHIYSERGFYNTLTEQAELLDRSVL--TNQGKKLVGDSLFYDRIIGYGEAFDNVKMTDS 316
Query: 284 VNRSSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPED-RRIA 342
+N++ L G+Y +Y+E D AFAT+R+ ID+S+ D+L+ DTL+M++ + R+
Sbjct: 317 INKNMLTGDYCFYNELTDSAFATKRAVAIDYSQGDSLYMHGDTLQMVSYNLNTDSLYRLM 376
Query: 343 RGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDY 402
+ Y VR+YRTDVQ + DS+ Y+S+DS + +Y +PI+WN+ QL G+ I+ + ++D+
Sbjct: 377 KAYHKVRMYRTDVQGVCDSLVYNSKDSCMTMYTDPILWNDGQQLLGEQIKIYMNDSTIDW 436
Query: 403 VDVLTKALAVRRIDSVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYL 462
++ +AL V DS+ Y+Q++G+ ++AY + +R I+V GN Y + K S
Sbjct: 437 AHIINQALTVEMKDSIHYNQVSGKEMKAYFVNGDMRHIEVIGNVLTAFYPEEKDSTMTGF 496
Query: 463 MNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKED 522
N +E + ++ +++K L G ++G YP+ + PD RL +F W + VRP +K+D
Sbjct: 497 -NCLEGSMLHLYMKDKRMEKGLFIGKSNGTMYPMDQIPPDKLRLPTFAWFDYVRPLNKDD 555
Query: 523 LFRRQPDSVLQVHRSLSDLR 542
+F + + + +D R
Sbjct: 556 IFNWRSKRAGETLKPTTDRR 575
>gi|156859578|gb|EDO53009.1| hypothetical protein BACUNI_03022 [Bacteroides uniformis ATCC 8492]
Length = 573
Score = 399 bits (1026), Expect = e-109, Method: Composition-based stats.
Identities = 204/479 (42%), Positives = 314/479 (65%), Gaps = 3/479 (0%)
Query: 47 VILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQG 106
V L +ADE + D+ PDVQ L+G+V +KH+ M CDSA + ++ N+ EAFG V M+QG
Sbjct: 65 VDLLYADEAQADKQLRPDVQVLIGSVRMKHDSMYMFCDSALIFEKINSVEAFGNVRMEQG 124
Query: 107 DTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSL 166
DT+ ++ YL+YDG + A LR VR+ NR+ L TDSL+YDR+ +LGYYFEGG++ D
Sbjct: 125 DTLFIYGDYLYYDGMSQLAMLRENVRMINRNTVLTTDSLNYDRLYDLGYYFEGGTLTDED 184
Query: 167 NTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGY 226
N LTS +GEYSP T A+F +V L N + + ++ L Y+T TKI+ ILGP+++ SD +
Sbjct: 185 NVLTSEWGEYSPATKLAVFNHDVKLVNPKFVLTSDTLKYSTATKIATILGPSDIVSDQNH 244
Query: 227 IVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNR 286
I S RGVY++ T+ LLDRS++ +N K+LTGDS+FYDR G+GEAF N+ + DTVNR
Sbjct: 245 IYSERGVYNTTTEQAELLDRSVL--TNEGKKLTGDSLFYDRILGYGEAFDNVQMNDTVNR 302
Query: 287 SSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDR-RIARGY 345
+ L G+Y +Y+E A AT+R+ ID+S+ D+L+ DTL +IT + R R Y
Sbjct: 303 NMLTGDYCFYNELTGSAVATKRAVAIDYSQGDSLFMHGDTLRLITYHMNTDSMYREMRAY 362
Query: 346 RHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDYVDV 405
VR YRTDVQA+ DS+ Y+S+DS + +Y +PI+W+ + QL G+ I+ + ++D+ +
Sbjct: 363 HKVRAYRTDVQAVCDSLVYNSKDSCMTMYTDPILWHGEQQLLGEEIKIYMNDSTIDWAHI 422
Query: 406 LTKALAVRRIDSVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNR 465
+ +AL V + DS+ Y+Q++G+ ++A+ D +R ++V+GN V+ Y ++ +MN
Sbjct: 423 INQALTVEKKDSIHYNQVSGKEMKAFFIDGDMRLVEVNGNVLVVYYPVEEKDSSLIMMNY 482
Query: 466 IEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLF 524
E + +E ++++ + G +G YP+ + PD RL SF W + +RP +KED+F
Sbjct: 483 SEGGLLKMYLKERRMERGVFVGKTTGTAYPLDQIPPDKSRLPSFVWFDYIRPLNKEDIF 541
>gi|150003017|ref|YP_001297761.1| hypothetical protein BVU_0424 [Bacteroides vulgatus ATCC 8482]
gi|149931441|gb|ABR38139.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 555
Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats.
Identities = 210/529 (39%), Positives = 332/529 (62%), Gaps = 8/529 (1%)
Query: 1 MRKGEKRESRLGSRQLGAIILIVTLSFSALASLQGPPPKGSK--GKTHVILEHADELRYD 58
M K + G ++ + ++ F LA ++ P KG + K+ V L H+D L+
Sbjct: 1 MLGKRKNKYSSGRHRILVVSVLCLFGFCLLAQVR-PAKKGEQKPAKSKVYLLHSDVLKKS 59
Query: 59 RLY-NPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLH 117
L +PD Q L+GNV +H+ M CDSA ++ N+ EAF V M QGDT+ ++ YL
Sbjct: 60 PLNPDPDAQILIGNVAFRHDSVYMYCDSACFYEKTNSLEAFDNVKMVQGDTLFLYGDYLF 119
Query: 118 YDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYS 177
YDGN + A++R+ VR+EN++ TL TDSL+YDR+ NLGYYF+GG+++D N LTS +GEYS
Sbjct: 120 YDGNTQIAQVRYNVRMENKNTTLLTDSLNYDRIYNLGYYFDGGTLMDEENVLTSEWGEYS 179
Query: 178 PTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSN 237
P T ++F +V L N +T+ ++ L Y+T TKI++ILGP+++ SD+ +I S G Y++
Sbjct: 180 PATKISVFNYDVKLVNPKFTLTSDTLRYSTATKIANILGPSDIVSDANHIYSELGFYNTQ 239
Query: 238 TDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYD 297
LLDRS++ +N K+LTGDS+FYDR G+GEAF N+I+TDTVN++ L G+Y YY+
Sbjct: 240 IGQAELLDRSVL--TNEGKRLTGDSLFYDRVKGYGEAFDNVIMTDTVNKNMLTGDYCYYN 297
Query: 298 EKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDR-RIARGYRHVRVYRTDVQ 356
E YAFAT+++ +D+S+ D+L+ ADTL+M T + R R Y VR+YRTDVQ
Sbjct: 298 ELTKYAFATKKAVAVDYSQGDSLFMHADTLQMYTYYLNTDSMFRETRAYHKVRMYRTDVQ 357
Query: 357 AIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDYVDVLTKALAVRRID 416
+ DS+ + S+DS L +Y +PI+WN + QL G+ I + ++D+ + +AL+V ++D
Sbjct: 358 GVCDSLVFSSKDSCLTMYYDPILWNNNQQLLGEKIMIYMNDSTIDWAHIQNQALSVEQLD 417
Query: 417 SVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFE 476
S Y+Q+ G+ ++A+ Q +R++ V G+ ++ Y S MN E + E
Sbjct: 418 STSYNQVTGKEMKAWFQGGEMRKVDVIGSVRLVYYPMESDST-LIGMNVSETSLLNMFLE 476
Query: 477 EGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFR 525
++KK+++ ++G YP+ P+ +L +F W + +RP KED+F+
Sbjct: 477 NRKMKKMIMSPKSNGTLYPMLQRPPEKMKLDNFVWFDYIRPLDKEDIFK 525
>gi|53715355|ref|YP_101347.1| hypothetical protein BF4071 [Bacteroides fragilis YCH46]
gi|60683324|ref|YP_213468.1| hypothetical protein BF3887 [Bacteroides fragilis NCTC 9343]
gi|52218220|dbj|BAD50813.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60494758|emb|CAH09564.1| putative exported protein [Bacteroides fragilis NCTC 9343]
Length = 567
Score = 396 bits (1018), Expect = e-108, Method: Composition-based stats.
Identities = 201/491 (40%), Positives = 321/491 (65%), Gaps = 6/491 (1%)
Query: 36 PPPKGSKGKTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTF 95
P K KT V L HAD+ + D+L PDVQ L+G+V ++H+ M CDSA + ++ N+F
Sbjct: 51 PEKAQGKKKTRVDLLHADQGQADKLARPDVQVLIGSVKLRHDSMYMYCDSALIYEKTNSF 110
Query: 96 EAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGY 155
EAF V M+QGDT+ ++ YL YDG + A+LR V++ NR+ TL TDSL+YDR+ NLGY
Sbjct: 111 EAFSNVRMEQGDTLFIYGDYLFYDGMTQIAQLRENVKMINRNTTLLTDSLNYDRLYNLGY 170
Query: 156 YFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHIL 215
YF+GG+++D N LTS +GEYSP T ++F +V L N + + ++ L Y+TDTKI+ IL
Sbjct: 171 YFDGGTLMDEENVLTSDWGEYSPATKLSVFNHDVKLVNPRFVLTSDTLKYSTDTKIATIL 230
Query: 216 GPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAF 275
GP+++ S+ +I S RG+Y++ + LLDRS++ +N K+L GDS+FYDR+ G+GEAF
Sbjct: 231 GPSDIVSEQNHIYSERGIYNTVSGQAELLDRSVL--TNDGKRLIGDSLFYDRKAGYGEAF 288
Query: 276 GNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRV 335
N+ + DTVN++ L G+Y YYDE K A AT+R+ +D+S+ D+L+ ADTL ++ +
Sbjct: 289 DNVQMNDTVNKNMLTGDYCYYDELKQNALATKRAVAVDYSRGDSLFMHADTL-LMNSYNL 347
Query: 336 PEDR--RIARGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRF 393
D R R + VR+Y D+Q + DS+ ++++DS L +Y +PI+WNE QL G+ I+
Sbjct: 348 DTDSLFREMRAFHKVRMYSIDLQGVCDSLVFNTKDSCLTMYRDPILWNEGQQLLGEEIKV 407
Query: 394 KFRNDSLDYVDVLTKALAVRRIDSVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQ 453
+ ++D+ ++ +AL V + DS+ ++Q++G+ I+AY + R++ V GN V+ Y Q
Sbjct: 408 YMNDSTIDWAHIINQALTVEQKDSIHFNQISGKEIKAYFAEGEARKVDVIGNVLVVYYPQ 467
Query: 454 HKRSKRWYLMNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEE 513
+ S MN E + ++ +++++++ ++G YP+ + PD +L +F W +
Sbjct: 468 EQDSTM-IGMNTSETSLLNMYLKDRKMERMVMSPKSNGTLYPMNQIPPDKMKLPTFSWFD 526
Query: 514 AVRPKSKEDLF 524
VRP SKED+F
Sbjct: 527 YVRPLSKEDIF 537
>gi|110639393|ref|YP_679601.1| hypothetical protein CHU_3019 [Cytophaga hutchinsonii ATCC 33406]
gi|110282074|gb|ABG60260.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 525
Score = 233 bits (595), Expect = 2e-59, Method: Composition-based stats.
Identities = 142/455 (31%), Positives = 238/455 (52%), Gaps = 11/455 (2%)
Query: 67 RLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYAR 126
+L +V+ K + CDSA + N EAFG V + QGDT++M L YDGN K A+
Sbjct: 67 KLKDHVIFKQGEMFLYCDSAFQYAKTNYVEAFGHVRLVQGDTLTMTCNKLEYDGNTKKAK 126
Query: 127 LRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFR 186
+V L ++ L T +L+YDR YF G +I D N LTS+ G Y+ T F+
Sbjct: 127 AIGDVILIDKQTILKTTALNYDREGKNVSYFSGANISDKGNNLTSTIGVYNTGTKIFTFK 186
Query: 187 DNVHLEN--KDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILL 244
NVH+ N + + +D + L YN+ ++++ G T++ + G I S G Y++ T V
Sbjct: 187 KNVHITNPGQGFLLDADTLQYNSQSRLATFRGETKITTKDGVIKSKEGSYNTATSVMYFG 246
Query: 245 DRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAF 304
R+ V+S G ++G+ I YD +T G G + + + + ++ G++ Y K Y+
Sbjct: 247 GRAQVFS--GDNTISGNKIDYDEKTKLGVVTGEVKIENKKDSITVLGQHAKYTGKNGYSI 304
Query: 305 ATQRSYMIDFSKPDTLWAAADTLEMI--TQRRVPEDRRIARGYRHVRVYRTDVQAIADSM 362
+ M + DTL+ ADTL I T +++ ++ + Y HV+++R D+QA DS+
Sbjct: 305 VSGNPLMYQVNNTDTLFLKADTLVSINDTIKKI----KLLKAYYHVQLFRKDMQARCDSL 360
Query: 363 QYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDYVDVLTKALAVRRIDSVMYDQ 422
Y+ DS +YLY NP++WN ++QL D+I +N + + + A + + ++Q
Sbjct: 361 VYNFYDSTIYLYTNPVLWNGENQLVADSIWMVQKNGKMHTMHMHVNAFVISKDTIDNFNQ 420
Query: 423 LAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQLKK 482
+ GR I A+ ++ + +I V GNAE I Y + K+ +N+ EA SI+ F++ +L
Sbjct: 421 IKGRQITAFFANNHISKILVEGNAESI-YHALEGEKKLMGVNKAEAGSIVVLFKDDKLST 479
Query: 483 VLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRP 517
+ P + L P+ +L F+W RP
Sbjct: 480 ITYVTKPDAAFIPPQELKPEDVKLKGFKWRIKERP 514
>gi|88712817|ref|ZP_01106902.1| hypothetical protein FB2170_09271 [Flavobacteriales bacterium
HTCC2170]
gi|88708715|gb|EAR00950.1| hypothetical protein FB2170_09271 [Flavobacteriales bacterium
HTCC2170]
Length = 566
Score = 231 bits (589), Expect = 1e-58, Method: Composition-based stats.
Identities = 157/525 (29%), Positives = 271/525 (51%), Gaps = 32/525 (6%)
Query: 72 VVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEV 131
+ +HEGA + CD A Q+EN +A G + +QQGD+V M + + Y+GN A+ V
Sbjct: 57 IQFEHEGADLFCDLAIYYQQENRLKAIGNIRLQQGDSVEMTSGKIDYNGNENLAKAWENV 116
Query: 132 RLENRSA-TLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVH 190
L ++S TL TD+L ++R+ YY + G++VDS+NTLTS G+Y T F D+VH
Sbjct: 117 VLTSKSQMTLTTDTLRFNRIEQQAYYQDFGTVVDSVNTLTSEIGKYFLETKKLQFLDSVH 176
Query: 191 LENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVY 250
L N DY +D+E+L Y +K +++ GP+ + ++ I RG YD+ + G + + +
Sbjct: 177 LTNPDYILDSEQLDYYETSKNAYLYGPSTITGNTYKIYCERGFYDTKVESGYFIKNTKID 236
Query: 251 SSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSY 310
+N + + GDS+++++ F A N+ + DT+N + Y + KD FAT+R+
Sbjct: 237 YNN--RIIKGDSVYFNKAREFASATNNIKVIDTINNGLIKAHYAEVFKAKDSVFATKRAV 294
Query: 311 MIDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIADSMQYDSRDSL 370
I + D+L+ DTL M+T + PE+ RI R +R+ +V++TD+ DS+ Y+ + L
Sbjct: 295 SIGLMEQDSLYIHGDTL-MLTGK--PEN-RILRAFRNAKVFKTDLSGKCDSIHYNEKTGL 350
Query: 371 LYLYDNPIMWNEDSQLSGDTIRFK--FRNDSLDYVDVLTKALAVRRIDSVM---YDQLAG 425
+ NPI+WN +Q++GD+I K + + +D + VL A + +DSV Y+Q G
Sbjct: 351 TQMITNPILWNGPNQMTGDSIHLKSNLKTEKMDSLKVLNNAFVI-SLDSVSMEGYNQAKG 409
Query: 426 RHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQLKKVLL 485
+ +D+ ++ I + N EV+ Y + +N+ + I ++ +
Sbjct: 410 IDLFGKFEDNQLKVIDLIKNTEVVYY-VYNDDDELVGINKTKCSKIRITMANNDIEDLTF 468
Query: 486 RGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQPDS-VLQVHRSLSDLRRF 544
G +P L+ + + L F W R SK+D+F ++ VL + R +S+
Sbjct: 469 FTDPEGDIFPETELSVNERILKGFIWRGDERIMSKDDIFDYDDNNIVLPIIRGISNPIDI 528
Query: 545 SGALAALRAYTALAEEERKDSLTIAALQTDSIPPTPAAGKEATDP 589
AEEE ++S + D + P + +A DP
Sbjct: 529 D------------AEEEERNS-----NEGDPVNNIPKSNDQAIDP 556
>gi|89891372|ref|ZP_01202878.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
gi|89516403|gb|EAS19064.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
Length = 548
Score = 231 bits (588), Expect = 2e-58, Method: Composition-based stats.
Identities = 140/465 (30%), Positives = 247/465 (53%), Gaps = 12/465 (2%)
Query: 72 VVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEV 131
V + H+G M CD A +++N A G V M QGD+V M ++Y Y+GN + A +V
Sbjct: 43 VYVVHQGIKMWCDQAVFYKKDNFLRALGSVRMNQGDSVLMNSKYAEYNGNTQLAFAAGKV 102
Query: 132 RLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHL 191
+ + TL TD+L +DR YY GG++ D+ +TLTS G + F D+V +
Sbjct: 103 NMRSPETTLSTDTLYFDRNKQQAYYRSGGTVRDTASTLTSRVGRFFMQEKKYQFIDDVVI 162
Query: 192 ENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVYS 251
N DYT+++ ++++ T+T +++ GP+ ++ + + RG YD+ D G + +S +
Sbjct: 163 VNPDYTINSSQVNFYTETGHAYLYGPSTIKGKASTVYCERGFYDTRNDYGHFVKKSRIDY 222
Query: 252 SNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYM 311
+N + +TGDS++++R T F A N+++TDT+N S + G Y KD F T+R+
Sbjct: 223 NN--RTVTGDSLYFNRVTNFASATNNIVVTDTINNSIIKGHYAEVFRDKDSVFITERAVA 280
Query: 312 IDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIADSMQYDSRDSLL 371
I + D+++ ADTL M+T PED RI RG+R VR++++D+ DS+ L
Sbjct: 281 ISLQEADSVYIHADTL-MVTG---PEDNRIVRGFRDVRLFKSDLSGRCDSIHTRQSTGLT 336
Query: 372 YLYDNPIMWNEDSQLSGDTIRFK--FRNDSLDYVDVLTKALAVRRIDSVM--YDQLAGRH 427
+ P++W+ SQ++GD+I + + LD + V A V + D++ Y+Q+ G+
Sbjct: 337 KMIKKPVLWSGKSQITGDSIHLQSNVETEKLDSLRVFYNAFIVDK-DTIHDGYNQIKGKE 395
Query: 428 IRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQLKKVLLRG 487
+ +D+ + ++++ N E + Y ++ S +N+ + + F G + + G
Sbjct: 396 LIGLFKDNELNKVKIDKNVENLLYVSNE-SDELVGINKGTSSKLEITFNNGDIAIIKPIG 454
Query: 488 VASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQPDSVL 532
+ P L + +RL F W + S +DLF +P VL
Sbjct: 455 NPKDETIPPDELPENARRLRGFNWRGEEQLNSVDDLFMGKPKPVL 499
>gi|124006426|ref|ZP_01691260.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123988083|gb|EAY27754.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length = 536
Score = 230 bits (587), Expect = 2e-58, Method: Composition-based stats.
Identities = 150/463 (32%), Positives = 244/463 (52%), Gaps = 9/463 (1%)
Query: 72 VVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEV 131
VVI+H+G + CDSA N +NT +A G M D ++ +R + YDGN K A V
Sbjct: 72 VVIRHKGNTLYCDSAIQNITKNTVQAIGNARMLGSDGTTVNSRTMFYDGNKKVANASGNV 131
Query: 132 RLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHL 191
L ++ TL T+ LDYD V + +Y+ GG IVDS N LTS G Y + A F+++VHL
Sbjct: 132 VLVDKKGTLTTEVLDYDVVSQVAHYYTGGKIVDSENILTSKEGTYDTNSKVAFFKNDVHL 191
Query: 192 ENKDYTMD--TEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIV 249
+K + ++ + YN +K+++ G T++ S G + + G Y++ T V +
Sbjct: 192 VSKKDKQEIFSDNIQYNMVSKMAYFRGKTKILSKDGTVYANEGEYNTKTKVSHFRTKGNA 251
Query: 250 YSSNGAKQ--LTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQ 307
++ L GDS+FYD G A GN LT + + G+ G + KK +
Sbjct: 252 RPKAETQEYILQGDSLFYDNTNRIGFAKGNARLTSKKDSLIIDGDIGRFWGKKGISKVYG 311
Query: 308 RSYMIDFSKPDTLWAAADTLEMI-TQRRVPEDR-RIARGYRHVRVYRTDVQAIADSMQYD 365
+ M S DTL+ ADTL I T++ ED +I + Y + +++R ++Q DS+ Y+
Sbjct: 312 AALMRSISNKDTLYLVADTLISIQTKKENSEDSVKILKAYHNTKIFRKELQGKCDSLVYN 371
Query: 366 SRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDYVDVLTKALAVRRIDSVMYDQLAG 425
DS +Y+Y++P++W+ SQLSGD+IR N+ + + + T A +++ ++QL G
Sbjct: 372 FGDSSIYMYNDPVLWDRKSQLSGDSIRVLMANNKIHRMLLRTNAFVIQQDTLNNFNQLKG 431
Query: 426 RHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADF-EEGQLKKVL 484
R+++A+ + S +R++ V GN E I + + MNR+ I F E+ ++K +
Sbjct: 432 RNMKAFFEKSEIRRVDVRGNGESIFFALEGDT-LLTGMNRVICSDIDIRFKEKNKVKTIT 490
Query: 485 LRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQ 527
+ GK P L +RL F W RPK K D+ R++
Sbjct: 491 FKSKPDGKFIPPHELAEPDKRLKGFLWRINERPK-KIDMLRQR 532
>gi|146300347|ref|YP_001194938.1| OstA family protein [Flavobacterium johnsoniae UW101]
gi|146154765|gb|ABQ05619.1| OstA family protein [Flavobacterium johnsoniae UW101]
Length = 549
Score = 230 bits (586), Expect = 2e-58, Method: Composition-based stats.
Identities = 152/517 (29%), Positives = 262/517 (50%), Gaps = 18/517 (3%)
Query: 19 IILIVTLSFSALASLQGPPPKGSKGKTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEG 78
++L V +F+ A PK + HV EHAD + P L GNV + H+G
Sbjct: 2 LVLSVQSTFAQAAKSAKTAPK----QIHV--EHADNFERNEPLVPGAVLLSGNVKVDHDG 55
Query: 79 AVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSA 138
V+ C+ A++ + EN +AFG V + QGDT+ + ++Y Y GN+K A + + + A
Sbjct: 56 IVLTCNKAYIFEGENYLKAFGNVQLVQGDTLFLNSKYAEYSGNLKQAFATGDAVMTSPDA 115
Query: 139 TLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTM 198
TL TD++ +DR + YY G+IV+ NTL S G Y F V + N Y +
Sbjct: 116 TLQTDTIHFDRNIQQAYYNTKGTIVNKENTLVSKSGRYFAAEKKFQFLTEVTITNPKYVI 175
Query: 199 DTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVYSSNGAKQL 258
+ L Y +++ +++LGP+ + S + YI + RG YD+ ++ L +S Y + +
Sbjct: 176 KSNHLDYYSNSGHTYLLGPSTITSKANYIYTERGFYDTKKNLAHFLRKS--YIKYDDRLI 233
Query: 259 TGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYMIDFSKPD 318
GDS++Y+R T F A N+ +TD++N+ + G Y + KD F T+R+ I+ + D
Sbjct: 234 EGDSLYYNRNTEFASATRNVKITDSINKGIVKGHYAEIFKLKDSMFVTKRAVAINLVEND 293
Query: 319 TLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPI 378
+++ L M+T + E RI R + +VR Y+TD+ DS+ +S+ +L L NPI
Sbjct: 294 SVYIHGKKL-MVTGK---EGERILRAFNNVRFYKTDMSGKCDSIHSNSKTALTKLIGNPI 349
Query: 379 MWNEDSQLSGDTIRFKFRNDS--LDYVDVLTKALAVRRIDSV--MYDQLAGRHIRAYMQD 434
+WN +SQ++GD + N++ LD + VL + + D++ Y+Q+ G ++ ++
Sbjct: 350 IWNGESQITGDIMHLIGDNNTKKLDSLKVLNNTFIISK-DTLGTGYNQVKGINLFGKFKE 408
Query: 435 SLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQLKKVLLRGVASGKGY 494
+ + V N EV+ Y +N+ + I E ++ + G Y
Sbjct: 409 GKLHDVDVIKNTEVV-YFMRNDDNELIGINKNVSSKINLILENNLIETITFFNKVDGDIY 467
Query: 495 PIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQPDSV 531
P L + ++L F W R KSK+D+F + + +
Sbjct: 468 PETDLPENARKLRGFVWRGDERIKSKDDIFTEEDNEL 504
>gi|150025820|ref|YP_001296646.1| hypothetical protein FP1772 [Flavobacterium psychrophilum JIP02/86]
gi|149772361|emb|CAL43839.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 551
Score = 227 bits (578), Expect = 2e-57, Method: Composition-based stats.
Identities = 146/492 (29%), Positives = 257/492 (52%), Gaps = 14/492 (2%)
Query: 41 SKGKTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQ 100
S+ ++++H+D L P L GNVVI HEG M C+ A+ + N + FG
Sbjct: 22 SQDAKQIVIQHSDFLDISEKEVPGAIVLTGNVVIIHEGVRMTCNKAYHFTKSNFVKIFGN 81
Query: 101 VSMQQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGG 160
V+M QGDT+SM ++Y Y+GN K+A +V L + TL TD++++DR YY G
Sbjct: 82 VNMVQGDTLSMNSKYAEYNGNTKFAYATGDVLLRDPKMTLATDTINFDRNSQQAYYNSKG 141
Query: 161 SIVDSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEM 220
+I D NTL S+ G+Y F D V + N T+ T L Y T++ +++ GP+ +
Sbjct: 142 TIRDPENTLVSNSGKYYLNQKKFQFSDAVTVTNPRQTIKTNHLDYYTNSGHAYVFGPSTI 201
Query: 221 RSDSGYIVSTRGVYDSNTDVGILLDRS-IVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMI 279
S + I +T+G YD+ D G L S I Y + + GD I+YDR+ F A N+
Sbjct: 202 TSATNTIYTTKGFYDTKKDEGKLQKGSKITYKD---RLIEGDDIYYDRKLDFARAKNNVK 258
Query: 280 LTDTVNRSSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDR 339
+TDT+N + G Y ++KD F T+++ I + D+++ A + ++T + PE
Sbjct: 259 VTDTINHFVVKGNYAEVYKQKDSMFITKKAVAITLFEKDSVYFHAKKI-LVTGK--PES- 314
Query: 340 RIARGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRF--KFRN 397
RI RG + R ++ D+ DS+ YD + L + P++WN +Q++GD + +
Sbjct: 315 RIIRGSNNARFFKKDISGKCDSIHYDKKKGLTQMIGKPVLWNGKNQMTGDVMHLVSNQKT 374
Query: 398 DSLDYVDVLTKALAVRRIDSV--MYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHK 455
+ +D + VL A + + D++ ++Q+ G+++ + + + ++ V N EVI Y +++
Sbjct: 375 EKIDSLKVLNNAFIISK-DTIGEGFNQVKGQNLFGKFKKNKLHEVNVIKNTEVIFYMRNE 433
Query: 456 RSKRWYLMNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAV 515
+++ +N+ + I E ++ + G YP + L + ++L F W
Sbjct: 434 KNE-LIGINKNVSSKINMILEANNIETITFFTDVEGIIYPEEELPENARKLKGFIWRGDE 492
Query: 516 RPKSKEDLFRRQ 527
+ +KEDLF ++
Sbjct: 493 QILTKEDLFPKE 504
>gi|83856772|ref|ZP_00950301.1| hypothetical protein CA2559_06755 [Croceibacter atlanticus
HTCC2559]
gi|83850572|gb|EAP88440.1| hypothetical protein CA2559_06755 [Croceibacter atlanticus
HTCC2559]
Length = 582
Score = 226 bits (576), Expect = 4e-57, Method: Composition-based stats.
Identities = 146/454 (32%), Positives = 244/454 (53%), Gaps = 12/454 (2%)
Query: 76 HEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEVRLEN 135
HEG + CD A +E+N F+A+G V ++QGDTV+M + Y Y+GN K+A VRL
Sbjct: 48 HEGIDVWCDQAVFYKEDNFFKAYGNVRIKQGDTVNMNSTYAEYNGNTKFAFASTGVRLST 107
Query: 136 RSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHLENKD 195
S TL TDSL +DR +Y GG++ D+ +T+TS G Y F+ +V + N +
Sbjct: 108 PSQTLTTDSLFFDRTKQQAFYRSGGTVRDTASTITSKIGRYYMELDKYSFKRDVVVNNPE 167
Query: 196 YTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVYSSNGA 255
Y +++E+L + T + +++ G + + S++ + RG YD+ D G + S + N
Sbjct: 168 YVINSEQLDFYTKSGHAYLYGESTIESETSTVYCERGFYDTRGDTGYFVKNSQIDYDN-- 225
Query: 256 KQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYMIDFS 315
++L GDS+F+DR F A N+ +TDT N+S + G Y KD F T+R+ I
Sbjct: 226 RKLEGDSLFFDRAKNFASATNNIKVTDTANQSIIKGHYAEVFRDKDSVFITKRALAITVQ 285
Query: 316 KPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYD 375
D+++ +DTL MIT + PE+R I RG+ R++++++ DS+ + L +
Sbjct: 286 DKDSVYIHSDTL-MITGK--PENRVI-RGFYDTRLFKSNMSGKCDSVHIQQKTGLTKMLG 341
Query: 376 NPIMWNEDSQLSGDTIRF--KFRNDSLDYVDVLTKALAVRRIDSVMYDQLAGRH-IRAYM 432
NP++W+ +SQL+GDTI + ++LD + V A +++ Y+Q+ G+ I +
Sbjct: 342 NPVLWSSNSQLTGDTIHLLNNPKTETLDTLKVFNNAFMIQKDSIEGYNQVKGKELIGLFN 401
Query: 433 QDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQLKKVLLRGVASGK 492
D+ + Q+ V N E I Y ++++ K +N A SI E ++ V G
Sbjct: 402 DDNDLYQVDVLKNTETIYYLRNEQ-KELIGLNNTLASSISILMENREIVDVYYYKQIDGT 460
Query: 493 GYPIKMLTPDLQ-RLASFRWEEAVRPKSKEDLFR 525
P + PD++ +L F W + +KEDLF+
Sbjct: 461 INP-DLNKPDVEKKLTGFNWRGTEQLITKEDLFK 493
>gi|149370671|ref|ZP_01890360.1| hypothetical protein SCB49_14445 [unidentified eubacterium SCB49]
gi|149356222|gb|EDM44779.1| hypothetical protein SCB49_14445 [unidentified eubacterium SCB49]
Length = 574
Score = 226 bits (575), Expect = 5e-57, Method: Composition-based stats.
Identities = 140/471 (29%), Positives = 241/471 (51%), Gaps = 16/471 (3%)
Query: 70 GNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRH 129
G V I+HEG M CD A+L ++N +A+G+V + QGDT+ M + Y Y+GN K+A
Sbjct: 53 GQVYIEHEGIEMWCDQAYLYTKDNFVKAYGEVKVTQGDTIKMNSSYAEYNGNTKFAFASG 112
Query: 130 EVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNV 189
+V + N TL TD+L +DR+ +Y GG+++D+ +TLTS G Y T F NV
Sbjct: 113 DVVMNNPQTTLKTDTLYFDRIKQQAFYRSGGTVIDTASTLTSRVGRYFAETKKYQFLSNV 172
Query: 190 HLENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIV 249
++N +Y +++E+L + ++ +++ G + + S++ + RG YD+ D G + S +
Sbjct: 173 KIDNPEYIVNSEQLDFYSENGDAYLYGESTIVSETSTVYCERGYYDTRGDTGYFVKNSRI 232
Query: 250 YSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRS 309
+N + L GDSI++DR GF A N+ + DT N+S++ G Y KKD F T+R+
Sbjct: 233 DYNN--RILHGDSIYFDRNKGFASATNNIKVIDTANQSTIKGHYAEVFRKKDSVFITKRA 290
Query: 310 YMIDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRT------DVQAIADSMQ 363
D+++ ADTL + + D RI RGY R+++ + +DS+
Sbjct: 291 IAATLRDTDSIYIHADTLRITGK----TDHRILRGYYKARLFKRGTPEEGNTSGKSDSIY 346
Query: 364 YDSRDSLLYLYDNPIMWNEDSQLSGDTIRF--KFRNDSLDYVDVLTKALAVRRIDSVMYD 421
D + L +P++W ++Q++GDTI + +D + V + V++ DS+ Y+
Sbjct: 347 IDENIGITKLLTDPVLWMGENQMTGDTIHILSNTVTEKVDTLKVFNNSFLVQK-DSLGYN 405
Query: 422 QLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQLK 481
Q+ G + ++ + + ++ N EVI Y + + ++ A + E ++
Sbjct: 406 QVKGERLIGLFTNNELDTVNINKNVEVIFY-LYGDNGNLTGIDLTTASQLQLTLENQEIV 464
Query: 482 KVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQPDSVL 532
GK YP L + L+ F W R KEDLF +P +L
Sbjct: 465 GTRFIKQVPGKIYPPSKLPEGDRILSKFNWRGEERLNRKEDLFSGKPTPIL 515
>gi|86133424|ref|ZP_01052006.1| hypothetical protein MED152_01930 [Tenacibaculum sp. MED152]
gi|85820287|gb|EAQ41434.1| hypothetical protein MED152_01930 [Polaribacter dokdonensis MED152]
Length = 545
Score = 225 bits (574), Expect = 6e-57, Method: Composition-based stats.
Identities = 141/491 (28%), Positives = 255/491 (51%), Gaps = 14/491 (2%)
Query: 41 SKGKTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQ 100
S+ K +I+E+A+ D P LLGNV IKH+G + C A ++EN F+A G
Sbjct: 18 SQEKKKIIIENAEIQYADEEKTPGATILLGNVRIKHDGINLTCQQALFYKKENFFKAIGN 77
Query: 101 VSMQQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGG 160
V ++QGDT++ + Y YD N K A V L++ + TL TD+L +DR+ YY
Sbjct: 78 VLIKQGDTITQTSDYADYDANAKQALSWGNVVLKDPTMTLTTDTLQFDRINQKLYYQNYA 137
Query: 161 SIVDSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEM 220
+I D NTL S G Y V + N ++ + + L Y TD+ ++++ GP+ +
Sbjct: 138 TIRDVTNTLKSKNGNYYLENKKFTATTRVTVVNPEHNLASNHLDYYTDSGLAYLYGPSTI 197
Query: 221 RS--DSGYIVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNM 278
+ + + S +G Y++ TD+ + ++ + + GDS++YD+ GF A N+
Sbjct: 198 TNTQNENKLYSEKGFYNTKTDISYFTKNAKLFLKE--RTVEGDSLYYDKNKGFASATNNI 255
Query: 279 ILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPED 338
+ DTV G Y EKKD F +R+ I + D+++ DTL ++T +
Sbjct: 256 KVIDTVQNFISKGNYAELFEKKDSLFIIKRAVAISIIEKDSMFIHGDTL-LVTGK---PK 311
Query: 339 RRIARGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFK--FR 396
+R+ R Y +V+++++D+Q DS+ + +Y NP++W++ +Q++GD+I +
Sbjct: 312 KRVIRTYHNVKIFKSDLQGKCDSIHTNQETGYTKMYRNPVIWSDQNQITGDSIYLQSNLE 371
Query: 397 NDSLDYVDVLTKALAVRRIDSVM---YDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQ 453
+ LD + V A V + DS+ Y+Q+ GR++ + ++ + V GNAE I + +
Sbjct: 372 TEKLDSLKVFNNAFIVSK-DSLAKEDYNQIKGRNMFGKFDANKLKFLLVKGNAESIYFNR 430
Query: 454 HKRSKRWYLMNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEE 513
+ ++ + + + +I E G+++ + + GK YP L + +++ F W E
Sbjct: 431 NAETQVLETITKEVSSNIEFTLENGEIQSIKYLKSSDGKTYPPSELDLEGRKIKGFIWRE 490
Query: 514 AVRPKSKEDLF 524
+PK+K D+F
Sbjct: 491 DEQPKTKYDIF 501
>gi|120435129|ref|YP_860815.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
gi|117577279|emb|CAL65748.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
Length = 599
Score = 224 bits (572), Expect = 1e-56, Method: Composition-based stats.
Identities = 140/476 (29%), Positives = 252/476 (52%), Gaps = 22/476 (4%)
Query: 72 VVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEV 131
V H+G + CD+A +E N F+A+G + MQQGD+VSM + Y Y+G+ ++A +V
Sbjct: 54 VYFDHDGIEVWCDNAVFYKEANFFKAYGNIRMQQGDSVSMTSNYAEYNGDTEFAFASGKV 113
Query: 132 RLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHL 191
+++ TL TDSL +DR+ YY GG + D+ + LTS+ G Y F D+V +
Sbjct: 114 KMKRPQTTLETDSLFFDRIKQQAYYRSGGKVTDTASVLTSTIGRYFMEEDKYSFVDSVVV 173
Query: 192 ENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVYS 251
N +Y +++E+L + +++ +++ GP+ + S++ + RG YD+ D G + S +
Sbjct: 174 TNPEYKINSEQLDFYSNSGHAYLYGPSTIESETSTVYCERGFYDTRADNGYFVKNSQINY 233
Query: 252 SNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYM 311
N + L GDS++++R+ F A N+ + DT+N S + G Y KD F T+++
Sbjct: 234 DN--RILKGDSLYFNRKNSFASATNNIRVIDTINNSRVSGHYAEVYRDKDSVFITKKALA 291
Query: 312 IDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTD------VQAIADSMQYD 365
+ DTL+ +DT+ MIT + PE+R I RG+ R+++ D + +DS+ D
Sbjct: 292 ASLQERDTLFIHSDTI-MITGK--PENRVI-RGFYDARIFKEDQTGENTMSGRSDSIYSD 347
Query: 366 SRDSLLYLYD-------NPIMWNEDSQLSGDTIRFKF--RNDSLDYVDVLTKALAVRRID 416
+ L L + P++W+ ++Q++GD+I + + + LD + V A +++
Sbjct: 348 QQSGLTKLINLTSRGNGKPVLWSGENQMTGDSIHLQSNPKTEQLDSLLVFDNAFLIQKDS 407
Query: 417 SVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFE 476
Y+QL G+ + Y +D+ + ++ + N E + Y ++ ++ +N+ A SI FE
Sbjct: 408 IEGYNQLKGKILTGYFKDNQLHEVVIDKNTETLNYMRNSENE-LIGINKTLASSIKILFE 466
Query: 477 EGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQPDSVL 532
Q++ + G P P+ ++L F W R SKE LF+ QP+ L
Sbjct: 467 NQQIQDIYYYNQVDGNLTPEADFPPNARQLQGFNWRGEDRILSKEGLFKGQPEPEL 522
>gi|88802837|ref|ZP_01118364.1| hypothetical protein PI23P_09605 [Polaribacter irgensii 23-P]
gi|88781695|gb|EAR12873.1| hypothetical protein PI23P_09605 [Polaribacter irgensii 23-P]
Length = 509
Score = 222 bits (565), Expect = 6e-56, Method: Composition-based stats.
Identities = 141/485 (29%), Positives = 252/485 (51%), Gaps = 19/485 (3%)
Query: 49 LEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDT 108
L++ DE +Y P L+GNV + H GAV+ C A Q+EN F+A V + QGDT
Sbjct: 7 LQYVDEDQY-----PGATVLIGNVKMIHAGAVLTCKQALFYQKENFFKALENVVVNQGDT 61
Query: 109 VSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNT 168
++ + YL YD N K + V L++ TL +D+L +DR+ YY +I D NT
Sbjct: 62 ITQTSDYLDYDANAKQSLSWGNVVLKDPEITLTSDTLQFDRLNQKLYYQSYATIKDKTNT 121
Query: 169 LTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRS--DSGY 226
L S G+Y T V + N ++ +++ L Y +T ++++ GP+ + + +
Sbjct: 122 LKSKKGQYYLETKKFTATTRVTVVNPEHHLESNHLDYYANTGLTYLFGPSTITNTQNENK 181
Query: 227 IVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNR 286
I RG Y++ TD+ + + ++ + + GDS+FYD++ GF A + + DTV
Sbjct: 182 IYCERGFYNTKTDISYFVKEAKLFLKE--RTIEGDSLFYDKKRGFASATNKIQIIDTVKN 239
Query: 287 SSLYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYR 346
+ G Y EK+D F +++ I K D+ + DTL ++T +PE +RI R Y
Sbjct: 240 FVIRGNYAEIFEKEDSLFIIKKAVAISIVKKDSTFIHGDTL-LVTG--IPE-KRIVRSYH 295
Query: 347 HVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRF--KFRNDSLDYVD 404
+V+++++D+Q DS+ + + + ++ NP++W++ +Q++GDTI + LD +
Sbjct: 296 NVKIFKSDLQGKCDSLHTNQQSGITKMFINPVLWSDGNQITGDTIHLISDTITEQLDSLK 355
Query: 405 VLTKALAVRRIDSVM---YDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWY 461
VL A V + DS+ Y+Q+ GR + + + + + V GNAE + Y + + +
Sbjct: 356 VLNNAFIVSK-DSLSIQEYNQIKGRDMFGKFKANKLELLLVKGNAESLYYNRSEETGSIE 414
Query: 462 LMNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKE 521
+ + + +I E GQ+ + + GK +P ++L F W +PK+KE
Sbjct: 415 TITKEISSNIEFTLENGQIISMKYLKSSDGKTHPPSQFPEVERKLKGFIWRAKEQPKTKE 474
Query: 522 DLFRR 526
D+F++
Sbjct: 475 DIFKK 479
>gi|88806998|ref|ZP_01122513.1| hypothetical protein RB2501_01790 [Robiginitalea biformata
HTCC2501]
gi|88782944|gb|EAR14118.1| hypothetical protein RB2501_01790 [Robiginitalea biformata
HTCC2501]
Length = 556
Score = 219 bits (558), Expect = 4e-55, Method: Composition-based stats.
Identities = 142/474 (29%), Positives = 248/474 (52%), Gaps = 12/474 (2%)
Query: 72 VVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEV 131
V +H+GA + CD A L Q++N EA G V ++QGD+V M + + Y+GN++ A+ V
Sbjct: 50 VQFEHQGADLWCDYAFLYQKDNRLEAIGNVRLEQGDSVLMTSGRVEYNGNLRLAKAYESV 109
Query: 132 RLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHL 191
RLEN+S TL TD+L +DR YY + G IVDS+NTLTS G Y F+D+V +
Sbjct: 110 RLENQSMTLTTDTLYFDRERQEAYYRDFGRIVDSVNTLTSEVGRYFMVPKKYQFQDSVLI 169
Query: 192 ENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVYS 251
+N DYT+++ L Y T++K +++ GP+ + + + RG YD+ + G + + ++
Sbjct: 170 KNPDYTLESTRLDYYTNSKNAYMYGPSTITGEDYTLYCERGFYDTRVEQGYGIRNTEIHY 229
Query: 252 SNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYM 311
++ + + GDS+++D+ + F A N+++TDT+N+ + Y +D FAT+R+
Sbjct: 230 ND--RIIEGDSVYFDKASEFASATNNIVITDTINKGVIRAHYAEVHRARDSVFATRRAVS 287
Query: 312 IDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIADSMQYDSRDSLL 371
I + D+L+ DTL M+T D RI R YR+ + ++TD+ DS+ ++ R +
Sbjct: 288 ISLVEQDSLYMHGDTL-MVTG---APDARILRAYRNAKFFKTDLSGKCDSIHFEERTGIT 343
Query: 372 YLYDNPIMWNEDSQLSGDTIRF--KFRNDSLDYVDVLTKALAVRR--IDSVMYDQLAGRH 427
L P++WN ++Q++GD+I D + VL A + I Y+Q G+
Sbjct: 344 QLIREPVIWNLENQMTGDSIYLLSDLETQKPDSLKVLGNAFLISEDTIGHRGYNQAKGKD 403
Query: 428 IRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQLKKVLLRG 487
+ + ++ + + GN EV+ Y + +++ I E +++ +
Sbjct: 404 LFGKFIEDQLKIVDLVGNTEVV-YFMYDDDNELIGIDKTVCSRIRLLMEASEIQDITFFI 462
Query: 488 VASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRRQPDSV-LQVHRSLSD 540
G YP L + + L F W R +D+F +++ L V R LS+
Sbjct: 463 DPDGVIYPEADLPEESRILEGFIWRGDERMYRWQDVFDEDDNNLELPVIRGLSE 516
>gi|91215797|ref|ZP_01252767.1| hypothetical protein P700755_02732 [Psychroflexus torquis ATCC
700755]
gi|91186263|gb|EAS72636.1| hypothetical protein P700755_02732 [Psychroflexus torquis ATCC
700755]
Length = 568
Score = 219 bits (557), Expect = 5e-55, Method: Composition-based stats.
Identities = 148/518 (28%), Positives = 256/518 (49%), Gaps = 12/518 (2%)
Query: 52 ADELRYDRLYNPD---VQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDT 108
+D R D + P + ++ V H+G + CD A Q ++ F AFG V M+QGDT
Sbjct: 29 SDRTRVDEVNYPGAFILSKVNNQVYFLHDGIEVWCDRAIFYQTDDFFRAFGNVRMKQGDT 88
Query: 109 VSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNT 168
V+M ++Y Y+G ++A +V L S L TDSL ++R+ +Y GG++ DS +T
Sbjct: 89 VNMTSKYAEYNGYTQFAFASEDVVLTTPSNRLTTDSLFFNRLKQEAFYRSGGAVKDSAST 148
Query: 169 LTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIV 228
+TS G Y FR +V + N +Y +D++ L + ++ + + GP+ + S++ +
Sbjct: 149 ITSVIGRYFMNQEKFSFRKDVKVRNPEYDIDSDYLDFYSEKGHAFLYGPSTITSETSTVF 208
Query: 229 STRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSS 288
RG YD+ D G + S + N + L GDSI++DR TGF A N+ +TDTVN+S
Sbjct: 209 CERGFYDTRKDNGFFVKNSKIDYEN--RLLEGDSIYFDRPTGFASATNNIKVTDTVNQSV 266
Query: 289 LYGEYGYYDEKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHV 348
+ G Y KD F T+ + D+++ A+DTL M+T + R R +
Sbjct: 267 IKGHYAEVFRMKDSLFITKNPLAAIKQEQDSVFIASDTL-MVTGK---TGNRDIRAFYDA 322
Query: 349 RVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRF--KFRNDSLDYVDVL 406
R+Y++D+ ADS+ + L L +PI+W+ +SQ++GDTI + LD + V
Sbjct: 323 RLYKSDLSGKADSIHSNEATGLTKLIRDPILWSGESQITGDTIHLISNTETEQLDSLKVF 382
Query: 407 TKALAVRRIDSVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRI 466
A +++ Y+Q+ G+ + +++ + ++ + N E + + + S + I
Sbjct: 383 YNAFMIQKDSIDGYNQIKGKELFGLFKNNEIYEVNIIRNTESLFFLRDDTSDLLGINKSI 442
Query: 467 EAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFRR 526
A I FE+ ++ V + +P M + ++L F W R SK DLF+
Sbjct: 443 SAKIKIL-FEKQEISDVYYYNEVDSQTHPSSMFPENARKLKDFSWRGDERLMSKADLFKG 501
Query: 527 QPDSVLQVHRSLSDLRRFSGALAALRAYTALAEEERKD 564
+ VL + L D + +R + + E+ D
Sbjct: 502 RDSLVLTKIKGLEDPDIYGDFFEGIRELNSNSNLEKAD 539
>gi|126662025|ref|ZP_01733024.1| hypothetical protein FBBAL38_01700 [Flavobacteria bacterium BAL38]
gi|126625404|gb|EAZ96093.1| hypothetical protein FBBAL38_01700 [Flavobacteria bacterium BAL38]
Length = 596
Score = 218 bits (556), Expect = 7e-55, Method: Composition-based stats.
Identities = 141/487 (28%), Positives = 247/487 (50%), Gaps = 17/487 (3%)
Query: 47 VILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQG 106
+I+E++D + ++ P GNV I H G M C+ A+ ++EN +AFG V M QG
Sbjct: 58 IIIENSDFVDMNQTEIPGAIVFTGNVRIIHNGVKMFCNKAYHFKDENYIKAFGNVQMNQG 117
Query: 107 DTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSL 166
DT++M +RY Y+G+ + A +V L + + L TD++ +D+ + Y G+I +
Sbjct: 118 DTITMNSRYAEYNGDKELAFATGDVVLRSPESILTTDTVYFDKKNQVASYNTYGTIRNKE 177
Query: 167 NTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGY 226
NTLTS G Y F V ++N + T+ T L Y ++ +++ GP+ + S
Sbjct: 178 NTLTSKSGRYYVDQKKYKFTTAVTVKNPESTIKTNNLDYYENSGHAYVFGPSTITSKENV 237
Query: 227 IVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNR 286
I + G YD+ D+G L S + N K + GD ++YD++ F N+ +TDT+N+
Sbjct: 238 IYTENGFYDTTNDIGKLSKNSKITLDN--KIIEGDDLYYDKKKNFSRGINNVKITDTINK 295
Query: 287 SSLYGEYG--YYD--EKKDYAFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDRRIA 342
G Y Y + KKD T+R+ + + DT++ + + P+D R+
Sbjct: 296 VIATGHYAELYRNAATKKDSMILTKRALVKTLVEKDTMYMHGKKIIV----SGPQDDRVI 351
Query: 343 RGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDS--L 400
R + +VR Y+TD+ DS+ +++ L L PI+WN ++Q++GD + N + L
Sbjct: 352 RAFNNVRFYKTDMSGKCDSLHSNNKTQLTKLIGKPILWNNENQMTGDVMHLIGNNKTQKL 411
Query: 401 DYVDVLTKALAVRRIDSVM---YDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRS 457
D + VL A +++ DS+ Y+Q+ G+++ DS ++++ V NAEVI Y + +
Sbjct: 412 DSLKVLNNAFIIQK-DSLSKNGYNQIKGQNLYGKFVDSKLKEVDVVKNAEVIYY-MYNDA 469
Query: 458 KRWYLMNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRP 517
+ +N+ I + EE ++ + YP K + ++L F W R
Sbjct: 470 NEFIGINKTLCSKINLELEENKINSITFFTKTDSNIYPEKEFPENARKLKGFLWRGDERI 529
Query: 518 KSKEDLF 524
SK+D+F
Sbjct: 530 LSKDDIF 536
>gi|126646936|ref|ZP_01719446.1| hypothetical protein ALPR1_19388 [Algoriphagus sp. PR1]
gi|126576984|gb|EAZ81232.1| hypothetical protein ALPR1_19388 [Algoriphagus sp. PR1]
Length = 524
Score = 216 bits (549), Expect = 5e-54, Method: Composition-based stats.
Identities = 136/457 (29%), Positives = 233/457 (50%), Gaps = 13/457 (2%)
Query: 66 QRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQVSM-QQGDTVSMFARYLHYDGNIKY 124
QRL+G+V ++H+ +++ CDSA+ Q N + FG V + Q D + + Y YDGN +
Sbjct: 23 QRLIGDVEMEHQSSLIYCDSAYFYQATNQAKLFGNVRIVDQEDPIQTTSSYAEYDGNTQL 82
Query: 125 ARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAI 184
A+LR V N+ TL+TD LDYDR N+ YYF G ++DS N LTS G Y +
Sbjct: 83 AKLRTNVVFTNQETTLYTDYLDYDRAGNIAYYFNDGRVIDSANVLTSEKGRYDVSIERIT 142
Query: 185 FRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTR--GVYDSNTDVGI 242
F+++V L N DYT+ T +L Y T K + G T + S G + + YD+
Sbjct: 143 FQNDVVLVNPDYTLRTNDLVYMTIPKTAETKGLTNLVSKEGNTLDAQKGSFYDTQNKQFR 202
Query: 243 LLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDY 302
D + ++ K + +FYD G+ E ++ + + ++GE G Y E++ +
Sbjct: 203 FFDGIVETETSRVKAI---ELFYDENLGYYEGKEDVRMLNKEREIEIFGEVGKYWEEEKH 259
Query: 303 AFATQRSYMIDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIADSM 362
+ + + + + DTL+ AD+L I+ R + + + +R V + + D+ ADS+
Sbjct: 260 SLIYGNALVRKYFEADTLYMTADSL--ISYDREEDSLKYLQAFRDVNLVKADLSGKADSL 317
Query: 363 QYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDSLDYVDVLTKALAVRRIDSVMYDQ 422
Y+ DS ++LY P+MWN+ SQ+S D++ F N+ L+ V + A + + + ++Q
Sbjct: 318 VYNYNDSSIHLYQEPVMWNQKSQISADSMTFFIANEVLERVFLKDNAFIITQDTILNFNQ 377
Query: 423 LAGRHIRAYMQDSLVRQIQVHGNAEVIQY--EQHKRSKRWYLMNRIEAPSIIADFEEGQL 480
+ GR + Y D + ++ + GN E + + E S+ +NR + +I F++G +
Sbjct: 378 MKGRKMTGYFSDGQISKMDIEGNGESLYFVLESDTISQG---VNRTLSATIQLKFKDGAI 434
Query: 481 KKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRP 517
+V G+ P + + D RL F W RP
Sbjct: 435 NRVNYGVKPDGRFIPSQRIDNDNSRLPGFSWRFDERP 471
>gi|86143092|ref|ZP_01061514.1| hypothetical protein MED217_10617 [Flavobacterium sp. MED217]
gi|85830537|gb|EAQ48996.1| hypothetical protein MED217_10617 [Leeuwenhoekiella blandensis
MED217]
Length = 609
Score = 209 bits (532), Expect = 5e-52, Method: Composition-based stats.
Identities = 135/463 (29%), Positives = 236/463 (50%), Gaps = 16/463 (3%)
Query: 72 VVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEV 131
V I+HEG M CD A + EN A G V M+QGDT+SM + Y Y+G+ ++A V
Sbjct: 55 VYIQHEGIEMWCDLAFHYKAENFVRAIGNVRMKQGDTISMRSNYAEYNGDTQFAWASGGV 114
Query: 132 RLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHL 191
L +ATL TD+L ++R+ YY GG++ D+ + + S G Y F NV +
Sbjct: 115 NLRTPTATLDTDTLYFNRIKQQAYYRTGGTLRDTASVIESKIGRYYLDQDKYSFIQNVVV 174
Query: 192 ENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVYS 251
N +Y +++++L + +++ + + GP+ + S++ + RG YD+ D G + S +
Sbjct: 175 TNPEYVINSDQLDFYSESGAAFLYGPSTITSETSTVYCERGYYDTRNDNGYFVKNSRIDY 234
Query: 252 SNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYM 311
N + + GDS+++DR T F A N+ + DT N S + G Y + KD F T+R+
Sbjct: 235 DN--RTVYGDSLYFDRPTSFASATNNIRVIDTANNSVIKGHYAEVFKDKDSVFITKRAVA 292
Query: 312 IDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIADSMQYDSRDSLL 371
I D+++ D L + VPE +RI RGY +VR++++D+ +DS+ + + L
Sbjct: 293 ITVQDQDSIYVHGDRLVVTG---VPE-KRIVRGYYNVRLFKSDMSGKSDSIHINQANGLT 348
Query: 372 YLYDNPIMWNEDSQLSGDTIRFK--FRNDSLDYVDVLTKALAVRRIDSVMYD-------Q 422
L P++W+ SQ++GD+I + + + LD + V A V++ ++D Q
Sbjct: 349 QLIGRPVLWSGLSQMTGDSIHLQSNTKTEQLDSLHVFDNAFLVQQDTIPLFDQEKSGFNQ 408
Query: 423 LAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQLKK 482
+ G + QD+ + QI + NAE I Y + + +++ ++ SI A E L
Sbjct: 409 VKGDVLYGTFQDNALHQIDIIKNAESINYMRADDGE-LQGIDKSKSASIRAILENNALVT 467
Query: 483 VLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFR 525
+ G+ +P+ + + W R +KED+F+
Sbjct: 468 ITKFKQVDGEVFPLSKFPKESREFEGLVWRGDERLLTKEDIFK 510
>gi|86132235|ref|ZP_01050830.1| hypothetical protein MED134_03459 [Cellulophaga sp. MED134]
gi|85817154|gb|EAQ38337.1| hypothetical protein MED134_03459 [Dokdonia donghaensis MED134]
Length = 611
Score = 206 bits (525), Expect = 3e-51, Method: Composition-based stats.
Identities = 130/457 (28%), Positives = 235/457 (51%), Gaps = 10/457 (2%)
Query: 72 VVIKHEGAVMRCDSAHLNQEENTFEAFGQVSMQQGDTVSMFARYLHYDGNIKYARLRHEV 131
V I+HEGA M CD A +EEN +A+ V ++QGD+VSM ++Y+ Y+G K+A +V
Sbjct: 57 VYIEHEGAEMWCDLAFFYKEENFVKAYRNVRLKQGDSVSMRSKYIEYNGKTKFAYAAGDV 116
Query: 132 RLENRSATLFTDSLDYDRVMNLGYYFEGGSIVDSLNTLTSSYGEYSPTTSDAIFRDNVHL 191
L+ + T+ TD++ ++R YY GG + + +TS G Y F +V +
Sbjct: 117 FLKKDTTTVTTDTMYFNRTTQQAYYRTGGVVTSPNSKITSRVGRYYIEQDKISFISDVVV 176
Query: 192 ENKDYTMDTEELHYNTDTKISHILGPTEMRSDSGYIVSTRGVYDSNTDVGILLDRSIVYS 251
+N +YT+++E+L + + + +++ GPT + S + + RG YD+ D G + S +
Sbjct: 177 KNPEYTINSEQLDFYSVPEHAYLYGPTTITSKTSKVYCERGFYDTANDYGYFVKNSRIDY 236
Query: 252 SNGAKQLTGDSIFYDRRTGFGEAFGNMILTDTVNRSSLYGEYGYYDEKKDYAFATQRSYM 311
N +Q+ GDS+++DR F A N+ + DT+NRS + G Y KD TQR+
Sbjct: 237 DN--RQVYGDSLYFDRNRNFASATNNIKVLDTLNRSLVKGHYAEVYRAKDSVLITQRAVA 294
Query: 312 IDFSKPDTLWAAADTLEMITQRRVPEDRRIARGYRHVRVYRTDVQAIADSMQYDSRDSLL 371
I D+++ D L + + D RI R +++V++Y++++ +DS+ + R L
Sbjct: 295 ITVEDNDSVYVHGDKLLLTGK----PDNRILRAFKNVKLYKSNMSGKSDSLHSNQRTGLT 350
Query: 372 YLYDNPIMWNEDSQLSGDTIRF--KFRNDSLDYVDVLTKALAVRRIDSVMYDQLAGRHIR 429
+ PI+W+E+SQ++GD+I + +D + V A ++ ++Q+ G+ +
Sbjct: 351 QMIRKPILWSEESQITGDSIHLISNTETEKIDSLKVFNNAFIAQKDTISGFNQIKGQKLY 410
Query: 430 AYMQD-SLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIADFEEGQLKKVLLRGV 488
+ D + ++Q+ + NAE I Y + ++R ++ I F E + ++
Sbjct: 411 GFFDDENALKQVDIINNAETIMYMREDNGD-LTGIDRGKSARIEITFFENTIDEINKIKS 469
Query: 489 ASGKGYPIKMLTPDLQRLASFRWEEAVRPKSKEDLFR 525
G +P + Q F W R SKED+F+
Sbjct: 470 PGGNIFPESQFLDEPQTFEGFNWRGDERLLSKEDIFK 506
>gi|149280297|ref|ZP_01886419.1| hypothetical protein PBAL39_08851 [Pedobacter sp. BAL39]
gi|149228986|gb|EDM34383.1| hypothetical protein PBAL39_08851 [Pedobacter sp. BAL39]
Length = 802
Score = 166 bits (420), Expect = 4e-39, Method: Composition-based stats.
Identities = 97/307 (31%), Positives = 162/307 (52%), Gaps = 17/307 (5%)
Query: 41 SKGKTHVILEHADELRYDRLYNPDVQRLLGNVVIKHEGAVMRCDSAHLNQEENTFEAFGQ 100
++ KT +IL+ + D N R N V + + A + CDSA + N F+AF
Sbjct: 4 AQKKTKIILQSSQRATIDAKANISYLR---NPVFRQDNATLACDSAVFYESRNVFDAFDN 60
Query: 101 VSMQQGDTVSMFARYLHYDGNIKYARLRHEVRLENRSATLFTDSLDYDRVMNLGYYFEGG 160
V + Q DT++++++ L YDGN K A L V++ ++ + L T+ LDY+ +G Y EGG
Sbjct: 61 VHINQADTINIYSKRLTYDGNTKNAHLTQNVKMIDKESILTTEVLDYNLGTKIGTYVEGG 120
Query: 161 SIVDSLNTLTSSYGEYSPTTSDAIFRDNVHLENKDYTMDTEELHYNTDTKISHILGPTEM 220
IV+ TLTS G Y + DA FR NV + + ++ L YNT T ++ GPT +
Sbjct: 121 KIVNKDVTLTSKNGYYFSNSRDAYFRYNVVVVTPQTVIKSDTLRYNTLTNWTYFYGPTNI 180
Query: 221 RSDSGYIVSTRGVYDSNTDVGILLDRSIVYSSNGAKQLTGDSIFYDRRTGFGEAFGNMIL 280
+ + + G Y++ T +++ + G+K L GDS++YD G+G+A N++
Sbjct: 181 KGKDDNLYTENGAYNTKTQYAYFGKKNLY--TQGSKSLKGDSLYYDGVAGYGKAVKNIVF 238
Query: 281 TDTVNRSSLYGEYGYYDEKKDYAFATQRSYM----------IDFSKPDTLWAAADTLE-- 328
DT +++ +YG+ G+Y + T+ Y+ + +PD+LW ADTLE
Sbjct: 239 RDTTDKTVMYGQLGFYYKIDQRTIVTKNPYIGLGTSDSVTVNNKLQPDSLWMGADTLETQ 298
Query: 329 MITQRRV 335
M+ Q+ +
Sbjct: 299 MVLQKSL 305
Score = 106 bits (264), Expect = 6e-21, Method: Composition-based stats.
Identities = 60/185 (32%), Positives = 105/185 (56%), Gaps = 2/185 (1%)
Query: 340 RIARGYRHVRVYRTDVQAIADSMQYDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFRNDS 399
R+ + Y HVRV+++++QA ADS+ Y S DS L Y +PI+W E SQ +GDTI + ++ +
Sbjct: 518 RVIKAYHHVRVFKSNMQARADSLFYTSADSTLRWYGSPILWAEGSQQTGDTIYLRLKDKT 577
Query: 400 LDYVDVLTKALAVR-RIDSVMYDQLAGRHIRAYMQDSLVRQIQVHGNAEVIQYEQHKRSK 458
+ V+ K V DS+ Y+Q+ G+ I + ++ + ++ V GNAE I + + +
Sbjct: 578 IRSSQVIEKGFLVNVNADSLRYNQIKGKLITGFFENGKLNRMFVDGNAESIYFNKDEDKN 637
Query: 459 RWYLMNRIEAPSIIADFEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRPK 518
+ MN+ + I F+E ++ +++ GK PI L D+ L F W+ +RP
Sbjct: 638 IYTEMNQTVSSRIKILFKEKEIDQIITIKDPEGKRTPIPELKEDV-FLTGFTWKPELRPL 696
Query: 519 SKEDL 523
SK+++
Sbjct: 697 SKKEV 701
>gi|21674905|ref|NP_662970.1| hypothetical protein CT2096 [Chlorobium tepidum TLS]
gi|21648132|gb|AAM73312.1| hypothetical protein CT2096 [Chlorobium tepidum TLS]
Length = 307
Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats.
Identities = 48/163 (29%), Positives = 77/163 (47%), Gaps = 10/163 (6%)
Query: 364 YDSRDSLLYLYDNPIMWNEDSQLSGDTIRFKFR----NDSLDYVDVLTKA-LAVRRIDS- 417
+D + L+L+D+ + W QLSGD+IR F +D + V A LAVR S
Sbjct: 142 FDQHKNELWLFDDAVAWQLGRQLSGDSIRVHFHEVGGKKKVDEIQVFGHAFLAVRDTLSA 201
Query: 418 --VMYDQLAGRHIRAYMQD-SLVRQIQVHGNAEVIQYEQHKRSKRWYLMNRIEAPSIIAD 474
++DQL+G+ + A + D S ++++ G A + Y + + +N I
Sbjct: 202 SPALHDQLSGKKLTANLDDNSRLQKVIAIGKARSL-YHIYDDKNQPSGVNFTSGERIRMF 260
Query: 475 FEEGQLKKVLLRGVASGKGYPIKMLTPDLQRLASFRWEEAVRP 517
F EG+L ++L+ G GK YP M L FR + +P
Sbjct: 261 FAEGKLDRILVTGGPLGKEYPNYMRNDPEINLPGFRLRDKEKP 303
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.319 0.135 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,322,151,014
Number of Sequences: 5470121
Number of extensions: 98236382
Number of successful extensions: 239792
Number of sequences better than 1.0e-05: 28
Number of HSP's better than 0.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 239536
Number of HSP's gapped (non-prelim): 30
length of query: 626
length of database: 1,894,087,724
effective HSP length: 139
effective length of query: 487
effective length of database: 1,133,740,905
effective search space: 552131820735
effective search space used: 552131820735
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 133 (55.8 bits)