BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= TF0813 
         (475 letters)

Database: nr 
           5,470,121 sequences; 1,894,087,724 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|150008515|ref|YP_001303258.1|  putative secreted glycosyl...   693   0.0  
gi|154493862|ref|ZP_02033182.1|  hypothetical protein PARMER...   652   0.0  
gi|150009061|ref|YP_001303804.1|  hypothetical protein BDI_2...   651   0.0  
gi|88711878|ref|ZP_01105966.1|  probable secreted glycosyl h...   252   3e-65
gi|88804756|ref|ZP_01120276.1|  probable secreted glycosyl h...   249   3e-64
gi|126648755|ref|ZP_01721239.1|  probable secreted glycosyl ...   241   8e-62
gi|126648754|ref|ZP_01721238.1|  probable secreted glycosyl ...   228   6e-58
gi|150008483|ref|YP_001303226.1|  hypothetical protein BDI_1...   223   3e-56
gi|88804436|ref|ZP_01119956.1|  probable secreted glycosyl h...   182   3e-44
gi|88712822|ref|ZP_01106907.1|  hypothetical protein FB2170_...   177   2e-42
gi|150008445|ref|YP_001303188.1|  hypothetical protein BDI_1...   170   2e-40
gi|154494235|ref|ZP_02033555.1|  hypothetical protein PARMER...   159   5e-37
gi|149197990|ref|ZP_01875038.1|  probable secreted glycosyl ...   150   2e-34
gi|149173077|ref|ZP_01851708.1|  protein up-regulated by thy...   150   3e-34
gi|83815548|ref|YP_446631.1|  probable secreted glycosyl hyd...   147   1e-33
gi|88712915|ref|ZP_01107000.1|  hypothetical protein FB2170_...   144   1e-32
gi|126646853|ref|ZP_01719363.1|  probable secreted glycosyl ...   139   3e-31
gi|32473821|ref|NP_866815.1|  probable secreted glycosyl hyd...   134   1e-29
gi|120435377|ref|YP_861063.1|  conserved hypothetical protei...   134   2e-29
gi|149197739|ref|ZP_01874789.1|  probable secreted glycosyl ...   132   4e-29
gi|149278985|ref|ZP_01885119.1|  hypothetical protein PBAL39...   132   7e-29
gi|86143589|ref|ZP_01061974.1|  hypothetical protein MED217_...   130   2e-28
gi|86143701|ref|ZP_01062077.1|  probable secreted glycosyl h...   129   6e-28
gi|154493991|ref|ZP_02033311.1|  hypothetical protein PARMER...   127   1e-27
gi|150009463|ref|YP_001304206.1|  hypothetical protein BDI_2...   126   3e-27
gi|150007837|ref|YP_001302580.1|  hypothetical protein BDI_1...   125   8e-27
gi|149196977|ref|ZP_01874030.1|  probable secreted glycosyl ...   124   1e-26
gi|88803973|ref|ZP_01119493.1|  hypothetical protein RB2501_...   123   3e-26
gi|88804319|ref|ZP_01119839.1|  probable secreted glycosyl h...   121   8e-26
gi|150009244|ref|YP_001303987.1|  hypothetical protein BDI_2...   120   2e-25
gi|154492730|ref|ZP_02032356.1|  hypothetical protein PARMER...   120   2e-25
gi|149276529|ref|ZP_01882673.1|  hypothetical protein PBAL39...   119   3e-25
gi|29347567|ref|NP_811070.1|  hypothetical protein BT_2157 [...   118   8e-25
gi|156110542|gb|EDO12287.1|  hypothetical protein BACOVA_021...   118   1e-24
gi|86131181|ref|ZP_01049780.1|  hypothetical protein MED134_...   117   1e-24
gi|149177844|ref|ZP_01856443.1|  probable secreted glycosyl ...   117   1e-24
gi|29349855|ref|NP_813358.1|  hypothetical protein BT_4447 [...   117   1e-24
gi|126645050|ref|ZP_01717594.1|  hypothetical protein ALPR1_...   117   2e-24
gi|153808737|ref|ZP_01961405.1|  hypothetical protein BACCAC...   117   2e-24
gi|88712851|ref|ZP_01106936.1|  probable secreted glycosyl h...   116   3e-24
gi|156112295|gb|EDO14040.1|  hypothetical protein BACOVA_002...   116   3e-24
gi|156109541|gb|EDO11286.1|  hypothetical protein BACOVA_031...   116   3e-24
gi|29348878|ref|NP_812381.1|  hypothetical protein BT_3469 [...   113   2e-23
gi|118073263|ref|ZP_01541446.1|  protein of unknown function...   112   5e-23
gi|149175127|ref|ZP_01853750.1|  probable secreted glycosyl ...   111   1e-22
gi|149174297|ref|ZP_01852924.1|  hypothetical protein PM8797...   110   2e-22
gi|116624768|ref|YP_826924.1|  protein of unknown function D...   109   3e-22
gi|156861848|gb|EDO55279.1|  hypothetical protein BACUNI_009...   106   3e-21
gi|116625197|ref|YP_827353.1|  protein of unknown function D...   105   7e-21
gi|149177059|ref|ZP_01855667.1|  hypothetical oxidoreductase...   105   7e-21
gi|149178817|ref|ZP_01857398.1|  hypothetical protein PM8797...   104   1e-20
gi|88713768|ref|ZP_01107849.1|  hypothetical protein FB2170_...   104   2e-20
gi|116621745|ref|YP_823901.1|  protein of unknown function D...   100   2e-19
gi|32472633|ref|NP_865627.1|  probable secreted glycosyl hyd...   100   4e-19
gi|32475761|ref|NP_868755.1|  probable secreted glycosyl hyd...    99   6e-19
gi|87311549|ref|ZP_01093668.1|  hypothetical protein DSM3645...    99   6e-19
gi|87308076|ref|ZP_01090218.1|  hypothetical protein DSM3645...    98   1e-18
gi|32470813|ref|NP_863806.1|  probable secreted glycosyl hyd...    98   2e-18
gi|149195996|ref|ZP_01873052.1|  hypothetical protein LNTAR_...    97   2e-18
gi|87308201|ref|ZP_01090343.1|  probable secreted glycosyl h...    97   3e-18
gi|149179157|ref|ZP_01857726.1|  hypothetical protein PM8797...    94   1e-17
gi|87308956|ref|ZP_01091094.1|  hypothetical protein DSM3645...    91   2e-16
gi|32471625|ref|NP_864618.1|  hypothetical protein RB1854 [R...    90   4e-16
gi|149199908|ref|ZP_01876936.1|  hypothetical protein LNTAR_...    88   1e-15
gi|149197913|ref|ZP_01874962.1|  hypothetical protein LNTAR_...    88   1e-15
gi|32471039|ref|NP_864032.1|  N-acetyl-galactosamine-6-sulfa...    87   3e-15
gi|87308660|ref|ZP_01090800.1|  probable protein kinase yloP...    85   1e-14
gi|32472822|ref|NP_865816.1|  hypothetical protein RB3944 [R...    84   2e-14
gi|149173679|ref|ZP_01852308.1|  hypothetical protein PM8797...    82   6e-14
gi|149174041|ref|ZP_01852669.1|  probable secreted glycosyl ...    80   3e-13
gi|87311427|ref|ZP_01093547.1|  hypothetical protein DSM3645...    77   3e-12
gi|88712821|ref|ZP_01106906.1|  hypothetical protein FB2170_...    75   7e-12
gi|149179308|ref|ZP_01857869.1|  hypothetical protein PM8797...    75   1e-11
gi|32476109|ref|NP_869103.1|  hypothetical protein RB9849 [R...    75   1e-11
gi|32474668|ref|NP_867662.1|  probable secreted glycosyl hyd...    74   2e-11
gi|149200234|ref|ZP_01877256.1|  hypothetical protein LNTAR_...    73   4e-11
gi|149177017|ref|ZP_01855625.1|  hypothetical protein PM8797...    73   5e-11
gi|126648469|ref|ZP_01720956.1|  hypothetical protein ALPR1_...    72   1e-10
gi|150007795|ref|YP_001302538.1|  hypothetical protein BDI_1...    70   3e-10
gi|154490163|ref|ZP_02030424.1|  hypothetical protein PARMER...    69   6e-10
gi|88713612|ref|ZP_01107694.1|  probable large multifunction...    66   4e-09
gi|116626562|ref|YP_828718.1|  protein of unknown function D...    66   5e-09
gi|116619831|ref|YP_821987.1|  protein of unknown function D...    65   7e-09
gi|87310208|ref|ZP_01092340.1|  hypothetical protein DSM3645...    65   9e-09
gi|87309043|ref|ZP_01091181.1|  hypothetical protein DSM3645...    65   9e-09
gi|150007144|ref|YP_001301887.1|  putative secreted glycosyl...    65   9e-09
gi|146280883|ref|YP_001171036.1|  hypothetical protein PST_0...    65   1e-08
gi|32473471|ref|NP_866465.1|  hypothetical protein-transmemb...    65   1e-08
gi|149177681|ref|ZP_01856282.1|  sulfatase [Planctomyces mar...    65   1e-08
gi|148253179|ref|YP_001237764.1|  putative exported protein ...    64   2e-08
gi|117164776|emb|CAJ88325.1|  putative secreted glycosyl hyd...    64   2e-08
gi|116625196|ref|YP_827352.1|  protein of unknown function D...    64   2e-08
gi|146342960|ref|YP_001208008.1|  hypothetical protein BRADO...    64   2e-08
gi|88713788|ref|ZP_01107869.1|  hypothetical protein FB2170_...    64   3e-08
gi|126648719|ref|ZP_01721203.1|  probable secreted glycosyl ...    63   3e-08
gi|32471313|ref|NP_864306.1|  probable secreted glycosyl hyd...    63   4e-08
gi|116626769|ref|YP_828925.1|  protein of unknown function D...    63   4e-08
gi|109897209|ref|YP_660464.1|  protein of unknown function D...    63   5e-08
gi|149196057|ref|ZP_01873113.1|  hypothetical protein LNTAR_...    62   6e-08
gi|87308845|ref|ZP_01090984.1|  hypothetical protein DSM3645...    62   7e-08
gi|87310061|ref|ZP_01092194.1|  serine/threonine protein kin...    61   1e-07
gi|116619940|ref|YP_822096.1|  protein of unknown function D...    60   3e-07
gi|32474367|ref|NP_867361.1|  probable secreted glycosyl hyd...    60   4e-07
gi|149177640|ref|ZP_01856241.1|  hypothetical protein PM8797...    60   4e-07
gi|145589295|ref|YP_001155892.1|  protein of unknown functio...    60   5e-07
gi|21224873|ref|NP_630652.1|  glycosyl hydrolase (secreted p...    59   5e-07
gi|116624007|ref|YP_826163.1|  protein of unknown function D...    59   9e-07
gi|87310236|ref|ZP_01092367.1|  hypothetical protein DSM3645...    59   9e-07
gi|149198757|ref|ZP_01875800.1|  probable secreted glycosyl ...    58   1e-06
gi|116621620|ref|YP_823776.1|  protein of unknown function D...    58   1e-06
gi|94969351|ref|YP_591399.1|  protein of unknown function DU...    58   2e-06
gi|21225488|ref|NP_631267.1|  secreted glycosyl hydrolase [S...    57   2e-06
gi|32471058|ref|NP_864051.1|  conserved hypothetical protein...    57   3e-06
gi|32470990|ref|NP_863983.1|  conserved hypothetical protein...    56   7e-06
gi|149196925|ref|ZP_01873978.1|  hypothetical protein LNTAR_...    55   1e-05
>gi|150008515|ref|YP_001303258.1| putative secreted glycosylhydrolase [Parabacteroides distasonis
           ATCC 8503]
 gi|149936939|gb|ABR43636.1| putative secreted glycosylhydrolase [Parabacteroides distasonis
           ATCC 8503]
          Length = 422

 Score =  693 bits (1789), Expect = 0.0,   Method: Composition-based stats.
 Identities = 330/408 (80%), Positives = 363/408 (88%), Gaps = 1/408 (0%)

Query: 69  ALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGS 128
            L AC     N LTE+EKA GW+LLFDG+TL+GWRDYNG ALTGPWEVV+G IQADG+GS
Sbjct: 15  GLIACDNTKHNTLTEQEKAEGWELLFDGETLDGWRDYNGTALTGPWEVVNGTIQADGQGS 74

Query: 129 DENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFA 188
           D +GYIV D+ YENFEL WDWKISKGGNSGLLYHVVERPQ+ VPYVTGPEYQLIDD  FA
Sbjct: 75  DASGYIVTDKAYENFELSWDWKISKGGNSGLLYHVVERPQFPVPYVTGPEYQLIDDINFA 134

Query: 189 EPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
           EPLEDWQRCGVDYAMYLPDF T+ V PAGEWN SKI+FDNGHV Y+MNG KT+EF+AWSD
Sbjct: 135 EPLEDWQRCGVDYAMYLPDFNTIKVHPAGEWNNSKIIFDNGHVTYFMNGHKTVEFDAWSD 194

Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEELFNGKDLT 308
           DWF RKNSGKW NAPEYGLA KGLICLQDHGYPAWFRNIKI+ELPRKT E  LFNG+D+T
Sbjct: 195 DWFSRKNSGKWANAPEYGLAHKGLICLQDHGYPAWFRNIKIKELPRKTREARLFNGEDIT 254

Query: 309 GWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGNSGIFIRSF 368
            WD YGTE WYV+D LLVCESGPDKQYGYLAT +YY+DF+LT +FKQEADGNSG+FIRSF
Sbjct: 255 NWDKYGTELWYVKDGLLVCESGPDKQYGYLATREYYDDFDLTVEFKQEADGNSGVFIRSF 314

Query: 369 VEE-GAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGN 427
           VEE   KVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPD++EN LK+ EWNTMRIRV G+
Sbjct: 315 VEEKDVKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDEKENILKQGEWNTMRIRVQGD 374

Query: 428 QVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
            V TWLNGE+MV+I+DEKIGAGQGRIALQIHDGGGIKVLWRN+ ++TL
Sbjct: 375 NVQTWLNGEEMVNIRDEKIGAGQGRIALQIHDGGGIKVLWRNLHLQTL 422
>gi|154493862|ref|ZP_02033182.1| hypothetical protein PARMER_03206 [Parabacteroides merdae ATCC
           43184]
 gi|154086122|gb|EDN85167.1| hypothetical protein PARMER_03206 [Parabacteroides merdae ATCC
           43184]
          Length = 429

 Score =  652 bits (1683), Expect = 0.0,   Method: Composition-based stats.
 Identities = 308/416 (74%), Positives = 356/416 (85%), Gaps = 3/416 (0%)

Query: 60  CLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDG 119
           C   + LF   ++C+    N LT EE A GWQLLFDG+TL GW+DYNG  LT PW VVDG
Sbjct: 17  CAISLALF---SSCTSVEPNTLTPEEIADGWQLLFDGKTLNGWKDYNGTTLTQPWHVVDG 73

Query: 120 AIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEY 179
            IQA G+GSD +GYIV D+ YENFEL WDWK+SKGGNSG+LYHVVERPQ+ VPYVTGPEY
Sbjct: 74  CIQAKGDGSDASGYIVTDKQYENFELSWDWKLSKGGNSGMLYHVVERPQFAVPYVTGPEY 133

Query: 180 QLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQK 239
           QLID+  F EPLE+WQ+ GVDYAM+LPD A M V P GEWN SKIVFDNGHVE+++NG K
Sbjct: 134 QLIDEPNFPEPLEEWQKLGVDYAMHLPDKAKMKVNPQGEWNNSKIVFDNGHVEHWLNGVK 193

Query: 240 TIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEE 299
            +EFEAW+DDW+ +KNSGKW NAPEYGLA+KG++CLQDHGYPA FRNIKI+ELPRKT+E 
Sbjct: 194 ILEFEAWTDDWYAKKNSGKWANAPEYGLAKKGVLCLQDHGYPASFRNIKIKELPRKTKEV 253

Query: 300 ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
            LFNG DL GW+ YGTE+WYV+D LL+CESGPDK+YGYLAT  YY+DF+LT +FKQEADG
Sbjct: 254 TLFNGTDLKGWEAYGTEKWYVEDGLLICESGPDKKYGYLATRDYYDDFDLTVEFKQEADG 313

Query: 360 NSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNT 419
           NSG+FIRSF+EE  KVNGWQVEVAPKG DTGGIYESYGRGWLIQIPD++EN LKE +WNT
Sbjct: 314 NSGVFIRSFIEEDVKVNGWQVEVAPKGHDTGGIYESYGRGWLIQIPDEKENILKEGDWNT 373

Query: 420 MRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
           MRI+V G+ V TWLNG++MV+I DEKIGAGQGRIALQIHDGGGIKVLWRN++VKTL
Sbjct: 374 MRIKVQGDNVQTWLNGQEMVNINDEKIGAGQGRIALQIHDGGGIKVLWRNLKVKTL 429
>gi|150009061|ref|YP_001303804.1| hypothetical protein BDI_2462 [Parabacteroides distasonis ATCC
           8503]
 gi|149937485|gb|ABR44182.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 430

 Score =  651 bits (1679), Expect = 0.0,   Method: Composition-based stats.
 Identities = 308/418 (73%), Positives = 360/418 (86%), Gaps = 4/418 (0%)

Query: 59  NCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVD 118
           +C   + LF   ++C+ + QN LT EE A GW LLFDG+TL+GW+DYNG  LT PW VVD
Sbjct: 16  SCAVSLALF---SSCASQEQNTLTPEEIADGWVLLFDGKTLDGWKDYNGTTLTQPWHVVD 72

Query: 119 GAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPE 178
           G IQA G+GSD +GYIV D+ YENFEL WDWK+SKGGNSG+LYHVVERPQY VPYVTGPE
Sbjct: 73  GCIQAKGDGSDASGYIVTDKEYENFELSWDWKLSKGGNSGMLYHVVERPQYAVPYVTGPE 132

Query: 179 YQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQ 238
           YQLID+  F EPLE+WQ+ GVDYAM+LPD + M V P GEWN SKIVFDNGHVE+++NGQ
Sbjct: 133 YQLIDEPNFPEPLEEWQKLGVDYAMHLPDKSKMKVNPQGEWNNSKIVFDNGHVEHWLNGQ 192

Query: 239 KTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEE 298
           K +EFEAW+DDW  +KNSGKW NAPEYGLA+KG++CLQDHGYPA FRN+KI+ELPRKT +
Sbjct: 193 KIVEFEAWTDDWHAKKNSGKWANAPEYGLAKKGVLCLQDHGYPASFRNLKIKELPRKTGK 252

Query: 299 E-ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
           E  LFNG DLTGW+ YGTE+WYV+D LLVCESGPDKQYGYLAT  YY+DF+LT +FKQEA
Sbjct: 253 EVNLFNGVDLTGWEPYGTEKWYVKDGLLVCESGPDKQYGYLATRDYYDDFDLTVEFKQEA 312

Query: 358 DGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREW 417
           DGNSG+FIRSFVEEG KVNGWQVEVAPKG DTGGIYESYGRGWL+QIPD++EN LKE +W
Sbjct: 313 DGNSGVFIRSFVEEGVKVNGWQVEVAPKGHDTGGIYESYGRGWLVQIPDEKENILKEGDW 372

Query: 418 NTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
           NTMRI+V G+ V TWLNG++MV++ DEKIGAG+GRIALQIHDGGGIKVLWRN+++ TL
Sbjct: 373 NTMRIKVQGDNVQTWLNGQEMVNLNDEKIGAGKGRIALQIHDGGGIKVLWRNLKLTTL 430
>gi|88711878|ref|ZP_01105966.1| probable secreted glycosyl hydrolase [Flavobacteriales bacterium
           HTCC2170]
 gi|88710819|gb|EAR03051.1| probable secreted glycosyl hydrolase [Flavobacteriales bacterium
           HTCC2170]
          Length = 200

 Score =  252 bits (644), Expect = 3e-65,   Method: Composition-based stats.
 Identities = 119/176 (67%), Positives = 145/176 (82%), Gaps = 1/176 (0%)

Query: 297 EEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQE 356
           +E+ LFNG+DLTGW +YGTE+W+V+D LLVCESGPD QYGYLAT ++Y DF L  +FKQE
Sbjct: 22  QEKSLFNGEDLTGWTIYGTEKWFVEDGLLVCESGPDAQYGYLATKEHYKDFTLILEFKQE 81

Query: 357 ADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
           A+GNSG+FIRS V+ G KV+GWQVEVAP G  TGG+YESYGRGWLI+    ++  LK  E
Sbjct: 82  ANGNSGVFIRSTVD-GTKVSGWQVEVAPPGHSTGGVYESYGRGWLIKPDPAKDKALKMGE 140

Query: 417 WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRV 472
           WN M+IRV G+++T+W+NGE+MV I D KIG G+G IALQIHDGGGIKV WRNIRV
Sbjct: 141 WNEMKIRVYGSKLTSWVNGEEMVTINDAKIGTGEGSIALQIHDGGGIKVKWRNIRV 196
>gi|88804756|ref|ZP_01120276.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
           HTCC2501]
 gi|88785635|gb|EAR16804.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
           HTCC2501]
          Length = 196

 Score =  249 bits (636), Expect = 3e-64,   Method: Composition-based stats.
 Identities = 122/180 (67%), Positives = 146/180 (81%), Gaps = 2/180 (1%)

Query: 293 PRKTEEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTAD 352
           P   +E ELFNG+DL+GW VYGTE+WYV+D LLVCESGPDK YGYLAT K+Y DF LT +
Sbjct: 15  PLAAQETELFNGEDLSGWTVYGTEKWYVEDGLLVCESGPDKGYGYLATDKHYKDFVLTLE 74

Query: 353 FKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFL 412
           F QE+DGNSG+FIRS V +G KV+GWQVEVAP G DTGG+YESYGRGWLI+ P+  +  +
Sbjct: 75  FLQESDGNSGVFIRSTV-DGTKVSGWQVEVAPPGHDTGGVYESYGRGWLIK-PEAGKPDV 132

Query: 413 KEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRV 472
           K  EWNTM+I V G+ +T+WLNG +M+ + DEKIG G+G IALQIHDGGGIKV WRNI V
Sbjct: 133 KMGEWNTMKIMVSGDTITSWLNGTEMITLTDEKIGQGEGSIALQIHDGGGIKVKWRNIVV 192
>gi|126648755|ref|ZP_01721239.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
 gi|126575206|gb|EAZ79556.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
          Length = 198

 Score =  241 bits (615), Expect = 8e-62,   Method: Composition-based stats.
 Identities = 116/179 (64%), Positives = 147/179 (82%), Gaps = 1/179 (0%)

Query: 297 EEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQE 356
           ++E+LFNG+DLTGW +YGTE+WYV+D LL+ ESGPDK YGYL T ++Y+DFE+T +FKQE
Sbjct: 21  KKEKLFNGEDLTGWTIYGTEKWYVEDGLLISESGPDKGYGYLGTNEHYDDFEITLEFKQE 80

Query: 357 ADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
           A+GNSG+FIRS V+ G KV+GWQVEVAP G DTGGIYESYGRGWLI+   +++ +LK  +
Sbjct: 81  ANGNSGVFIRSTVD-GTKVSGWQVEVAPPGHDTGGIYESYGRGWLIKPDPEKDKYLKFGK 139

Query: 417 WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
           WN MRI V G+ VT++LNG +MV+  D KIG G+G I LQIHDGGGIKV W+NI +K L
Sbjct: 140 WNKMRIVVKGDNVTSYLNGHEMVNFTDAKIGEGKGGICLQIHDGGGIKVYWKNIVLKKL 198
>gi|126648754|ref|ZP_01721238.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
 gi|126575205|gb|EAZ79555.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
          Length = 254

 Score =  228 bits (582), Expect = 6e-58,   Method: Composition-based stats.
 Identities = 122/217 (56%), Positives = 153/217 (70%), Gaps = 2/217 (0%)

Query: 77  PQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVY 136
           P N LTEEEKA GW LLFDG    GWR +NG      W + DGA++A G+G D  G IV+
Sbjct: 39  PDNTLTEEEKATGWMLLFDGSDPSGWRAFNGDTFPEGWTIEDGALKALGKGGDIGGDIVF 98

Query: 137 D-RIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQ 195
               +E FE++WDWKIS+GGNSG+LYHVVE P+Y  PY TGPEYQ+ID  GF EPLE WQ
Sbjct: 99  GPMEFEEFEMEWDWKISEGGNSGVLYHVVEDPKYHAPYETGPEYQVIDQLGFPEPLEKWQ 158

Query: 196 RCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKN 255
             G DYAM  PD+    V+PAGEWN SKI+F      Y++NG+KT+EF  +S++W   +N
Sbjct: 159 SIGADYAMTEPDYEGA-VKPAGEWNHSKIIFSEEGSSYWLNGKKTVEFVPYSEEWTAARN 217

Query: 256 SGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           SGKW + P+Y +A+ GLI LQDHG   WF+NIKI++L
Sbjct: 218 SGKWNDFPDYAIAKTGLISLQDHGAVTWFKNIKIKKL 254
>gi|150008483|ref|YP_001303226.1| hypothetical protein BDI_1869 [Parabacteroides distasonis ATCC
           8503]
 gi|149936907|gb|ABR43604.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 258

 Score =  223 bits (567), Expect = 3e-56,   Method: Composition-based stats.
 Identities = 114/244 (46%), Positives = 159/244 (65%), Gaps = 18/244 (7%)

Query: 64  MVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQA 123
           MV  G+ ++ +E   NVLTEEEKA G+ LLF+G+   GW+ +NG  + G W+V DG I  
Sbjct: 17  MVACGSRSSSTEVKDNVLTEEEKAEGYTLLFNGKDFTGWKMFNGGDVKG-WQVEDGVIVG 75

Query: 124 DGEGSDE--------NGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVT 175
            G G D         +  IV  + Y NF+++WDWKI   GNSG LYHV E P+YK P+ T
Sbjct: 76  YGNGGDVIADTTIKVSTDIVTVKNYHNFQIKWDWKIGAQGNSGFLYHVQEGPKYKAPFET 135

Query: 176 GPEYQLIDDKGF-------AEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDN 228
           GPEYQLIDD  +        E LEDWQ+ G +YAMY+P+  T  V P GEWN+S +++ +
Sbjct: 136 GPEYQLIDDDNYPWVSETGKEGLEDWQKTGCNYAMYVPE--TKQVNPPGEWNSSMVLYKD 193

Query: 229 GHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIK 288
           G+VE+++NG+K   F+  S+DW  R+ SGKWE  P+YG++  G +C QDHG   +F+N+K
Sbjct: 194 GYVEHWLNGEKLFSFQEGSEDWKMRRYSGKWEAFPDYGISTTGKLCFQDHGSKVYFKNVK 253

Query: 289 IREL 292
           I++L
Sbjct: 254 IKDL 257
>gi|88804436|ref|ZP_01119956.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
           HTCC2501]
 gi|88785315|gb|EAR16484.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
           HTCC2501]
          Length = 264

 Score =  182 bits (463), Expect = 3e-44,   Method: Composition-based stats.
 Identities = 100/234 (42%), Positives = 132/234 (56%), Gaps = 31/234 (13%)

Query: 85  EKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDG--------AIQADGEGSDENGYIVY 136
           E    WQ LFDG +LEGWR YN + +   W + D          ++ D EG  +  Y   
Sbjct: 36  EDTGEWQYLFDGTSLEGWRGYNAETMPPGWVIEDSVLTFKTELGLEQDYEGGKDILYAAE 95

Query: 137 DRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAE------- 189
           +  ++NFEL  +WK+ +GGNSG+ YHV E   Y  P V  PEYQLIDD+ +A        
Sbjct: 96  E--FDNFELYLEWKLPEGGNSGIFYHVKE--GYDGPPVVAPEYQLIDDENYARIHDLTEY 151

Query: 190 -----------PLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQ 238
                       L+  Q+ G DYAM+ PD     + P GEWN+SKIVF    VE+++NG+
Sbjct: 152 NLSLGYTENPNELKPLQQTGADYAMHPPD-PGKTLHPVGEWNSSKIVFTPERVEHWLNGE 210

Query: 239 KTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
             + F  W D W ++KNS KW+N+P YG  +KG I LQDH  P WFRNIKIR+L
Sbjct: 211 MILSFVPWDDAWEEKKNSDKWKNSPAYGTFKKGYIALQDHASPIWFRNIKIRKL 264
>gi|88712822|ref|ZP_01106907.1| hypothetical protein FB2170_09296 [Flavobacteriales bacterium
           HTCC2170]
 gi|88708720|gb|EAR00955.1| hypothetical protein FB2170_09296 [Flavobacteriales bacterium
           HTCC2170]
          Length = 263

 Score =  177 bits (448), Expect = 2e-42,   Method: Composition-based stats.
 Identities = 99/241 (41%), Positives = 135/241 (56%), Gaps = 26/241 (10%)

Query: 76  KPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY-- 133
           K +  +  + KA  W  LFDG + EGWR YNG+AL   W   DGA+  D E   E  Y  
Sbjct: 25  KSETGIEAQVKANDWITLFDGVSTEGWRAYNGKALPPGWVAKDGALTFDTELGLEQDYKG 84

Query: 134 ---IVYD-RIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFA- 188
              I+Y    ++NFE   +WK+ +GGNSG+ YH+ E   Y  P    PEYQLIDD+ +A 
Sbjct: 85  GKDIIYGAEEFDNFEFYVEWKLPEGGNSGIFYHLKE--GYNSPPEVSPEYQLIDDENYAR 142

Query: 189 -----------------EPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHV 231
                            E L+  Q+   DYAM+  +     + P GEWN+SKIVF    V
Sbjct: 143 IHDLTEYNLSLGYTEKPEELKPLQQTASDYAMHAANPEGKILHPVGEWNSSKIVFTPEKV 202

Query: 232 EYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRE 291
           E+++NG+  + F  WS+DW ++KNS KW+N+ +YG  + G I  QDH  P WFRNIKI++
Sbjct: 203 EHWLNGKMVLSFVPWSEDWHEKKNSDKWKNSEDYGKFKTGFIGFQDHSSPIWFRNIKIKK 262

Query: 292 L 292
           L
Sbjct: 263 L 263
>gi|150008445|ref|YP_001303188.1| hypothetical protein BDI_1827 [Parabacteroides distasonis ATCC
           8503]
 gi|149936869|gb|ABR43566.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 245

 Score =  170 bits (431), Expect = 2e-40,   Method: Composition-based stats.
 Identities = 99/247 (40%), Positives = 138/247 (55%), Gaps = 11/247 (4%)

Query: 54  MMKCMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP 113
           M K      ++        C  +  N LT++EK AGW+LLF+G+   GWR  NG A+   
Sbjct: 1   MKKLFKSFLILSAIAMAIPCFAQTPNTLTKKEKKAGWELLFNGKDFSGWRQCNGTAMPAN 60

Query: 114 WEVVDGAIQA-DGEGSD----ENGYIVY-DRIYENFELQWDWKISKGGNSGLLYHVVERP 167
           W + D A++   GEG       NG I+Y ++ ++NFEL  DWK SK GNSG+ Y+V E P
Sbjct: 61  WVIEDNAMKVFTGEGKKPGQGANGDILYQNKKFKNFELSVDWKASKMGNSGIFYYVREVP 120

Query: 168 QYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFD 227
              + Y   PE Q++D+    +        G  Y M   D  T+N  PAGEWNT  I   
Sbjct: 121 GKPI-YYAAPEVQVLDNVDATDNKLANHLAGSLYDMLPADPKTVN--PAGEWNTIVIRVK 177

Query: 228 NGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEY--GLARKGLICLQDHGYPAWFR 285
           +G V +  NG+K +E+  WS +W     + K++N P +  G++++G I LQDHGYP WFR
Sbjct: 178 DGKVTHTQNGKKVVEYTLWSKEWDDLVANSKFKNFPGFTEGISKEGYIGLQDHGYPIWFR 237

Query: 286 NIKIREL 292
           NIKIREL
Sbjct: 238 NIKIREL 244

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 61/215 (28%), Positives = 96/215 (44%), Gaps = 36/215 (16%)

Query: 294 RKTEEEELFNGKDLTGW-DVYGTE---QWYVQDSLLVCESGPDKQYG------YLATCKY 343
           +K   E LFNGKD +GW    GT     W ++D+ +   +G  K+ G       L   K 
Sbjct: 33  KKAGWELLFNGKDFSGWRQCNGTAMPANWVIEDNAMKVFTGEGKKPGQGANGDILYQNKK 92

Query: 344 YNDFELTADFKQEADGNSGIF--IRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWL 401
           + +FEL+ D+K    GNSGIF  +R    +       +V+V      T     ++  G L
Sbjct: 93  FKNFELSVDWKASKMGNSGIFYYVREVPGKPIYYAAPEVQVLDNVDATDNKLANHLAGSL 152

Query: 402 IQ-IPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD-----------IQDEKIG-- 447
              +P D +      EWNT+ IRV   +VT   NG+++V+           + + K    
Sbjct: 153 YDMLPADPKTVNPAGEWNTIVIRVKDGKVTHTQNGKKVVEYTLWSKEWDDLVANSKFKNF 212

Query: 448 -------AGQGRIALQIHDGGGIKVLWRNIRVKTL 475
                  + +G I LQ H   G  + +RNI+++ L
Sbjct: 213 PGFTEGISKEGYIGLQDH---GYPIWFRNIKIREL 244
>gi|154494235|ref|ZP_02033555.1| hypothetical protein PARMER_03585 [Parabacteroides merdae ATCC
           43184]
 gi|154086097|gb|EDN85142.1| hypothetical protein PARMER_03585 [Parabacteroides merdae ATCC
           43184]
          Length = 271

 Score =  159 bits (401), Expect = 5e-37,   Method: Composition-based stats.
 Identities = 97/261 (37%), Positives = 145/261 (55%), Gaps = 13/261 (4%)

Query: 42  FVEFFENFKIVFMMKCM--NCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTL 99
           F+    N K+   MK +  + + ++ +  A+ + ++K  N LTE+EK  GW LLF+G+  
Sbjct: 13  FIFSMSNLKMKCNMKNLFKSFVVLLAVMAAVPSFAQKANNTLTEKEKKQGWTLLFNGKDF 72

Query: 100 EGWRDYNGQALTGPWEVVDGAIQ---ADGE--GSDENGYIVY-DRIYENFELQWDWKISK 153
            GWR  N   +   W + D A++   A G+  G    G I+Y ++ ++NFEL  DWK SK
Sbjct: 73  TGWRQCNSTGMASNWVIEDEAMKVFTAPGKKPGHGAGGDILYKEKKFKNFELSVDWKTSK 132

Query: 154 GGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNV 213
            GNSG+ Y+V E P   + Y   PE Q++D+    +        G  Y M   D  T  V
Sbjct: 133 MGNSGIFYYVREVPGKPI-YYAAPEVQVLDNVDATDNKLANHLAGSLYDMLPADPKT--V 189

Query: 214 RPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEY--GLARKG 271
           +PAGEWNT  I   +G V +  NG+K +++  WS +W     + K+++   +  G++ +G
Sbjct: 190 KPAGEWNTIVIKVKDGKVTHTQNGKKVVQYTLWSKEWDDMVANSKFKDFQGFQEGISHEG 249

Query: 272 LICLQDHGYPAWFRNIKIREL 292
            I LQDHGYP WFRNIKIREL
Sbjct: 250 YIGLQDHGYPIWFRNIKIREL 270

 Score = 59.3 bits (142), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 62/223 (27%), Positives = 97/223 (43%), Gaps = 36/223 (16%)

Query: 286 NIKIRELPRKTEEEELFNGKDLTGWDVYGT----EQWYVQDSLLVCESGPDKQYGY---- 337
           N  + E  +K     LFNGKD TGW    +      W ++D  +   + P K+ G+    
Sbjct: 51  NNTLTEKEKKQGWTLLFNGKDFTGWRQCNSTGMASNWVIEDEAMKVFTAPGKKPGHGAGG 110

Query: 338 --LATCKYYNDFELTADFKQEADGNSGIF--IRSFVEEGAKVNGWQVEVAPKGFDTGGIY 393
             L   K + +FEL+ D+K    GNSGIF  +R    +       +V+V      T    
Sbjct: 111 DILYKEKKFKNFELSVDWKTSKMGNSGIFYYVREVPGKPIYYAAPEVQVLDNVDATDNKL 170

Query: 394 ESYGRGWLIQ-IPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMV------------- 439
            ++  G L   +P D +      EWNT+ I+V   +VT   NG+++V             
Sbjct: 171 ANHLAGSLYDMLPADPKTVKPAGEWNTIVIKVKDGKVTHTQNGKKVVQYTLWSKEWDDMV 230

Query: 440 ------DIQDEKIG-AGQGRIALQIHDGGGIKVLWRNIRVKTL 475
                 D Q  + G + +G I LQ H   G  + +RNI+++ L
Sbjct: 231 ANSKFKDFQGFQEGISHEGYIGLQDH---GYPIWFRNIKIREL 270
>gi|149197990|ref|ZP_01875038.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
           HTCC2155]
 gi|149138902|gb|EDM27307.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
           HTCC2155]
          Length = 218

 Score =  150 bits (379), Expect = 2e-34,   Method: Composition-based stats.
 Identities = 80/203 (39%), Positives = 116/203 (57%), Gaps = 7/203 (3%)

Query: 90  WQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWDW 149
           WQ LF+GQ L+GWR+YN Q + G W V D AI    +G     +IVY++ +++FEL+  W
Sbjct: 23  WQSLFNGQDLQGWRNYNSQGINGKWIVEDSAIHLTEKGGQ---HIVYNQPFKDFELKLQW 79

Query: 150 KISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFA 209
           KIS+ GNSG+     ER QY  P+++G E Q++DD+          + G  Y +      
Sbjct: 80  KISERGNSGIFIRSSERYQY--PWMSGVEMQILDDEKHPNAKNPLTKAGSCYDLIAAPEG 137

Query: 210 TMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLAR 269
            +N   A  WN   I+    H ++++NG KT +F+  S +W       K++  P +   +
Sbjct: 138 AVN--KAMAWNDVHIIVKGSHYQFFLNGVKTADFDVKSAEWRALIAGSKFKKYPGFSENK 195

Query: 270 KGLICLQDHGYPAWFRNIKIREL 292
           +G ICLQDHG P WFRNIKIREL
Sbjct: 196 QGFICLQDHGDPVWFRNIKIREL 218
>gi|149173077|ref|ZP_01851708.1| protein up-regulated by thyroid hormone-putative PQQ-dependent
           glucose dehydrogenase [Planctomyces maris DSM 8797]
 gi|148847883|gb|EDL62215.1| protein up-regulated by thyroid hormone-putative PQQ-dependent
           glucose dehydrogenase [Planctomyces maris DSM 8797]
          Length = 659

 Score =  150 bits (378), Expect = 3e-34,   Method: Composition-based stats.
 Identities = 93/278 (33%), Positives = 146/278 (52%), Gaps = 28/278 (10%)

Query: 60  CLCVMVLFGA-LTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVD 118
           CL +++   A ++A S+   N L++ E+ +GW+LLFDG+T +GWR+Y  + ++  W + D
Sbjct: 24  CLSLLMTNTAVISADSDTSLNKLSKAEQKSGWKLLFDGKTTDGWRNYKKEGVSDGWTIKD 83

Query: 119 GAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPE 178
           G +    +G+   G I+ D  +  FEL  +++IS  GNSGL++HV E  + K P++TGPE
Sbjct: 84  GVLSRSAKGA---GDIITDDQFGFFELSLEYRISPEGNSGLMFHVTE--EEKTPWMTGPE 138

Query: 179 YQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNV-----------------RPAGEWNT 221
            Q+ D+    +P    Q+ G  Y +Y P      +                 RPAG+WN 
Sbjct: 139 VQIQDNVDGHDP----QKAGWLYQLYKPATPKWMIEAEKAGKKVTPAVVDATRPAGDWNH 194

Query: 222 SKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYP 281
             +       +  MNG K  +F+  S DW +R  + K+   P +G   KG ICLQDH   
Sbjct: 195 LFLRVGPDRSQIIMNGVKYFQFDKGSADWNKRVAASKFSKYPSFGKPTKGHICLQDHNDL 254

Query: 282 AWFRNIKIRELPRKTEEEELFNGK-DLTGWDVYGTEQW 318
             FRNIKIRE+P     ++  +G+  L G   +   +W
Sbjct: 255 VSFRNIKIREIPADGSVQDPSDGELALKGVPAFPNLEW 292
>gi|83815548|ref|YP_446631.1| probable secreted glycosyl hydrolase [Salinibacter ruber DSM 13855]
 gi|83756942|gb|ABC45055.1| probable secreted glycosyl hydrolase [Salinibacter ruber DSM 13855]
          Length = 222

 Score =  147 bits (371), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 93/224 (41%), Positives = 129/224 (57%), Gaps = 16/224 (7%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL--TGPWEVVDGAIQADGEGSDENGY--- 133
           N LT  E+A GW LLFDG+T  GWR YN +    TG W + DG +  +G G   +G    
Sbjct: 5   NTLTPAERADGWTLLFDGETAAGWRGYNDEDFPDTG-WTIEDGVLTIEGAGGGVSGSGGD 63

Query: 134 IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLID-----DKGFA 188
           I+    Y +F L+ +WKIS+GGNSG+ Y  +E+P   + Y + PE Q++D     D G  
Sbjct: 64  IITTETYGDFVLKLEWKISEGGNSGIFYRAIEQPDQPI-YWSAPEMQILDNANHPDAGRG 122

Query: 189 EPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
           E     ++ G  Y +   D  T +    GEW    IV + GHVE+++NGQK +E+E W+ 
Sbjct: 123 E--NGNRKAGSLYDLIPADPQTFSGH--GEWQDVMIVVEGGHVEHWLNGQKVLEYETWTP 178

Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
            W++     K+   PE+G AR+G I LQDHG  A FRNIKI+EL
Sbjct: 179 GWYRMIRDSKFRTHPEFGDAREGHIGLQDHGTTAHFRNIKIKEL 222
>gi|88712915|ref|ZP_01107000.1| hypothetical protein FB2170_09761 [Flavobacteriales bacterium
           HTCC2170]
 gi|88708813|gb|EAR01048.1| hypothetical protein FB2170_09761 [Flavobacteriales bacterium
           HTCC2170]
          Length = 266

 Score =  144 bits (364), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 87/228 (38%), Positives = 121/228 (53%), Gaps = 14/228 (6%)

Query: 75  EKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY- 133
           E   N LT  EKA GW +LFDG T EGWR Y  +     WE+VDG +   G G  E G  
Sbjct: 43  EAVMNGLTAAEKADGWVMLFDGTTSEGWRGYKKEHFPAAWEIVDGTMHMMGSGRGEAGAK 102

Query: 134 ----IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAE 189
               I++D+ ++NF L  +WKIS+GGNSG+ Y   E+  Y   + T PE Q++D++    
Sbjct: 103 DGGDIIFDKQFQNFTLSLEWKISEGGNSGIFYLGEEKLDYI--WKTAPEMQILDNE--RH 158

Query: 190 PLEDWQRCGVDYAMYLPDFA---TMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAW 246
           P     + G   A  L D       N +PAGEWN  ++    G V +  NG+  +E+  W
Sbjct: 159 PDAKLGKDGNRQAGSLYDLVPAKPQNAKPAGEWNKIEVTVYKGTVIHSQNGENVVEYHLW 218

Query: 247 SDDWFQRKNSGKWE--NAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           + +W +     K+   NA    +  KG I LQDHG   WFRN+K++EL
Sbjct: 219 TPEWNEMVAGSKFPGLNAEWADVPSKGYIGLQDHGDDVWFRNVKLKEL 266
>gi|126646853|ref|ZP_01719363.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
 gi|126576901|gb|EAZ81149.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
          Length = 246

 Score =  139 bits (351), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 77/224 (34%), Positives = 129/224 (57%), Gaps = 10/224 (4%)

Query: 74  SEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSD---- 129
           +E+P  V  ++E+   +  LF+G T  GW  Y G  +   W++ DG +  D +  D    
Sbjct: 28  TEEPTEVTQKQEE---FTPLFEGNTFAGWHKYGGGEVGKAWKIEDGTVYLDAKNKDGWQT 84

Query: 130 -ENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFA 188
            + G IV D  +ENF L++DWKI++ GNSG+++ V E P+Y   + TG E Q++D++G  
Sbjct: 85  GDGGDIVTDEEFENFHLKYDWKIAENGNSGVIFFVQEAPEYPYSWHTGMEMQVLDNEGHP 144

Query: 189 EPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
           +      R G  Y + +    T  V+P GEWN ++I+ D G+++  +NG   +E E W+ 
Sbjct: 145 DAKIISHRAGDLYDLIVSSEET--VKPWGEWNHAEIIADQGNLKLRLNGVTVVETELWTP 202

Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           +W       K+++ P +G  +KG I LQDHG   +F+N++I++L
Sbjct: 203 EWEALIADSKFKDMPGFGTFKKGKIALQDHGDLVYFKNVEIKKL 246
>gi|32473821|ref|NP_866815.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
 gi|32444357|emb|CAD74355.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
          Length = 254

 Score =  134 bits (338), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 78/211 (36%), Positives = 116/211 (54%), Gaps = 20/211 (9%)

Query: 90  WQLLFDGQTLEGWRDYNGQALTGPWEVVDGAI-----QADGEGSDENGYIVYDRIYENFE 144
           W+ LFDG  L+ WR+YN  ++T  W++   A+     +  G+ +     I  ++ ++ FE
Sbjct: 56  WETLFDGSNLDAWREYNRDSVTSGWKIEGNALTCISHKDQGDAARGENLITKEK-FDAFE 114

Query: 145 LQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMY 204
           L+ D+K++   NSG+++HVVE    K PY TGPE Q+ D KG  +P    Q+CG  Y +Y
Sbjct: 115 LELDFKVTPAANSGVMFHVVETK--KPPYYTGPEIQIQDHKGGHDP----QKCGWLYQLY 168

Query: 205 LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQR---KNSGKWEN 261
             +  T + +PAGEWN  +++      +  +NG    EF   SDDW +R      GKWE 
Sbjct: 169 PSE--TDSTKPAGEWNHLRVLITPAKCQIEVNGVLYSEFVKGSDDWNERVAKSKFGKWEG 226

Query: 262 APEYGLARKGLICLQDHGYPAWFRNIKIREL 292
              +G    G ICLQDH     +RNI+IR L
Sbjct: 227 ---FGEPTNGHICLQDHNDEVSYRNIRIRRL 254
>gi|120435377|ref|YP_861063.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
 gi|117577527|emb|CAL65996.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
          Length = 251

 Score =  134 bits (336), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 86/248 (34%), Positives = 129/248 (52%), Gaps = 33/248 (13%)

Query: 69  ALTACSEKPQNVLTEEEKAAG--------------WQLLFDGQTLEGWRDYNGQALTGPW 114
           ALTAC  K +N  +EE K A               WQ LF+G+ L+GW+ +N  +++  W
Sbjct: 13  ALTAC--KNENKESEEIKVAENTEMKSSEAKEDQEWQELFNGENLDGWKAFNKDSISDQW 70

Query: 115 EVVDGAI--QADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVP 172
           +  +GAI  +   E   +   ++    +ENFEL  +WKIS+ GNSG+++ V E  +Y  P
Sbjct: 71  KAENGAISFKPSAENRSKTENLITKEEFENFELSLEWKISEAGNSGIMWAVQEGEKYNEP 130

Query: 173 YVTGPEYQLIDDKGFAEPLEDWQR-CGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHV 231
           Y+TGPE Q++D++   +      R  G  Y M  P       +PAGEWN   I     H+
Sbjct: 131 YLTGPEIQVLDNQRHPDAKNGLNRTAGALYDMIPPSEDV--TKPAGEWNKEVI-----HI 183

Query: 232 EY-------YMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWF 284
            Y        +NG   +EF    ++W    +  K+     +G ++KG I LQDHG P W+
Sbjct: 184 NYKENKGWVKLNGTTIVEFPVHGEEWKNMVSKSKFSEWEGFGASQKGHIALQDHGDPVWY 243

Query: 285 RNIKIREL 292
           RNIKI++L
Sbjct: 244 RNIKIKQL 251
>gi|149197739|ref|ZP_01874789.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
           HTCC2155]
 gi|149139309|gb|EDM27712.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
           HTCC2155]
          Length = 233

 Score =  132 bits (333), Expect = 4e-29,   Method: Composition-based stats.
 Identities = 74/225 (32%), Positives = 124/225 (55%), Gaps = 9/225 (4%)

Query: 70  LTACSEKP--QNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEG 127
           +T+C+  P   N L++ E   GWQLLF+GQ +  WR++  Q +   W V  G ++  G G
Sbjct: 15  ITSCTSTPIPDNSLSKAEAKEGWQLLFNGQDMSQWRNFKKQDINPKWVVEGGTMKLSGGG 74

Query: 128 SDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF 187
               G I+  + YENF+ + +WKIS+ GNSG+   ++   + K  Y   PE Q++D++  
Sbjct: 75  G---GDIMTKKQYENFDFRMEWKISEAGNSGIF--ILADEKGKRIYSHAPEIQILDNEKH 129

Query: 188 AEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWS 247
            +  +   R G  Y M      +   + AGEWN  +I+ +  H++ + NG +T++    S
Sbjct: 130 NDRKKPNHRSGSLYDMITSPAESH--KKAGEWNQVRILLNKSHLQVWQNGIQTVDIVMHS 187

Query: 248 DDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           D+W +     K++N   +G+ +KG + LQDH    WF+N+K+ EL
Sbjct: 188 DEWKELVGKSKFKNWKGFGMNKKGHLGLQDHNDVVWFKNLKVLEL 232
>gi|149278985|ref|ZP_01885119.1| hypothetical protein PBAL39_03869 [Pedobacter sp. BAL39]
 gi|149230264|gb|EDM35649.1| hypothetical protein PBAL39_03869 [Pedobacter sp. BAL39]
          Length = 230

 Score =  132 bits (331), Expect = 7e-29,   Method: Composition-based stats.
 Identities = 75/216 (34%), Positives = 115/216 (53%), Gaps = 5/216 (2%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTG-PWEVVDGAIQADGEGSDENGYIVYD 137
           N +  +    G++ L DG+T  GW  Y GQA  G  W+V DGA   D    +  G ++ D
Sbjct: 18  NTIQAQSVKKGFKALSDGKTTAGWHTY-GQATAGEKWKVEDGAFHLDPSVQNGGGDLITD 76

Query: 138 RIYENFELQWDWKISKGGNSGLLYHVVE-RPQYKVPYVTGPEYQLIDDKGFAEPLEDWQR 196
           + Y NF L +DWK++   NSG++++V E + +Y   Y TG E Q+ID+ G  +      R
Sbjct: 77  KEYTNFHLIYDWKVAPNANSGVIFYVKEDKEKYHATYSTGLEMQVIDNDGHPDAKNVKHR 136

Query: 197 CGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNS 256
               Y +     ++  V+P GEWNT +I+  NG +E  +NG   ++   W+DD+      
Sbjct: 137 AADLYDIIAS--SSEPVKPVGEWNTGEIISKNGKLELKLNGVTVVKTTLWNDDFKALLAK 194

Query: 257 GKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
            K+    ++   + G I LQDHG   W+RNI I+EL
Sbjct: 195 SKFATWKDFAAFKTGKIALQDHGDEVWYRNIMIKEL 230
>gi|86143589|ref|ZP_01061974.1| hypothetical protein MED217_13359 [Flavobacterium sp. MED217]
 gi|85830036|gb|EAQ48497.1| hypothetical protein MED217_13359 [Leeuwenhoekiella blandensis
           MED217]
          Length = 458

 Score =  130 bits (328), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 88/224 (39%), Positives = 125/224 (55%), Gaps = 11/224 (4%)

Query: 78  QNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP---WEVVDG---AIQADGEGSDEN 131
           +N LT+ E A+GWQLL+DG+T EGWR   G+    P   W + DG    +   GE S   
Sbjct: 237 KNNLTQAEVASGWQLLWDGKTTEGWR--GGKLDHFPEKGWVIEDGELIVLSTGGEESAAG 294

Query: 132 GYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEP- 190
           G IV    Y++FEL+ D+KI++G NSG+ Y+V            G EYQ++DD    +  
Sbjct: 295 GDIVTTEQYQDFELKIDFKITEGANSGIKYYVDTELNKGEGSSIGLEYQILDDAHHPDAK 354

Query: 191 LEDWQRCGVDYAMYLPDFATMN--VRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
           L + +      ++Y    A  N  V P GEWNT++IV  N HVE+Y+N  K +E++  SD
Sbjct: 355 LGNHEGSRTLASLYDLIQADPNKPVNPIGEWNTARIVSKNKHVEHYLNDVKVLEYDRGSD 414

Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
            + Q     K+++ P +G   KG I LQDHG    F+NIKI+ L
Sbjct: 415 AFLQLVEESKYKDWPGFGTFEKGNILLQDHGDRVAFKNIKIKVL 458

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 58/188 (30%), Positives = 99/188 (52%), Gaps = 14/188 (7%)

Query: 299 EELFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
           + LFNG  L GW  + G   + ++   +V  +  +    +L T + Y DF +  D+K + 
Sbjct: 34  QSLFNGTSLDGWKQLNGKAAYRIEGDEIVGTTVANTPNSFLTTTQDYGDFIMELDYKVDP 93

Query: 358 DGNSGIFIRSFVE---EGAKVNGWQVEVAPKGFD-TGGIYESYGRGWLIQIPDD--RENF 411
             NSGI IRS         +V+G+Q+E+ P     + GIY+   RGWL  + ++   +  
Sbjct: 94  SMNSGIQIRSLSTPKFRNGRVHGYQIEIDPSERAWSAGIYDEARRGWLYSLENNPKAQQA 153

Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH-----DGGGIKVL 466
            K+ EWN  R+  +G+ + TW+NG +   + D++  +  G IALQ+H     D  G ++ 
Sbjct: 154 FKQNEWNHYRVEALGDTLKTWINGVEAAHLVDDQTAS--GFIALQVHAIGAEDEPGKEIR 211

Query: 467 WRNIRVKT 474
           W+NI++ T
Sbjct: 212 WKNIKIIT 219

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 82/353 (23%), Positives = 146/353 (41%), Gaps = 74/353 (20%)

Query: 51  IVFMMKCMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL 110
           IVF    ++ L ++V     T C  + +   T +E    WQ LF+G +L+GW+  NG+A 
Sbjct: 4   IVFKYLVLSALGLLVF----TGCKNESE---TADEP---WQSLFNGTSLDGWKQLNGKA- 52

Query: 111 TGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYK 170
              + +    I      +  N ++   + Y +F ++ D+K+    NSG+    +  P+++
Sbjct: 53  --AYRIEGDEIVGTTVANTPNSFLTTTQDYGDFIMELDYKVDPSMNSGIQIRSLSTPKFR 110

Query: 171 VPYVTGPEYQL-IDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNG 229
              V G + ++   ++ ++  + D  R G  Y++     A    +   EWN  ++     
Sbjct: 111 NGRVHGYQIEIDPSERAWSAGIYDEARRGWLYSLENNPKAQQAFK-QNEWNHYRVEALGD 169

Query: 230 HVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDH--------GYP 281
            ++ ++NG   +E     DD                     G I LQ H        G  
Sbjct: 170 TLKTWING---VEAAHLVDDQ-----------------TASGFIALQVHAIGAEDEPGKE 209

Query: 282 AWFRNIK-IRELPRKTEEEE----------------------LFNGKDLTGW-----DVY 313
             ++NIK I E P++  ++                       L++GK   GW     D +
Sbjct: 210 IRWKNIKIITENPQQYSKQMSLPAFNTKNNLTQAEVASGWQLLWDGKTTEGWRGGKLDHF 269

Query: 314 GTEQWYVQD-SLLVCESGPDKQY--GYLATCKYYNDFELTADFKQEADGNSGI 363
             + W ++D  L+V  +G ++    G + T + Y DFEL  DFK     NSGI
Sbjct: 270 PEKGWVIEDGELIVLSTGGEESAAGGDIVTTEQYQDFELKIDFKITEGANSGI 322
>gi|86143701|ref|ZP_01062077.1| probable secreted glycosyl hydrolase [Flavobacterium sp. MED217]
 gi|85829744|gb|EAQ48206.1| probable secreted glycosyl hydrolase [Leeuwenhoekiella blandensis
           MED217]
          Length = 248

 Score =  129 bits (323), Expect = 6e-28,   Method: Composition-based stats.
 Identities = 88/253 (34%), Positives = 128/253 (50%), Gaps = 23/253 (9%)

Query: 58  MNCLCVMVLFGA--LTACSEKPQNVLTEEEKAAG----------WQLLFDGQTLEGWRDY 105
           M  L V V F     T+C EK +   TEE   A           W++LFDG  ++ W  Y
Sbjct: 1   MKQLIVSVAFAMALFTSCKEKAEEANTEEVAVATTETTQPEDNEWEVLFDGTNIDKWHAY 60

Query: 106 NGQALTGPWEVVDGAI---QADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYH 162
           NG   T  WE+VD  +    A+     EN  +V +  Y +F L+ DW IS+GGNSG+++ 
Sbjct: 61  NGGDPT-QWEIVDDVLVFTPAENRNGSEN--LVTNEAYTSFVLKMDWMISEGGNSGIMWA 117

Query: 163 VVERPQYKVPYVTGPEYQLIDDKGFAEP-LEDWQRCGVDYAMYLPDFATMNVRPAGEWNT 221
           V E P+Y  PY TGPE Q++DD+   +       R G  Y M       +N  PAGEWN+
Sbjct: 118 VEEDPKYHEPYATGPEIQILDDERHPDTNAGPSHRSGALYDMIGAPEGVVN--PAGEWNS 175

Query: 222 SKIVFDNGHVE--YYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHG 279
            +I  D    +    +NG++ + F    + W    ++ K+ +   +     GLI LQDHG
Sbjct: 176 YEITIDYNTNQGIIVLNGEEVVTFPVKGEAWEDLVSNSKFASWEAFAKTDTGLIALQDHG 235

Query: 280 YPAWFRNIKIREL 292
           +   F+NIKI++L
Sbjct: 236 HQVSFKNIKIKKL 248
>gi|154493991|ref|ZP_02033311.1| hypothetical protein PARMER_03336 [Parabacteroides merdae ATCC
           43184]
 gi|154086251|gb|EDN85296.1| hypothetical protein PARMER_03336 [Parabacteroides merdae ATCC
           43184]
          Length = 284

 Score =  127 bits (320), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 77/228 (33%), Positives = 119/228 (52%), Gaps = 19/228 (8%)

Query: 82  TEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY-----IVY 136
           T  + A G   LFDG++  GWR Y    +   WEVVDG I   G G+ E G      +V+
Sbjct: 58  TFPKDADGKVTLFDGKSFNGWRGYGRTDVPAAWEVVDGTIHIKGSGAGEAGAKDGGDLVF 117

Query: 137 DRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF--AEPLEDW 194
              ++N+E +++WK+ KG NSG+LY +++  + +  Y++ PEYQ++D+     A+  +D 
Sbjct: 118 AHKFKNYEFEFEWKVGKGSNSGVLY-MIQEVEGQPSYISAPEYQVLDNANHPDAKLGKDG 176

Query: 195 QRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQ-- 252
            R        +P     N +P GEWN  KI+   G V +Y N +  +E+  W+  W +  
Sbjct: 177 NRQSASLYDMIPA-KPQNSKPFGEWNKGKIMCYKGTVVHYQNDEPVVEYHLWTQQWKEML 235

Query: 253 ---RKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
              + +  KW  A E      G  ++G I  QDHG   W+RNI I+EL
Sbjct: 236 DNSKFSKDKWPLAYELLLNCGGENKEGFIGFQDHGDDVWYRNITIKEL 283
>gi|150009463|ref|YP_001304206.1| hypothetical protein BDI_2876 [Parabacteroides distasonis ATCC
           8503]
 gi|149937887|gb|ABR44584.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 270

 Score =  126 bits (317), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 81/225 (36%), Positives = 116/225 (51%), Gaps = 28/225 (12%)

Query: 90  WQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDEN---GYIVYDRIYENFELQ 146
           W  +FDG+TL GWR Y  Q +   W V DG+I   G  +  +   G ++YD+ ++NF  +
Sbjct: 52  WITMFDGKTLNGWRGYCRQDVPLGWVVEDGSITYKGSDNKADTGFGDLIYDKKFKNFVFE 111

Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRC------GVD 200
            +WKI K GNSG+ Y   E     + Y + PEYQL+D++   +    W+ C      G  
Sbjct: 112 IEWKIDKAGNSGIFYTAQEIEGTPI-YYSSPEYQLLDNENMPDA---WEGCDGNRQAGAV 167

Query: 201 YAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDW---FQRKNSG 257
           Y M +PD     V+P G WN ++IV  N  V +YMN  K +EF+  +  W          
Sbjct: 168 YDMIMPD--PQPVKPYGNWNKTRIVVYNQRVIHYMNDVKILEFQFGTPVWRALVDHSKFS 225

Query: 258 KWENAPE-----YGL-----ARKGLICLQDHGYPAWFRNIKIREL 292
           K+  +PE     Y L      + G I +QDHGY   FRNI+I+EL
Sbjct: 226 KFSTSPEKCPEAYDLMLQCGKQPGYIGMQDHGYGVCFRNIRIKEL 270
>gi|150007837|ref|YP_001302580.1| hypothetical protein BDI_1195 [Parabacteroides distasonis ATCC
           8503]
 gi|149936261|gb|ABR42958.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 269

 Score =  125 bits (313), Expect = 8e-27,   Method: Composition-based stats.
 Identities = 88/228 (38%), Positives = 124/228 (54%), Gaps = 16/228 (7%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTG-PWEVVDGAI---QADGEGSDENGYI 134
           N LT++EKA GW LLFDG+T +GWR  +  A     W V DG +   ++DG  S   G I
Sbjct: 43  NQLTDQEKAEGWALLFDGKTTKGWRGAHKDAFPDHGWMVKDGELIVQKSDGSESTNGGDI 102

Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF--AEPLE 192
           V +  Y  FE   D+KI++G NSG+ Y V E+ + K     G E+QL+DD     A+   
Sbjct: 103 VTEGEYSAFEFSVDFKITEGANSGIKYFVTEQEKQK-GSAYGLEFQLLDDAKHPDAKLYT 161

Query: 193 DWQRCGVDYAMY-LPDFATMNVRPAGEWNTSKI-VFDNGHVEYYMNGQKTIEFEAWSDDW 250
            +       ++Y L     ++    GEWNT+ + VF N HVE+++NG K +E+E  S ++
Sbjct: 162 TFPGSRTLGSLYDLKKSENIHFNGVGEWNTAVVKVFPNNHVEHWLNGVKVLEYERGSKEF 221

Query: 251 FQRKNSGKWENAPEY------GLARKGLICLQDHGYPAWFRNIKIREL 292
                  K+ + P Y      G A KG I LQDHG    FRNIK++EL
Sbjct: 222 RDLVKGSKYAD-PSYNAGGAFGEAPKGHILLQDHGDEVAFRNIKVKEL 268
>gi|149196977|ref|ZP_01874030.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
           HTCC2155]
 gi|149140087|gb|EDM28487.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
           HTCC2155]
          Length = 223

 Score =  124 bits (312), Expect = 1e-26,   Method: Composition-based stats.
 Identities = 79/236 (33%), Positives = 120/236 (50%), Gaps = 14/236 (5%)

Query: 58  MNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVV 117
           M    ++ LF      +E   N L++E+K+ GWQLLFDG+T  GW +Y    L   W   
Sbjct: 1   MRSFLILSLFSFSIFAAE--MNTLSDEQKSEGWQLLFDGKTTNGWVNYQSDKLNPLWVAE 58

Query: 118 DGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGP 177
           DG ++   +G    G++     +++FEL+  WKIS GGNSG+   V  +          P
Sbjct: 59  DGCLKLVKKGG---GHMHSTSSFKDFELKLQWKISAGGNSGVFLRVTPKSASG-----SP 110

Query: 178 EYQLIDDKGFAEPLEDWQRCGVDYAMYL-PDFATMNVRPAGEWNTSKIVFDNGHVEYYMN 236
           E Q++D++      +     G  Y +   P  A   V+  GEWN   I+    H ++++N
Sbjct: 111 EMQVLDNEKNGNGKDPKTSAGALYGIIAAPKGA---VKAQGEWNQVHIIAKGKHYQFFLN 167

Query: 237 GQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           G KT +F+  S+++ + K+ GK      YG   +G I LQDHG    FRNI I+EL
Sbjct: 168 GVKTADFDIDSEEFQKLKSQGKMAKKKTYGSNTEGHIGLQDHGKEVCFRNIMIKEL 223
>gi|88803973|ref|ZP_01119493.1| hypothetical protein RB2501_03965 [Robiginitalea biformata
           HTCC2501]
 gi|88784852|gb|EAR16021.1| hypothetical protein RB2501_03965 [Robiginitalea biformata
           HTCC2501]
          Length = 454

 Score =  123 bits (308), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 83/223 (37%), Positives = 125/223 (56%), Gaps = 9/223 (4%)

Query: 78  QNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTG-PWEVVDGAIQA---DGEGSDENGY 133
           +N LT+ EK  GWQLL+DG++ EGW     +      WE+ DG +      G  S+  G 
Sbjct: 232 KNNLTQAEKEDGWQLLWDGESTEGWHGARLEDFPDYGWEIEDGVLTVLASGGGESEAGGD 291

Query: 134 IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPL-- 191
           IV D +Y +F+L+ D++I++G NSG+ Y+V            G EYQ++DD+   +    
Sbjct: 292 IVTDSLYGDFDLRVDFRITEGANSGIKYYVDTELNKGEGSAIGLEYQILDDERHPDAKLG 351

Query: 192 --EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDD 249
             E  +     Y +   D A   V P G+WNT++I+  +GHVE+++NG K +E+E  SD 
Sbjct: 352 NHEGSRTMASLYDLIRADPAK-PVNPIGQWNTARILSRDGHVEHWLNGVKVLEYERGSDA 410

Query: 250 WFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           + Q  +  K++  P +G A +G I LQDHG    FRNIKI+ L
Sbjct: 411 YRQLVSESKYKIWPGFGEAGRGHILLQDHGNRVSFRNIKIKTL 453

 Score =  109 bits (272), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 68/192 (35%), Positives = 112/192 (58%), Gaps = 16/192 (8%)

Query: 296 TEEEELFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDKQYGYLATCKYYNDFELTADFK 354
           T  +ELFNG+DL+GW   G E  Y V+D  +V  +  D    ++AT + Y DF L  ++ 
Sbjct: 25  TPWQELFNGEDLSGWTQLGGEAEYAVRDGAIVGTTVHDTPNSFMATEQLYEDFILELEYL 84

Query: 355 QEADGNSGIFIRSFVEE---GAKVNGWQVEVAP--KGFDTGGIYESYGRGWLIQIPD--D 407
            ++  NSGI +RS  ++     +V+G+Q+E+ P  +G+ + GIY+   RGWL+ + D  D
Sbjct: 85  VDSTMNSGIQVRSNSQDYYMDGRVHGYQIEIDPSDRGW-SAGIYDEARRGWLVPVTDNPD 143

Query: 408 RENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-----G 462
            +   ++ +WN  RI  +G+ + TW+NG     + D+K    +G IALQ+H  G     G
Sbjct: 144 AQAAFRQGDWNHYRIEAIGDTLKTWINGVPAAHLIDDK--TSEGFIALQVHSIGDDAQAG 201

Query: 463 IKVLWRNIRVKT 474
            +++WR+IR+ T
Sbjct: 202 TEIIWRDIRILT 213

 Score = 60.5 bits (145), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 115/461 (24%), Positives = 177/461 (38%), Gaps = 99/461 (21%)

Query: 85  EKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFE 144
           E    WQ LF+G+ L GW    G+A    + V DGAI         N ++  +++YE+F 
Sbjct: 22  EDDTPWQELFNGEDLSGWTQLGGEA---EYAVRDGAIVGTTVHDTPNSFMATEQLYEDFI 78

Query: 145 LQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQL-IDDKGFAEPLEDWQRCGVDYAM 203
           L+ ++ +    NSG+      +  Y    V G + ++   D+G++  + D  R G    +
Sbjct: 79  LELEYLVDSTMNSGIQVRSNSQDYYMDGRVHGYQIEIDPSDRGWSAGIYDEARRGWLVPV 138

Query: 204 YLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAP 263
                A    R  G+WN  +I      ++ ++NG          DD    K S       
Sbjct: 139 TDNPDAQAAFR-QGDWNHYRIEAIGDTLKTWINGVPAAHL---IDD----KTS------- 183

Query: 264 EYGLARKGLICLQDH--------GYPAWFRNIKI------RELPRKTEEEELFNGKDLTG 309
                 +G I LQ H        G    +R+I+I      R   R T  E +    +LT 
Sbjct: 184 ------EGFIALQVHSIGDDAQAGTEIIWRDIRILTDSLARAYSRDTPLEPVVTKNNLTQ 237

Query: 310 ----------WDVYGTEQWY-------------VQDSLLVC---ESGPDKQYGYLATCKY 343
                     WD   TE W+             ++D +L       G  +  G + T   
Sbjct: 238 AEKEDGWQLLWDGESTEGWHGARLEDFPDYGWEIEDGVLTVLASGGGESEAGGDIVTDSL 297

Query: 344 YNDFELTADFKQEADGNSGI--FIRSFVEEG-AKVNGWQVEV--------APKGFDTGGI 392
           Y DF+L  DF+     NSGI  ++ + + +G     G + ++        A  G   G  
Sbjct: 298 YGDFDLRVDFRITEGANSGIKYYVDTELNKGEGSAIGLEYQILDDERHPDAKLGNHEGSR 357

Query: 393 YESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNG-------------EQMV 439
             +     +   P    N +   +WNT RI      V  WLNG              Q+V
Sbjct: 358 TMASLYDLIRADPAKPVNPIG--QWNTARILSRDGHVEHWLNGVKVLEYERGSDAYRQLV 415

Query: 440 DIQDEKI-----GAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
                KI      AG+G I LQ H   G +V +RNI++KTL
Sbjct: 416 SESKYKIWPGFGEAGRGHILLQDH---GNRVSFRNIKIKTL 453
>gi|88804319|ref|ZP_01119839.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
           HTCC2501]
 gi|88785198|gb|EAR16367.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
           HTCC2501]
          Length = 250

 Score =  121 bits (304), Expect = 8e-26,   Method: Composition-based stats.
 Identities = 84/246 (34%), Positives = 122/246 (49%), Gaps = 26/246 (10%)

Query: 67  FGALTACSEKPQN----------VLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEV 116
           F A+ AC E  +             T E +A+ W +LFDG + +GW+ YN + +   W +
Sbjct: 11  FFAVIACKENKETPSEEVAESAETATPENEASDWIVLFDGTSFDGWKGYNQEGVPDTWSI 70

Query: 117 VDGAI-----QADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKV 171
            +GA+         EG+  N  +V +  +ENF L  +W+IS+GGNSG+ + V E  Q+  
Sbjct: 71  EEGAMVFTPPAERPEGASYN--LVTESKFENFVLSLEWQISEGGNSGVFWGVEELEQFGQ 128

Query: 172 PYVTGPEYQLIDDKGFAEPLE-DWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFD--- 227
           PY TGPE Q++D++   +       + G  Y M  P       RP GEWNT +I  D   
Sbjct: 129 PYQTGPEIQVLDNEKHPDAKAGTTHQAGALYDMIAPSEDV--TRPVGEWNTMEITIDYAG 186

Query: 228 -NGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRN 286
             G V   MNG + + F   +D W       K++    +G   +G I LQDHG    FRN
Sbjct: 187 ETGKV--VMNGTELLTFPLGNDAWDAMVADSKFDGWEGFGQYHEGKIGLQDHGDRVAFRN 244

Query: 287 IKIREL 292
           IKI+ L
Sbjct: 245 IKIKPL 250
>gi|150009244|ref|YP_001303987.1| hypothetical protein BDI_2646 [Parabacteroides distasonis ATCC
           8503]
 gi|149937668|gb|ABR44365.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 284

 Score =  120 bits (301), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 73/232 (31%), Positives = 123/232 (53%), Gaps = 20/232 (8%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY----- 133
           ++ T  +   G  ++FDG+T  GWR Y+   + G W + DGAI+ +G G+ E G      
Sbjct: 54  DITTFPKDKEGRYVIFDGKTFNGWRGYDRADVPGAWTIEDGAIKINGSGAGEAGASNGGD 113

Query: 134 IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF--AEPL 191
           +++     NFEL+++WK+ KG NSG ++ +++  + +  Y++ PEYQ++D++    A+  
Sbjct: 114 LIFAHKLGNFELEFEWKVGKGSNSG-VFIMIQEVEGQPSYISAPEYQVLDNENHPDAKLG 172

Query: 192 EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNG-QKTIEFEAWSDDW 250
           +D  R        +P     N +P GEWN  KI+   G V +Y+N  +  +E+  W+  W
Sbjct: 173 KDGNRKSSSLYDMIPA-KPQNAKPFGEWNKGKIMCYKGTVVHYLNSDEPVVEYHLWTPQW 231

Query: 251 FQ-----RKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
            +     + +  KW  A E      G  ++G I  QDHG   WFRNI ++ L
Sbjct: 232 KEMLDNSKFSKDKWPLAYELLLNCGGANKEGFIGFQDHGDDVWFRNITVKVL 283
>gi|154492730|ref|ZP_02032356.1| hypothetical protein PARMER_02367 [Parabacteroides merdae ATCC
           43184]
 gi|154087035|gb|EDN86080.1| hypothetical protein PARMER_02367 [Parabacteroides merdae ATCC
           43184]
          Length = 459

 Score =  120 bits (301), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 88/228 (38%), Positives = 117/228 (51%), Gaps = 16/228 (7%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL-TGPWEVVDGAIQADGEGSDE---NGYI 134
           N LTE EKAAGW+LLFDG+T  GWR    +      W++ +G +     G  E    G I
Sbjct: 233 NTLTEAEKAAGWKLLFDGKTSNGWRGAGQETFPENGWKIENGELTVMKNGGPEGKRGGDI 292

Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDW 194
           +    +  FEL +++K+++G NSG+ Y + E  + K  +V GPEYQ++DDK   +     
Sbjct: 293 LTVDEFGAFELSFEFKLTEGANSGMKYLIQESKKNK-GFVIGPEYQVLDDKQHPDAKLYT 351

Query: 195 QRCGVDYAMYLPDF---ATMNVRPAGEWNTSKI-VFDNGHVEYYMNGQKTIEFEAWSDDW 250
              G      L D            G+WN   I VF N HVE++MNG KT+E++ W+ D 
Sbjct: 352 TYPGSRTVSSLYDIIPAKNKRFNGVGQWNKGVIKVFPNKHVEHWMNGFKTVEYD-WASDA 410

Query: 251 FQRKNSGKWENAPEY------GLARKGLICLQDHGYPAWFRNIKIREL 292
           F     G      EY      G A KG I LQDH     FRNIKIREL
Sbjct: 411 FLEVVKGSKFAKKEYAEFGPFGTAEKGHILLQDHWDEVSFRNIKIREL 458

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 61/189 (32%), Positives = 101/189 (53%), Gaps = 15/189 (7%)

Query: 299 EELFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
           ++LFNGKD TG+  + G   + V++  +V ++   +   ++AT + Y DF L  + K   
Sbjct: 26  QQLFNGKDFTGFKQLNGKAPYRVENGCMVGQTVDKEPNSFMATEQTYGDFILEFEVKCHP 85

Query: 358 DGNSGIFIRSFVE---EGAKVNGWQVEVAPKGFD-TGGIYESYGRGWLIQIPDDREN--F 411
           D NSG+  RS  +      +V+G+Q E+ P     +GG+Y+   RGWL  + ++      
Sbjct: 86  DLNSGVQFRSESKPDYNNGRVHGYQCEIDPSDRAWSGGLYDEARRGWLAPLTNNEAGRAA 145

Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH------DGGGIKV 465
            K+ +WN  RI  +GN +  WLNG    ++ D+     +G IA Q+H      +  G ++
Sbjct: 146 YKKDDWNKYRIEAIGNSIRIWLNGVNTSNVVDDM--TPEGFIAFQVHGIFGKTENVGKEI 203

Query: 466 LWRNIRVKT 474
            WRNIR+KT
Sbjct: 204 WWRNIRIKT 212

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 95/406 (23%), Positives = 160/406 (39%), Gaps = 80/406 (19%)

Query: 89  GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWD 148
           GWQ LF+G+   G++  NG+A   P+ V +G +       + N ++  ++ Y +F L+++
Sbjct: 24  GWQQLFNGKDFTGFKQLNGKA---PYRVENGCMVGQTVDKEPNSFMATEQTYGDFILEFE 80

Query: 149 WKISKGGNSGLLYHVVERPQYKVPYVTGPEYQL-IDDKGFAEPLEDWQRCGVDYAMYLPD 207
            K     NSG+ +    +P Y    V G + ++   D+ ++  L D  R G   A    +
Sbjct: 81  VKCHPDLNSGVQFRSESKPDYNNGRVHGYQCEIDPSDRAWSGGLYDEARRGW-LAPLTNN 139

Query: 208 FATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGL 267
            A        +WN  +I      +  ++NG  T       DD                 +
Sbjct: 140 EAGRAAYKKDDWNKYRIEAIGNSIRIWLNGVNTSNV---VDD-----------------M 179

Query: 268 ARKGLICLQDHGY---------PAWFRNIKIRE------------------LPRKTEEEE 300
             +G I  Q HG            W+RNI+I+                   +P    E E
Sbjct: 180 TPEGFIAFQVHGIFGKTENVGKEIWWRNIRIKTENLEAERMQGPLAPEVNCIPNTLTEAE 239

Query: 301 -------LFNGKDLTGWDVYGTEQ-----WYVQDSLLVC--ESGPD-KQYGYLATCKYYN 345
                  LF+GK   GW   G E      W +++  L      GP+ K+ G + T   + 
Sbjct: 240 KAAGWKLLFDGKTSNGWRGAGQETFPENGWKIENGELTVMKNGGPEGKRGGDILTVDEFG 299

Query: 346 DFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFD-----TGGIYESYGRGW 400
            FEL+ +FK     NSG  ++  ++E  K  G+ +    +  D        +Y +Y    
Sbjct: 300 AFELSFEFKLTEGANSG--MKYLIQESKKNKGFVIGPEYQVLDDKQHPDAKLYTTYPGSR 357

Query: 401 LIQ-----IPDDRENFLKEREWNTMRIRVVGNQ-VTTWLNGEQMVD 440
            +      IP   + F    +WN   I+V  N+ V  W+NG + V+
Sbjct: 358 TVSSLYDIIPAKNKRFNGVGQWNKGVIKVFPNKHVEHWMNGFKTVE 403
>gi|149276529|ref|ZP_01882673.1| hypothetical protein PBAL39_02377 [Pedobacter sp. BAL39]
 gi|149233049|gb|EDM38424.1| hypothetical protein PBAL39_02377 [Pedobacter sp. BAL39]
          Length = 452

 Score =  119 bits (299), Expect = 3e-25,   Method: Composition-based stats.
 Identities = 80/224 (35%), Positives = 121/224 (54%), Gaps = 13/224 (5%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRD-YNGQALTGPWEVVDG---AIQADGEGSDENGYI 134
           N L+ +EKA G+ LL+DG+T EGWR  Y        W + DG    +++DG  S   G I
Sbjct: 231 NDLSAQEKAEGYSLLWDGRTTEGWRGAYKSTFPESGWLIKDGELSVVKSDGSESTHGGDI 290

Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDW 194
           V ++ Y  FEL++D+K++ G NSG+ Y V      K   + GPEYQ++DD+    P    
Sbjct: 291 VTEKQYGAFELKFDFKLTPGANSGVKYFVTLTEGNKGSAI-GPEYQVLDDE--RHPDAKL 347

Query: 195 QRCGVDYAMYLPDFATMN-----VRPAGEWNTSKI-VFDNGHVEYYMNGQKTIEFEAWSD 248
            + G      L D  T        R  GEWN   I VF +  +EY++NG K +E+E  + 
Sbjct: 348 GKNGNRTLGSLYDLMTSKKIPNAQRKIGEWNRGLIRVFPDNKIEYWLNGYKILEYERGTP 407

Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           ++       K+++   +G+A KG + LQDHG   +FR++KI+ L
Sbjct: 408 EFTALVAGSKYKDWNNFGMAEKGHVLLQDHGDQVFFRSLKIKTL 451

 Score =  101 bits (252), Expect = 9e-20,   Method: Composition-based stats.
 Identities = 70/196 (35%), Positives = 105/196 (53%), Gaps = 16/196 (8%)

Query: 292 LPRKTEEEELFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELT 350
           L +  + ++LFNGKDL+GW  + G  ++ V++  ++  +   +   +L T   Y DF L 
Sbjct: 20  LLKAQQWQQLFNGKDLSGWKQLNGKAKYEVRNGEIIGTTVSAEPNSFLCTDVDYGDFILE 79

Query: 351 ADFKQEADGNSGIFIRSFVE---EGAKVNGWQVEVAPKGFD-TGGIYESYGRGWLIQI-- 404
            +   +   NSGI IRS  +   E  +V+G+QVEV P     +GGIY+   RGWL  +  
Sbjct: 80  VELMADPSMNSGIQIRSESKSDYENGRVHGYQVEVDPSDRQFSGGIYDEARRGWLYPMDI 139

Query: 405 -PDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-- 461
            P  +  F K   WN  RI  +GN + TW+NG    ++ D    AG   IALQ+H  G  
Sbjct: 140 NPKGKLAF-KNGSWNKYRIECIGNSIRTWVNGVPAANVVDNMTPAG--FIALQVHSIGKD 196

Query: 462 ---GIKVLWRNIRVKT 474
              G ++ WRNIR++T
Sbjct: 197 EIAGKQIRWRNIRIQT 212
>gi|29347567|ref|NP_811070.1| hypothetical protein BT_2157 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29339468|gb|AAO77264.1| probable secreted glycosyl hydrolase [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 290

 Score =  118 bits (296), Expect = 8e-25,   Method: Composition-based stats.
 Identities = 74/231 (32%), Positives = 114/231 (49%), Gaps = 26/231 (11%)

Query: 87  AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDE-----NGYIVYDRIYE 141
           A G+  +FDG+T  GWR Y    +   W + DG I+ +G G  E      G +++   ++
Sbjct: 60  ADGYITIFDGKTFNGWRGYGKDRVPSKWTIEDGCIKFNGSGGGEAQDGDGGDLIFAHKFK 119

Query: 142 NFELQWDWKISKGGNSGLLYHVVERPQ--------YKVPYVTGPEYQLIDDKGF--AEPL 191
           NFEL+ +WK+SKGGNSG+ Y   E            +  Y++ PEYQ++D+     A+  
Sbjct: 120 NFELEMEWKVSKGGNSGIFYLAQEVTSKDKDGNDVLEPIYISAPEYQVLDNDNHPDAKLG 179

Query: 192 EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
           +D  R        +P     N +P GEWN +KI+   G V +  N +  +E+  W+  W 
Sbjct: 180 KDNNRQSASLYDMIPA-VPQNAKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWT 238

Query: 252 -----QRKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
                 + +  KW  A E      G   +G I +QDHG   WFRNI+++ L
Sbjct: 239 DLLQASKFSQDKWPLAFELLNNCGGENHEGFIGMQDHGDDVWFRNIRVKVL 289
>gi|156110542|gb|EDO12287.1| hypothetical protein BACOVA_02177 [Bacteroides ovatus ATCC 8483]
          Length = 451

 Score =  118 bits (295), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 75/222 (33%), Positives = 117/222 (52%), Gaps = 11/222 (4%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP-WEVVDG---AIQADGEGSDENGYI 134
           N ++  E   GW LL+DG+T +GWR           W++ DG    +++ G  S   G I
Sbjct: 232 NTISPNEAKEGWTLLWDGKTTDGWRGAKLSTFPAKGWKIEDGILKVMKSGGAESANGGDI 291

Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEP---L 191
           V  R Y+NF L+ D+KI++G NSG+ Y V            G E+Q++DD    +    +
Sbjct: 292 VTTRKYKNFILKVDFKITEGANSGIKYFVNPDMNKGAGSAIGCEFQILDDDKHPDAKLGV 351

Query: 192 EDWQRCGVDYAMY-LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDW 250
           +  ++ G  Y +   P     N +   E+NT+ I+    HVE+++NG K IE++  +D W
Sbjct: 352 KGNRKLGSLYDLIPAPKNKPFNKK---EFNTATIIVKGNHVEHWLNGVKLIEYDRNNDMW 408

Query: 251 FQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
                  K++N P +G   +G I LQDHG   WF+N+KI+EL
Sbjct: 409 NALVAYSKYKNWPNFGNPEEGNILLQDHGDEVWFKNVKIKEL 450

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 72/189 (38%), Positives = 99/189 (52%), Gaps = 16/189 (8%)

Query: 299 EELFNGKDLTGWD-VYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
           E LFNGK+L GW  + G  ++ + D  +V  S       +LAT K Y DF L  DFK + 
Sbjct: 25  EPLFNGKNLKGWKKLNGKAEYKIVDGAIVGVSKMGTPNTFLATTKNYGDFILEFDFKVDD 84

Query: 358 DGNSGIFIRSFVEEGAK---VNGWQVEVAP-KGFDTGGIYESYGRGWLIQI---PDDREN 410
             NSG+ +RS  ++  K   V+G+Q E+ P K   +GGIY+   R WL  +   P  +  
Sbjct: 85  GLNSGVQLRSESKKDYKKGRVHGYQFEIDPSKRAWSGGIYDEARRNWLYPLTLNPSAKTA 144

Query: 411 FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-----GIKV 465
           F K   WN  RI  VGN + TW+NG    +I D+      G IALQ+H  G     G  V
Sbjct: 145 F-KNNAWNKARIEAVGNSIRTWINGVPCANIWDDMTPV--GFIALQVHAIGNAADEGKTV 201

Query: 466 LWRNIRVKT 474
            W++IR+ T
Sbjct: 202 SWKDIRICT 210

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 110/457 (24%), Positives = 186/457 (40%), Gaps = 95/457 (20%)

Query: 87  AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQ 146
           A  W+ LF+G+ L+GW+  NG+A    +++VDGAI    +    N ++   + Y +F L+
Sbjct: 21  AQNWEPLFNGKNLKGWKKLNGKA---EYKIVDGAIVGVSKMGTPNTFLATTKNYGDFILE 77

Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDK-GFAEPLEDWQRCGVDYAMYL 205
           +D+K+  G NSG+      +  YK   V G ++++   K  ++  + D  R    Y + L
Sbjct: 78  FDFKVDDGLNSGVQLRSESKKDYKKGRVHGYQFEIDPSKRAWSGGIYDEARRNWLYPLTL 137

Query: 206 PDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEY 265
              A    +    WN ++I      +  ++NG        W D                 
Sbjct: 138 NPSAKTAFK-NNAWNKARIEAVGNSIRTWINGVPCANI--WDD----------------- 177

Query: 266 GLARKGLICLQ--------DHGYPAWFRNIKI-------RELPRKTEEEE---------- 300
            +   G I LQ        D G    +++I+I        + P      E          
Sbjct: 178 -MTPVGFIALQVHAIGNAADEGKTVSWKDIRICTTDVERYQTPEAQAAPEVNLIANTISP 236

Query: 301 ---------LFNGKDLTGW-----DVYGTEQWYVQDSLL-VCESGPDKQY--GYLATCKY 343
                    L++GK   GW       +  + W ++D +L V +SG  +    G + T + 
Sbjct: 237 NEAKEGWTLLWDGKTTDGWRGAKLSTFPAKGWKIEDGILKVMKSGGAESANGGDIVTTRK 296

Query: 344 YNDFELTADFKQEADGNSGI--FIRSFVEEGAKVN---GWQVEVAPKGFDTG-GIYESYG 397
           Y +F L  DFK     NSGI  F+   + +GA       +Q+    K  D   G+  +  
Sbjct: 297 YKNFILKVDFKITEGANSGIKYFVNPDMNKGAGSAIGCEFQILDDDKHPDAKLGVKGNRK 356

Query: 398 RGWLIQ-IPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMV------DIQDEKIGAG- 449
            G L   IP  +     ++E+NT  I V GN V  WLNG +++      D+ +  +    
Sbjct: 357 LGSLYDLIPAPKNKPFNKKEFNTATIIVKGNHVEHWLNGVKLIEYDRNNDMWNALVAYSK 416

Query: 450 -----------QGRIALQIHDGGGIKVLWRNIRVKTL 475
                      +G I LQ H   G +V ++N+++K L
Sbjct: 417 YKNWPNFGNPEEGNILLQDH---GDEVWFKNVKIKEL 450
>gi|86131181|ref|ZP_01049780.1| hypothetical protein MED134_09676 [Cellulophaga sp. MED134]
 gi|85818592|gb|EAQ39752.1| hypothetical protein MED134_09676 [Dokdonia donghaensis MED134]
          Length = 244

 Score =  117 bits (294), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 76/224 (33%), Positives = 119/224 (53%), Gaps = 9/224 (4%)

Query: 74  SEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY 133
           +++ + V T  + +    +LFDG + + W+ Y    +   W + DGA+      S+E G 
Sbjct: 25  TQEVEEVETVTQASTTEIVLFDGSSFDAWKGYGTDGMHENWTIEDGAMAF--TPSEEGGK 82

Query: 134 -IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF--AEP 190
            I+    Y+NFEL  +WK+S+GGNSG+ + V E P++K  Y TGPE Q++DD+    A+ 
Sbjct: 83  NIITKNTYKNFELNLEWKVSEGGNSGIFWGVKESPEFKEAYETGPEIQVLDDERHPDAKV 142

Query: 191 LEDWQRCGVDYAMYLPDFATMNVRPAGEWN--TSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
                + G  Y M  P    +N  PAGEWN  T  I  D+   +  +NG++   F    +
Sbjct: 143 ANGTHKAGSLYDMIKPADGMIN--PAGEWNKVTLYINHDSNLGKVSLNGKEAYTFPVNGE 200

Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           +W       K+ +   +G  ++G I LQDHG   W+RNI I+EL
Sbjct: 201 EWDAMVAKTKFADWKGFGKYQEGHIGLQDHGDKVWYRNITIKEL 244
>gi|149177844|ref|ZP_01856443.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
 gi|148843334|gb|EDL57698.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
          Length = 432

 Score =  117 bits (294), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 111/412 (26%), Positives = 175/412 (42%), Gaps = 56/412 (13%)

Query: 77  PQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVY 136
           P+  LTEEE AAGW  LFDG +L GW+  N       W V +G I+AD     E G ++ 
Sbjct: 59  PETGLTEEEIAAGWIALFDGHSLSGWKPNNDVN----WHVDEGVIKAD---KGEPGLLLT 111

Query: 137 DRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQR 196
              + ++EL++++K++   NSG+       P    P     E  L D K          R
Sbjct: 112 TSPFADYELKFEFKLTPETNSGIFLRTTFNPTD--PSKDCYELNLCDQKTEFPTGSLVAR 169

Query: 197 CGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNS 256
             +D  + +          + EW T ++  +  +++  +NG++ + F             
Sbjct: 170 SKIDKPLPV----------SSEWQTCEVNLEGSNIKAIINGKEVLNF------------- 206

Query: 257 GKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEELFNGKDLTGWDVYGTE 316
               N     L + G I LQ +     FR I ++ L   T    +FNG DL GW+V    
Sbjct: 207 ----NDTSKNLRKTGFIGLQKNEGAIEFRKIYLKPLRMST----IFNGVDLAGWNVVPGS 258

Query: 317 QWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG-NSGIFIRSFV-EEGAK 374
           Q   +          +KQ GYL T + + DF   A  K   D  NSG F R+    E   
Sbjct: 259 QSTFEVVDGTIHVTAEKQ-GYLETEEIWGDFLFQATAKSNGDSLNSGYFFRAIKGSEKGM 317

Query: 375 VNGWQVEVAPKGFDTGGIY--ESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTW 432
            NG++V++   G   G     E+ G G + +  + R     + EW T  +   G  +  W
Sbjct: 318 ANGYEVQIH-NGIKEGDRTKPENAGTGAIFRRTEARRVVANDHEWFTTTLSASGPHIAVW 376

Query: 433 LNGEQMVDIQDEK---------IGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
           +NG Q+ D  D +         +    G I+LQ HD     + +++++V TL
Sbjct: 377 INGYQVTDWTDTRKPDENPRKGLRLEAGHISLQGHD-PTTDLNFKDLKVSTL 427
>gi|29349855|ref|NP_813358.1| hypothetical protein BT_4447 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29341766|gb|AAO79552.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 240

 Score =  117 bits (294), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 75/231 (32%), Positives = 115/231 (49%), Gaps = 26/231 (11%)

Query: 87  AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDE-----NGYIVYDRIYE 141
           A G+  +FDG+T  GWR Y    +   W + DG I+ +G GS E      G +++   ++
Sbjct: 10  ADGYITIFDGKTFNGWRGYGKDRVPSKWTIEDGCIKFNGSGSGEAQNGDGGDLIFAHKFK 69

Query: 142 NFELQWDWKISKGGNSGLLYHVVERPQ--------YKVPYVTGPEYQLIDDKGF--AEPL 191
           NFEL+ +WK+SKGGNSG+ Y   E            +  Y++ PEYQ++D+     A+  
Sbjct: 70  NFELEMEWKVSKGGNSGIFYLAQEVTSKDKDGNDVLEPIYISAPEYQVLDNDNHPDAKLG 129

Query: 192 EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
           +D  R        +P     N +P GEWN +KI+   G V +  N +  +E+  W+  W 
Sbjct: 130 KDNNRQSASLYDMIPA-VPQNAKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWT 188

Query: 252 -----QRKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
                 + +  KW  A E      G   +G I +QDHG   WFRNI+++ L
Sbjct: 189 DLLQASKFSQDKWPLAFELLNNCGGENHEGFIGMQDHGDDVWFRNIRVKVL 239
>gi|126645050|ref|ZP_01717594.1| hypothetical protein ALPR1_10430 [Algoriphagus sp. PR1]
 gi|126578461|gb|EAZ82625.1| hypothetical protein ALPR1_10430 [Algoriphagus sp. PR1]
          Length = 461

 Score =  117 bits (293), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 78/224 (34%), Positives = 121/224 (54%), Gaps = 13/224 (5%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRD-YNGQALTGPWEVVDGAI---QADGEGSDENGYI 134
           N LT+ EK  GW+LLF+GQ  EGW+  Y  +     W V DG +   ++DG  S   G I
Sbjct: 240 NELTDYEKNTGWKLLFNGQNSEGWKGAYKDEFPDFGWSVNDGILTIAESDGGESTNAGDI 299

Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDW 194
           V    +  F+L +++++++G NSGL Y V      K   + G EYQ++DD+    P    
Sbjct: 300 VTKEEFSAFDLGFEFRLTEGANSGLKYFVTLSEGNKGSAI-GLEYQILDDE--KHPDAKM 356

Query: 195 QRCGVDYAMYLPDFATMN-----VRPAGEWNTSKIVFD-NGHVEYYMNGQKTIEFEAWSD 248
            + G      L D  T       + P GEWN  ++V + N HV +Y+NG K +E++  S+
Sbjct: 357 GKEGNRTLSSLYDLITAQKQGRFINPIGEWNKGRVVVEPNNHVTHYLNGLKVLEYDRGSE 416

Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           ++ +   + K++  P +G A +G I LQDHG    F+NIK++ L
Sbjct: 417 EFRELVANSKYKIWPNFGEAEQGHILLQDHGNRVSFKNIKLKSL 460

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 66/191 (34%), Positives = 101/191 (52%), Gaps = 19/191 (9%)

Query: 300 ELFNGKDLTGWD-VYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQE-A 357
           +LFNGKDL+GW  V GT  + V D ++V  +       +L T + Y D+ L  D K E  
Sbjct: 33  DLFNGKDLSGWKAVAGTANFEVVDGVIVGSAVAGSPNTFLITEETYGDYILELDLKVENL 92

Query: 358 DGNSGIFIRSFVEEGAK-----VNGWQVEVAPKGFD-TGGIYESYGRGWLIQI---PDDR 408
             NSGI  R   +  A+     V G+QVE  P     + GIY+   RGWL  +   P  +
Sbjct: 93  TSNSGIMARGQFDPAARDGNGLVYGYQVEADPSERAWSAGIYDEARRGWLYPLDLNPAAK 152

Query: 409 ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH-----DGGGI 463
             F K  E+N  RI V+G+++ TWLNG+++  + D+     +G + LQ+H     +  G 
Sbjct: 153 TAF-KMGEFNHYRIEVIGDEIKTWLNGQEVAYVVDDM--DSKGFVGLQVHSIRNPEDEGN 209

Query: 464 KVLWRNIRVKT 474
           K  ++N+++KT
Sbjct: 210 KTYFKNVKIKT 220

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 108/450 (24%), Positives = 178/450 (39%), Gaps = 82/450 (18%)

Query: 89  GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWD 148
           GW  LF+G+ L GW+   G   T  +EVVDG I         N +++ +  Y ++ L+ D
Sbjct: 30  GWVDLFNGKDLSGWKAVAG---TANFEVVDGVIVGSAVAGSPNTFLITEETYGDYILELD 86

Query: 149 WKISK-GGNSGLLYHVVERPQYKVPYVTGPEYQLIDD---KGFAEPLEDWQRCGVDYAMY 204
            K+     NSG++      P  +        YQ+  D   + ++  + D  R G  Y + 
Sbjct: 87  LKVENLTSNSGIMARGQFDPAARDGNGLVYGYQVEADPSERAWSAGIYDEARRGWLYPLD 146

Query: 205 LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPE 264
           L   A    +  GE+N  +I      ++ ++NGQ+     A+  D    K          
Sbjct: 147 LNPAAKTAFK-MGEFNHYRIEVIGDEIKTWLNGQEV----AYVVDDMDSKGF-------- 193

Query: 265 YGLARKGLICLQDHGYPAWFRNIKIR------------------------ELPRKTEEEE 300
            GL    +   +D G   +F+N+KI+                        +  + T  + 
Sbjct: 194 VGLQVHSIRNPEDEGNKTYFKNVKIKTTNLDPKPFSSSIYVVNNRLNELTDYEKNTGWKL 253

Query: 301 LFNGKDLTGW-----DVYGTEQWYVQDSLLV---CESGPDKQYGYLATCKYYNDFELTAD 352
           LFNG++  GW     D +    W V D +L     + G     G + T + ++ F+L  +
Sbjct: 254 LFNGQNSEGWKGAYKDEFPDFGWSVNDGILTIAESDGGESTNAGDIVTKEEFSAFDLGFE 313

Query: 353 FKQEADGNSGIFIRSFVEEGAKVN--GWQVEVAPKGFDTGGIYESYGRGWLIQIPD---- 406
           F+     NSG+     + EG K +  G + ++              G   L  + D    
Sbjct: 314 FRLTEGANSGLKYFVTLSEGNKGSAIGLEYQILDDEKHPDAKMGKEGNRTLSSLYDLITA 373

Query: 407 -DRENFLKE-REWNTMRIRV-VGNQVTTWLNG-------------EQMVDIQDEKI---- 446
             +  F+    EWN  R+ V   N VT +LNG              ++V     KI    
Sbjct: 374 QKQGRFINPIGEWNKGRVVVEPNNHVTHYLNGLKVLEYDRGSEEFRELVANSKYKIWPNF 433

Query: 447 -GAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
             A QG I LQ H   G +V ++NI++K+L
Sbjct: 434 GEAEQGHILLQDH---GNRVSFKNIKLKSL 460
>gi|153808737|ref|ZP_01961405.1| hypothetical protein BACCAC_03036 [Bacteroides caccae ATCC 43185]
 gi|149128563|gb|EDM19781.1| hypothetical protein BACCAC_03036 [Bacteroides caccae ATCC 43185]
          Length = 287

 Score =  117 bits (292), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 73/229 (31%), Positives = 114/229 (49%), Gaps = 26/229 (11%)

Query: 89  GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDE-----NGYIVYDRIYENF 143
           G+  +FDG+T +GWR Y    +   W + DG I+ +G G  E      G +++   ++NF
Sbjct: 59  GYITIFDGKTFDGWRGYGKDKVPAKWTIEDGCIKFNGTGGGEAQDADGGDLIFAHKFKNF 118

Query: 144 ELQWDWKISKGGNSGLLYHVVE--------RPQYKVPYVTGPEYQLIDDKGF--AEPLED 193
           EL+ +WK++KG NSG+LY   E            +  Y++ PEYQ++D+     A+  +D
Sbjct: 119 ELELEWKVAKGSNSGILYLAQEITSKDKDGNDVLEPIYISAPEYQILDNANHPDAKLGKD 178

Query: 194 WQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF-- 251
             R        +P     N +P GEWN +KI+   G V +  N +  +E+  W+  W   
Sbjct: 179 NNRQSASLYDMIPA-VPQNSKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWTDM 237

Query: 252 ---QRKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
               + +  KW  A E      G   +G I LQDHG   WFRNI+++ L
Sbjct: 238 LQASKFSEEKWPLAFELLNNCGGDNHEGFIGLQDHGDDVWFRNIRVKVL 286
>gi|88712851|ref|ZP_01106936.1| probable secreted glycosyl hydrolase [Flavobacteriales bacterium
           HTCC2170]
 gi|88708749|gb|EAR00984.1| probable secreted glycosyl hydrolase [Flavobacteriales bacterium
           HTCC2170]
          Length = 232

 Score =  116 bits (291), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 75/214 (35%), Positives = 112/214 (52%), Gaps = 14/214 (6%)

Query: 89  GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQAD--GEGSDENGYIVYDRIYENFELQ 146
           G+  LF+G++L+GW  Y    +   W   +G +  D   + S EN  +V D+ Y N+EL 
Sbjct: 24  GFTDLFNGKSLDGWHSYGKDEINDGWYADNGELIFDFQRDKSGENSNLVTDKQYTNYELS 83

Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDD--KGFAEPLEDWQRCGVDYAMY 204
            +WKI   GNSG+ + V+E  +++ PY+TGPE Q++DD  + + E   D  R G  Y + 
Sbjct: 84  IEWKIYPHGNSGIFWGVIESEEFEQPYMTGPEIQILDDGWEAYIEERGDINRAGSLYGLI 143

Query: 205 LPDFATMNVRPAGEWNTSKIVFDNGHVEYYM--NGQKTIEFEAWSDDW---FQRKNSGKW 259
            P     N  PA EWN   I  D+   E ++  NG + + F     +W     +    KW
Sbjct: 144 PPSSIVSN--PAEEWNHYLIHIDHKENEGFVVFNGTEVVRFPVHGPEWKAMIAKSGFAKW 201

Query: 260 ENAPEYGLARKGLICLQDHGYPAWFRNIKIRELP 293
            +   +G A+ G I LQ+ G    FRNIKI+ELP
Sbjct: 202 SS---FGTAKTGHISLQEWGGKVAFRNIKIKELP 232
>gi|156112295|gb|EDO14040.1| hypothetical protein BACOVA_00235 [Bacteroides ovatus ATCC 8483]
          Length = 244

 Score =  116 bits (290), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 76/221 (34%), Positives = 116/221 (52%), Gaps = 9/221 (4%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL-TGPWEVVDGAI---QADGEGSDENGYI 134
           N LT +EK  GW+LL+DG+T  GWR        T  W+++   +   ++ GE S   G I
Sbjct: 25  NTLTNQEKNEGWKLLWDGKTTNGWRGARISTFPTKGWKIIGNDLVVEKSKGEESGNGGDI 84

Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDW 194
           V  + Y++FEL  D+KI++G NSG+ Y V            G E+Q++DD+    P    
Sbjct: 85  VTIKTYKSFELVADFKITEGANSGIKYFVDPDLNKGKGSAIGCEFQILDDE--KHPDAKA 142

Query: 195 QRCGVDYAMYLPDF---ATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
            R G      L D     +  +    E+NT++I+    HVE+++NG K +E+E  +  W 
Sbjct: 143 GRKGNRTVGSLYDLIPAGSNKLFKKNEFNTARIIVKGNHVEHWLNGIKVVEYERNNQMWK 202

Query: 252 QRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
                 K+ + P +G  ++G I LQDHG    F+NIKI+EL
Sbjct: 203 ALVAGSKYADWPNFGEGKEGHILLQDHGDEVHFKNIKIKEL 243

 Score = 55.8 bits (133), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 63/216 (29%), Positives = 92/216 (42%), Gaps = 52/216 (24%)

Query: 301 LFNGKDLTGW-----DVYGTEQWYVQDSLLVCESGPDKQYGY---LATCKYYNDFELTAD 352
           L++GK   GW       + T+ W +  + LV E    ++ G    + T K Y  FEL AD
Sbjct: 39  LWDGKTTNGWRGARISTFPTKGWKIIGNDLVVEKSKGEESGNGGDIVTIKTYKSFELVAD 98

Query: 353 FKQEADGNSGI--FIRSFVEEG-AKVNGWQVEV-----------APKGFDT-GGIYESYG 397
           FK     NSGI  F+   + +G     G + ++             KG  T G +Y+   
Sbjct: 99  FKITEGANSGIKYFVDPDLNKGKGSAIGCEFQILDDEKHPDAKAGRKGNRTVGSLYD--- 155

Query: 398 RGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDE------------- 444
                 IP       K+ E+NT RI V GN V  WLNG ++V+ +               
Sbjct: 156 -----LIPAGSNKLFKKNEFNTARIIVKGNHVEHWLNGIKVVEYERNNQMWKALVAGSKY 210

Query: 445 ----KIGAG-QGRIALQIHDGGGIKVLWRNIRVKTL 475
                 G G +G I LQ H   G +V ++NI++K L
Sbjct: 211 ADWPNFGEGKEGHILLQDH---GDEVHFKNIKIKEL 243
>gi|156109541|gb|EDO11286.1| hypothetical protein BACOVA_03189 [Bacteroides ovatus ATCC 8483]
          Length = 288

 Score =  116 bits (290), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 73/229 (31%), Positives = 113/229 (49%), Gaps = 26/229 (11%)

Query: 89  GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDE-----NGYIVYDRIYENF 143
           G+  +FDG+T  GWR Y    +   W + DG I+ +G G  E      G +++   ++NF
Sbjct: 60  GYITIFDGETFNGWRGYGKDRVPTKWTIEDGCIKFNGSGGGEAQDGDGGDLIFAHKFKNF 119

Query: 144 ELQWDWKISKGGNSGLLYHVVERPQ--------YKVPYVTGPEYQLIDDKGF--AEPLED 193
           EL+ +WK++KG NSG+LY   E            +  Y++ PEYQ++D+     A+  +D
Sbjct: 120 ELELEWKVAKGSNSGILYLAQEVTSKDKDGNDVLEPIYISAPEYQILDNANHPDAKLGKD 179

Query: 194 WQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF-- 251
             R        +P     N +P GEWN +KI+   G V +  N +  +E+  W+  W   
Sbjct: 180 NNRQSASLYDMIPA-VPQNSKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWTDM 238

Query: 252 ---QRKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
               + +  KW  A E      G   +G I LQDHG   WFRNI+++ L
Sbjct: 239 LQASKFSEDKWPLAFELLNNCGGENHEGFIGLQDHGDDVWFRNIRVKVL 287
>gi|29348878|ref|NP_812381.1| hypothetical protein BT_3469 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340784|gb|AAO78575.1| probable secreted glycosyl hydrolase [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 449

 Score =  113 bits (283), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 75/222 (33%), Positives = 118/222 (53%), Gaps = 11/222 (4%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP-WEVVDG---AIQADGEGSDENGYI 134
           N ++  E   GW LL+DG+T  GWR     A     W++ DG    +++ G  S   G I
Sbjct: 230 NTISPREAKEGWALLWDGKTNNGWRGAKLNAFPEKGWKMEDGILKVMKSGGAESANGGDI 289

Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEP---L 191
           V  R Y+NF L  D+KI++G NSG+ Y V            G E+Q++DD    +    +
Sbjct: 290 VTTRKYKNFILTVDFKITEGANSGVKYFVNPDLNKGEGSAIGCEFQILDDDKHPDAKLGV 349

Query: 192 EDWQRCGVDYAMY-LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDW 250
           +  ++ G  Y +   P+    N +   ++NT+ I+  + HVE+++NG K IE+   +D W
Sbjct: 350 KGNRKLGSLYDLIPAPEKKPFNKK---DFNTATIIVQDNHVEHWLNGVKLIEYTRNTDMW 406

Query: 251 FQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
                  K++N P +G + +G I LQDHG   WF+N+KI+EL
Sbjct: 407 NALVAYSKYKNWPNFGNSAEGNILLQDHGDEVWFKNVKIKEL 448

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 70/189 (37%), Positives = 100/189 (52%), Gaps = 16/189 (8%)

Query: 299 EELFNGKDLTGWD-VYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
           E LFNGK+L GW  + G  ++ + D  +V  S       +LAT K Y DF L  DFK + 
Sbjct: 23  EPLFNGKNLKGWKKLNGKAEYKIVDGAIVGISKMGTPNTFLATTKNYGDFILEFDFKIDD 82

Query: 358 DGNSGIFIRSFVE---EGAKVNGWQVEVAP-KGFDTGGIYESYGRGWLIQI---PDDREN 410
             NSG+ +RS  +   +  +V+G+Q E+ P K   +GGIY+   R WL  +   P  +  
Sbjct: 83  GLNSGVQLRSESKKDYQNGRVHGYQFEIDPSKRAWSGGIYDEARRNWLYPLTLNPAAKTA 142

Query: 411 FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-----GIKV 465
           F K   WN  RI  +GN + TW+NG    +I D+   +  G IALQ+H  G     G  V
Sbjct: 143 F-KNNAWNKARIEAIGNSIRTWINGVPCANIWDDMTPS--GFIALQVHAIGNASEEGKTV 199

Query: 466 LWRNIRVKT 474
            W++IR+ T
Sbjct: 200 SWKDIRICT 208

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 114/441 (25%), Positives = 195/441 (44%), Gaps = 63/441 (14%)

Query: 87  AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQ 146
           A  W+ LF+G+ L+GW+  NG+A    +++VDGAI    +    N ++   + Y +F L+
Sbjct: 19  AQTWEPLFNGKNLKGWKKLNGKA---EYKIVDGAIVGISKMGTPNTFLATTKNYGDFILE 75

Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDK-GFAEPLEDWQRCGVDYAMYL 205
           +D+KI  G NSG+      +  Y+   V G ++++   K  ++  + D  R    Y + L
Sbjct: 76  FDFKIDDGLNSGVQLRSESKKDYQNGRVHGYQFEIDPSKRAWSGGIYDEARRNWLYPLTL 135

Query: 206 PDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD---DWFQRKNSGKWENA 262
              A    +    WN ++I      +  ++NG        W D     F         NA
Sbjct: 136 NPAAKTAFK-NNAWNKARIEAIGNSIRTWINGVPCANI--WDDMTPSGFIALQVHAIGNA 192

Query: 263 PEYG--LARKGL-ICLQD-------HGYPAWFRNIKIREL-PRKTEE--EELFNGKDLTG 309
            E G  ++ K + IC  D           A  RN+    + PR+ +E    L++GK   G
Sbjct: 193 SEEGKTVSWKDIRICTTDVERYQTPETEEAPERNMIANTISPREAKEGWALLWDGKTNNG 252

Query: 310 W-----DVYGTEQWYVQDSLL-VCESGPDKQY--GYLATCKYYNDFELTADFKQEADGNS 361
           W     + +  + W ++D +L V +SG  +    G + T + Y +F LT DFK     NS
Sbjct: 253 WRGAKLNAFPEKGWKMEDGILKVMKSGGAESANGGDIVTTRKYKNFILTVDFKITEGANS 312

Query: 362 GIFIRSFVE------EGAKVN-GWQVEVAPKGFDTG-GIYESYGRGWLIQ-IPDDRENFL 412
           G  ++ FV       EG+ +   +Q+    K  D   G+  +   G L   IP   +   
Sbjct: 313 G--VKYFVNPDLNKGEGSAIGCEFQILDDDKHPDAKLGVKGNRKLGSLYDLIPAPEKKPF 370

Query: 413 KEREWNTMRIRVVGNQVTTWLNGEQMV------DIQDEKIG------------AGQGRIA 454
            ++++NT  I V  N V  WLNG +++      D+ +  +             + +G I 
Sbjct: 371 NKKDFNTATIIVQDNHVEHWLNGVKLIEYTRNTDMWNALVAYSKYKNWPNFGNSAEGNIL 430

Query: 455 LQIHDGGGIKVLWRNIRVKTL 475
           LQ H   G +V ++N+++K L
Sbjct: 431 LQDH---GDEVWFKNVKIKEL 448
>gi|118073263|ref|ZP_01541446.1| protein of unknown function DUF1080 [Shewanella woodyi ATCC 51908]
 gi|118022335|gb|EAV36156.1| protein of unknown function DUF1080 [Shewanella woodyi ATCC 51908]
          Length = 239

 Score =  112 bits (280), Expect = 5e-23,   Method: Composition-based stats.
 Identities = 73/228 (32%), Positives = 114/228 (50%), Gaps = 7/228 (3%)

Query: 65  VLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQAD 124
           +L   +        N L+++EK AGWQLLF+G+ +  WR++  Q +   W +   +I   
Sbjct: 19  LLLSTVATAGVAADNQLSKKEKEAGWQLLFNGKDMSQWRNFKQQGVNPKWVIDQDSIHLS 78

Query: 125 GEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDD 184
           G G  +   ++  + Y+NFEL  DWKIS+ GNSG+ + + +    K+ Y    E Q++D+
Sbjct: 79  GGGGGD---LLTKQAYKNFELTLDWKISRAGNSGI-FVLADELGSKI-YSHAIEVQILDN 133

Query: 185 KGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFE 244
           +  A+   D    G  Y +     A+  V  AGEWN  +I   N  +  + NG  T +  
Sbjct: 134 QRHADNKIDSHLSGSIYDIQASPPASHRV--AGEWNKVRIRMHNNSLSVWQNGILTADLI 191

Query: 245 AWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
             S+ W       K+     +    +G I LQDH  P WF+NIK+REL
Sbjct: 192 VGSEKWNSLVAESKFRTWTGFAQTSQGHIGLQDHSDPVWFKNIKLREL 239
>gi|149175127|ref|ZP_01853750.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
 gi|148846105|gb|EDL60445.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
          Length = 261

 Score =  111 bits (277), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 78/246 (31%), Positives = 125/246 (50%), Gaps = 46/246 (18%)

Query: 57  CMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEV 116
           C+ CL ++      T C+ +P N L+E+E+  G++LLF+G+ L GW+       +G W+V
Sbjct: 9   CLGCLVLLQFHQ--TGCTSEP-NQLSEQEQLQGFKLLFNGKDLSGWQH------SGNWKV 59

Query: 117 VDGAIQADGEGSDENGYIVYD--RIYENFELQWDWKISKGGNSGLLYHVVERP-QYKVPY 173
            DG I   G+G    G +VY+   + +NFEL+++WK+ +G NSG+ Y    RP QY    
Sbjct: 60  EDGIISRAGKG----GSLVYETEHVPDNFELKFEWKVGEGSNSGVYY----RPGQY---- 107

Query: 174 VTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEY 233
               EYQ++D+    +           Y    P       RP G+WNT +IV     +++
Sbjct: 108 ----EYQILDNNKHVDGKNPRTSAASIYFCLPPSHDA--TRPVGDWNTGRIVCQGTVIQH 161

Query: 234 YMNGQKTIEFE------AWSDDWFQRKNSGKWENAPEYGLARKGL-ICLQDHGYPAWFRN 286
           ++NG+K I+ +      AW  +    +            LA +G  + LQDHG P W+R 
Sbjct: 162 WLNGEKVIDLDYTDPRYAWHVELLANRGG---------DLADRGAKLSLQDHGDPVWYRG 212

Query: 287 IKIREL 292
           IK+R +
Sbjct: 213 IKMRSI 218
>gi|149174297|ref|ZP_01852924.1| hypothetical protein PM8797T_03084 [Planctomyces maris DSM 8797]
 gi|148846842|gb|EDL61178.1| hypothetical protein PM8797T_03084 [Planctomyces maris DSM 8797]
          Length = 1079

 Score =  110 bits (276), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 73/200 (36%), Positives = 104/200 (52%), Gaps = 32/200 (16%)

Query: 301 LFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
           LFNGK+L GW  +G +  Y + D+ +V  S P     +L T K Y+DFEL  DFK +   
Sbjct: 30  LFNGKNLDGWVQHGGKAKYDIVDNTIVGTSVPKTPNSFLCTKKMYDDFELQVDFKVDPLL 89

Query: 360 NSGIFIRSFVEE--------------------GAKVNGWQVEVAPKGFD-TGGIYESYGR 398
           NSGI IRS V +                      +V+G+QVE+ P     +GGIY+   R
Sbjct: 90  NSGIQIRSNVYDEDKVLETKGADGKDKKIKIAAGRVHGYQVEIDPSDRAWSGGIYDEGRR 149

Query: 399 GWLIQIPDDR--ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQ 456
           GWL  + D++  +   K+ EWN  RI   G+ + TW+NG    D++D+     +G IALQ
Sbjct: 150 GWLNNLADNKAAQKAFKQNEWNHYRIVCRGDSIKTWINGVPAADLKDDL--TSKGFIALQ 207

Query: 457 IHDGG------GIKVLWRNI 470
           +H  G      G +V WRN+
Sbjct: 208 VHGVGNHPEKVGKQVSWRNV 227
>gi|116624768|ref|YP_826924.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116227930|gb|ABJ86639.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 243

 Score =  109 bits (273), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 74/247 (29%), Positives = 128/247 (51%), Gaps = 39/247 (15%)

Query: 81  LTEEEKAAGWQLLFDGQTLEGWRDYNGQAL-TGPWEVVDGAIQADGEGSDENGYIVYDRI 139
           +T +EKAAGW+LLFDG++ + W D   ++  +  + + DG I++  + + +       R 
Sbjct: 1   MTAQEKAAGWKLLFDGKSYKNWEDPTKKSPPSNAFTIEDGCIKSLPKANIDEDLFTKQR- 59

Query: 140 YENFELQWDWKISKGGNSGLLYHVV---------------------------ERPQYKVP 172
           +++FEL++DWKIS GGNSG+ Y +                            +RP     
Sbjct: 60  FQDFELEFDWKISPGGNSGIKYRIQDRVMLADEKKGQRFEDRVNASMKDRRKDRPAKGQE 119

Query: 173 YVTGPEYQLIDDKGFAEPLEDW-QRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHV 231
           YV G EYQ++D++   +       + G  Y M  P  A    +P GE+N S+++    HV
Sbjct: 120 YVIGFEYQVLDNEKNPDARRGTNHQAGALYDMISP--AKDATKPVGEFNHSRLLVKGDHV 177

Query: 232 EYYMNGQKTIEFEAWSDDWFQRKNSGKW-ENAPEYGL-----ARKGLICLQDHGYPAWFR 285
           E+++NG+K ++  +  D    + ++ +W  ++P Y L      ++  I +Q+H   AWF+
Sbjct: 178 EHWLNGEKVVD-GSLKDPGVAKGSAARWGTSSPVYDLLVNQPRKECQISVQNHNSDAWFK 236

Query: 286 NIKIREL 292
           NIKIR+L
Sbjct: 237 NIKIRKL 243
>gi|156861848|gb|EDO55279.1| hypothetical protein BACUNI_00951 [Bacteroides uniformis ATCC 8492]
          Length = 291

 Score =  106 bits (265), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 70/231 (30%), Positives = 115/231 (49%), Gaps = 26/231 (11%)

Query: 87  AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQ-----ADGEGSDENGYIVYDRIYE 141
           A G+  +FDG++L+GWR Y    +   W + DG ++          + E G +++   ++
Sbjct: 61  ADGYITIFDGKSLDGWRGYGKDKVPSRWIIEDGCLKFCGTGTGEGQTGEGGDLIFAHKFK 120

Query: 142 NFELQWDWKISKGGNSGLLYHVVERPQ--------YKVPYVTGPEYQLIDDKGF--AEPL 191
           NFEL+ +WKISKGGNSG+ Y   E            +  Y++ PE+Q++D+     A+  
Sbjct: 121 NFELELEWKISKGGNSGIFYLAQEVTSKDKDGNEVLEPIYISAPEFQVLDNANHPDAKLG 180

Query: 192 EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
           +D  R        +P     N +P GEWN +KI+   G V +  N +  +E+  W+  W 
Sbjct: 181 KDNNRQAASLYDMIPA-VPQNAKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWT 239

Query: 252 Q-----RKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
           +     + +  KW  A E      G   +G I +QDHG   W+RNI+++ L
Sbjct: 240 EMLQASKFSEEKWPLAFELLNNCGGENHEGFIGVQDHGDDVWYRNIRVKVL 290
>gi|116625197|ref|YP_827353.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116228359|gb|ABJ87068.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 212

 Score =  105 bits (262), Expect = 7e-21,   Method: Composition-based stats.
 Identities = 65/179 (36%), Positives = 98/179 (54%), Gaps = 5/179 (2%)

Query: 300 ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
           +LFNGKDL+GW   G E+W V+D  +  + G  K+YGYL T K Y DF L+  FK E DG
Sbjct: 33  QLFNGKDLSGWVNVGHEKWTVEDGTIHGQ-GVTKEYGYLRTEKQYKDFWLSIRFKCEDDG 91

Query: 360 NSGIFIRSFVEEGAK--VNGWQVEVAPK-GFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
           NSG++  +  + G      G Q E+        GG+Y   GRGW+     + E  ++  +
Sbjct: 92  NSGVYFHTDFKPGTVDVSKGMQFEIDRTLNHHNGGLYGD-GRGWIAWPSPEYEQVIRPTD 150

Query: 417 WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
           WN   ++V GN +   LNG  ++D  D    +  G IALQ+H GG   + +++I ++ +
Sbjct: 151 WNEFLLKVEGNHMVAILNGIAIIDFTDPTPKSFDGYIALQLHSGGEGNMRFKDIYLRDM 209
>gi|149177059|ref|ZP_01855667.1| hypothetical oxidoreductase [Planctomyces maris DSM 8797]
 gi|148844124|gb|EDL58479.1| hypothetical oxidoreductase [Planctomyces maris DSM 8797]
          Length = 542

 Score =  105 bits (262), Expect = 7e-21,   Method: Composition-based stats.
 Identities = 74/225 (32%), Positives = 114/225 (50%), Gaps = 25/225 (11%)

Query: 74  SEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY 133
           S  P N LT  EK AGW+LLF+G+   GW+  NG+ +  P E  DGA+     G    GY
Sbjct: 337 SSAPDNTLTSAEKEAGWKLLFNGKDYSGWKCNNGKPIAAPIE--DGALVPYKSG----GY 390

Query: 134 -IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLE 192
            IVYD+ + +F+ + D K+ +  NSG+ + V +    K P  TG E Q++   G    + 
Sbjct: 391 LIVYDKPFADFKFKCDVKMPEECNSGIFFRVGD---LKNPVQTGFEAQVLTGDGTG--MH 445

Query: 193 DWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQ 252
           D+   G  Y +  P  +       GEW   +I     HV   +NG+   +  A  D+W +
Sbjct: 446 DF---GAIYDLVAP--SVNRASKPGEWTNLEITCQGPHVSVAVNGKVVAKLNA--DEWTE 498

Query: 253 -----RKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
                  +S K+++A +    RKG +  QDHG+  W++N+K+ EL
Sbjct: 499 PGKRLDGSSHKFKDAVK-DFPRKGYLGFQDHGHKVWYKNVKLLEL 542
>gi|149178817|ref|ZP_01857398.1| hypothetical protein PM8797T_06527 [Planctomyces maris DSM 8797]
 gi|148842358|gb|EDL56740.1| hypothetical protein PM8797T_06527 [Planctomyces maris DSM 8797]
          Length = 1742

 Score =  104 bits (259), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 69/187 (36%), Positives = 109/187 (58%), Gaps = 10/187 (5%)

Query: 290  RELPRKTEEEEL---FNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYND 346
            +++P K   + L   FNG+DL GW    ++ W V++  +V  S   KQ  +L +    +D
Sbjct: 1555 QQVPMKATPDNLKLFFNGQDLAGW-TGNSQLWSVENGEIVGRSPGIKQNEFLVSDLQVSD 1613

Query: 347  FELTADFKQEAD-GNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIP 405
            FEL    K   D GNSGI  RS +E G+ V G+Q + A KG+  G +YE +GRG L +  
Sbjct: 1614 FELKLKVKLTPDIGNSGIQFRSSLEPGSHVKGYQAD-AGKGW-WGKLYEEHGRGLLFK-- 1669

Query: 406  DDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKV 465
            +  E ++++ EWN  RI  VG+Q+ T++NG    ++ D + GA  G IA QIH GG ++V
Sbjct: 1670 ESGEAYVRKGEWNEYRIVAVGSQIRTFINGNLCTNLNDPQ-GAKTGIIAFQIHSGGPMEV 1728

Query: 466  LWRNIRV 472
             ++++ +
Sbjct: 1729 RFKDLEL 1735
>gi|88713768|ref|ZP_01107849.1| hypothetical protein FB2170_00670 [Flavobacteriales bacterium
           HTCC2170]
 gi|88707895|gb|EAR00134.1| hypothetical protein FB2170_00670 [Flavobacteriales bacterium
           HTCC2170]
          Length = 461

 Score =  104 bits (259), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 67/208 (32%), Positives = 103/208 (49%), Gaps = 7/208 (3%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL-TGPWEVVDG---AIQADGEGSDENGYI 134
           N +  +E   GWQ+L+DG+T  GWR           WE+ +G    + + GE S   G I
Sbjct: 238 NKVAVDEIKNGWQMLWDGKTTNGWRGARLDEFPENGWEITNGILTVLPSGGEESAAGGDI 297

Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLE-- 192
           V   +Y +FEL+ D+KI++G NSG+ Y+V            G EYQ++DD    +  +  
Sbjct: 298 VTKEVYGDFELKVDFKITEGANSGIKYYVDTDLNKGPGSSIGLEYQILDDARHPDAKKGN 357

Query: 193 -DWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
            +  R        +       V P GEWNT+ I+  +  VE+ +NG   +++E  SD + 
Sbjct: 358 HEGSRTVASLYDLIQAATDKPVNPIGEWNTAHIISKDNQVEHRLNGMTVLKYERKSDSYK 417

Query: 252 QRKNSGKWENAPEYGLARKGLICLQDHG 279
           +  +  K+   P +G   KG I LQDHG
Sbjct: 418 KLVSESKYVKWPNFGEVEKGHILLQDHG 445

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 63/188 (33%), Positives = 102/188 (54%), Gaps = 16/188 (8%)

Query: 300 ELFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
           E+F+G+ L GW   G E  Y V++  +V  +  D    ++ + K Y DF L  ++  ++ 
Sbjct: 35  EIFDGETLNGWTQKGGEANYTVREGSIVGSTIHDTPNSFMTSDKMYGDFILELEYLVDST 94

Query: 359 GNSGIFIRSFVEEG---AKVNGWQVEVAPKGFD-TGGIYESYGRGWL---IQIPDDRENF 411
            NSGI IRS         +V+G+Q+E+ P     + GIY+   RGWL   I  PD ++ F
Sbjct: 95  MNSGIQIRSNSYPHYMHGRVHGYQIEIDPSDRAWSAGIYDEGRRGWLNNLIDNPDAQKGF 154

Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-----GIKVL 466
            K+ +WN  RI  +G+ + TW+NG     + D+K  +  G I LQ+H  G     G +++
Sbjct: 155 -KQNDWNHYRIEAIGDTLKTWINGIPAAHLIDDKTAS--GFIGLQVHSIGKDKKEGTEII 211

Query: 467 WRNIRVKT 474
           W+NI++ T
Sbjct: 212 WKNIKILT 219
>gi|116621745|ref|YP_823901.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116224907|gb|ABJ83616.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 214

 Score =  100 bits (250), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 70/219 (31%), Positives = 109/219 (49%), Gaps = 14/219 (6%)

Query: 81  LTEEEKAAG-WQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRI 139
           +T    AAG W+ LFDG+T  GW +  G+     W V DG ++   +          D +
Sbjct: 1   MTVAAGAAGEWRTLFDGKTSAGWLEITGKPFPATWTVEDGCLKTSPKPGGMQDIRTVD-V 59

Query: 140 YENFELQWDWKISKGGNSGLLYHVVERPQY-----KVPYVTGPEYQLIDDKGFAEPLEDW 194
           + NFEL++DWK+   GNSG+ Y V +  ++     +     G EYQL DD       +  
Sbjct: 60  FRNFELEFDWKMLADGNSGVKYLVQKVDEWTNKDGRQARARGLEYQLADDHNPDAASDPA 119

Query: 195 QRCGVDYAMYLPDFATMNVRPA-GEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQR 253
           +  G  Y++  P      + P  GE+N S++V + GHVE+++NG K +EF    D   Q+
Sbjct: 120 RVAGSLYSVIAP---VPKITPKIGEFNHSRLVVNGGHVEHWLNGTKVVEFST-GDAAVQK 175

Query: 254 KNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           +   +     +  L  +G I LQ+H    WFR I++R L
Sbjct: 176 QL--RTLRGKDGELLEEGPISLQNHSSEVWFRGIRVRTL 212
>gi|32472633|ref|NP_865627.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
 gi|32443870|emb|CAD73311.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
          Length = 429

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 75/191 (39%), Positives = 103/191 (53%), Gaps = 18/191 (9%)

Query: 301 LFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYG-YLATCKYYNDFELTADFKQEAD 358
           LFNG DL+GW    G+ ++ V+D ++V E  P+     +L + K + DF L   +K    
Sbjct: 33  LFNGSDLSGWVKRGGSAKYRVEDGVIVGECAPNTPGNTFLCSDKEFGDFVLKLRYKFLES 92

Query: 359 GNSGIFIRSFV-EEG--AKVNGWQVEVAPKGFDTGGIYESYGRG------WLIQI-PDDR 408
           GNSG+  RS   EEG   +V G+Q E+ P G  TG IY+   RG      WL    P +R
Sbjct: 93  GNSGVQFRSASREEGDRQRVFGYQAEMRPGGDMTGRIYDEGRRGHKHGIIWLDAFTPQER 152

Query: 409 ENFLKER----EWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIK 464
            +  +E     EWN + I+ VG  + TWLNG  +VDI D    + +G I LQIH G    
Sbjct: 153 LDAAQESCRPGEWNDLEIQCVGPSIKTWLNGNLVVDIFDSF--SMKGFIGLQIHSGETGS 210

Query: 465 VLWRNIRVKTL 475
           V W++IRVK L
Sbjct: 211 VAWKDIRVKDL 221

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 106/447 (23%), Positives = 188/447 (42%), Gaps = 71/447 (15%)

Query: 61  LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGA 120
           L   +  G L + +   +   TEE    G+  LF+G  L GW    G A    + V DG 
Sbjct: 5   LAAAITIGLLASITHNVEAADTEE----GFVSLFNGSDLSGWVKRGGSA---KYRVEDGV 57

Query: 121 IQAD-GEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEY 179
           I  +    +  N ++  D+ + +F L+  +K  + GNSG+ +    R +     V G + 
Sbjct: 58  IVGECAPNTPGNTFLCSDKEFGDFVLKLRYKFLESGNSGVQFRSASREEGDRQRVFGYQA 117

Query: 180 QLIDDKGFAEPLEDWQRCGVDYAM-----YLP----DFATMNVRPAGEWNTSKIVFDNGH 230
           ++         + D  R G  + +     + P    D A  + RP GEWN  +I      
Sbjct: 118 EMRPGGDMTGRIYDEGRRGHKHGIIWLDAFTPQERLDAAQESCRP-GEWNDLEIQCVGPS 176

Query: 231 VEYYMNGQKTIE-FEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDH----GYPAWFR 285
           ++ ++NG   ++ F+++S                      KG I LQ H    G  AW +
Sbjct: 177 IKTWLNGNLVVDIFDSFS---------------------MKGFIGLQIHSGETGSVAW-K 214

Query: 286 NIKIRELPRKTEEEELFNGKD----LTGWDVYGTEQW-YVQDSLLV-CESGPDKQYGYLA 339
           +I++++L     +       D    L G      E+W + +D +L    S    + G + 
Sbjct: 215 DIRVKDLGESQWQSFFVKNDDGEYGLEGARFVLPEEWSFTKDGVLHGVHSKSQGKDGLVI 274

Query: 340 TCKYYNDFELTADFKQEADGNSGIFIRSFVEEGAKV-NGWQVEVAPKGFD-----TGGIY 393
           +   +++F     ++    GNS ++ R+   +   V  G+Q E+A  G D     T GI 
Sbjct: 275 SDDNFDNFIARVTYRMRG-GNSALYFRAEETDAPWVLRGFQNEIANNGKDCALWHTAGII 333

Query: 394 ESY---GRGWLIQIPDDRENFL-----KEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEK 445
           +     GRGW++      + F+     K+ +WNT      G+++   LNG    DI DE+
Sbjct: 334 DGNTIPGRGWIVT----NDEFVGKVRNKDDQWNTTCTAAYGDRLVQTLNGFCTSDIIDEE 389

Query: 446 IGAGQGRIALQIHDGGGIKVLWRNIRV 472
                G++ LQ+H G   ++ +++  V
Sbjct: 390 C-EKTGKLGLQMHGGTDCEMYFKDFEV 415
>gi|32475761|ref|NP_868755.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
 gi|32446304|emb|CAD76132.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
          Length = 446

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 119/452 (26%), Positives = 190/452 (42%), Gaps = 76/452 (16%)

Query: 61  LCVMVLFGALTACSEKPQNVLTEEEKAAG-WQLLFDGQTLEGWRDYNGQALTGPWEVVDG 119
            CV +  G++    + P+   TE +  A   Q LFDG++L GW +       G  EVVDG
Sbjct: 24  FCVSICCGSVATGDDMPKPAKTESQAPANEMQSLFDGKSLTGWTN---PYEWGKTEVVDG 80

Query: 120 AIQADGEGSDENGYIVYDRIYENFELQWDWKISKG-GNSGLLYHVVERPQYKVPYVTGPE 178
            I      +D+  ++V ++I++++E + + K+ +G  NSG +      P     Y    E
Sbjct: 81  EIHLT---ADKKFFLVTEKIFQDYEFEGEVKLPEGKSNSGFMARGQVSPNKVFGYQA--E 135

Query: 179 YQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQ 238
               D +      ++ +R  ++     P+      R    WN  +I     H+++++N  
Sbjct: 136 ADPTDRRWSGGLYDEGRRQWLNPLWEQPEAQAAFDR--DRWNRYRIRCVGNHLQFFINDV 193

Query: 239 KTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAW---FRNIKIRELPRK 295
            T        D+F   N               G I LQ HG       FRN+K+R L   
Sbjct: 194 PTT-------DYFDPVN-------------LSGRIGLQHHGEKGQTYRFRNLKVRNLGSH 233

Query: 296 TEEEELFNGKDLTGWDVYGTEQWYVQDSLLV--CESGPDKQYGYLATCKYYND--FELTA 351
            E + LF+GK L GW+  G   W V D +L     S  ++  G L +     D  + +  
Sbjct: 234 -EWKPLFDGKSLDGWETVGGGTWTVVDGILQGRASSEANEPNGMLYSKHPMTDGTYRIEY 292

Query: 352 DFKQEADGNSGIFIRSFVEEGAK-VNGWQVEVAPKGFDTGGIYESYGRGWLIQ------- 403
            FK+   G+SG F+RS + E    V G Q E+     + GG+Y++ G GWL++       
Sbjct: 293 RFKK---GDSGFFVRSEITENKPFVKGVQCEIDNSD-EVGGLYQTGGAGWLVRPLHYLET 348

Query: 404 -IPDDRENFLK----------------------EREWNTMRIRVVGNQVTTWLNGEQMVD 440
             P DR   +                       E  WN M + V G ++   LN    VD
Sbjct: 349 GFPKDRHAVVNRHWKQAREGLQLDKKPVVSDDDETPWNMMTVSVHGKRIVVHLNDCLAVD 408

Query: 441 IQDEKIGAGQGRIALQIHDGGGIKVLWRNIRV 472
              E + A  G IALQ+H    ++V +R + +
Sbjct: 409 HVVEDL-ADSGVIALQLHGNQDLEVDFRKVEM 439

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 63/185 (34%), Positives = 91/185 (49%), Gaps = 13/185 (7%)

Query: 297 EEEELFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFK- 354
           E + LF+GK LTGW + Y   +  V D  +   +  DK++ +L T K + D+E   + K 
Sbjct: 53  EMQSLFDGKSLTGWTNPYEWGKTEVVDGEIHLTA--DKKF-FLVTEKIFQDYEFEGEVKL 109

Query: 355 QEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFD-TGGIYESYGRGWLIQI---PDDREN 410
            E   NSG   R  V    KV G+Q E  P     +GG+Y+   R WL  +   P+ +  
Sbjct: 110 PEGKSNSGFMARGQVSPN-KVFGYQAEADPTDRRWSGGLYDEGRRQWLNPLWEQPEAQAA 168

Query: 411 FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNI 470
           F ++R WN  RIR VGN +  ++N     D  D       GRI LQ H   G    +RN+
Sbjct: 169 FDRDR-WNRYRIRCVGNHLQFFINDVPTTDYFDPV--NLSGRIGLQHHGEKGQTYRFRNL 225

Query: 471 RVKTL 475
           +V+ L
Sbjct: 226 KVRNL 230
>gi|87311549|ref|ZP_01093668.1| hypothetical protein DSM3645_02096 [Blastopirellula marina DSM
           3645]
 gi|87285805|gb|EAQ77720.1| hypothetical protein DSM3645_02096 [Blastopirellula marina DSM
           3645]
          Length = 220

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 75/194 (38%), Positives = 103/194 (53%), Gaps = 24/194 (12%)

Query: 302 FNGKDLTGW-DVYGTEQWYVQDSLLV---CESGPDKQYGYLATCKYYNDFELTADFKQEA 357
           FNGKDL+GW    GT  + V+D  ++    E G    +  L T K Y DFEL  + K + 
Sbjct: 31  FNGKDLSGWTQKNGTATYVVEDGGVIRGKTEVGSPNSF--LCTDKDYGDFELEFEVKCDD 88

Query: 358 DGNSGIFIRSFVEEG------AKVNGWQVEVAPKGFDTGGIY-ESYGRGWLIQIPDDR-- 408
             NSG+ IRS   E        +VNG QVE+     + G +Y E+ GRGWL   P DR  
Sbjct: 89  GLNSGVQIRSQTAEAKGDQKFGRVNGPQVEIEKSVGEAGYVYGEATGRGWLT--PADRLK 146

Query: 409 -ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQ--GRIALQIH----DGG 461
             +  K  EWN+ R+   G ++ T++NGE + D+ DE+I      G I LQ+H    D G
Sbjct: 147 PHDHFKNGEWNSYRVVAKGPRIQTFINGEPIEDLTDEEIYKTHPTGFIGLQVHGIGKDQG 206

Query: 462 GIKVLWRNIRVKTL 475
             +V W+NIR+K L
Sbjct: 207 PYEVRWKNIRIKPL 220
>gi|87308076|ref|ZP_01090218.1| hypothetical protein DSM3645_20802 [Blastopirellula marina DSM
           3645]
 gi|87289158|gb|EAQ81050.1| hypothetical protein DSM3645_20802 [Blastopirellula marina DSM
           3645]
          Length = 229

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 72/201 (35%), Positives = 102/201 (50%), Gaps = 28/201 (13%)

Query: 301 LFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
           LFNGK+L G+  +G +  Y ++   +V  S  +    +L T K Y DF L  DFK +   
Sbjct: 30  LFNGKNLEGFTQHGGKAVYTIEGDEIVGTSTLNTPNTFLCTNKEYGDFILEVDFKVDPKL 89

Query: 360 NSGIFIRSFVEEGA----------------KVNGWQVEVAPKGFD-TGGIYESYGRGWLI 402
           NSGI IRS V   A                +V+G+QVE+ P     +GGIY+   RGWL 
Sbjct: 90  NSGIQIRSQVFPEATEVDFGDGKMKKMAKDRVHGYQVEIDPSARAWSGGIYDEARRGWLN 149

Query: 403 QIPDDRE--NFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDG 460
            + ++ E     K+ +WN  RI   G+ + TW+NG    D++D      +G IALQ+H  
Sbjct: 150 DLKNNPEAGKAFKQDDWNHYRIECRGDSIKTWINGVPAADLKDGL--TSKGLIALQVHGI 207

Query: 461 GGIK------VLWRNIRVKTL 475
           GG K      V W+NI +K L
Sbjct: 208 GGDKEKAGTQVRWKNIMIKEL 228
>gi|32470813|ref|NP_863806.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
 gi|32442958|emb|CAD71479.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
          Length = 272

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 70/222 (31%), Positives = 111/222 (50%), Gaps = 32/222 (14%)

Query: 83  EEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAI-QADGEGSDENGYIVYDR--I 139
           + + AA +  LFDG++ +GW        +G W + DGA  +A G GS     + Y R  +
Sbjct: 47  QSDAAADFVELFDGKSFDGWEH------SGNWRIEDGAFFRAAGGGS-----LTYKRTLV 95

Query: 140 YENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGV 199
            ++FEL+++WK+S G NSG+ Y    RP          EYQ++D+ G   P  +  R   
Sbjct: 96  PDDFELRFEWKVSDGCNSGVYY----RPGQV-------EYQVLDNVG--SPYGENPRQSA 142

Query: 200 DYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKW 259
               +    +    RP GEWN+ ++V     ++++ NGQK ++F+     W +       
Sbjct: 143 ASLFFCMAPSKDATRPVGEWNSGRVVCKGTVIQHWFNGQKVLDFDYTDPKWAEMVRLLTI 202

Query: 260 ENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEEL 301
                 G  R G + LQDHG P W+RN++ RE+P   +EE L
Sbjct: 203 RGGDLTG--RGGELWLQDHGQPVWYRNLRWREIP---DEESL 239
>gi|149195996|ref|ZP_01873052.1| hypothetical protein LNTAR_22659 [Lentisphaera araneosa HTCC2155]
 gi|149140843|gb|EDM29240.1| hypothetical protein LNTAR_22659 [Lentisphaera araneosa HTCC2155]
          Length = 231

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 71/188 (37%), Positives = 104/188 (55%), Gaps = 13/188 (6%)

Query: 301 LFNGKDLTGWDVY-GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
           LFNGKDL+GW V  GT  + V++  ++ ++    +  +L + K ++DFELT + K   D 
Sbjct: 43  LFNGKDLSGWTVRNGTGTYRVENGAIIGKTTDGSKNTFLCSDKLFSDFELTLEVKLINDQ 102

Query: 360 -NSGIFIRSFVEEGAK-VNGWQVEVAP---KGFDTGGIYESYGRGWLIQIPDDR-ENFLK 413
            NSGI IRS      K VNG QVEV     KG ++G IY     GW+      +    +K
Sbjct: 103 LNSGIQIRSNDNNAKKRVNGPQVEVEATKGKGAESGYIYGEACGGWMTPKAKLKPHTLMK 162

Query: 414 EREWNTMRIRVVGNQVTTWLNGEQMVDIQDE-KIGAG-QGRIALQIHD----GGGIKVLW 467
             EWNT+ I   G ++ TW+NG Q+ D+ D+ K+ +  +G I LQ+H      G  +V W
Sbjct: 163 NGEWNTVLIIAKGAKIQTWINGTQVSDLTDDAKLKSHPEGFIGLQVHSIPKGKGPYEVTW 222

Query: 468 RNIRVKTL 475
           +NI +K L
Sbjct: 223 KNIMIKDL 230
>gi|87308201|ref|ZP_01090343.1| probable secreted glycosyl hydrolase [Blastopirellula marina DSM
           3645]
 gi|87289283|gb|EAQ81175.1| probable secreted glycosyl hydrolase [Blastopirellula marina DSM
           3645]
          Length = 401

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 106/410 (25%), Positives = 174/410 (42%), Gaps = 64/410 (15%)

Query: 61  LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGA 120
           L  +++     A +E+  N L+E E A GW LLFDG+T  GW+   G+     W V  G 
Sbjct: 11  LLALIVAPLTAANAEQAPNTLSEAEIADGWLLLFDGETTFGWKS-EGKI---DWTVDQGV 66

Query: 121 IQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQ 180
           I+A      + G +     + N+EL  D++     NS +      +P+        P Y 
Sbjct: 67  IRAT---KGDVGQLRTTTQFANYELHVDFRAPAETNSAIFLRTSPKPE-------SPTYG 116

Query: 181 LIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRP---AGEWNTSKIVFDNGHVEYYMNG 237
             +      P  +    G            M V P   A +W++  +  D G +   ++G
Sbjct: 117 CYELN--IAPESNSYPTG-------SLVGRMKVTPPCVADQWHSFDVTADQGTIVVKLDG 167

Query: 238 QKTIEFEAWSDDWFQRKNSGKWENAPEY-GLARKGLICLQDHGYPAWFRNIKIRELPRKT 296
            + +++E                  P+Y GL   G I LQ +     FRN+K++ L  ++
Sbjct: 168 AEVLKYED-----------------PQYVGL---GFIGLQHNQGEIEFRNVKLKPLGMQS 207

Query: 297 EEEELFNGKDLTGWDVY---GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADF 353
               +FNGKDLTGW  Y    +E     +  +  ++GP    G L T K Y DF L  D 
Sbjct: 208 ----IFNGKDLTGWKTYPAMESEFTVTDEGTIHAKNGP----GQLETEKSYGDFVLRLDA 259

Query: 354 KQEADG-NSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGG--IYESYGRGWLIQIPDDREN 410
              A   NSG+F R     G K+ G++ ++   G++     +   +G G + +    R  
Sbjct: 260 ITHAKNLNSGVFFRCI--PGDKMMGYESQIH-NGYEAEDRTLPIDHGTGAIFRRMKARRV 316

Query: 411 FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDG 460
              +++W T  +   G  ++ W+NG Q+ D  D++      R  L+I  G
Sbjct: 317 VSDDQKWFTKTLIAQGPHISVWVNGYQVTDWTDQRKPDENPRRGLRIEPG 366
>gi|149179157|ref|ZP_01857726.1| hypothetical protein PM8797T_17619 [Planctomyces maris DSM 8797]
 gi|148842017|gb|EDL56411.1| hypothetical protein PM8797T_17619 [Planctomyces maris DSM 8797]
          Length = 227

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 70/196 (35%), Positives = 98/196 (50%), Gaps = 24/196 (12%)

Query: 301 LFNGKDLTGWDVY-GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
           LF+GK L  W  + GT  + V+D  +   +       +L T K Y +FEL  + K   + 
Sbjct: 34  LFDGKTLNNWVQHNGTATYMVKDGTIEGTTSEGSPNSFLCTKKNYGNFELEFEVKVHNNL 93

Query: 360 NSGIFIRSFVEEG-AKVNGWQVEVAPKGFDTGG----------IYESYGRGWLIQIPDDR 408
           NSG+ IRS  E G  +VNG QVE+   G D G            Y   G GW+   P+D+
Sbjct: 94  NSGVQIRSQQENGDGRVNGPQVEIEASG-DNGAEAGYIYGEAIKYAGKGIGWMT--PEDK 150

Query: 409 EN---FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQ--GRIALQIH----D 459
                 LK+ EWN  R+   G ++ TW+NGEQ+ D+ DE++      G I LQ+H     
Sbjct: 151 RTPHKNLKDGEWNQFRVVANGPRIQTWVNGEQVSDLVDERVYKSHPTGFIGLQVHGIKKG 210

Query: 460 GGGIKVLWRNIRVKTL 475
            G   V W+NIR+K L
Sbjct: 211 TGPYSVAWKNIRIKEL 226
>gi|87308956|ref|ZP_01091094.1| hypothetical protein DSM3645_19403 [Blastopirellula marina DSM 3645]
 gi|87288299|gb|EAQ80195.1| hypothetical protein DSM3645_19403 [Blastopirellula marina DSM 3645]
          Length = 1348

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 10/179 (5%)

Query: 300  ELFNGKDLTGWDVYGTEQ-WYVQDSLLVCESGPD-KQYGYLATCKYYNDFELTADFKQ-E 356
            ELFNG+DL GW+  G    W V++  +V ++        +L +     +F L  + +  +
Sbjct: 1173 ELFNGQDLNGWN--GDRNLWSVENGEIVGKTLTGIPANSFLISDLAAANFRLKLEIRLVK 1230

Query: 357  ADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
             +GNSGI  RS    G  V G+Q +     +  G +YE +GRG L+ +    E+ LK  +
Sbjct: 1231 NEGNSGIQFRSNALPGGSVQGYQADAGAGWW--GKLYEEHGRG-LLWVKSGEEH-LKPGD 1286

Query: 417  WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
            WN   I   GNQV T+LNG+  VD+ D+K G  QG  ALQ+H GG  +V +R+++++ L
Sbjct: 1287 WNQYEIVAQGNQVKTFLNGQPCVDLHDDK-GVKQGVFALQLHSGGPTEVRFRHLQLEIL 1344
>gi|32471625|ref|NP_864618.1| hypothetical protein RB1854 [Rhodopirellula baltica SH 1]
 gi|32396996|emb|CAD72299.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 703

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 58/184 (31%), Positives = 103/184 (55%), Gaps = 9/184 (4%)

Query: 299 EELFNGKDLTGWDVY-GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
           + LF+GK L GW+   GT ++ V++  +V  +       +L + + Y++FELT +   + 
Sbjct: 521 KSLFDGKTLDGWNRKNGTAKYRVENGTIVGTTSEGSPNSFLCSDENYDNFELTFEVNVDE 580

Query: 358 DGNSGIFIRSFV-EEGAKVNGWQVEVAPKGFDTGGIY-ESYGRGWLIQIPDDRENFLKER 415
             NSG+ IRS   E+G +V G QVE+     + G IY E+ GRGW+ +    ++ + K  
Sbjct: 581 GLNSGVQIRSQSREKGGRVYGPQVEIESAPGEAGYIYSEATGRGWITKEQPIKDAY-KNG 639

Query: 416 EWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH----DGGGIKVLWRNIR 471
           ++N   +R  GN++  W+  +++ DIQD +  +  G + LQ+H      G  +V WR+I+
Sbjct: 640 KFNRYLVRAHGNRIQVWIGDQKISDIQDPE-SSTDGFLGLQVHGIKAGTGPYEVSWRDIK 698

Query: 472 VKTL 475
           ++ L
Sbjct: 699 IRNL 702

 Score = 60.1 bits (144), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 5/103 (4%)

Query: 86  KAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFEL 145
           KA GW+ LFDG+TL+GW   NG   T  + V +G I         N ++  D  Y+NFEL
Sbjct: 516 KADGWKSLFDGKTLDGWNRKNG---TAKYRVENGTIVGTTSEGSPNSFLCSDENYDNFEL 572

Query: 146 QWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFA 188
            ++  + +G NSG+   +  + + K   V GP+ ++    G A
Sbjct: 573 TFEVNVDEGLNSGV--QIRSQSREKGGRVYGPQVEIESAPGEA 613
>gi|149199908|ref|ZP_01876936.1| hypothetical protein LNTAR_09721 [Lentisphaera araneosa HTCC2155]
 gi|149136977|gb|EDM25402.1| hypothetical protein LNTAR_09721 [Lentisphaera araneosa HTCC2155]
          Length = 223

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 69/195 (35%), Positives = 97/195 (49%), Gaps = 19/195 (9%)

Query: 300 ELFNGKDLTGWDVYGTE-QWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQ-EA 357
           ELFNGK+L GW     E  + V+D  ++  +       +L +   Y DFEL  + K  + 
Sbjct: 25  ELFNGKNLDGWTEKTKEGSFRVEDGAIIGTAKDGMGTTFLCSNNNYGDFELEFETKLIDN 84

Query: 358 DGNSGIFIRSFVEE------GAKVNGWQVEVAPKGFD---TGGIYESYGRGWLIQIPDDR 408
             NSG+ IRS ++E         V G QVEV  + F+   +G IY    + WL    D +
Sbjct: 85  KLNSGVQIRSRLQEPDGKQKHPAVYGPQVEVTGRNFEKNQSGYIYGQAWKTWLTPKEDKK 144

Query: 409 -ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQ---DEKIGAGQGRIALQIH----DG 460
              F K+ EWN  R+   GNQ+TTWLNG ++V        +     G IALQ+H      
Sbjct: 145 AHQFFKDGEWNHFRVLAKGNQITTWLNGNKIVTTTVPAKRQQSNPSGFIALQVHGIKKGT 204

Query: 461 GGIKVLWRNIRVKTL 475
           G  +V W+NI+VK L
Sbjct: 205 GPFQVAWKNIKVKEL 219
>gi|149197913|ref|ZP_01874962.1| hypothetical protein LNTAR_05481 [Lentisphaera araneosa HTCC2155]
 gi|149139134|gb|EDM27538.1| hypothetical protein LNTAR_05481 [Lentisphaera araneosa HTCC2155]
          Length = 442

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 96/364 (26%), Positives = 156/364 (42%), Gaps = 80/364 (21%)

Query: 132 GYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYK-----VPYVTGPEYQLIDDK- 185
           G I   + Y+++ L++++K++   N+G+       PQ K     +P  +  E Q++D+  
Sbjct: 64  GNIYTAKEYQDYVLRFEFKLAPHANNGIGLRA-PHPQDKSVKRHIPAYSTVEIQILDNTH 122

Query: 186 -GFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFE 244
             +A+ L+D Q  G  Y +       +  +P GEWN  +I     H++  +NG+K ++ +
Sbjct: 123 PKYAK-LKDHQFHGSAYGIAAAKRGFL--KPLGEWNYQEIHLKASHLQVILNGEKILDCD 179

Query: 245 AWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEE------ 298
             S D      +GK+    +      G I +  HG    FRNI + EL     +      
Sbjct: 180 IASGD------AGKYAKGRD---RTSGHIAIAGHGPGVTFRNISVAELDNALSQAQEDNV 230

Query: 299 -----EELFNGKDLTGW-----------------------------DVYGTEQWYVQDSL 324
                 +LFNGKDLT W                             D    + W V D  
Sbjct: 231 APEGFTQLFNGKDLTNWKGLLDRPFDRPHKRKTLKADKLKELQAKADESMKKHWSVTDKG 290

Query: 325 LVCESGPDKQYGY-LATCKYYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGW----- 378
            +   G  K+ G+ LAT K Y DFE    +K   +G+SGI++R       +V  W     
Sbjct: 291 ELFFDG--KKGGHSLATLKQYKDFEFHVSWKINQNGDSGIYLRGL----PQVQIWDPSDQ 344

Query: 379 --QVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGE 436
             Q   A KG  +G ++ +   G    +  D+       EWN   IR++G++V+ W NG+
Sbjct: 345 KVQKLGAHKG--SGALWNNPKEGKWPLVKADKPT----GEWNHFFIRMIGDRVSIWTNGK 398

Query: 437 QMVD 440
           Q VD
Sbjct: 399 QTVD 402
>gi|32471039|ref|NP_864032.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
 gi|32396741|emb|CAD71706.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
          Length = 889

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 68/191 (35%), Positives = 91/191 (47%), Gaps = 20/191 (10%)

Query: 301 LFNGKDLTGWDVYGTE-QWYVQDSLLVCESGPDKQYGYLATCKY-YNDFELTADFKQEAD 358
           LFNGKDL+GW   G    + V+D +LV +  P     YL+T +  ++DF  T D K E  
Sbjct: 65  LFNGKDLSGWTAKGGSCTFEVKDGILVGQVVPGSNSTYLSTERDDFDDFIFTCDMKWEES 124

Query: 359 GNSGIFIRSFVEEGAKVNGWQVEVAPK----GFD-----TGGIYESYGRG-----WLIQI 404
            NSG+  R+  + G   NG +    P+    GF      +GGIY     G     WL + 
Sbjct: 125 CNSGVMFRAQSKPGK--NGTETVFGPQAEMEGFTQDRHWSGGIYGQSCGGYFYPLWLKEH 182

Query: 405 PDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIK 464
            + R     E  WN + I   GN V TW+NG       D+     +G   LQ+H G    
Sbjct: 183 KEARAA-TTEDIWNRVTISAQGNVVKTWINGVPAAHWIDDG-SYPKGFFGLQVHKGAKGT 240

Query: 465 VLWRNIRVKTL 475
           VLW+NIRVK L
Sbjct: 241 VLWKNIRVKEL 251
>gi|87308660|ref|ZP_01090800.1| probable protein kinase yloP-putative serine/threonine protein
           kinase [Blastopirellula marina DSM 3645]
 gi|87288752|gb|EAQ80646.1| probable protein kinase yloP-putative serine/threonine protein
           kinase [Blastopirellula marina DSM 3645]
          Length = 1534

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 59/191 (30%), Positives = 92/191 (48%), Gaps = 29/191 (15%)

Query: 299 EELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
           E +FNG DLTGW   G   W V++  +V E  PD++ G L   K Y+ +EL  ++  EAD
Sbjct: 806 ESIFNGHDLTGWSERGAPGWRVENQQIVSEVSPDRERGSLLLDKLYDAYELEFEYALEAD 865

Query: 359 GNSGIFIRSFVEEGAKVNGW-QVEV----------APKGFDTGGIYESYGRGWLIQIPDD 407
            +SG+F+ +   E A    W Q+++           P    TG +Y       L+     
Sbjct: 866 ADSGLFLNAGFGESALEAQWLQIQLLDDQMGTYDDIPAERRTGSVYGVAAASALVDS--- 922

Query: 408 RENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQ---GRIALQIHDGGGIK 464
                ++  W  MR+R  GN VT ++N    + +   ++G GQ   G I LQ + G G  
Sbjct: 923 -----QKTPWRKMRVRFDGNSVTVYINN---IMVTQHELG-GQYPTGHIGLQRYKGSG-- 971

Query: 465 VLWRNIRVKTL 475
             +RN+R++ L
Sbjct: 972 -KFRNLRIRNL 981
>gi|32472822|ref|NP_865816.1| hypothetical protein RB3944 [Rhodopirellula baltica SH 1]
 gi|32444059|emb|CAD73501.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 513

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 100/427 (23%), Positives = 171/427 (40%), Gaps = 76/427 (17%)

Query: 61  LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP------- 113
           LCV +   A T    +        +  +GW  LF+G+ L GW   +GQ    P       
Sbjct: 83  LCVALCLAAFTTSHAQDAT-----DTQSGWATLFNGKDLSGW---HGQPHFDPYKLAEMS 134

Query: 114 ------------------WEVVDGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGG 155
                             W V +G +  DG G+    Y+  D  Y ++EL+ +++     
Sbjct: 135 DEERASKIEEWTADAKSHWSVENGELVNDGHGA----YLTTDEGYSDYELKLEYRTVAKA 190

Query: 156 NSGLLYHVVERPQYKVPYVTGP-EYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVR 214
           +SG+  ++   PQ ++   T   ++ L  + G      +         + L D      +
Sbjct: 191 DSGI--YLKGTPQVQIWDTTDEGKFNLGANLGSGALWNNSPGAAGKDPLVLAD------K 242

Query: 215 PAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLIC 274
           P GEWNT  ++        ++NG++T++  A  +++++R             L   G I 
Sbjct: 243 PFGEWNTVHVIQVGSRTSVWLNGKQTVD-HAIMENYWRRGEP----------LPASGPIQ 291

Query: 275 LQDHGYPAWFRNIKIRELPRKTEEEEL-----------FNGKDLTGWDVYGTEQWYVQDS 323
           LQ HG    +RNI++R+L      + L           F+GK L GW     +    +D 
Sbjct: 292 LQTHGGEIRWRNIQVRQLDTDEANDILASKHNDNFTSVFDGKTLEGWIGAVADYEVTEDG 351

Query: 324 LLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVA 383
            + C+ G   + G L T K Y DF +   F+    GN+G+ IR+  +EG        E+ 
Sbjct: 352 SIQCQKG---RGGNLLTEKEYGDFSVRLRFRLPERGNNGLAIRA-PKEGNPAYAAMTELQ 407

Query: 384 PKGFD---TGGIYESYGRGWLIQIPDDRENFLKE-REWNTMRIRVVGNQVTTWLNGEQMV 439
               D      + E    G    +   +  +L+   EWN  ++ V+G  +   LNG  ++
Sbjct: 408 VLDNDHPAYAKLDERQYHGSAYGMAAAKRGYLRPVGEWNFQQVTVIGPTIRVELNGNVIL 467

Query: 440 DIQDEKI 446
           D    KI
Sbjct: 468 DTDVSKI 474
>gi|149173679|ref|ZP_01852308.1| hypothetical protein PM8797T_04560 [Planctomyces maris DSM 8797]
 gi|148847209|gb|EDL61543.1| hypothetical protein PM8797T_04560 [Planctomyces maris DSM 8797]
          Length = 165

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 53/167 (31%), Positives = 88/167 (52%), Gaps = 12/167 (7%)

Query: 129 DENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKG-- 186
           D  G +  D+ Y+NF L++D+K+  G N+G+ +HV   P+   P   G E Q++DD    
Sbjct: 7   DSGGNLFTDKEYKNFVLRFDFKLEPGANNGIGFHVPLNPKTS-PAYAGKEIQILDDTADK 65

Query: 187 FAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAW 246
           +A+ L+ +Q  G  Y          +++P GEWNT +++ D   V+  +NG   ++F   
Sbjct: 66  YAK-LQKYQYHGSLYGTAPAKRG--HLKPVGEWNTQELLVDGNKVKVTLNGTTIVDF--- 119

Query: 247 SDDWFQRKNSGKWENAPEYGLARK-GLICLQDHGYPAWFRNIKIREL 292
             D    K +G  +     GL R+ G +CL  HG    F+N++I+EL
Sbjct: 120 --DMTDAKKNGTIDGKDHPGLKRESGHLCLCGHGAKIEFKNLRIKEL 164
>gi|149174041|ref|ZP_01852669.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
 gi|148847021|gb|EDL61356.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
          Length = 219

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 66/193 (34%), Positives = 90/193 (46%), Gaps = 22/193 (11%)

Query: 294 RKTEEEE----LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFEL 349
           + TE EE    LFNGKDLTGW V     + V+D ++        +  +L T K Y +FE 
Sbjct: 36  KNTEAEEGWIELFNGKDLTGWTVAEGGPFEVKDGVIEVTG----KRSHLFTDKEYKNFEF 91

Query: 350 TADFKQEADGNSGIFIRS-FVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDR 408
            AD K     NSGIF  + F EEG    G++ +V     D       Y R  L + P   
Sbjct: 92  KADVKTTPGSNSGIFFHTKFQEEGWPTQGYESQVNVSHKDPVKTGSLYNRVKLFKTP--- 148

Query: 409 ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAG------QGRIALQIHDGGG 462
               K+ EW T  I V G  V   +N + ++D  + +   G      +G  ALQ HD   
Sbjct: 149 ---AKDNEWWTQHIIVNGRHVIVKINDQTVIDYTEPEGATGSPSLGEKGSFALQAHDPKS 205

Query: 463 IKVLWRNIRVKTL 475
           + V ++NIRVK L
Sbjct: 206 V-VYYKNIRVKPL 217

 Score = 61.2 bits (147), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 68/251 (27%), Positives = 102/251 (40%), Gaps = 66/251 (26%)

Query: 58  MNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVV 117
           M  L V  L G  +A S + +N   EE    GW  LF+G+ L GW    G    GP+EV 
Sbjct: 17  MAALLVSSLIGNNSAYSGE-KNTEAEE----GWIELFNGKDLTGWTVAEG----GPFEVK 67

Query: 118 DGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQ--------- 168
           DG I+  G+ S    ++  D+ Y+NFE + D K + G NSG+ +H   + +         
Sbjct: 68  DGVIEVTGKRS----HLFTDKEYKNFEFKADVKTTPGSNSGIFFHTKFQEEGWPTQGYES 123

Query: 169 -----YKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSK 223
                +K P  TG  Y  +  K F  P +D                        EW T  
Sbjct: 124 QVNVSHKDPVKTGSLYNRV--KLFKTPAKD-----------------------NEWWTQH 158

Query: 224 IVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPA- 282
           I+ +  HV   +N Q  I++                  +P  G   KG   LQ H   + 
Sbjct: 159 IIVNGRHVIVKINDQTVIDY----------TEPEGATGSPSLG--EKGSFALQAHDPKSV 206

Query: 283 -WFRNIKIREL 292
            +++NI+++ L
Sbjct: 207 VYYKNIRVKPL 217
>gi|87311427|ref|ZP_01093547.1| hypothetical protein DSM3645_25327 [Blastopirellula marina DSM
           3645]
 gi|87285839|gb|EAQ77753.1| hypothetical protein DSM3645_25327 [Blastopirellula marina DSM
           3645]
          Length = 217

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 65/191 (34%), Positives = 96/191 (50%), Gaps = 23/191 (12%)

Query: 301 LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGN 360
           LFNGKDL GW V   + + V+D  +VC     K  G L   K +++F L  +FK E   N
Sbjct: 32  LFNGKDLDGW-VGAVKGYDVEDGAIVCNP---KVGGNLYYGKEFDNFVLRFEFKLEPGAN 87

Query: 361 SGIFIRSFVEEGAKVNGWQVEV---APKGFDTGGIYESYGRGWLIQIPDDRENFLKE-RE 416
           +G+ IRS +E  +  NG ++++     + + T   Y+++G  + + +P  R  FLK   E
Sbjct: 88  NGLAIRSPIEGNSAYNGIELQILDTEDERYKTIKPYQAHGSVYGV-VPAKR-GFLKPIGE 145

Query: 417 WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHD------------GGGIK 464
           WN   +   GNQV   LNGE +VD  D K  +  G +    H             G G K
Sbjct: 146 WNVQEVIADGNQVKVTLNGEVIVD-ADIKEASKDGTMDGNKHPGLLNEKGHIGFLGHGTK 204

Query: 465 VLWRNIRVKTL 475
           V +RNIR+K +
Sbjct: 205 VSFRNIRIKPI 215

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 63/237 (26%), Positives = 111/237 (46%), Gaps = 29/237 (12%)

Query: 61  LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGA 120
           L V+ L GAL   +       T   +  G+  LF+G+ L+GW      A+ G ++V DGA
Sbjct: 7   LLVLTLIGALAGAA-------TLHAEDEGFTTLFNGKDLDGWVG----AVKG-YDVEDGA 54

Query: 121 IQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQ 180
           I  + +     G + Y + ++NF L++++K+  G N+GL    +  P        G E Q
Sbjct: 55  IVCNPK---VGGNLYYGKEFDNFVLRFEFKLEPGANNGL---AIRSPIEGNSAYNGIELQ 108

Query: 181 LID--DKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQ 238
           ++D  D+ + + ++ +Q  G  Y +       +  +P GEWN  +++ D   V+  +NG+
Sbjct: 109 ILDTEDERY-KTIKPYQAHGSVYGVVPAKRGFL--KPIGEWNVQEVIADGNQVKVTLNGE 165

Query: 239 KTIEFEAWSDDWFQRKNSGKWENAPEYGLA-RKGLICLQDHGYPAWFRNIKIRELPR 294
             ++      D  +    G  +     GL   KG I    HG    FRNI+I+ + +
Sbjct: 166 VIVD-----ADIKEASKDGTMDGNKHPGLLNEKGHIGFLGHGTKVSFRNIRIKPIAK 217
>gi|88712821|ref|ZP_01106906.1| hypothetical protein FB2170_09291 [Flavobacteriales bacterium
           HTCC2170]
 gi|88708719|gb|EAR00954.1| hypothetical protein FB2170_09291 [Flavobacteriales bacterium
           HTCC2170]
          Length = 222

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 70/237 (29%), Positives = 118/237 (49%), Gaps = 32/237 (13%)

Query: 61  LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGA 120
           L ++VLF   T+C    Q  L ++    G+  LF+G+ L+GW   N       +E +DG 
Sbjct: 13  LLIIVLF---TSCG--AQKGLDDD----GFVSLFNGENLDGWIGNNNS-----YEAIDGM 58

Query: 121 I--QADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPE 178
           I    +GEGS  N Y V    Y +F  +++++++ G N+GL  H     +  V Y+ G E
Sbjct: 59  IVVNPNGEGSGGNLYTVDQ--YSDFIFRFEFQLTPGANNGLGIH--SPLEGDVAYL-GKE 113

Query: 179 YQLIDDKG--FAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMN 236
            Q++D+    +A+ L+ +Q  G  Y +       +N  P GEWNT +++ +   +E  +N
Sbjct: 114 LQILDNTADKYAD-LKPYQYHGSVYGIIPAKRGFLN--PVGEWNTQEVIVNGTKIEIRLN 170

Query: 237 GQKTIEFEAWSDDWFQRKNSGKWENAPEYGLAR-KGLICLQDHGYPAWFRNIKIREL 292
           G   ++      D+ +   +G  +     GL R +G I    HG    FRNIKI+++
Sbjct: 171 GTTIVD-----GDFIEASKNGTMDKKEHPGLKRTEGHIGFLGHGDVVRFRNIKIKKI 222
>gi|149179308|ref|ZP_01857869.1| hypothetical protein PM8797T_11551 [Planctomyces maris DSM 8797]
 gi|148841849|gb|EDL56251.1| hypothetical protein PM8797T_11551 [Planctomyces maris DSM 8797]
          Length = 853

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 58/186 (31%), Positives = 104/186 (55%), Gaps = 14/186 (7%)

Query: 299 EELFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDK--QYGYLATCKYYNDFELTADFKQ 355
           + LFNGK   GW  +G ++ + ++D  ++  S  +K  +  +L + K Y+DFEL  +FK 
Sbjct: 673 QSLFNGKTFDGW--HGNKKIFRIEDGEIIAGSLTEKVERNEFLRSNKVYDDFELKLEFKL 730

Query: 356 EADG-NSGIFIRSF-VEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIP--DDRENF 411
             D  N+G+ IR+  + +  +V+G+Q ++   G+  G +Y+   R  ++  P  + R+  
Sbjct: 731 LGDKTNAGVQIRTAEIPDHHEVSGYQADLG-TGY-WGCLYDESRRKKILAGPPAELRDLP 788

Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQ--DEKIGAGQGRIALQIHDGGGIKVLWRN 469
           ++  +WN+ RIR  G ++  W+N  Q VD    D +I   +G IALQIH     +  +RN
Sbjct: 789 VRMNDWNSYRIRCEGPRIRIWINDVQTVDFTEADPQIPL-KGIIALQIHGNLVNEAHYRN 847

Query: 470 IRVKTL 475
           +R++ L
Sbjct: 848 VRLREL 853

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 56/212 (26%), Positives = 99/212 (46%), Gaps = 29/212 (13%)

Query: 85  EKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFE 144
           EK  G+Q LF+G+T +GW            E++ G++    E  + N ++  +++Y++FE
Sbjct: 667 EKELGFQSLFNGKTFDGWHGNKKIFRIEDGEIIAGSLT---EKVERNEFLRSNKVYDDFE 723

Query: 145 LQWDWK-ISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAM 203
           L+ ++K +    N+G+     E P +    V+G  YQ     G+   L D  R     A 
Sbjct: 724 LKLEFKLLGDKTNAGVQIRTAEIPDHH--EVSG--YQADLGTGYWGCLYDESRRKKILAG 779

Query: 204 YLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAP 263
              +   + VR   +WN+ +I  +   +  ++N  +T++F                E  P
Sbjct: 780 PPAELRDLPVR-MNDWNSYRIRCEGPRIRIWINDVQTVDFT---------------EADP 823

Query: 264 EYGLARKGLICLQDHG---YPAWFRNIKIREL 292
           +  L  KG+I LQ HG     A +RN+++REL
Sbjct: 824 QIPL--KGIIALQIHGNLVNEAHYRNVRLREL 853
>gi|32476109|ref|NP_869103.1| hypothetical protein RB9849 [Rhodopirellula baltica SH 1]
 gi|32446653|emb|CAD76489.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 281

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 66/201 (32%), Positives = 92/201 (45%), Gaps = 31/201 (15%)

Query: 301 LFNGKDLTGWDVYGTEQ-WYVQDSLL---VCESGPDKQYGYLATCK-YYNDFELTADFKQ 355
           LF+GK L GW   G E  W V D  +     +  P KQ  +L   +     FELT  FK 
Sbjct: 81  LFDGKTLDGWR--GREDLWSVDDGAIHGQTTDEAPIKQNTFLILDRPVKGSFELTLQFKM 138

Query: 356 EADGNSGIFIRSFV--EEGAKVNGWQVEVAPKGFDTGGIYESYGRGWL------------ 401
              GNSGI  RS V  EE   V G+Q ++       G +YE  GRG L            
Sbjct: 139 -IGGNSGIQYRSKVLDEEKFIVGGYQADIDATNRFAGILYEEKGRGILATRGQQTTIWAT 197

Query: 402 -------IQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIG--AGQGR 452
                      ++  N +   EWN  RI V  N +  ++N   M+ + D++ G  A  G 
Sbjct: 198 GEKTTEQFATAEELANSIHLGEWNDYRILVRDNHLEQFINETLMIRLVDQQPGKKADSGV 257

Query: 453 IALQIHDGGGIKVLWRNIRVK 473
           IALQ+H G  +KV ++NI+++
Sbjct: 258 IALQLHQGPAMKVWFKNIQIR 278
>gi|32474668|ref|NP_867662.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
 gi|32445207|emb|CAD75209.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
          Length = 445

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 99/382 (25%), Positives = 157/382 (41%), Gaps = 54/382 (14%)

Query: 84  EEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENF 143
           E    GW  LFDG TL GW    G A    + V D  I AD     EN  +     + ++
Sbjct: 83  ERTREGWIRLFDGHTLFGWV-IGGNA---NFRVEDETIVAD---QGENCLLTTSTQWSDY 135

Query: 144 ELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAM 203
           EL+  ++  +  NSG+       PQ     VT   Y++      A P   +   GV   +
Sbjct: 136 ELELQFQCDEETNSGVFVRTTLDPQD----VTTDCYEV----NIAPPSNPFPTGGV---V 184

Query: 204 YLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAP 263
                 T +  P  +W+T  I+ +   +   ++G  T E +                   
Sbjct: 185 QRTKGQTFDTDPE-KWHTMNILCNGSTLRVTIDGTVTCELD------------------- 224

Query: 264 EYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEELFNGKDLTGWDVY-GTEQWYV-- 320
           +      G I LQ       F++I++R L  K     L +G  L GW V  G E  Y   
Sbjct: 225 DATRPTTGYIGLQHRDGRVAFKDIQLRPLGLKNL---LADG--LEGWTVREGMEGEYRID 279

Query: 321 QDSLLVCESGPDKQYGYLATCKYYNDFELTADFK-QEADGNSGIFIRSFVEEGAKVNGWQ 379
            D  LV + G  +    L T   + DF + AD+K  +   NSG+F R+    G ++ G++
Sbjct: 280 DDGHLVVDGGKQQ----LETKAVFGDFVMLADYKMDDPKSNSGLFFRAI--PGDEMMGYE 333

Query: 380 VEVAPKGFDTGGIYES-YGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQM 438
            +V+ +  D   +  +  G G + +  D R    + + WN++ +   GN   TW+NG Q+
Sbjct: 334 CQVSNELIDGNPLQPADCGAGGIFRRQDARVVAGEPKRWNSILLVAEGNHFATWVNGLQV 393

Query: 439 VDIQDEKIGAGQGRIALQIHDG 460
            DI D +      R  L++  G
Sbjct: 394 TDIVDTRKADENPRRGLRLEPG 415
>gi|149200234|ref|ZP_01877256.1| hypothetical protein LNTAR_02824 [Lentisphaera araneosa HTCC2155]
 gi|149136676|gb|EDM25107.1| hypothetical protein LNTAR_02824 [Lentisphaera araneosa HTCC2155]
          Length = 229

 Score = 72.8 bits (177), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 57/209 (27%), Positives = 104/209 (49%), Gaps = 20/209 (9%)

Query: 285 RNIKIRELPRKTEEEE--LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDK--QYGYLAT 340
           +N+     P+   +E   L     L  W V  +  W + D++++ ++G +K  +  +L T
Sbjct: 23  KNVNEASEPKVQSKESINLIADNSLKAWKV-TSSLWSISDNVIIGDTGKEKPQKPEWLYT 81

Query: 341 CKYYNDFELTADFKQEAD--GNSGIF--IRSFVEEGAK-------VNGWQVEVAPKGFDT 389
            + + DF  T++FK       NSGI+  ++ F+ +  K        +G++ +++   F  
Sbjct: 82  KQKFGDFLFTSEFKLTGSIAPNSGIYYRVKPFIFDRIKKKGAFEVASGYEYDLSYNKF-L 140

Query: 390 GGIYESYGRGWLIQIPDDR--ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIG 447
           G + + Y R  L   PD++     +K+ +WN   IR   N++  WLNG +++D  D    
Sbjct: 141 GSLGDWYARPSLRIFPDNKITAQLIKKNDWNRATIRAKSNRLEYWLNGVKIMDFIDHDPK 200

Query: 448 AGQ-GRIALQIHDGGGIKVLWRNIRVKTL 475
           A Q G I LQIHDG  +K+ +R + +  L
Sbjct: 201 ASQSGVIGLQIHDGALMKIEFRKMHILPL 229
>gi|149177017|ref|ZP_01855625.1| hypothetical protein PM8797T_11786 [Planctomyces maris DSM 8797]
 gi|148844082|gb|EDL58437.1| hypothetical protein PM8797T_11786 [Planctomyces maris DSM 8797]
          Length = 235

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 65/208 (31%), Positives = 93/208 (44%), Gaps = 48/208 (23%)

Query: 301 LFNGKDLTGW-------------------------DVYGTEQWYVQDSLLVCESGPDKQY 335
           LFNGKDLTGW                         D    E W V D ++V     D + 
Sbjct: 42  LFNGKDLTGWKGLVGSPKTRAKMSPEELAEAQKKADDEMKEHWNVVDGVIVF----DGKG 97

Query: 336 GYLATCKYYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEV-APKGFDTGGIYE 394
             L T K Y DF++  D+K + DG+SGI++R       +V  W   V A  G  +GG+Y 
Sbjct: 98  KSLCTAKDYGDFDMLVDWKIKKDGDSGIYLRG----SPQVQIWDPAVKAANGVGSGGLYN 153

Query: 395 SYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD-------IQDEKIG 447
           +        +  D        EWNT RI++VG +V+ WLNG+ + D        + +K  
Sbjct: 154 NKKNPDKPLVTADN----PVGEWNTFRIKMVGEKVSVWLNGKLVTDNVTLENYWERDKPI 209

Query: 448 AGQGRIALQIHDGGGIKVLWRNIRVKTL 475
              G+I LQ H   G  + +RN+ +K L
Sbjct: 210 YETGQIELQNH---GNTLYFRNVFIKEL 234
>gi|126648469|ref|ZP_01720956.1| hypothetical protein ALPR1_08673 [Algoriphagus sp. PR1]
 gi|126575353|gb|EAZ79685.1| hypothetical protein ALPR1_08673 [Algoriphagus sp. PR1]
          Length = 222

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 60/210 (28%), Positives = 103/210 (49%), Gaps = 20/210 (9%)

Query: 86  KAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFEL 145
           K  G++ LF+G+ L+GW   N ++    +   +G I  D +G    G +  ++ Y NF L
Sbjct: 29  KEDGFKRLFNGENLDGWVG-NKES----YRAENGMIVIDPQGGG-GGNLYTEKEYGNFIL 82

Query: 146 QWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDD--KGFAEPLEDWQRCGVDYAM 203
            ++++++ G N+GL  H    P        G E Q++D+  K +AE LE +Q  G  Y +
Sbjct: 83  HFEFQLTPGANNGLGIHA---PLEGDAAYVGKELQILDNRAKKYAE-LEVYQYHGSVYGV 138

Query: 204 YLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAP 263
                  +N  P GEWN   ++ ++  ++  +NG+  ++      D+ +    G  ++  
Sbjct: 139 IPARRGFLN--PVGEWNKQTVIVNHPKIQVILNGETILQ-----GDYLEASKEGTLDHKE 191

Query: 264 EYGLAR-KGLICLQDHGYPAWFRNIKIREL 292
             GL R  G I    HG    FRNI+I+EL
Sbjct: 192 HPGLERSSGHIGFLGHGDVVHFRNIRIKEL 221
>gi|150007795|ref|YP_001302538.1| hypothetical protein BDI_1153 [Parabacteroides distasonis ATCC 8503]
 gi|149936219|gb|ABR42916.1| conserved hypothetical protein [Parabacteroides distasonis ATCC 8503]
          Length = 1155

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 109/450 (24%), Positives = 184/450 (40%), Gaps = 102/450 (22%)

Query: 89   GWQLLFDGQTLEGWR----DYNGQALTGPWEVVDGAIQADG----EGSDENGYIVYD--- 137
            G+  +F+G+ L GW+    +   +A   P ++     QAD     +   ENG +V+D   
Sbjct: 744  GFVSIFNGKDLTGWKGLVENPIARAKMKPAQLAKAQEQADENMRRDWKVENGLLVFDGTG 803

Query: 138  -------RIYENFELQWDWKISKGG---NSGLLYHVVERPQYKVPYVTG-PEYQLIDDKG 186
                   + Y +FE+  DW +   G   ++G+             Y+ G P+ Q+     
Sbjct: 804  YDNLCTEKQYGDFEMYVDWMLDPKGPEADAGI-------------YLRGTPQVQI----- 845

Query: 187  FAEPLEDWQRCGVDYAMYLPDFATMN-----VRPA-------GEWNTSKIVFDNGHVEYY 234
                   W    V+    +      N      +P+       GEWN+  I      V   
Sbjct: 846  -------WDTSRVNVGAQVGSGGLYNNQVNESKPSKVTDNKLGEWNSFYIKMVGDRVTVV 898

Query: 235  MNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPR 294
            +NG+K ++     ++++ RK        P + + +   I +Q HG   ++RNI ++EL +
Sbjct: 899  LNGEKVVD-NVILENYWDRK-------LPIFPVEQ---IEMQAHGSKVYYRNIYVKELEK 947

Query: 295  K-----TEEEE------LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYG-YLATCK 342
            +     + EEE      LF+G ++  W    T  + ++D  +     P   +G  L T K
Sbjct: 948  QEPFKLSPEEEKEGFKVLFDGTNMHEW-TGNTVDYILEDGCI--SMVPSSSFGGNLYTKK 1004

Query: 343  YYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEV--APKGFDTGGIYESYGRGW 400
             Y +F    DF+     N+G+ IR+ +E  A   G +V+V         G I      G 
Sbjct: 1005 EYGNFIYRFDFQLTPGANNGVGIRTPMEGDAAYVGMEVQVLDCEHPIYQGNITPLQHHGS 1064

Query: 401  LIQIPDDREN----FLKEREWNTMRIRVVGNQVTTWLNGEQMVD--IQDE-KIGAGQGRI 453
            +  I   RE+    F    EWNT  I   G+ +   +NG  ++D  I+D  K G   G+ 
Sbjct: 1065 VYGIIPAREDHPKAFKPVGEWNTEEIMADGDHIRVTVNGVVILDGNIRDAVKNGTPDGKE 1124

Query: 454  ALQIHD--------GGGIKVLWRNIRVKTL 475
               + +        G G  V +RNIR+K L
Sbjct: 1125 HPGLFNKKGHIGFLGHGSPVKFRNIRIKEL 1154

 Score = 65.5 bits (158), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 64/224 (28%), Positives = 98/224 (43%), Gaps = 57/224 (25%)

Query: 292 LPRKTEEEELFNGKDLTGW-------------------------DVYGTEQWYVQDSLLV 326
           +P +     +FNGKDLTGW                         D      W V++ LLV
Sbjct: 739 MPDEVGFVSIFNGKDLTGWKGLVENPIARAKMKPAQLAKAQEQADENMRRDWKVENGLLV 798

Query: 327 CESGPDKQYGYLATCKYYNDFELTADFKQEADG---NSGIFIRSFVEEGAKVNGWQVEVA 383
            +      Y  L T K Y DFE+  D+  +  G   ++GI++R       +V  W     
Sbjct: 799 FDG---TGYDNLCTEKQYGDFEMYVDWMLDPKGPEADAGIYLRG----TPQVQIWDTSRV 851

Query: 384 PKG--FDTGGIYESYGRGWLIQIPDDRENFLKER---EWNTMRIRVVGNQVTTWLNGEQM 438
             G    +GG+Y +       Q+ + + + + +    EWN+  I++VG++VT  LNGE++
Sbjct: 852 NVGAQVGSGGLYNN-------QVNESKPSKVTDNKLGEWNSFYIKMVGDRVTVVLNGEKV 904

Query: 439 VD------IQDEKIGA-GQGRIALQIHDGGGIKVLWRNIRVKTL 475
           VD        D K+      +I +Q H   G KV +RNI VK L
Sbjct: 905 VDNVILENYWDRKLPIFPVEQIEMQAH---GSKVYYRNIYVKEL 945

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 61/220 (27%), Positives = 99/220 (45%), Gaps = 26/220 (11%)

Query: 81   LTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIY 140
            L+ EE+  G+++LFDG  +  W        T  + + DG I      S   G +   + Y
Sbjct: 953  LSPEEEKEGFKVLFDGTNMHEW-----TGNTVDYILEDGCISMV-PSSSFGGNLYTKKEY 1006

Query: 141  ENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLID-----DKGFAEPLEDWQ 195
             NF  ++D++++ G N+G+    +  P        G E Q++D      +G   PL   Q
Sbjct: 1007 GNFIYRFDFQLTPGANNGV---GIRTPMEGDAAYVGMEVQVLDCEHPIYQGNITPL---Q 1060

Query: 196  RCGVDYAMYLP--DFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQR 253
              G  Y + +P  +      +P GEWNT +I+ D  H+   +NG   ++      +    
Sbjct: 1061 HHGSVYGI-IPAREDHPKAFKPVGEWNTEEIMADGDHIRVTVNGVVILD-----GNIRDA 1114

Query: 254  KNSGKWENAPEYGL-ARKGLICLQDHGYPAWFRNIKIREL 292
              +G  +     GL  +KG I    HG P  FRNI+I+EL
Sbjct: 1115 VKNGTPDGKEHPGLFNKKGHIGFLGHGSPVKFRNIRIKEL 1154
>gi|154490163|ref|ZP_02030424.1| hypothetical protein PARMER_00395 [Parabacteroides merdae ATCC
           43184]
 gi|154089055|gb|EDN88099.1| hypothetical protein PARMER_00395 [Parabacteroides merdae ATCC
           43184]
          Length = 1150

 Score = 69.3 bits (168), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 70/237 (29%), Positives = 101/237 (42%), Gaps = 54/237 (22%)

Query: 277 DHGYPAWFRNIKIRELPRKTEEEELFNGKDLTGW-------------------------D 311
           D GY        + E+P+      LFNGKDLTGW                         D
Sbjct: 721 DAGYQREAIKKHLAEMPQGEGFVSLFNGKDLTGWKGLVQNPIARAKMKPGQLAKEQAKAD 780

Query: 312 VYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG---NSGIFIRSF 368
               + W V+D +L+     D     L T K Y DFE+  D+  +  G   ++GI++R  
Sbjct: 781 EVMRKGWSVEDGMLIFNGKGDN----LCTEKQYGDFEMYVDWMLDPAGPEADAGIYLRG- 835

Query: 369 VEEGAKVNGWQVEVAPKG--FDTGGIYES-YGRGWLIQIPDDRENFLKEREWNTMRIRVV 425
                +V  W       G    +GG+Y +        ++ D+     K  EWN+  I++V
Sbjct: 836 ---TPQVQIWDTSRVNVGAQVGSGGLYNNQMNESKPTKVADN-----KLGEWNSFYIKMV 887

Query: 426 GNQVTTWLNGEQMVD------IQDEKIGA-GQGRIALQIHDGGGIKVLWRNIRVKTL 475
           G++VT  LNGE++VD        D K+      +I LQ H   G KV +RNI VK L
Sbjct: 888 GDRVTVVLNGEKVVDDVILENYWDRKLPIFPVEQIELQAH---GSKVYYRNIYVKEL 941

 Score = 68.9 bits (167), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 106/453 (23%), Positives = 182/453 (40%), Gaps = 94/453 (20%)

Query: 81   LTEEEKAAGWQLLFDGQTLEGWRDY---------------------NGQALTGPWEVVDG 119
            L E  +  G+  LF+G+ L GW+                         + +   W V DG
Sbjct: 733  LAEMPQGEGFVSLFNGKDLTGWKGLVQNPIARAKMKPGQLAKEQAKADEVMRKGWSVEDG 792

Query: 120  AIQADGEGSDENGYIVYDRIYENFELQWDWKISKGG---NSGLLYHVVERPQYKVPYVTG 176
             +  +G+G +    +  ++ Y +FE+  DW +   G   ++G+             Y+ G
Sbjct: 793  MLIFNGKGDN----LCTEKQYGDFEMYVDWMLDPAGPEADAGI-------------YLRG 835

Query: 177  -PEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYM 235
             P+ Q+ D        +       +  M       +     GEWN+  I      V   +
Sbjct: 836  TPQVQIWDTSRVNVGAQVGSGGLYNNQMNESKPTKVADNKLGEWNSFYIKMVGDRVTVVL 895

Query: 236  NGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRK 295
            NG+K ++ +   ++++ RK        P + + +   I LQ HG   ++RNI ++EL RK
Sbjct: 896  NGEKVVD-DVILENYWDRK-------LPIFPVEQ---IELQAHGSKVYYRNIYVKELERK 944

Query: 296  ------TEEEE-----LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQY-GYLATCKY 343
                   EEE+     LF+G ++  W    T  + ++D  +     P K Y G L T   
Sbjct: 945  EPFKLSAEEEKEGFKVLFDGTNMHEW-TGNTVDYTLEDGCI--SMIPSKSYGGNLYTKDE 1001

Query: 344  YNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVA----PKGFDTGGIYESYGRG 399
            Y +F    +F+     N+G+ IR+ +E  A   G ++++     P   D   + + +G  
Sbjct: 1002 YGNFVYRFEFQLTPGANNGVGIRTPMEGDAAYVGMEIQILDCEHPIYKDITPL-QHHGSV 1060

Query: 400  WLIQIPDDREN---FLKEREWNTMRIRVVGNQVTTWLNGEQMVD----------IQDEKI 446
            + I IP   E+   F    EWN   I   G+ +   +NG  +++            D K 
Sbjct: 1061 YGI-IPAKAEHHSAFKPAGEWNYEEIVANGDNIKVTVNGVVIMEGNIREATKNGTADHKE 1119

Query: 447  GAG----QGRIALQIHDGGGIKVLWRNIRVKTL 475
              G    +G I    H   G  V +RNIR+K L
Sbjct: 1120 HPGLFNKKGHIGFLGH---GSPVKFRNIRIKEL 1149
>gi|88713612|ref|ZP_01107694.1| probable large multifunctional protein-putative glycosyl hydrolase
           [Flavobacteriales bacterium HTCC2170]
 gi|88708122|gb|EAR00360.1| probable large multifunctional protein-putative glycosyl hydrolase
           [Flavobacteriales bacterium HTCC2170]
          Length = 310

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 45/151 (29%), Positives = 69/151 (45%), Gaps = 15/151 (9%)

Query: 293 PRKTEEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTAD 352
           P   E + LFNGK+L GW   G  QW V+D +L  E    K    L + + +NDF+L   
Sbjct: 135 PVWGESKALFNGKNLDGWQAMGVNQWMVKDGILTSE----KSGANLVSSEKFNDFKLHIV 190

Query: 353 FKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFL 412
           F+     NSG+++R   E     N   + + P     GGIY       ++  P       
Sbjct: 191 FRYPEGSNSGVYLRGRYEVQIADN---IGLEPSSILFGGIYGFLTPNEMVAKPAG----- 242

Query: 413 KEREWNTMRIRVVGNQVTTWLNGEQMVDIQD 443
              EW    I ++G +VT   NG++++  Q+
Sbjct: 243 ---EWQEYDITLIGRRVTIIANGKEIITNQN 270
>gi|116626562|ref|YP_828718.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116229724|gb|ABJ88433.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 309

 Score = 66.2 bits (160), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 59/196 (30%), Positives = 92/196 (46%), Gaps = 29/196 (14%)

Query: 291 ELPRK-TEEEELFNGKDLTGW---DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYND 346
           E P+  T  E +FNGKDLTGW   D   T  W V+   LV ES    +   L T + ++D
Sbjct: 126 EAPKAWTAPEPIFNGKDLTGWEPTDPAATNHWVVKGGELVNES----KGANLKTTRKFDD 181

Query: 347 FELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPD 406
           F+L  ++    DGNSGI++R   E   +V   +V+   K    G +Y       ++++P 
Sbjct: 182 FKLHIEYNCPDDGNSGIYLRGRYE--VQVEYEKVDANDKFHSIGAVYSMLAP--VVELPR 237

Query: 407 DRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQD---------EKIGAGQGRIALQI 457
                 K   W T  I +VG ++T   +G + +D Q+         +   A  G   +Q 
Sbjct: 238 ------KPGTWETFDITLVGRRLTVVRDGVKTIDNQEIAGTTGGALDSNEAEPGPFYIQG 291

Query: 458 HDGGGIKVLWRNIRVK 473
              GG+K  +RNI ++
Sbjct: 292 DHTGGMK--YRNITIQ 305
>gi|116619831|ref|YP_821987.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116222993|gb|ABJ81702.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 212

 Score = 65.5 bits (158), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 66/210 (31%), Positives = 100/210 (47%), Gaps = 22/210 (10%)

Query: 88  AGWQLLFDGQTLEGWRDYNGQALTGPWEVV-DGAIQADGEGSDENGYIVYDRIYENFELQ 146
           AG+  LFDG+TL GW+   G    GP  VV +G I    +G    G +  ++ + NF  +
Sbjct: 21  AGFTPLFDGKTLNGWKLVGGH---GPGYVVQEGKIVCPADGG---GNLFTEKEFGNFAFR 74

Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDD--KGFAEPLEDWQRCGVDYAMY 204
           +++K++ G N+G+    +  P        G E Q++DD  K +   +   Q  G  Y + 
Sbjct: 75  FEFKLTPGANNGI---GIRAPYEGDAAYQGMEIQILDDGDKVYQGKIRPEQYHGSVYDV- 130

Query: 205 LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPE 264
           +P   T   +P GEWN  +IV D   ++  +NG   I  +A   D    K      + P 
Sbjct: 131 IPA-RTGYRKPVGEWNEEEIVADGRRIKVTLNG--VIILDA---DLSIVKEPQVLAHHP- 183

Query: 265 YGLARK-GLICLQDHGYPAWFRNIKIRELP 293
            GLAR  G I    HG    FRNI+++ LP
Sbjct: 184 -GLARTAGHIGFLGHGSLVEFRNIRVKPLP 212

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 58/189 (30%), Positives = 85/189 (44%), Gaps = 17/189 (8%)

Query: 301 LFNGKDLTGWDVYGTEQ--WYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
           LF+GK L GW + G     + VQ+  +VC   P    G L T K + +F    +FK    
Sbjct: 26  LFDGKTLNGWKLVGGHGPGYVVQEGKIVC---PADGGGNLFTEKEFGNFAFRFEFKLTPG 82

Query: 359 GNSGIFIRSFVEEGAKVNGWQVEVAPKGFDT--GGIYESYGRGWLIQIPDDRENFLKE-R 415
            N+GI IR+  E  A   G ++++   G     G I      G +  +   R  + K   
Sbjct: 83  ANNGIGIRAPYEGDAAYQGMEIQILDDGDKVYQGKIRPEQYHGSVYDVIPARTGYRKPVG 142

Query: 416 EWNTMRIRVVGNQVTTWLNGEQMVD-----IQDEKIGA---GQGRIALQI-HDGGGIKVL 466
           EWN   I   G ++   LNG  ++D     +++ ++ A   G  R A  I   G G  V 
Sbjct: 143 EWNEEEIVADGRRIKVTLNGVIILDADLSIVKEPQVLAHHPGLARTAGHIGFLGHGSLVE 202

Query: 467 WRNIRVKTL 475
           +RNIRVK L
Sbjct: 203 FRNIRVKPL 211
>gi|87310208|ref|ZP_01092340.1| hypothetical protein DSM3645_14090 [Blastopirellula marina DSM
           3645]
 gi|87287198|gb|EAQ79100.1| hypothetical protein DSM3645_14090 [Blastopirellula marina DSM
           3645]
          Length = 505

 Score = 65.5 bits (158), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 62/210 (29%), Positives = 95/210 (45%), Gaps = 51/210 (24%)

Query: 299 EELFNGKDLTGW-------------------------DVYGTEQWYVQDSLLVCESGPDK 333
           ++LFNGKDL+GW                         D    E W ++D +L      D 
Sbjct: 33  QKLFNGKDLSGWKGLVASPPKRAEMSAEALAAEQEKADASMREHWTIEDGVLTY----DG 88

Query: 334 QYGYLATCKYYNDFELTADFKQEADGNSGIFIRSFVE-EGAKVNGWQVEVAPKGFDTGGI 392
           +   L T K Y DFE+  D+K + DG+SGI++R   + +    N W+V        +GG+
Sbjct: 89  KGQSLCTDKDYADFEMYVDWKIKDDGDSGIYVRGSPQIQIWDPNHWKV-------GSGGL 141

Query: 393 YESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD-------IQDEK 445
           Y +        I  D        EWNTM +R++G++VT  LNG+ + D        + +K
Sbjct: 142 YNNKKNPSAPTIIADN----PIGEWNTMYVRMIGDRVTVKLNGKLVTDNVVLENYWERDK 197

Query: 446 IGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
                G+I LQ H   G  + +RNI ++ L
Sbjct: 198 PIYPTGQIELQHH---GNTLWFRNIFIREL 224

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 63/235 (26%), Positives = 97/235 (41%), Gaps = 54/235 (22%)

Query: 89  GWQLLFDGQTLEGWR----------DYNGQALTGP-----------WEVVDGAIQADGEG 127
           G+Q LF+G+ L GW+          + + +AL              W + DG +  DG+G
Sbjct: 31  GFQKLFNGKDLSGWKGLVASPPKRAEMSAEALAAEQEKADASMREHWTIEDGVLTYDGKG 90

Query: 128 SDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF 187
                 +  D+ Y +FE+  DWKI   G+SG+  +V   PQ ++     P +  +   G 
Sbjct: 91  QS----LCTDKDYADFEMYVDWKIKDDGDSGI--YVRGSPQIQI---WDPNHWKVGSGGL 141

Query: 188 AEPLEDWQRCGVDYAMYLPDFATMNV-RPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAW 246
               ++            P   T+    P GEWNT  +      V   +NG+   +    
Sbjct: 142 YNNKKN------------PSAPTIIADNPIGEWNTMYVRMIGDRVTVKLNGKLVTDNVVL 189

Query: 247 SDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEEL 301
            + W + K        P Y     G I LQ HG   WFRNI IREL +  E +++
Sbjct: 190 ENYWERDK--------PIY---PTGQIELQHHGNTLWFRNIFIRELAKPDELKKI 233
>gi|87309043|ref|ZP_01091181.1| hypothetical protein DSM3645_19838 [Blastopirellula marina DSM
           3645]
 gi|87288386|gb|EAQ80282.1| hypothetical protein DSM3645_19838 [Blastopirellula marina DSM
           3645]
          Length = 208

 Score = 65.5 bits (158), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 59/188 (31%), Positives = 99/188 (52%), Gaps = 18/188 (9%)

Query: 301 LFNGKDLTGWDVYGTEQWY-VQDSLLVC----ESGPDKQYGYLATCKYYNDFELTADFKQ 355
           +F+G+ L GW+  G  +W+ V D  +V     ++ P+ ++  L T + Y DFELT + K 
Sbjct: 26  IFDGETLEGWE--GKSEWFHVADGAVVAGSLEKAIPNNEF--LCTKEEYGDFELTLEAKL 81

Query: 356 EADG-NSGIFIRS-FVEEGAKVNGWQVEVA--PKGFDTGGIY-ESYGRGWLIQIP-DDRE 409
              G N+G+  RS  +    +V G+Q ++   P     G +Y ES  + ++ + P ++  
Sbjct: 82  VGQGTNAGVQFRSQRIPNHHEVIGYQCDMGSTPVRLIWGSLYDESRRKIFVAEGPAEEVA 141

Query: 410 NFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQ--DEKIGAGQGRIALQIHDGGGIKVLW 467
             +K  EWN ++IR  G ++  W+N  Q VD    D+ I A  G I LQIH G   +  +
Sbjct: 142 KTVKRGEWNRLKIRCQGAKIQIWVNDLQTVDYTEADDAI-ARTGIIGLQIHSGPAAEASY 200

Query: 468 RNIRVKTL 475
           R +++K L
Sbjct: 201 RKLQLKKL 208
>gi|150007144|ref|YP_001301887.1| putative secreted glycosylhydrolase [Parabacteroides distasonis
           ATCC 8503]
 gi|149935568|gb|ABR42265.1| putative secreted glycosylhydrolase [Parabacteroides distasonis
           ATCC 8503]
          Length = 222

 Score = 65.5 bits (158), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 76/250 (30%), Positives = 106/250 (42%), Gaps = 41/250 (16%)

Query: 53  FMMKCMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTG 112
           F++      CV   F +  A + K             W+ LF G+ LE   +YN +    
Sbjct: 4   FLLAMATLWCVASTFSSFAADNNK-------------WKPLF-GKNLEN-ANYNPEV--- 45

Query: 113 PWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVP 172
            W   DG +   G   DE+  I     YENFEL  D+K   G NSG++ +  +   + +P
Sbjct: 46  -WSETDGVL---GAVKDES--IWTKDEYENFELDLDFKTDVGTNSGVVVYCTDTKDW-IP 98

Query: 173 YVTGPEYQLIDDK----GFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDN 228
                E Q+ DD     G  +P   +++CG  Y  +L       V+  GEWN  +I    
Sbjct: 99  --NSVEIQIADDHCEKWGNGKP---YEKCGAIYG-HLGAVQDKVVKKPGEWNHMRIKCAG 152

Query: 229 GHVEYYMNGQKTIEFE--AWSDDWFQRKNSG--KWENAPEYGLARKGLICLQ-DHGYP-A 282
            H+   +NG+K  E +   W+        S    W   P   L  KG I LQ  HG    
Sbjct: 153 QHIMVILNGKKVTEMDMSKWTSGTKNPDGSDIPSWLPKPFAELPTKGFIGLQGKHGDSLI 212

Query: 283 WFRNIKIREL 292
           WFRNIKIR L
Sbjct: 213 WFRNIKIRSL 222
>gi|146280883|ref|YP_001171036.1| hypothetical protein PST_0488 [Pseudomonas stutzeri A1501]
 gi|145569088|gb|ABP78194.1| conserved hypothetical protein [Pseudomonas stutzeri A1501]
          Length = 194

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 45/155 (29%), Positives = 73/155 (47%), Gaps = 8/155 (5%)

Query: 140 YENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGV 199
           Y NF L++++ +  GGNSG+ Y V E  +  + + +GPE QL+DD    +  E   R G 
Sbjct: 45  YANFILRFEYALPVGGNSGVFYRVDEAAE--LAWHSGPEMQLLDDAVHPDGAEPTTRNGA 102

Query: 200 DYAMYLPDFATMNVRP--AGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSG 257
            Y +     AT    P   G +    +V  +  VE++++ +  + +     D   +  + 
Sbjct: 103 LYGLR----ATQQETPIEPGSFMEGALVVRDADVEHWLDNRMVLSYRLDDPDLRMQIRTS 158

Query: 258 KWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           K+ + P Y  A  G I LQ HG    FR + I  L
Sbjct: 159 KFADKPLYAQATAGHIVLQHHGEAVRFRRLSIEPL 193
>gi|32473471|ref|NP_866465.1| hypothetical protein-transmembrane region and signal peptide
           prediction [Rhodopirellula baltica SH 1]
 gi|32398151|emb|CAD78246.1| hypothetical protein-transmembrane region and signal peptide
           prediction [Rhodopirellula baltica SH 1]
          Length = 263

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 64/219 (29%), Positives = 107/219 (48%), Gaps = 38/219 (17%)

Query: 293 PRKTEEEELFNGKDLTGWDVYGTEQ-WYVQDSLLVCESGPDKQYGYLATCKYYN----DF 347
           P ++    +FNGKDLTGW   G ++ W V+D ++  E+ P+ +        + +    +F
Sbjct: 46  PAESGLTSIFNGKDLTGWS--GDDRLWSVRDGVIHGETTPENKANGNTFLIWEDGNTKNF 103

Query: 348 ELTADFKQEADGNSGIFIRS--FVEEGAK----VNGWQVEVAPK-GFD--TGGIYESYGR 398
           E+   F+  A  NSGI  RS    ++ A+    V G+Q E+  +  F    G IY+  GR
Sbjct: 104 EVRLSFRCNATNNSGIQYRSKHITDKSARNEWVVRGYQHELRNEMNFPNIAGFIYDEGGR 163

Query: 399 GWLIQIPDDR-----------ENFLKERE---------WNTMRIRVVGNQVTTWLNGEQM 438
              I +  ++           ENF+ E E         WN + I   GN +  +LNG+ +
Sbjct: 164 RGRICMVGEKAAWKDGKKQVLENFMDEAEFQKLFQLDGWNEVVIIGKGNHIQHFLNGKLI 223

Query: 439 VDIQDEK--IGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
           +D  DE+  +    G++ALQ+H G  +   +++IR K+L
Sbjct: 224 LDFTDEQPELKLLDGKLALQLHAGKPMWAEFKDIRFKSL 262
>gi|149177681|ref|ZP_01856282.1| sulfatase [Planctomyces maris DSM 8797]
 gi|148843499|gb|EDL57861.1| sulfatase [Planctomyces maris DSM 8797]
          Length = 664

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 60/189 (31%), Positives = 87/189 (46%), Gaps = 22/189 (11%)

Query: 301 LFNGKDLTGWDV--YGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
           LFNG DL+GW +     E ++V+   LVC   P    G+L T K Y+DF L  DFK    
Sbjct: 480 LFNGHDLSGWTLKRANREGYHVEAGKLVC---PADGGGFLFTEKEYSDFSLKFDFKLTKA 536

Query: 359 GNSGIFIR-SFVEEGAKVNGWQVEVAP-KGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
            N+GI IR   V++     G +++V   KG+        Y       IP  R       E
Sbjct: 537 ANNGIAIRCPLVDQKPAYEGMEIQVLDNKGYPKKLKPTQYHGSVYDVIPAKRGALKPVGE 596

Query: 417 WNTMRIRVVGNQVTTWLNG-----EQMVDIQDEKIGAG-------QGRIALQIHDGGGIK 464
           WN   I   G+++T  +N        + +I+DE++ A        +G I L  H   G  
Sbjct: 597 WNHEEIICRGSKITVIVNDIPVLQTDLSEIKDEQVLAKHPGLKNQRGHIGLLGH---GSH 653

Query: 465 VLWRNIRVK 473
           V ++NIR+K
Sbjct: 654 VEYQNIRIK 662

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 55/212 (25%), Positives = 100/212 (47%), Gaps = 23/212 (10%)

Query: 85  EKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFE 144
           E+ A ++ LF+G  L GW     +A    + V  G +    +G    G++  ++ Y +F 
Sbjct: 472 EQTADYKPLFNGHDLSGWT--LKRANREGYHVEAGKLVCPADGG---GFLFTEKEYSDFS 526

Query: 145 LQWDWKISKGGNSGLLYH---VVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDY 201
           L++D+K++K  N+G+      V ++P Y+     G E Q++D+KG+ + L+  Q  G  Y
Sbjct: 527 LKFDFKLTKAANNGIAIRCPLVDQKPAYE-----GMEIQVLDNKGYPKKLKPTQYHGSVY 581

Query: 202 AMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWEN 261
            +       +  +P GEWN  +I+     +   +N     +      D  + K+      
Sbjct: 582 DVIPAKRGAL--KPVGEWNHEEIICRGSKITVIVN-----DIPVLQTDLSEIKDEQVLAK 634

Query: 262 APEYGLA-RKGLICLQDHGYPAWFRNIKIREL 292
            P  GL  ++G I L  HG    ++NI+I+E 
Sbjct: 635 HP--GLKNQRGHIGLLGHGSHVEYQNIRIKEF 664
>gi|148253179|ref|YP_001237764.1| putative exported protein of unknown function [Bradyrhizobium sp.
           BTAi1]
 gi|146405352|gb|ABQ33858.1| putative exported protein of unknown function [Bradyrhizobium sp.
           BTAi1]
          Length = 193

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 51/165 (30%), Positives = 78/165 (47%), Gaps = 15/165 (9%)

Query: 314 GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGNSGIFIRSFVEEGA 373
           G   W  +D  L  +    K   YL T   Y DF++ A+F  +   NSGIFIR   ++  
Sbjct: 41  GKANWAAKDGALSADKLDGKDPSYLVTKTSYKDFQIKAEFWVDDAANSGIFIR--CDQSD 98

Query: 374 KVNG---WQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVT 430
           K++    ++V +  K  D       YG G ++ +           +WNT  I   G ++T
Sbjct: 99  KIDSNICYEVNIFDKRPDP-----KYGTGAIVDVAKVDPMPKAGGKWNTYEITAKGTRLT 153

Query: 431 TWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
             LNGE+  D+ D K  AG   IALQ   G G+ V ++ +++K L
Sbjct: 154 VILNGEKTADVDDSKHAAGP--IALQY--GSGV-VKFKKVQIKPL 193
>gi|117164776|emb|CAJ88325.1| putative secreted glycosyl hydrolase [Streptomyces ambofaciens ATCC
           23877]
          Length = 592

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 57/182 (31%), Positives = 83/182 (45%), Gaps = 17/182 (9%)

Query: 301 LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGN 360
           LF+G   TGW   G   + + D  L    G    +   A  ++  D+ L  D+K   D N
Sbjct: 284 LFDGSGTTGWQQAGPGGFTLADGTLTSHGGLGMLW--YAAEEFTGDYSLKLDWKAAGDDN 341

Query: 361 SGIFIRSFVEEG----AKVNGWQVEV-APKGFD--TGGIYESYGRGWLIQIPDDRENFLK 413
           SG+F+  F   G    A  NG+++++ A    D  TG +Y          +P        
Sbjct: 342 SGVFV-GFPASGDPWSAVNNGYEIQIDATDAADRTTGAVYGFRS----ADLPARDAALNP 396

Query: 414 EREWNTMRIRVVGNQVTTWLNGEQMVDI--QDEKIGAGQGRIALQIHDGGGIKVLWRNIR 471
             EWNT  +RV G ++  +LNG ++ D    D      QG I LQ H G G +V +R+IR
Sbjct: 397 PGEWNTYELRVTGERLEIFLNGSKINDFTNTDPARSLRQGHIGLQNH-GDGDEVAFRDIR 455

Query: 472 VK 473
           VK
Sbjct: 456 VK 457
>gi|116625196|ref|YP_827352.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116228358|gb|ABJ87067.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 323

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 49/152 (32%), Positives = 72/152 (47%), Gaps = 19/152 (12%)

Query: 296 TEEEELFNGKDLTGWDVY---GTEQWYVQDSLLVCESGPDKQYG-YLATCKYYNDFELTA 351
           T+ E LFNGKDLTGW+ +       W  QD +LV     D  +G  L T + +NDF+L  
Sbjct: 146 TDPEPLFNGKDLTGWEAFPAGAVNHWVAQDGVLV-----DTDHGASLKTTRTFNDFKLHI 200

Query: 352 DFKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENF 411
           +F     GNSGI++R    +  +V   +  V  K  D G +Y     G+   +    E  
Sbjct: 201 EFNCPDGGNSGIYLRG--RDEIQVAYEKPGVEDKFHDMGAVY-----GF---VAPTAEVP 250

Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQD 443
                W +  + ++G  VT   NG + VD Q+
Sbjct: 251 RTPGTWESFDVTLIGRYVTIVRNGVKTVDNQE 282
>gi|146342960|ref|YP_001208008.1| hypothetical protein BRADO6143 [Bradyrhizobium sp. ORS278]
 gi|146195766|emb|CAL79793.1| hypothetical protein [Bradyrhizobium sp. ORS278]
          Length = 205

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 53/178 (29%), Positives = 82/178 (46%), Gaps = 15/178 (8%)

Query: 301 LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGN 360
           L +G     +   G   W  +D  L+ +    K   YL T   Y DF++ A+F  +   N
Sbjct: 40  LVDGDKKVEFTEVGKANWEAKDGALMADKLDGKDPSYLVTKTSYKDFQIKAEFWVDDAAN 99

Query: 361 SGIFIRSFVEEGAKVNG---WQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREW 417
           SGIFIR   ++  K++    ++V +  K  D       YG G ++ +           +W
Sbjct: 100 SGIFIR--CDQADKIDSNICYEVNIFDKRPDP-----KYGTGAIVDVSKVDPMPKAGGKW 152

Query: 418 NTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
           NT  I   G ++T   NGE+  D+ D K  AG   IALQ   G GI V ++ +++K L
Sbjct: 153 NTYEITAKGTRLTVIFNGEKTADVDDSKHAAGP--IALQY--GSGI-VKFKKVQIKPL 205
>gi|88713788|ref|ZP_01107869.1| hypothetical protein FB2170_00770 [Flavobacteriales bacterium
           HTCC2170]
 gi|88707915|gb|EAR00154.1| hypothetical protein FB2170_00770 [Flavobacteriales bacterium
           HTCC2170]
          Length = 199

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 55/196 (28%), Positives = 97/196 (49%), Gaps = 27/196 (13%)

Query: 283 WFRNIKIRELPRKTEEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCK 342
           +F N K  EL +K  +E   +G D T W+       ++ + ++    G     G++ T K
Sbjct: 23  FFNNAKPAELFKKNSKECFISG-DAT-WN-------FIDNEIVGIAKGAS---GFIMTKK 70

Query: 343 YYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQ---VEVAPKGFDTGGIYESYGRG 399
            Y +F L  +FK ++  NSGIFIR   +E + V+ ++    ++ P   +  G   +  + 
Sbjct: 71  SYKNFILELEFKPDSTVNSGIFIRCKNKELSMVDCYENNIWDLHPNQENRTGAVVNRSKP 130

Query: 400 WLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHD 459
            +     D+        WNT +I++  N + TW+NGE + D+ D  +   +G IALQ  +
Sbjct: 131 LIYVNTLDK--------WNTYKIKIEKNHLQTWVNGELITDLHDNDL--SEGMIALQAAE 180

Query: 460 GGGIKVLWRNIRVKTL 475
            G I+  +RNI+ + L
Sbjct: 181 TGEIR--FRNIKFQNL 194
>gi|126648719|ref|ZP_01721203.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
 gi|126575170|gb|EAZ79520.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
          Length = 207

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 56/191 (29%), Positives = 91/191 (47%), Gaps = 26/191 (13%)

Query: 300 ELFNGKDLTGWDVY-GTEQWYVQDSLLVCESGPDKQYGYLATC--KYYNDFELTADFKQE 356
           ELFNG++  GW +    + + ++D +L   +GP     Y        +N+FEL    K  
Sbjct: 26  ELFNGQNFEGWKISENPDSFSIEDGMLKV-NGPRGHMFYEGEVGDHDFNNFELEVTLKTL 84

Query: 357 ADGNSGIFIRS-FVEEGAKVNGWQVEVAPKGFD---TGGIYESYGRGWLIQIPDDRENFL 412
            + NSGIFI + + E G    G +++V     D   TG +Y            D R+ F+
Sbjct: 85  PEANSGIFIHTKYQERGWPNIGHEIQVNQSHGDWRKTGSVY---------SFKDVRDTFV 135

Query: 413 KEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAG--------QGRIALQIHDGGGIK 464
           ++ EW    I V G++VT  +NGE + +  + K   G         G IALQ HD   + 
Sbjct: 136 EDGEWYKETIIVQGDKVTVKVNGEVINEYDETKDREGDLGTKKLDHGTIALQAHDPNSV- 194

Query: 465 VLWRNIRVKTL 475
           V ++++++K L
Sbjct: 195 VYYKSVKIKIL 205
>gi|32471313|ref|NP_864306.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
 gi|32443154|emb|CAD71985.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
          Length = 504

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 61/212 (28%), Positives = 92/212 (43%), Gaps = 34/212 (16%)

Query: 293 PRKTEE--EELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKY--YNDFE 348
           PR T    E L  G     W+    + W +++ +L+  +    +     T K     DFE
Sbjct: 110 PRDTPAGAESLLEGNLADSWE-GSLDDWTLENGVLIGTTDGSVKVNRFITSKIAPVEDFE 168

Query: 349 LTADFKQEADGNSGIFIRSFVEEGAKVN---GWQVEV---APKGFDTGGIYESYGR---- 398
           L  D    A GNSGI  RS V E    N   G+Q +V    PK    G +YE  GR    
Sbjct: 169 LEVDVWVSARGNSGIQYRSEVREDLGPNVMVGYQCDVVAATPKY--NGMLYEERGRRILC 226

Query: 399 -------------GWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD--IQD 443
                        GW+++   +   F  E  W+  ++RVVGN    W++G++  +    D
Sbjct: 227 HTAEKVVTDADGQGWVVE-SSEPPKFAPE-AWHRYKVRVVGNHHQHWIDGQKTAEHFDLD 284

Query: 444 EKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
               +  GRI +Q+H G  +++ +RN  VK L
Sbjct: 285 PNGRSLSGRIGVQVHVGPPMEIRYRNFFVKRL 316
>gi|116626769|ref|YP_828925.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116229931|gb|ABJ88640.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 252

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 63/205 (30%), Positives = 91/205 (44%), Gaps = 30/205 (14%)

Query: 299 EELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYN----DFELTADFK 354
           + +F+GK L GWD      W V+   LV ++  +KQ        +      DFEL   FK
Sbjct: 50  QPIFDGKSLAGWD-GDPGFWRVEGGALVGQTSTEKQPAQNTFLIWRGGSPADFELKLQFK 108

Query: 355 QEADGNSGIFIRSFVEEGAK--VNGWQVEVAPKGFDTGGIYESYGRGWLIQ------IP- 405
                NSGI  RS      K  + G+Q ++      TG IYE  GRG+L        IP 
Sbjct: 109 LTG-FNSGIQFRSIELPDIKWAMKGYQADMDGVQQYTGQIYEERGRGFLAMRGQFSYIPQ 167

Query: 406 -------------DDRENFLKEREWNTMRIRVVGNQVTTWLNGE-QMVDIQDEKIGAG-Q 450
                        ++ +  +K  +WN + +   GN +   LNG    + I D+K G    
Sbjct: 168 GGKPGLVGSVGDSNELKALIKGEDWNDLHLIARGNTIVQLLNGRITSMLIDDDKEGRKMD 227

Query: 451 GRIALQIHDGGGIKVLWRNIRVKTL 475
           G I +Q+H G  +K+  RNIR+K L
Sbjct: 228 GLIGIQVHKGPPMKIEVRNIRLKKL 252
>gi|109897209|ref|YP_660464.1| protein of unknown function DUF1080 [Pseudoalteromonas atlantica
           T6c]
 gi|109699490|gb|ABG39410.1| protein of unknown function DUF1080 [Pseudoalteromonas atlantica
           T6c]
          Length = 259

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 65/272 (23%), Positives = 119/272 (43%), Gaps = 47/272 (17%)

Query: 55  MKCMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGW-----RDYNGQA 109
           M  +  + +  L G L++C+        +    +GW  LF+G+ L+GW         G  
Sbjct: 1   MTLVKLILIGTLLGILSSCTH-------QRPVKSGWHTLFNGENLDGWTVKIHHHEVGDN 53

Query: 110 LTGPWEVVDGAIQADGEGSD----ENGYIVYDRIYENFELQWDWKISKG----------G 155
           +   + V +G ++   +  D    + G++ +++ + NF L+ D+                
Sbjct: 54  VDDTFRVENGLLRVSYDQYDTFDKQFGHLYFNQPFSNFHLKLDYHFYGDFLSDAPHYAER 113

Query: 156 NSGLLYHVVERPQYKVPYVTGP---EYQLID--DKGFAEPLEDWQRCGVDY----AMYLP 206
           NSGL+Y+  + P   +     P   E Q +   D G A P  +    G D     A+Y  
Sbjct: 114 NSGLMYYS-QAPDTILKEQDWPISVEMQFLAGLDDGKARPTGNMCSPGTDIEYQGAVY-T 171

Query: 207 DFATMNVRPA---GEWNTSKIVFDNGHVEYYMNGQKTIEFE--AWSDDWFQRKNSGKWE- 260
           D   M+  P    GEW +++++ +NG+V + +NG   +++        + +  +   W  
Sbjct: 172 DHCLMSSSPTIPVGEWVSAELIVNNGNVTHIINGDIVLQYTRPTMGGKFVKGYDPAIWTP 231

Query: 261 NAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
           +AP +     G I LQ  G+P  F+NIKI+ L
Sbjct: 232 SAPLH----SGYIALQSEGHPIEFKNIKIKAL 259
>gi|149196057|ref|ZP_01873113.1| hypothetical protein LNTAR_22964 [Lentisphaera araneosa HTCC2155]
 gi|149140904|gb|EDM29301.1| hypothetical protein LNTAR_22964 [Lentisphaera araneosa HTCC2155]
          Length = 1233

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 54/200 (27%), Positives = 98/200 (49%), Gaps = 25/200 (12%)

Query: 299 EELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYG--YLATCKYYNDFELTADFKQE 356
           ++LFNG +L GW       W V+D ++  +S  +   G  +L       DFE+T   + +
Sbjct: 24  QDLFNGNNLAGWK-GDLSYWSVKDGVIFGQSTKEHPTGGTFLVWDGEVADFEITLQARVK 82

Query: 357 ADGNSGIFIRSFVEEGAK--VNGWQVEVAPKGFDTGGIY-ESYGRGWLIQ----IPDDRE 409
            + NSG+  RS +    +  VNG+Q ++       G +Y +  GRG + Q    +  D++
Sbjct: 83  GN-NSGLQYRSKIANAQRFTVNGYQADIIDANHLFGMMYHQGEGRGIVAQRFQQVAVDKQ 141

Query: 410 ---NFLKE----------REWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAG-QGRIAL 455
                +KE           +WN  R+  VGN++   +NG   VD+ D+   A  +G +AL
Sbjct: 142 GKKTIVKEFGDKNQKWDASQWNEYRVIAVGNRLIHQVNGVTTVDVTDDHPNAARKGILAL 201

Query: 456 QIHDGGGIKVLWRNIRVKTL 475
           Q+H G  +   +++I+++ +
Sbjct: 202 QLHGGAPMTAEFKDIKLRKV 221
>gi|87308845|ref|ZP_01090984.1| hypothetical protein DSM3645_11417 [Blastopirellula marina DSM
           3645]
 gi|87288556|gb|EAQ80451.1| hypothetical protein DSM3645_11417 [Blastopirellula marina DSM
           3645]
          Length = 1672

 Score = 62.4 bits (150), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 61/202 (30%), Positives = 92/202 (45%), Gaps = 30/202 (14%)

Query: 299 EELFNGKDLTGWDVYGTEQ-WYVQDSLLVCES---GPDKQYGYLATCK-YYNDFELTADF 353
           + +F+GK L GW   G EQ W VQD  +  ++    P K   +L   +   +DFEL   +
Sbjct: 30  KSIFDGKTLNGWR--GKEQFWSVQDGAITGQTTSENPTKGNTFLIWDQGKVDDFELKLKY 87

Query: 354 KQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQ------IPDD 407
           K    GNSGI  RS       V G+Q ++  K   +G  YE  GRG +        + D 
Sbjct: 88  KI-VGGNSGIQYRSTDLGDFVVKGYQADIDSKDTYSGINYEERGRGIIANRGVKATVYDG 146

Query: 408 RENFLKER--------------EWNTMRIRVVGNQVTTWLNGEQMVDIQDE--KIGAGQG 451
            +    ER              +WN   I   GN +T ++NG +  ++ DE  K     G
Sbjct: 147 NQGNKDERFAESADIQAKINKEDWNEYHIIAKGNHLTHFINGVKTSEVIDEGKKDNRESG 206

Query: 452 RIALQIHDGGGIKVLWRNIRVK 473
            +ALQ+H G  + V +++I +K
Sbjct: 207 ILALQLHAGPPMNVQFKDIELK 228
>gi|87310061|ref|ZP_01092194.1| serine/threonine protein kinase [Blastopirellula marina DSM 3645]
 gi|87287307|gb|EAQ79208.1| serine/threonine protein kinase [Blastopirellula marina DSM 3645]
          Length = 806

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 39/126 (30%), Positives = 65/126 (51%), Gaps = 17/126 (13%)

Query: 79  NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP--WEVVDGAIQADGEGSDENGYIVY 136
           NV    + A  W  +F+G+ + GW      ++ GP  W V  G++   G    E G+I+ 
Sbjct: 605 NVPPTTDVAQDWVSVFNGRDMRGW------SIQGPTLWRVASGSLIGVGTSPQEAGWIML 658

Query: 137 DRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEP----LE 192
           ++ Y+ +EL++++K+ +GGNSG+  +   RP   +      E Q+IDD   A P    + 
Sbjct: 659 EKKYDAYELEFEYKLEEGGNSGVFLN--SRPGEPLVGSKFLEIQIIDD---AAPKFRNIP 713

Query: 193 DWQRCG 198
           D QR G
Sbjct: 714 DIQRTG 719
>gi|116619940|ref|YP_822096.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116223102|gb|ABJ81811.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 256

 Score = 60.5 bits (145), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 63/227 (27%), Positives = 94/227 (41%), Gaps = 46/227 (20%)

Query: 288 KIRELPRKTEEEE-----LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCK 342
           K  + P   E +E     +F+GK L GW+    + W V++  LV E  P    G +    
Sbjct: 33  KQSDRPEAIEGDEPGFKPIFDGKSLAGWE-GDPKYWRVENGALVGEITP----GTVIKSN 87

Query: 343 YY--------NDFELTADFKQEADGNSGIFIRSFV-------EEGAKVNGWQVEVAPKGF 387
            +         DFEL AD++    GNSGI  RS V            + G+Q ++  +  
Sbjct: 88  TFIIWRGGEPADFELKADYRITTAGNSGINYRSVVVPDKVTPSNQFAMRGYQHDIDGQNR 147

Query: 388 DTGGIYESYGR------GWLIQIPDDRENFLKER-------------EWNTMRIRVVGNQ 428
            TG  YE  GR      G    +   R+  +                +WN+  I   GN 
Sbjct: 148 YTGQNYEEKGRLFLALRGQTTHVVGGRKPIVLSSTGDTKALAEFITSDWNSCHIIARGNV 207

Query: 429 VTTWLNGEQMVDIQDEKIG--AGQGRIALQIHDGGGIKVLWRNIRVK 473
           +T  LNG  M  + D+       +G I +Q+H G  +KV +RN R+K
Sbjct: 208 LTHILNGHLMCCVIDDDPPNRMAKGLIGVQVHVGPPMKVEYRNFRLK 254
>gi|32474367|ref|NP_867361.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
 gi|32444905|emb|CAD74907.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
          Length = 230

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 54/188 (28%), Positives = 83/188 (44%), Gaps = 16/188 (8%)

Query: 301 LFNGKDLTGW--DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
           LF+G+ L GW       + W V+D +LVC+ G      Y+       +F   AD K    
Sbjct: 44  LFDGESLDGWKKSTENPDSWQVEDGMLVCK-GERCHLFYVGELAPLKNFHFKADVKVMPG 102

Query: 359 GNSGIFIRS-FVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREW 417
            N+GI+  + + E G    G++ +V     D       YG    I       N +++ EW
Sbjct: 103 SNAGIYFHTKYQESGWPKYGYECQVNVSHKDPKKTSSLYGVE-NIDAETLAANGIRDNEW 161

Query: 418 NTMRIRVVGNQVTTWLNGEQMVDI---QDEKIGA-------GQGRIALQIHDGGGIKVLW 467
            T  I V G  +   +NG+ +VD     D++  +       G+G  ALQ HD   I   +
Sbjct: 162 YTQEIIVRGKHIELKVNGKTLVDFTEPSDQEAFSDRFERRLGEGTFALQAHDPQSI-AYF 220

Query: 468 RNIRVKTL 475
           +N+RVK L
Sbjct: 221 KNLRVKPL 228
>gi|149177640|ref|ZP_01856241.1| hypothetical protein PM8797T_27292 [Planctomyces maris DSM 8797]
 gi|148843458|gb|EDL57820.1| hypothetical protein PM8797T_27292 [Planctomyces maris DSM 8797]
          Length = 240

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 62/216 (28%), Positives = 102/216 (47%), Gaps = 45/216 (20%)

Query: 291 ELPRKTEEEELFNGKDLTGWDVYGTEQ--WYVQDSLLVCESGPDKQYGYLATCKYYNDFE 348
           ELP   + + LFNGKDLTGW    T++  WYV+D  LVC   P    G + + + Y +F 
Sbjct: 24  ELP---QYKPLFNGKDLTGWVNVNTDKDTWYVKDGTLVCTGHP---IGVMRSDRQYENFL 77

Query: 349 LTADFKQ-EADGNSGIFIRS--FVEEGAKV-NGWQVE----------------VAPKGFD 388
           L  +++  EA GNSG+F  S   V EG ++  G +++                + P  + 
Sbjct: 78  LHIEWRHMEAGGNSGVFAWSEGTVPEGRRLPKGMEIQMLELDWVNQHKLKDGSLPPIAYV 137

Query: 389 TGGIYESYGRGWLIQIPDD----RENFLKER-----EWNTMRIRVVGNQVTTWLNGEQMV 439
            G   E +G   LI  PD+    R   ++ R     +WN   +  V   V   +NG+ + 
Sbjct: 138 HG---ELFGANGLITTPDNPRGTRSKSIENRCKGKGQWNVYDVVCVDGVVKLSVNGKFVN 194

Query: 440 DIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
            +++  I   +G + L+     G ++ +RNI++  L
Sbjct: 195 GVRNASI--KKGYLCLE---SEGAEIQFRNIQIMEL 225
>gi|145589295|ref|YP_001155892.1| protein of unknown function DUF1080 [Polynucleobacter sp.
           QLW-P1DMWA-1]
 gi|145047701|gb|ABP34328.1| protein of unknown function DUF1080 [Polynucleobacter sp.
           QLW-P1DMWA-1]
          Length = 197

 Score = 59.7 bits (143), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 53/180 (29%), Positives = 90/180 (50%), Gaps = 20/180 (11%)

Query: 300 ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
           +L +G  LT W++ GT  W + + ++      +K  G+L + K Y +F +  +F  E++ 
Sbjct: 34  DLIDGVSLTDWNIIGTANWVIGNGIV----EGNKPNGFLVSTKPYKNFIIRTEFWAESNT 89

Query: 360 NSGIFIRSFVEEGAKVNG---WQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
           NSGIFIR   ++  KV     +++ +    +DT    ++Y  G ++ +            
Sbjct: 90  NSGIFIR--CQDPKKVTQSTCYEINI----WDTRP-EQAYATGAIVDVAKVDPVPKAGGR 142

Query: 417 WNTMRIRVVGNQVTTWLNGEQMV-DIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
           WNTM I   G+     LNG   V D QD +    +G IALQ    GGI + ++ +++KTL
Sbjct: 143 WNTMEITANGSHFKVVLNGVTTVADGQDSRY--VEGPIALQ--SAGGI-IKFKKVQIKTL 197
>gi|21224873|ref|NP_630652.1| glycosyl hydrolase (secreted protein) [Streptomyces coelicolor A3(2)]
 gi|3218378|emb|CAA19630.1| putative glycosyl hydrolase (putative secreted protein) [Streptomyces
            coelicolor A3(2)]
          Length = 1238

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 59/213 (27%), Positives = 97/213 (45%), Gaps = 41/213 (19%)

Query: 89   GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYD-RIYENFELQW 147
            G++ +F+GQTL+GW+    QA  G + V +G ++++G      G + Y  +  +++ L+ 
Sbjct: 1058 GYRDIFNGQTLDGWK----QAGPGKFNVKNGVLESEGG----MGLLWYQAKELKSYSLKL 1109

Query: 148  DWKISKGGNSGLLYHVVERPQYKVPYVT---GPEYQLIDDKGFAEPLEDWQRCGVDYAMY 204
            DWK+    NSG+    V  P    P+     G E Q ID     +     +  G  Y   
Sbjct: 1110 DWKMRGDDNSGVF---VGFPASDDPWSAVNKGYEIQ-IDATDAVD-----RTTGAIYTFK 1160

Query: 205  LPDFATMN--VRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENA 262
              +    +  +RP G+WN+ +I      ++ ++NG K  +F                   
Sbjct: 1161 AANIKARDQVLRPPGQWNSYEIKVQGERLQVFLNGVKINDFT---------------NKD 1205

Query: 263  PEYGLARKGLICLQDHGY--PAWFRNIKIRELP 293
            PE  L   G I LQ+HG      FRNI+++ELP
Sbjct: 1206 PERSLT-DGYIGLQNHGADDQVSFRNIQLKELP 1237

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 54/186 (29%), Positives = 88/186 (47%), Gaps = 20/186 (10%)

Query: 300  ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYL-ATCKYYNDFELTADFKQEAD 358
            ++FNG+ L GW   G  ++ V++ +L  E G     G L    K    + L  D+K   D
Sbjct: 1061 DIFNGQTLDGWKQAGPGKFNVKNGVLESEGG----MGLLWYQAKELKSYSLKLDWKMRGD 1116

Query: 359  GNSGIFIRSFVEE---GAKVNGWQVEV-APKGFD--TGGIYESYGRGWLIQIPDDRENFL 412
             NSG+F+     +    A   G+++++ A    D  TG IY              R+  L
Sbjct: 1117 DNSGVFVGFPASDDPWSAVNKGYEIQIDATDAVDRTTGAIYTFKAANI-----KARDQVL 1171

Query: 413  KER-EWNTMRIRVVGNQVTTWLNGEQMVDI--QDEKIGAGQGRIALQIHDGGGIKVLWRN 469
            +   +WN+  I+V G ++  +LNG ++ D   +D +     G I LQ H G   +V +RN
Sbjct: 1172 RPPGQWNSYEIKVQGERLQVFLNGVKINDFTNKDPERSLTDGYIGLQNH-GADDQVSFRN 1230

Query: 470  IRVKTL 475
            I++K L
Sbjct: 1231 IQLKEL 1236
>gi|116624007|ref|YP_826163.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116227169|gb|ABJ85878.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 229

 Score = 58.5 bits (140), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 65/215 (30%), Positives = 94/215 (43%), Gaps = 51/215 (23%)

Query: 301 LFNGKDLTGWD-------------------VYGTEQWYVQDSL------LVCESGPDKQY 335
           LFNGKDL+GW                        +QW  +  L         E   D + 
Sbjct: 26  LFNGKDLSGWRGRQGTYSPHAEALLSKEELAAKQQQWNAERDLHWSVDVAKGEIVSDGKS 85

Query: 336 GYLATCKYYNDFELTADFKQ-EADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYE 394
            +LAT + Y DF+L  D+   + +G+SGI++RS+     +V  W V+  P+    G    
Sbjct: 86  VHLATARDYRDFDLYVDWLMVKHNGDSGIYLRSY----PQVQIWDVD-NPREVKNGA--- 137

Query: 395 SYGRGWLIQIPDDREN---FLKERE----WNTMRIRVVGNQVTTWLNGEQMVDIQ--DEK 445
             G G L    DD       +K       WNT RI++ G++V+ WLNG+  VD Q  D  
Sbjct: 138 PRGSGALWNNNDDNPGKWPLVKADNPVGAWNTFRIKMAGSRVSVWLNGKLTVDNQVLDNF 197

Query: 446 IGAG-----QGRIALQIHDGGGIKVLWRNIRVKTL 475
                    +G I LQ H   G ++ +RNI +K L
Sbjct: 198 YNRSLPLVPKGAIELQTH---GSEIRFRNIYIKEL 229
>gi|87310236|ref|ZP_01092367.1| hypothetical protein DSM3645_27448 [Blastopirellula marina DSM
           3645]
 gi|87286985|gb|EAQ78888.1| hypothetical protein DSM3645_27448 [Blastopirellula marina DSM
           3645]
          Length = 253

 Score = 58.5 bits (140), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 63/217 (29%), Positives = 93/217 (42%), Gaps = 59/217 (27%)

Query: 301 LFNGKDLTGWDVYG-------------------------TEQWYVQDSLLVCE-SGPDKQ 334
           LFNG DL+GW  YG                         ++ W V+D  LV +  GP   
Sbjct: 40  LFNGHDLSGW--YGLNPHLAAKLDGEEKDKNLRTQRAEFSKYWRVEDGSLVNDGKGP--- 94

Query: 335 YGYLATCKYYNDFELTADFKQEADGNSGIFIR-----SFVEEGAKVNGWQVEVAPKGFDT 389
             Y  T   + D EL  ++K  A  +SGI++R        +   K N    +  P    +
Sbjct: 95  --YATTVDEFGDMELQLEYKTVAGADSGIYLRGAPQVQIWDSNQKFNSKAPDRKPH-LGS 151

Query: 390 GGIYESY----GRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD--IQD 443
           GG+Y +     GR  L ++ D         EWN +RIR +G +   WLNG+ +VD  + +
Sbjct: 152 GGLYNNTPGAPGRDPL-ELADHPFG-----EWNQLRIRQIGARTWVWLNGKLVVDDAVME 205

Query: 444 EKIGAGQ-----GRIALQIHDGGGIKVLWRNIRVKTL 475
              G  +     G I LQ H G   ++ WRNI V+ +
Sbjct: 206 NFWGRSKPLPSTGPIMLQTHGG---EISWRNIFVREI 239
>gi|149198757|ref|ZP_01875800.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
           HTCC2155]
 gi|149138193|gb|EDM26603.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
           HTCC2155]
          Length = 234

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 51/174 (29%), Positives = 80/174 (45%), Gaps = 31/174 (17%)

Query: 311 DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQ---EADGNSGIFIRS 367
           +VY    + ++D ++   S   K+  +  T K Y DF L  + K    E   NSG+  R+
Sbjct: 37  NVYNDGHYELKDGVVHMTS---KKNFFFPTKKRYADFILEYEVKMPDVEEYSNSGLIFRA 93

Query: 368 FVEEGAK---VNGWQVEVAPKGFD-TGGIYESYGRGWLI----------------QI--- 404
            ++EG K   V G+Q EV P     +GG+Y+   RGWL                 Q+   
Sbjct: 94  QIKEGKKGKTVIGYQAEVDPSERAWSGGLYDQGRRGWLYPKHATRSKYDEDFKGSQLEPW 153

Query: 405 PDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH 458
            ++++   K  EWN  R+   G+ +  +LNG  M  + D K    +G IA+Q H
Sbjct: 154 TEEKKKVYKHLEWNKYRVECRGSDIKIFLNGTLMTHVIDTK--DAEGHIAIQHH 205
>gi|116621620|ref|YP_823776.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
 gi|116224782|gb|ABJ83491.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
          Length = 228

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 54/211 (25%), Positives = 91/211 (43%), Gaps = 50/211 (23%)

Query: 301 LFNGKDLTGWDVYGTEQWYVQ-DSLLVCE-----------SGP-----------DKQYGY 337
           LFNGK+L GW++ G  QW V  D  +V +            GP           D Q   
Sbjct: 24  LFNGKNLDGWEIIGDGQWTVMADGTVVGQRIGELRKMLVPGGPLTTPKDFKSWVDTQSWL 83

Query: 338 LATCKYYNDFELTADFKQEADGNSGIFIRS-------------FVEEGAKVNGWQVEVA- 383
             T   + +F+L  ++  +  GNSG+ IR              + +  +K+ G+++++  
Sbjct: 84  YTTRNDFGEFDLHLEYWTKTSGNSGVSIRDTSRAKWGVTTPPDYTKTPSKI-GYEIQINN 142

Query: 384 --PKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDI 441
             P    +G IY           P D +   K+ EWN M I+   +++T  +NG  + + 
Sbjct: 143 RFPDPHPSGSIYG------FQDAPKDSQ---KDDEWNAMDIKSRNDKITVSINGRVVAEH 193

Query: 442 QDEKIGAGQGRIALQIHDGGGIKVLWRNIRV 472
             +   +  G I LQ+HD   I   +RN+R+
Sbjct: 194 AGDPARSKTGPIGLQLHDQFSIS-QFRNVRI 223
>gi|94969351|ref|YP_591399.1| protein of unknown function DUF1080 [Acidobacteria bacterium
           Ellin345]
 gi|94551401|gb|ABF41325.1| protein of unknown function DUF1080 [Acidobacteria bacterium
           Ellin345]
          Length = 308

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 47/152 (30%), Positives = 72/152 (47%), Gaps = 20/152 (13%)

Query: 293 PRKTEEEELFNGKDLTGW---DVYGTEQWYVQDSLLVC-ESGPDKQYGYLATCKYYNDFE 348
           P+  +  +LFNGKDLTGW   D   T  W V D  L   E GP+     + + + + DF+
Sbjct: 130 PKWGKPIDLFNGKDLTGWTMSDPKATNPWKVIDGALTSPEHGPE-----IISNQKFKDFK 184

Query: 349 LTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDR 408
           +  +F      NSG+++R   E  A++        P+   TG IY     G+L+  P   
Sbjct: 185 IHVEFNIHGTANSGVYLRGRYE--AQIETDSANEGPE-HHTGSIY-----GFLVGDPKPP 236

Query: 409 ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD 440
               +   W T  I ++G  VT  LNG+ ++D
Sbjct: 237 R---QSDVWQTYDITLLGRWVTVVLNGKTIID 265
>gi|21225488|ref|NP_631267.1| secreted glycosyl hydrolase [Streptomyces coelicolor A3(2)]
 gi|8546922|emb|CAB94634.1| putative secreted glycosyl hydrolase. [Streptomyces coelicolor
           A3(2)]
          Length = 579

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 54/188 (28%), Positives = 86/188 (45%), Gaps = 17/188 (9%)

Query: 295 KTEEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFK 354
           +T    LF+G   +GW   G   + + D  L    G    Y Y A  ++  D+ L  D++
Sbjct: 265 ETGYRSLFDGTGTSGWKQAGPGGFTLADGTLTSHGGLG-MYWYQAE-EFTGDYSLKLDWR 322

Query: 355 QEADGNSGIFIRSFVEE---GAKVNGWQVEVAPKGF---DTGGIYESYGRGWLIQIPDDR 408
              D NSG+F+     +    A  NG+++++         TG +Y     G+       R
Sbjct: 323 ASGDDNSGVFVGFPASDDPWSAVDNGYEIQIDATDTPDRTTGSVY-----GFQSADVAAR 377

Query: 409 ENFLKER-EWNTMRIRVVGNQVTTWLNGEQMVDI--QDEKIGAGQGRIALQIHDGGGIKV 465
           +  L    EWNT  +RV G ++  +LNG ++ D    D      QG I +Q H G G +V
Sbjct: 378 DAALNPPGEWNTYEVRVTGERLELFLNGRKINDFTNTDPARSLRQGHIGIQNH-GDGDEV 436

Query: 466 LWRNIRVK 473
            +R++RVK
Sbjct: 437 SFRDVRVK 444
>gi|32471058|ref|NP_864051.1| conserved hypothetical protein-putative rhamnosidase
           [Rhodopirellula baltica SH 1]
 gi|32396760|emb|CAD71725.1| conserved hypothetical protein-putative rhamnosidase
           [Rhodopirellula baltica SH 1]
          Length = 824

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 57/201 (28%), Positives = 88/201 (43%), Gaps = 31/201 (15%)

Query: 301 LFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFK-QEAD 358
           LFNGKDL+GW + Y   +  V D  +  ++  D ++ +L T + Y++F L+ D    E  
Sbjct: 53  LFNGKDLSGWRNPYSHGEASVVDGEIHLKA--DNKF-FLVTEQKYSNFRLSVDIHLPEGP 109

Query: 359 GNSGIFIRSFVEEGA--KVNGWQVEV-APKGFDTGGIYESYGRGWLIQIPDDR------- 408
            NSG+  R  V+E A  KV G+Q E    +   +GG+++   R W+      R       
Sbjct: 110 SNSGVMFRCHVDEDAEKKVYGYQAECDGSERRWSGGLFDEARRRWIWPSTKGRSTTQFRA 169

Query: 409 --------------ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIA 454
                          + L    WN   I  + + +T  LNG Q V  +D       G I 
Sbjct: 170 HEEESQKFFAEPRVRDALNRNGWNRYTITCIDDVITIELNGVQTVRFRDAM--DSSGFIG 227

Query: 455 LQIHDGGGIKVLWRNIRVKTL 475
           +Q H   G    +RN+ +K L
Sbjct: 228 IQHHGEKGQTYRFRNLFIKEL 248
>gi|32470990|ref|NP_863983.1| conserved hypothetical protein-putative secreted protein
           [Rhodopirellula baltica SH 1]
 gi|32396692|emb|CAD71657.1| conserved hypothetical protein-putative secreted protein
           [Rhodopirellula baltica SH 1]
          Length = 264

 Score = 55.8 bits (133), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 8/112 (7%)

Query: 58  MNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVV 117
           M    V+ L  A T+ S +      + +++A W  LFDG++LEGW           WEVV
Sbjct: 57  MGLALVLCLSAAGTSVSAEDN---AKADESAKWTTLFDGESLEGWEKVGRD--DSKWEVV 111

Query: 118 DGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQY 169
           DG I+  G  S     +     Y+NF  + + KI+ GGNSG+ +    +P +
Sbjct: 112 DGVIKGTGGVS---MLVNTSGPYKNFRYRAEVKINDGGNSGVYFRTTRKPGF 160
>gi|149196925|ref|ZP_01873978.1| hypothetical protein LNTAR_10981 [Lentisphaera araneosa HTCC2155]
 gi|149140035|gb|EDM28435.1| hypothetical protein LNTAR_10981 [Lentisphaera araneosa HTCC2155]
          Length = 185

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 52/164 (31%), Positives = 78/164 (47%), Gaps = 16/164 (9%)

Query: 318 WYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA-DGNSGIFIRSFVEEGAKVN 376
           W  +D+L++ E+  DK+   L T K + D+EL   FK ++ D +SG+F+R         N
Sbjct: 32  WTTEDNLIIGEN-VDKKNSVLWTEKKFKDYELVVKFKTDSKDYDSGVFLRG--------N 82

Query: 377 GWQVEV----APKGFDTGGIYE-SYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTT 431
             QV++    + K   T  IY  S  +G      D      K  +WN M++ V G  +  
Sbjct: 83  SHQVQIGVSRSLKKDLTACIYAPSDKKGKYPASSDKVAELHKLGQWNEMKMIVQGKNMKV 142

Query: 432 WLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
           +LNG Q VD    KI    G I LQ+H G   K+ +  +  K L
Sbjct: 143 YLNGTQTVDYDGVKIKE-SGPIGLQLHAGVHQKMEFEIVSFKEL 185
  Database: nr
    Posted date:  Sep 17, 2007 11:41 AM
  Number of letters in database: 999,999,834
  Number of sequences in database:  2,976,859
  
  Database: /nucleus1/users/jsaw/ncbi/db/nr.01
    Posted date:  Sep 17, 2007 11:48 AM
  Number of letters in database: 894,087,890
  Number of sequences in database:  2,493,262
  
Lambda     K      H
   0.320    0.139    0.451 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,052,168,933
Number of Sequences: 5470121
Number of extensions: 96143281
Number of successful extensions: 171161
Number of sequences better than 1.0e-05: 122
Number of HSP's better than  0.0 without gapping: 35
Number of HSP's successfully gapped in prelim test: 87
Number of HSP's that attempted gapping in prelim test: 170471
Number of HSP's gapped (non-prelim): 373
length of query: 475
length of database: 1,894,087,724
effective HSP length: 137
effective length of query: 338
effective length of database: 1,144,681,147
effective search space: 386902227686
effective search space used: 386902227686
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 132 (55.5 bits)