BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= TF0813
(475 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|150008515|ref|YP_001303258.1| putative secreted glycosyl... 693 0.0
gi|154493862|ref|ZP_02033182.1| hypothetical protein PARMER... 652 0.0
gi|150009061|ref|YP_001303804.1| hypothetical protein BDI_2... 651 0.0
gi|88711878|ref|ZP_01105966.1| probable secreted glycosyl h... 252 3e-65
gi|88804756|ref|ZP_01120276.1| probable secreted glycosyl h... 249 3e-64
gi|126648755|ref|ZP_01721239.1| probable secreted glycosyl ... 241 8e-62
gi|126648754|ref|ZP_01721238.1| probable secreted glycosyl ... 228 6e-58
gi|150008483|ref|YP_001303226.1| hypothetical protein BDI_1... 223 3e-56
gi|88804436|ref|ZP_01119956.1| probable secreted glycosyl h... 182 3e-44
gi|88712822|ref|ZP_01106907.1| hypothetical protein FB2170_... 177 2e-42
gi|150008445|ref|YP_001303188.1| hypothetical protein BDI_1... 170 2e-40
gi|154494235|ref|ZP_02033555.1| hypothetical protein PARMER... 159 5e-37
gi|149197990|ref|ZP_01875038.1| probable secreted glycosyl ... 150 2e-34
gi|149173077|ref|ZP_01851708.1| protein up-regulated by thy... 150 3e-34
gi|83815548|ref|YP_446631.1| probable secreted glycosyl hyd... 147 1e-33
gi|88712915|ref|ZP_01107000.1| hypothetical protein FB2170_... 144 1e-32
gi|126646853|ref|ZP_01719363.1| probable secreted glycosyl ... 139 3e-31
gi|32473821|ref|NP_866815.1| probable secreted glycosyl hyd... 134 1e-29
gi|120435377|ref|YP_861063.1| conserved hypothetical protei... 134 2e-29
gi|149197739|ref|ZP_01874789.1| probable secreted glycosyl ... 132 4e-29
gi|149278985|ref|ZP_01885119.1| hypothetical protein PBAL39... 132 7e-29
gi|86143589|ref|ZP_01061974.1| hypothetical protein MED217_... 130 2e-28
gi|86143701|ref|ZP_01062077.1| probable secreted glycosyl h... 129 6e-28
gi|154493991|ref|ZP_02033311.1| hypothetical protein PARMER... 127 1e-27
gi|150009463|ref|YP_001304206.1| hypothetical protein BDI_2... 126 3e-27
gi|150007837|ref|YP_001302580.1| hypothetical protein BDI_1... 125 8e-27
gi|149196977|ref|ZP_01874030.1| probable secreted glycosyl ... 124 1e-26
gi|88803973|ref|ZP_01119493.1| hypothetical protein RB2501_... 123 3e-26
gi|88804319|ref|ZP_01119839.1| probable secreted glycosyl h... 121 8e-26
gi|150009244|ref|YP_001303987.1| hypothetical protein BDI_2... 120 2e-25
gi|154492730|ref|ZP_02032356.1| hypothetical protein PARMER... 120 2e-25
gi|149276529|ref|ZP_01882673.1| hypothetical protein PBAL39... 119 3e-25
gi|29347567|ref|NP_811070.1| hypothetical protein BT_2157 [... 118 8e-25
gi|156110542|gb|EDO12287.1| hypothetical protein BACOVA_021... 118 1e-24
gi|86131181|ref|ZP_01049780.1| hypothetical protein MED134_... 117 1e-24
gi|149177844|ref|ZP_01856443.1| probable secreted glycosyl ... 117 1e-24
gi|29349855|ref|NP_813358.1| hypothetical protein BT_4447 [... 117 1e-24
gi|126645050|ref|ZP_01717594.1| hypothetical protein ALPR1_... 117 2e-24
gi|153808737|ref|ZP_01961405.1| hypothetical protein BACCAC... 117 2e-24
gi|88712851|ref|ZP_01106936.1| probable secreted glycosyl h... 116 3e-24
gi|156112295|gb|EDO14040.1| hypothetical protein BACOVA_002... 116 3e-24
gi|156109541|gb|EDO11286.1| hypothetical protein BACOVA_031... 116 3e-24
gi|29348878|ref|NP_812381.1| hypothetical protein BT_3469 [... 113 2e-23
gi|118073263|ref|ZP_01541446.1| protein of unknown function... 112 5e-23
gi|149175127|ref|ZP_01853750.1| probable secreted glycosyl ... 111 1e-22
gi|149174297|ref|ZP_01852924.1| hypothetical protein PM8797... 110 2e-22
gi|116624768|ref|YP_826924.1| protein of unknown function D... 109 3e-22
gi|156861848|gb|EDO55279.1| hypothetical protein BACUNI_009... 106 3e-21
gi|116625197|ref|YP_827353.1| protein of unknown function D... 105 7e-21
gi|149177059|ref|ZP_01855667.1| hypothetical oxidoreductase... 105 7e-21
gi|149178817|ref|ZP_01857398.1| hypothetical protein PM8797... 104 1e-20
gi|88713768|ref|ZP_01107849.1| hypothetical protein FB2170_... 104 2e-20
gi|116621745|ref|YP_823901.1| protein of unknown function D... 100 2e-19
gi|32472633|ref|NP_865627.1| probable secreted glycosyl hyd... 100 4e-19
gi|32475761|ref|NP_868755.1| probable secreted glycosyl hyd... 99 6e-19
gi|87311549|ref|ZP_01093668.1| hypothetical protein DSM3645... 99 6e-19
gi|87308076|ref|ZP_01090218.1| hypothetical protein DSM3645... 98 1e-18
gi|32470813|ref|NP_863806.1| probable secreted glycosyl hyd... 98 2e-18
gi|149195996|ref|ZP_01873052.1| hypothetical protein LNTAR_... 97 2e-18
gi|87308201|ref|ZP_01090343.1| probable secreted glycosyl h... 97 3e-18
gi|149179157|ref|ZP_01857726.1| hypothetical protein PM8797... 94 1e-17
gi|87308956|ref|ZP_01091094.1| hypothetical protein DSM3645... 91 2e-16
gi|32471625|ref|NP_864618.1| hypothetical protein RB1854 [R... 90 4e-16
gi|149199908|ref|ZP_01876936.1| hypothetical protein LNTAR_... 88 1e-15
gi|149197913|ref|ZP_01874962.1| hypothetical protein LNTAR_... 88 1e-15
gi|32471039|ref|NP_864032.1| N-acetyl-galactosamine-6-sulfa... 87 3e-15
gi|87308660|ref|ZP_01090800.1| probable protein kinase yloP... 85 1e-14
gi|32472822|ref|NP_865816.1| hypothetical protein RB3944 [R... 84 2e-14
gi|149173679|ref|ZP_01852308.1| hypothetical protein PM8797... 82 6e-14
gi|149174041|ref|ZP_01852669.1| probable secreted glycosyl ... 80 3e-13
gi|87311427|ref|ZP_01093547.1| hypothetical protein DSM3645... 77 3e-12
gi|88712821|ref|ZP_01106906.1| hypothetical protein FB2170_... 75 7e-12
gi|149179308|ref|ZP_01857869.1| hypothetical protein PM8797... 75 1e-11
gi|32476109|ref|NP_869103.1| hypothetical protein RB9849 [R... 75 1e-11
gi|32474668|ref|NP_867662.1| probable secreted glycosyl hyd... 74 2e-11
gi|149200234|ref|ZP_01877256.1| hypothetical protein LNTAR_... 73 4e-11
gi|149177017|ref|ZP_01855625.1| hypothetical protein PM8797... 73 5e-11
gi|126648469|ref|ZP_01720956.1| hypothetical protein ALPR1_... 72 1e-10
gi|150007795|ref|YP_001302538.1| hypothetical protein BDI_1... 70 3e-10
gi|154490163|ref|ZP_02030424.1| hypothetical protein PARMER... 69 6e-10
gi|88713612|ref|ZP_01107694.1| probable large multifunction... 66 4e-09
gi|116626562|ref|YP_828718.1| protein of unknown function D... 66 5e-09
gi|116619831|ref|YP_821987.1| protein of unknown function D... 65 7e-09
gi|87310208|ref|ZP_01092340.1| hypothetical protein DSM3645... 65 9e-09
gi|87309043|ref|ZP_01091181.1| hypothetical protein DSM3645... 65 9e-09
gi|150007144|ref|YP_001301887.1| putative secreted glycosyl... 65 9e-09
gi|146280883|ref|YP_001171036.1| hypothetical protein PST_0... 65 1e-08
gi|32473471|ref|NP_866465.1| hypothetical protein-transmemb... 65 1e-08
gi|149177681|ref|ZP_01856282.1| sulfatase [Planctomyces mar... 65 1e-08
gi|148253179|ref|YP_001237764.1| putative exported protein ... 64 2e-08
gi|117164776|emb|CAJ88325.1| putative secreted glycosyl hyd... 64 2e-08
gi|116625196|ref|YP_827352.1| protein of unknown function D... 64 2e-08
gi|146342960|ref|YP_001208008.1| hypothetical protein BRADO... 64 2e-08
gi|88713788|ref|ZP_01107869.1| hypothetical protein FB2170_... 64 3e-08
gi|126648719|ref|ZP_01721203.1| probable secreted glycosyl ... 63 3e-08
gi|32471313|ref|NP_864306.1| probable secreted glycosyl hyd... 63 4e-08
gi|116626769|ref|YP_828925.1| protein of unknown function D... 63 4e-08
gi|109897209|ref|YP_660464.1| protein of unknown function D... 63 5e-08
gi|149196057|ref|ZP_01873113.1| hypothetical protein LNTAR_... 62 6e-08
gi|87308845|ref|ZP_01090984.1| hypothetical protein DSM3645... 62 7e-08
gi|87310061|ref|ZP_01092194.1| serine/threonine protein kin... 61 1e-07
gi|116619940|ref|YP_822096.1| protein of unknown function D... 60 3e-07
gi|32474367|ref|NP_867361.1| probable secreted glycosyl hyd... 60 4e-07
gi|149177640|ref|ZP_01856241.1| hypothetical protein PM8797... 60 4e-07
gi|145589295|ref|YP_001155892.1| protein of unknown functio... 60 5e-07
gi|21224873|ref|NP_630652.1| glycosyl hydrolase (secreted p... 59 5e-07
gi|116624007|ref|YP_826163.1| protein of unknown function D... 59 9e-07
gi|87310236|ref|ZP_01092367.1| hypothetical protein DSM3645... 59 9e-07
gi|149198757|ref|ZP_01875800.1| probable secreted glycosyl ... 58 1e-06
gi|116621620|ref|YP_823776.1| protein of unknown function D... 58 1e-06
gi|94969351|ref|YP_591399.1| protein of unknown function DU... 58 2e-06
gi|21225488|ref|NP_631267.1| secreted glycosyl hydrolase [S... 57 2e-06
gi|32471058|ref|NP_864051.1| conserved hypothetical protein... 57 3e-06
gi|32470990|ref|NP_863983.1| conserved hypothetical protein... 56 7e-06
gi|149196925|ref|ZP_01873978.1| hypothetical protein LNTAR_... 55 1e-05
>gi|150008515|ref|YP_001303258.1| putative secreted glycosylhydrolase [Parabacteroides distasonis
ATCC 8503]
gi|149936939|gb|ABR43636.1| putative secreted glycosylhydrolase [Parabacteroides distasonis
ATCC 8503]
Length = 422
Score = 693 bits (1789), Expect = 0.0, Method: Composition-based stats.
Identities = 330/408 (80%), Positives = 363/408 (88%), Gaps = 1/408 (0%)
Query: 69 ALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGS 128
L AC N LTE+EKA GW+LLFDG+TL+GWRDYNG ALTGPWEVV+G IQADG+GS
Sbjct: 15 GLIACDNTKHNTLTEQEKAEGWELLFDGETLDGWRDYNGTALTGPWEVVNGTIQADGQGS 74
Query: 129 DENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFA 188
D +GYIV D+ YENFEL WDWKISKGGNSGLLYHVVERPQ+ VPYVTGPEYQLIDD FA
Sbjct: 75 DASGYIVTDKAYENFELSWDWKISKGGNSGLLYHVVERPQFPVPYVTGPEYQLIDDINFA 134
Query: 189 EPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
EPLEDWQRCGVDYAMYLPDF T+ V PAGEWN SKI+FDNGHV Y+MNG KT+EF+AWSD
Sbjct: 135 EPLEDWQRCGVDYAMYLPDFNTIKVHPAGEWNNSKIIFDNGHVTYFMNGHKTVEFDAWSD 194
Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEELFNGKDLT 308
DWF RKNSGKW NAPEYGLA KGLICLQDHGYPAWFRNIKI+ELPRKT E LFNG+D+T
Sbjct: 195 DWFSRKNSGKWANAPEYGLAHKGLICLQDHGYPAWFRNIKIKELPRKTREARLFNGEDIT 254
Query: 309 GWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGNSGIFIRSF 368
WD YGTE WYV+D LLVCESGPDKQYGYLAT +YY+DF+LT +FKQEADGNSG+FIRSF
Sbjct: 255 NWDKYGTELWYVKDGLLVCESGPDKQYGYLATREYYDDFDLTVEFKQEADGNSGVFIRSF 314
Query: 369 VEE-GAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGN 427
VEE KVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPD++EN LK+ EWNTMRIRV G+
Sbjct: 315 VEEKDVKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDEKENILKQGEWNTMRIRVQGD 374
Query: 428 QVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
V TWLNGE+MV+I+DEKIGAGQGRIALQIHDGGGIKVLWRN+ ++TL
Sbjct: 375 NVQTWLNGEEMVNIRDEKIGAGQGRIALQIHDGGGIKVLWRNLHLQTL 422
>gi|154493862|ref|ZP_02033182.1| hypothetical protein PARMER_03206 [Parabacteroides merdae ATCC
43184]
gi|154086122|gb|EDN85167.1| hypothetical protein PARMER_03206 [Parabacteroides merdae ATCC
43184]
Length = 429
Score = 652 bits (1683), Expect = 0.0, Method: Composition-based stats.
Identities = 308/416 (74%), Positives = 356/416 (85%), Gaps = 3/416 (0%)
Query: 60 CLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDG 119
C + LF ++C+ N LT EE A GWQLLFDG+TL GW+DYNG LT PW VVDG
Sbjct: 17 CAISLALF---SSCTSVEPNTLTPEEIADGWQLLFDGKTLNGWKDYNGTTLTQPWHVVDG 73
Query: 120 AIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEY 179
IQA G+GSD +GYIV D+ YENFEL WDWK+SKGGNSG+LYHVVERPQ+ VPYVTGPEY
Sbjct: 74 CIQAKGDGSDASGYIVTDKQYENFELSWDWKLSKGGNSGMLYHVVERPQFAVPYVTGPEY 133
Query: 180 QLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQK 239
QLID+ F EPLE+WQ+ GVDYAM+LPD A M V P GEWN SKIVFDNGHVE+++NG K
Sbjct: 134 QLIDEPNFPEPLEEWQKLGVDYAMHLPDKAKMKVNPQGEWNNSKIVFDNGHVEHWLNGVK 193
Query: 240 TIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEE 299
+EFEAW+DDW+ +KNSGKW NAPEYGLA+KG++CLQDHGYPA FRNIKI+ELPRKT+E
Sbjct: 194 ILEFEAWTDDWYAKKNSGKWANAPEYGLAKKGVLCLQDHGYPASFRNIKIKELPRKTKEV 253
Query: 300 ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
LFNG DL GW+ YGTE+WYV+D LL+CESGPDK+YGYLAT YY+DF+LT +FKQEADG
Sbjct: 254 TLFNGTDLKGWEAYGTEKWYVEDGLLICESGPDKKYGYLATRDYYDDFDLTVEFKQEADG 313
Query: 360 NSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNT 419
NSG+FIRSF+EE KVNGWQVEVAPKG DTGGIYESYGRGWLIQIPD++EN LKE +WNT
Sbjct: 314 NSGVFIRSFIEEDVKVNGWQVEVAPKGHDTGGIYESYGRGWLIQIPDEKENILKEGDWNT 373
Query: 420 MRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
MRI+V G+ V TWLNG++MV+I DEKIGAGQGRIALQIHDGGGIKVLWRN++VKTL
Sbjct: 374 MRIKVQGDNVQTWLNGQEMVNINDEKIGAGQGRIALQIHDGGGIKVLWRNLKVKTL 429
>gi|150009061|ref|YP_001303804.1| hypothetical protein BDI_2462 [Parabacteroides distasonis ATCC
8503]
gi|149937485|gb|ABR44182.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 430
Score = 651 bits (1679), Expect = 0.0, Method: Composition-based stats.
Identities = 308/418 (73%), Positives = 360/418 (86%), Gaps = 4/418 (0%)
Query: 59 NCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVD 118
+C + LF ++C+ + QN LT EE A GW LLFDG+TL+GW+DYNG LT PW VVD
Sbjct: 16 SCAVSLALF---SSCASQEQNTLTPEEIADGWVLLFDGKTLDGWKDYNGTTLTQPWHVVD 72
Query: 119 GAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPE 178
G IQA G+GSD +GYIV D+ YENFEL WDWK+SKGGNSG+LYHVVERPQY VPYVTGPE
Sbjct: 73 GCIQAKGDGSDASGYIVTDKEYENFELSWDWKLSKGGNSGMLYHVVERPQYAVPYVTGPE 132
Query: 179 YQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQ 238
YQLID+ F EPLE+WQ+ GVDYAM+LPD + M V P GEWN SKIVFDNGHVE+++NGQ
Sbjct: 133 YQLIDEPNFPEPLEEWQKLGVDYAMHLPDKSKMKVNPQGEWNNSKIVFDNGHVEHWLNGQ 192
Query: 239 KTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEE 298
K +EFEAW+DDW +KNSGKW NAPEYGLA+KG++CLQDHGYPA FRN+KI+ELPRKT +
Sbjct: 193 KIVEFEAWTDDWHAKKNSGKWANAPEYGLAKKGVLCLQDHGYPASFRNLKIKELPRKTGK 252
Query: 299 E-ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
E LFNG DLTGW+ YGTE+WYV+D LLVCESGPDKQYGYLAT YY+DF+LT +FKQEA
Sbjct: 253 EVNLFNGVDLTGWEPYGTEKWYVKDGLLVCESGPDKQYGYLATRDYYDDFDLTVEFKQEA 312
Query: 358 DGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREW 417
DGNSG+FIRSFVEEG KVNGWQVEVAPKG DTGGIYESYGRGWL+QIPD++EN LKE +W
Sbjct: 313 DGNSGVFIRSFVEEGVKVNGWQVEVAPKGHDTGGIYESYGRGWLVQIPDEKENILKEGDW 372
Query: 418 NTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
NTMRI+V G+ V TWLNG++MV++ DEKIGAG+GRIALQIHDGGGIKVLWRN+++ TL
Sbjct: 373 NTMRIKVQGDNVQTWLNGQEMVNLNDEKIGAGKGRIALQIHDGGGIKVLWRNLKLTTL 430
>gi|88711878|ref|ZP_01105966.1| probable secreted glycosyl hydrolase [Flavobacteriales bacterium
HTCC2170]
gi|88710819|gb|EAR03051.1| probable secreted glycosyl hydrolase [Flavobacteriales bacterium
HTCC2170]
Length = 200
Score = 252 bits (644), Expect = 3e-65, Method: Composition-based stats.
Identities = 119/176 (67%), Positives = 145/176 (82%), Gaps = 1/176 (0%)
Query: 297 EEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQE 356
+E+ LFNG+DLTGW +YGTE+W+V+D LLVCESGPD QYGYLAT ++Y DF L +FKQE
Sbjct: 22 QEKSLFNGEDLTGWTIYGTEKWFVEDGLLVCESGPDAQYGYLATKEHYKDFTLILEFKQE 81
Query: 357 ADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
A+GNSG+FIRS V+ G KV+GWQVEVAP G TGG+YESYGRGWLI+ ++ LK E
Sbjct: 82 ANGNSGVFIRSTVD-GTKVSGWQVEVAPPGHSTGGVYESYGRGWLIKPDPAKDKALKMGE 140
Query: 417 WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRV 472
WN M+IRV G+++T+W+NGE+MV I D KIG G+G IALQIHDGGGIKV WRNIRV
Sbjct: 141 WNEMKIRVYGSKLTSWVNGEEMVTINDAKIGTGEGSIALQIHDGGGIKVKWRNIRV 196
>gi|88804756|ref|ZP_01120276.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
HTCC2501]
gi|88785635|gb|EAR16804.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
HTCC2501]
Length = 196
Score = 249 bits (636), Expect = 3e-64, Method: Composition-based stats.
Identities = 122/180 (67%), Positives = 146/180 (81%), Gaps = 2/180 (1%)
Query: 293 PRKTEEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTAD 352
P +E ELFNG+DL+GW VYGTE+WYV+D LLVCESGPDK YGYLAT K+Y DF LT +
Sbjct: 15 PLAAQETELFNGEDLSGWTVYGTEKWYVEDGLLVCESGPDKGYGYLATDKHYKDFVLTLE 74
Query: 353 FKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFL 412
F QE+DGNSG+FIRS V +G KV+GWQVEVAP G DTGG+YESYGRGWLI+ P+ + +
Sbjct: 75 FLQESDGNSGVFIRSTV-DGTKVSGWQVEVAPPGHDTGGVYESYGRGWLIK-PEAGKPDV 132
Query: 413 KEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRV 472
K EWNTM+I V G+ +T+WLNG +M+ + DEKIG G+G IALQIHDGGGIKV WRNI V
Sbjct: 133 KMGEWNTMKIMVSGDTITSWLNGTEMITLTDEKIGQGEGSIALQIHDGGGIKVKWRNIVV 192
>gi|126648755|ref|ZP_01721239.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
gi|126575206|gb|EAZ79556.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
Length = 198
Score = 241 bits (615), Expect = 8e-62, Method: Composition-based stats.
Identities = 116/179 (64%), Positives = 147/179 (82%), Gaps = 1/179 (0%)
Query: 297 EEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQE 356
++E+LFNG+DLTGW +YGTE+WYV+D LL+ ESGPDK YGYL T ++Y+DFE+T +FKQE
Sbjct: 21 KKEKLFNGEDLTGWTIYGTEKWYVEDGLLISESGPDKGYGYLGTNEHYDDFEITLEFKQE 80
Query: 357 ADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
A+GNSG+FIRS V+ G KV+GWQVEVAP G DTGGIYESYGRGWLI+ +++ +LK +
Sbjct: 81 ANGNSGVFIRSTVD-GTKVSGWQVEVAPPGHDTGGIYESYGRGWLIKPDPEKDKYLKFGK 139
Query: 417 WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
WN MRI V G+ VT++LNG +MV+ D KIG G+G I LQIHDGGGIKV W+NI +K L
Sbjct: 140 WNKMRIVVKGDNVTSYLNGHEMVNFTDAKIGEGKGGICLQIHDGGGIKVYWKNIVLKKL 198
>gi|126648754|ref|ZP_01721238.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
gi|126575205|gb|EAZ79555.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
Length = 254
Score = 228 bits (582), Expect = 6e-58, Method: Composition-based stats.
Identities = 122/217 (56%), Positives = 153/217 (70%), Gaps = 2/217 (0%)
Query: 77 PQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVY 136
P N LTEEEKA GW LLFDG GWR +NG W + DGA++A G+G D G IV+
Sbjct: 39 PDNTLTEEEKATGWMLLFDGSDPSGWRAFNGDTFPEGWTIEDGALKALGKGGDIGGDIVF 98
Query: 137 D-RIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQ 195
+E FE++WDWKIS+GGNSG+LYHVVE P+Y PY TGPEYQ+ID GF EPLE WQ
Sbjct: 99 GPMEFEEFEMEWDWKISEGGNSGVLYHVVEDPKYHAPYETGPEYQVIDQLGFPEPLEKWQ 158
Query: 196 RCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKN 255
G DYAM PD+ V+PAGEWN SKI+F Y++NG+KT+EF +S++W +N
Sbjct: 159 SIGADYAMTEPDYEGA-VKPAGEWNHSKIIFSEEGSSYWLNGKKTVEFVPYSEEWTAARN 217
Query: 256 SGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
SGKW + P+Y +A+ GLI LQDHG WF+NIKI++L
Sbjct: 218 SGKWNDFPDYAIAKTGLISLQDHGAVTWFKNIKIKKL 254
>gi|150008483|ref|YP_001303226.1| hypothetical protein BDI_1869 [Parabacteroides distasonis ATCC
8503]
gi|149936907|gb|ABR43604.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 258
Score = 223 bits (567), Expect = 3e-56, Method: Composition-based stats.
Identities = 114/244 (46%), Positives = 159/244 (65%), Gaps = 18/244 (7%)
Query: 64 MVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQA 123
MV G+ ++ +E NVLTEEEKA G+ LLF+G+ GW+ +NG + G W+V DG I
Sbjct: 17 MVACGSRSSSTEVKDNVLTEEEKAEGYTLLFNGKDFTGWKMFNGGDVKG-WQVEDGVIVG 75
Query: 124 DGEGSDE--------NGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVT 175
G G D + IV + Y NF+++WDWKI GNSG LYHV E P+YK P+ T
Sbjct: 76 YGNGGDVIADTTIKVSTDIVTVKNYHNFQIKWDWKIGAQGNSGFLYHVQEGPKYKAPFET 135
Query: 176 GPEYQLIDDKGF-------AEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDN 228
GPEYQLIDD + E LEDWQ+ G +YAMY+P+ T V P GEWN+S +++ +
Sbjct: 136 GPEYQLIDDDNYPWVSETGKEGLEDWQKTGCNYAMYVPE--TKQVNPPGEWNSSMVLYKD 193
Query: 229 GHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIK 288
G+VE+++NG+K F+ S+DW R+ SGKWE P+YG++ G +C QDHG +F+N+K
Sbjct: 194 GYVEHWLNGEKLFSFQEGSEDWKMRRYSGKWEAFPDYGISTTGKLCFQDHGSKVYFKNVK 253
Query: 289 IREL 292
I++L
Sbjct: 254 IKDL 257
>gi|88804436|ref|ZP_01119956.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
HTCC2501]
gi|88785315|gb|EAR16484.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
HTCC2501]
Length = 264
Score = 182 bits (463), Expect = 3e-44, Method: Composition-based stats.
Identities = 100/234 (42%), Positives = 132/234 (56%), Gaps = 31/234 (13%)
Query: 85 EKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDG--------AIQADGEGSDENGYIVY 136
E WQ LFDG +LEGWR YN + + W + D ++ D EG + Y
Sbjct: 36 EDTGEWQYLFDGTSLEGWRGYNAETMPPGWVIEDSVLTFKTELGLEQDYEGGKDILYAAE 95
Query: 137 DRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAE------- 189
+ ++NFEL +WK+ +GGNSG+ YHV E Y P V PEYQLIDD+ +A
Sbjct: 96 E--FDNFELYLEWKLPEGGNSGIFYHVKE--GYDGPPVVAPEYQLIDDENYARIHDLTEY 151
Query: 190 -----------PLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQ 238
L+ Q+ G DYAM+ PD + P GEWN+SKIVF VE+++NG+
Sbjct: 152 NLSLGYTENPNELKPLQQTGADYAMHPPD-PGKTLHPVGEWNSSKIVFTPERVEHWLNGE 210
Query: 239 KTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+ F W D W ++KNS KW+N+P YG +KG I LQDH P WFRNIKIR+L
Sbjct: 211 MILSFVPWDDAWEEKKNSDKWKNSPAYGTFKKGYIALQDHASPIWFRNIKIRKL 264
>gi|88712822|ref|ZP_01106907.1| hypothetical protein FB2170_09296 [Flavobacteriales bacterium
HTCC2170]
gi|88708720|gb|EAR00955.1| hypothetical protein FB2170_09296 [Flavobacteriales bacterium
HTCC2170]
Length = 263
Score = 177 bits (448), Expect = 2e-42, Method: Composition-based stats.
Identities = 99/241 (41%), Positives = 135/241 (56%), Gaps = 26/241 (10%)
Query: 76 KPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY-- 133
K + + + KA W LFDG + EGWR YNG+AL W DGA+ D E E Y
Sbjct: 25 KSETGIEAQVKANDWITLFDGVSTEGWRAYNGKALPPGWVAKDGALTFDTELGLEQDYKG 84
Query: 134 ---IVYD-RIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFA- 188
I+Y ++NFE +WK+ +GGNSG+ YH+ E Y P PEYQLIDD+ +A
Sbjct: 85 GKDIIYGAEEFDNFEFYVEWKLPEGGNSGIFYHLKE--GYNSPPEVSPEYQLIDDENYAR 142
Query: 189 -----------------EPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHV 231
E L+ Q+ DYAM+ + + P GEWN+SKIVF V
Sbjct: 143 IHDLTEYNLSLGYTEKPEELKPLQQTASDYAMHAANPEGKILHPVGEWNSSKIVFTPEKV 202
Query: 232 EYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRE 291
E+++NG+ + F WS+DW ++KNS KW+N+ +YG + G I QDH P WFRNIKI++
Sbjct: 203 EHWLNGKMVLSFVPWSEDWHEKKNSDKWKNSEDYGKFKTGFIGFQDHSSPIWFRNIKIKK 262
Query: 292 L 292
L
Sbjct: 263 L 263
>gi|150008445|ref|YP_001303188.1| hypothetical protein BDI_1827 [Parabacteroides distasonis ATCC
8503]
gi|149936869|gb|ABR43566.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 245
Score = 170 bits (431), Expect = 2e-40, Method: Composition-based stats.
Identities = 99/247 (40%), Positives = 138/247 (55%), Gaps = 11/247 (4%)
Query: 54 MMKCMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP 113
M K ++ C + N LT++EK AGW+LLF+G+ GWR NG A+
Sbjct: 1 MKKLFKSFLILSAIAMAIPCFAQTPNTLTKKEKKAGWELLFNGKDFSGWRQCNGTAMPAN 60
Query: 114 WEVVDGAIQA-DGEGSD----ENGYIVY-DRIYENFELQWDWKISKGGNSGLLYHVVERP 167
W + D A++ GEG NG I+Y ++ ++NFEL DWK SK GNSG+ Y+V E P
Sbjct: 61 WVIEDNAMKVFTGEGKKPGQGANGDILYQNKKFKNFELSVDWKASKMGNSGIFYYVREVP 120
Query: 168 QYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFD 227
+ Y PE Q++D+ + G Y M D T+N PAGEWNT I
Sbjct: 121 GKPI-YYAAPEVQVLDNVDATDNKLANHLAGSLYDMLPADPKTVN--PAGEWNTIVIRVK 177
Query: 228 NGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEY--GLARKGLICLQDHGYPAWFR 285
+G V + NG+K +E+ WS +W + K++N P + G++++G I LQDHGYP WFR
Sbjct: 178 DGKVTHTQNGKKVVEYTLWSKEWDDLVANSKFKNFPGFTEGISKEGYIGLQDHGYPIWFR 237
Query: 286 NIKIREL 292
NIKIREL
Sbjct: 238 NIKIREL 244
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 61/215 (28%), Positives = 96/215 (44%), Gaps = 36/215 (16%)
Query: 294 RKTEEEELFNGKDLTGW-DVYGTE---QWYVQDSLLVCESGPDKQYG------YLATCKY 343
+K E LFNGKD +GW GT W ++D+ + +G K+ G L K
Sbjct: 33 KKAGWELLFNGKDFSGWRQCNGTAMPANWVIEDNAMKVFTGEGKKPGQGANGDILYQNKK 92
Query: 344 YNDFELTADFKQEADGNSGIF--IRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWL 401
+ +FEL+ D+K GNSGIF +R + +V+V T ++ G L
Sbjct: 93 FKNFELSVDWKASKMGNSGIFYYVREVPGKPIYYAAPEVQVLDNVDATDNKLANHLAGSL 152
Query: 402 IQ-IPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD-----------IQDEKIG-- 447
+P D + EWNT+ IRV +VT NG+++V+ + + K
Sbjct: 153 YDMLPADPKTVNPAGEWNTIVIRVKDGKVTHTQNGKKVVEYTLWSKEWDDLVANSKFKNF 212
Query: 448 -------AGQGRIALQIHDGGGIKVLWRNIRVKTL 475
+ +G I LQ H G + +RNI+++ L
Sbjct: 213 PGFTEGISKEGYIGLQDH---GYPIWFRNIKIREL 244
>gi|154494235|ref|ZP_02033555.1| hypothetical protein PARMER_03585 [Parabacteroides merdae ATCC
43184]
gi|154086097|gb|EDN85142.1| hypothetical protein PARMER_03585 [Parabacteroides merdae ATCC
43184]
Length = 271
Score = 159 bits (401), Expect = 5e-37, Method: Composition-based stats.
Identities = 97/261 (37%), Positives = 145/261 (55%), Gaps = 13/261 (4%)
Query: 42 FVEFFENFKIVFMMKCM--NCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTL 99
F+ N K+ MK + + + ++ + A+ + ++K N LTE+EK GW LLF+G+
Sbjct: 13 FIFSMSNLKMKCNMKNLFKSFVVLLAVMAAVPSFAQKANNTLTEKEKKQGWTLLFNGKDF 72
Query: 100 EGWRDYNGQALTGPWEVVDGAIQ---ADGE--GSDENGYIVY-DRIYENFELQWDWKISK 153
GWR N + W + D A++ A G+ G G I+Y ++ ++NFEL DWK SK
Sbjct: 73 TGWRQCNSTGMASNWVIEDEAMKVFTAPGKKPGHGAGGDILYKEKKFKNFELSVDWKTSK 132
Query: 154 GGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNV 213
GNSG+ Y+V E P + Y PE Q++D+ + G Y M D T V
Sbjct: 133 MGNSGIFYYVREVPGKPI-YYAAPEVQVLDNVDATDNKLANHLAGSLYDMLPADPKT--V 189
Query: 214 RPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEY--GLARKG 271
+PAGEWNT I +G V + NG+K +++ WS +W + K+++ + G++ +G
Sbjct: 190 KPAGEWNTIVIKVKDGKVTHTQNGKKVVQYTLWSKEWDDMVANSKFKDFQGFQEGISHEG 249
Query: 272 LICLQDHGYPAWFRNIKIREL 292
I LQDHGYP WFRNIKIREL
Sbjct: 250 YIGLQDHGYPIWFRNIKIREL 270
Score = 59.3 bits (142), Expect = 6e-07, Method: Composition-based stats.
Identities = 62/223 (27%), Positives = 97/223 (43%), Gaps = 36/223 (16%)
Query: 286 NIKIRELPRKTEEEELFNGKDLTGWDVYGT----EQWYVQDSLLVCESGPDKQYGY---- 337
N + E +K LFNGKD TGW + W ++D + + P K+ G+
Sbjct: 51 NNTLTEKEKKQGWTLLFNGKDFTGWRQCNSTGMASNWVIEDEAMKVFTAPGKKPGHGAGG 110
Query: 338 --LATCKYYNDFELTADFKQEADGNSGIF--IRSFVEEGAKVNGWQVEVAPKGFDTGGIY 393
L K + +FEL+ D+K GNSGIF +R + +V+V T
Sbjct: 111 DILYKEKKFKNFELSVDWKTSKMGNSGIFYYVREVPGKPIYYAAPEVQVLDNVDATDNKL 170
Query: 394 ESYGRGWLIQ-IPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMV------------- 439
++ G L +P D + EWNT+ I+V +VT NG+++V
Sbjct: 171 ANHLAGSLYDMLPADPKTVKPAGEWNTIVIKVKDGKVTHTQNGKKVVQYTLWSKEWDDMV 230
Query: 440 ------DIQDEKIG-AGQGRIALQIHDGGGIKVLWRNIRVKTL 475
D Q + G + +G I LQ H G + +RNI+++ L
Sbjct: 231 ANSKFKDFQGFQEGISHEGYIGLQDH---GYPIWFRNIKIREL 270
>gi|149197990|ref|ZP_01875038.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
HTCC2155]
gi|149138902|gb|EDM27307.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
HTCC2155]
Length = 218
Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats.
Identities = 80/203 (39%), Positives = 116/203 (57%), Gaps = 7/203 (3%)
Query: 90 WQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWDW 149
WQ LF+GQ L+GWR+YN Q + G W V D AI +G +IVY++ +++FEL+ W
Sbjct: 23 WQSLFNGQDLQGWRNYNSQGINGKWIVEDSAIHLTEKGGQ---HIVYNQPFKDFELKLQW 79
Query: 150 KISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFA 209
KIS+ GNSG+ ER QY P+++G E Q++DD+ + G Y +
Sbjct: 80 KISERGNSGIFIRSSERYQY--PWMSGVEMQILDDEKHPNAKNPLTKAGSCYDLIAAPEG 137
Query: 210 TMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLAR 269
+N A WN I+ H ++++NG KT +F+ S +W K++ P + +
Sbjct: 138 AVN--KAMAWNDVHIIVKGSHYQFFLNGVKTADFDVKSAEWRALIAGSKFKKYPGFSENK 195
Query: 270 KGLICLQDHGYPAWFRNIKIREL 292
+G ICLQDHG P WFRNIKIREL
Sbjct: 196 QGFICLQDHGDPVWFRNIKIREL 218
>gi|149173077|ref|ZP_01851708.1| protein up-regulated by thyroid hormone-putative PQQ-dependent
glucose dehydrogenase [Planctomyces maris DSM 8797]
gi|148847883|gb|EDL62215.1| protein up-regulated by thyroid hormone-putative PQQ-dependent
glucose dehydrogenase [Planctomyces maris DSM 8797]
Length = 659
Score = 150 bits (378), Expect = 3e-34, Method: Composition-based stats.
Identities = 93/278 (33%), Positives = 146/278 (52%), Gaps = 28/278 (10%)
Query: 60 CLCVMVLFGA-LTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVD 118
CL +++ A ++A S+ N L++ E+ +GW+LLFDG+T +GWR+Y + ++ W + D
Sbjct: 24 CLSLLMTNTAVISADSDTSLNKLSKAEQKSGWKLLFDGKTTDGWRNYKKEGVSDGWTIKD 83
Query: 119 GAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPE 178
G + +G+ G I+ D + FEL +++IS GNSGL++HV E + K P++TGPE
Sbjct: 84 GVLSRSAKGA---GDIITDDQFGFFELSLEYRISPEGNSGLMFHVTE--EEKTPWMTGPE 138
Query: 179 YQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNV-----------------RPAGEWNT 221
Q+ D+ +P Q+ G Y +Y P + RPAG+WN
Sbjct: 139 VQIQDNVDGHDP----QKAGWLYQLYKPATPKWMIEAEKAGKKVTPAVVDATRPAGDWNH 194
Query: 222 SKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYP 281
+ + MNG K +F+ S DW +R + K+ P +G KG ICLQDH
Sbjct: 195 LFLRVGPDRSQIIMNGVKYFQFDKGSADWNKRVAASKFSKYPSFGKPTKGHICLQDHNDL 254
Query: 282 AWFRNIKIRELPRKTEEEELFNGK-DLTGWDVYGTEQW 318
FRNIKIRE+P ++ +G+ L G + +W
Sbjct: 255 VSFRNIKIREIPADGSVQDPSDGELALKGVPAFPNLEW 292
>gi|83815548|ref|YP_446631.1| probable secreted glycosyl hydrolase [Salinibacter ruber DSM 13855]
gi|83756942|gb|ABC45055.1| probable secreted glycosyl hydrolase [Salinibacter ruber DSM 13855]
Length = 222
Score = 147 bits (371), Expect = 1e-33, Method: Composition-based stats.
Identities = 93/224 (41%), Positives = 129/224 (57%), Gaps = 16/224 (7%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL--TGPWEVVDGAIQADGEGSDENGY--- 133
N LT E+A GW LLFDG+T GWR YN + TG W + DG + +G G +G
Sbjct: 5 NTLTPAERADGWTLLFDGETAAGWRGYNDEDFPDTG-WTIEDGVLTIEGAGGGVSGSGGD 63
Query: 134 IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLID-----DKGFA 188
I+ Y +F L+ +WKIS+GGNSG+ Y +E+P + Y + PE Q++D D G
Sbjct: 64 IITTETYGDFVLKLEWKISEGGNSGIFYRAIEQPDQPI-YWSAPEMQILDNANHPDAGRG 122
Query: 189 EPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
E ++ G Y + D T + GEW IV + GHVE+++NGQK +E+E W+
Sbjct: 123 E--NGNRKAGSLYDLIPADPQTFSGH--GEWQDVMIVVEGGHVEHWLNGQKVLEYETWTP 178
Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
W++ K+ PE+G AR+G I LQDHG A FRNIKI+EL
Sbjct: 179 GWYRMIRDSKFRTHPEFGDAREGHIGLQDHGTTAHFRNIKIKEL 222
>gi|88712915|ref|ZP_01107000.1| hypothetical protein FB2170_09761 [Flavobacteriales bacterium
HTCC2170]
gi|88708813|gb|EAR01048.1| hypothetical protein FB2170_09761 [Flavobacteriales bacterium
HTCC2170]
Length = 266
Score = 144 bits (364), Expect = 1e-32, Method: Composition-based stats.
Identities = 87/228 (38%), Positives = 121/228 (53%), Gaps = 14/228 (6%)
Query: 75 EKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY- 133
E N LT EKA GW +LFDG T EGWR Y + WE+VDG + G G E G
Sbjct: 43 EAVMNGLTAAEKADGWVMLFDGTTSEGWRGYKKEHFPAAWEIVDGTMHMMGSGRGEAGAK 102
Query: 134 ----IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAE 189
I++D+ ++NF L +WKIS+GGNSG+ Y E+ Y + T PE Q++D++
Sbjct: 103 DGGDIIFDKQFQNFTLSLEWKISEGGNSGIFYLGEEKLDYI--WKTAPEMQILDNE--RH 158
Query: 190 PLEDWQRCGVDYAMYLPDFA---TMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAW 246
P + G A L D N +PAGEWN ++ G V + NG+ +E+ W
Sbjct: 159 PDAKLGKDGNRQAGSLYDLVPAKPQNAKPAGEWNKIEVTVYKGTVIHSQNGENVVEYHLW 218
Query: 247 SDDWFQRKNSGKWE--NAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+ +W + K+ NA + KG I LQDHG WFRN+K++EL
Sbjct: 219 TPEWNEMVAGSKFPGLNAEWADVPSKGYIGLQDHGDDVWFRNVKLKEL 266
>gi|126646853|ref|ZP_01719363.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
gi|126576901|gb|EAZ81149.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
Length = 246
Score = 139 bits (351), Expect = 3e-31, Method: Composition-based stats.
Identities = 77/224 (34%), Positives = 129/224 (57%), Gaps = 10/224 (4%)
Query: 74 SEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSD---- 129
+E+P V ++E+ + LF+G T GW Y G + W++ DG + D + D
Sbjct: 28 TEEPTEVTQKQEE---FTPLFEGNTFAGWHKYGGGEVGKAWKIEDGTVYLDAKNKDGWQT 84
Query: 130 -ENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFA 188
+ G IV D +ENF L++DWKI++ GNSG+++ V E P+Y + TG E Q++D++G
Sbjct: 85 GDGGDIVTDEEFENFHLKYDWKIAENGNSGVIFFVQEAPEYPYSWHTGMEMQVLDNEGHP 144
Query: 189 EPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
+ R G Y + + T V+P GEWN ++I+ D G+++ +NG +E E W+
Sbjct: 145 DAKIISHRAGDLYDLIVSSEET--VKPWGEWNHAEIIADQGNLKLRLNGVTVVETELWTP 202
Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+W K+++ P +G +KG I LQDHG +F+N++I++L
Sbjct: 203 EWEALIADSKFKDMPGFGTFKKGKIALQDHGDLVYFKNVEIKKL 246
>gi|32473821|ref|NP_866815.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
gi|32444357|emb|CAD74355.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
Length = 254
Score = 134 bits (338), Expect = 1e-29, Method: Composition-based stats.
Identities = 78/211 (36%), Positives = 116/211 (54%), Gaps = 20/211 (9%)
Query: 90 WQLLFDGQTLEGWRDYNGQALTGPWEVVDGAI-----QADGEGSDENGYIVYDRIYENFE 144
W+ LFDG L+ WR+YN ++T W++ A+ + G+ + I ++ ++ FE
Sbjct: 56 WETLFDGSNLDAWREYNRDSVTSGWKIEGNALTCISHKDQGDAARGENLITKEK-FDAFE 114
Query: 145 LQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMY 204
L+ D+K++ NSG+++HVVE K PY TGPE Q+ D KG +P Q+CG Y +Y
Sbjct: 115 LELDFKVTPAANSGVMFHVVETK--KPPYYTGPEIQIQDHKGGHDP----QKCGWLYQLY 168
Query: 205 LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQR---KNSGKWEN 261
+ T + +PAGEWN +++ + +NG EF SDDW +R GKWE
Sbjct: 169 PSE--TDSTKPAGEWNHLRVLITPAKCQIEVNGVLYSEFVKGSDDWNERVAKSKFGKWEG 226
Query: 262 APEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+G G ICLQDH +RNI+IR L
Sbjct: 227 ---FGEPTNGHICLQDHNDEVSYRNIRIRRL 254
>gi|120435377|ref|YP_861063.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
gi|117577527|emb|CAL65996.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
Length = 251
Score = 134 bits (336), Expect = 2e-29, Method: Composition-based stats.
Identities = 86/248 (34%), Positives = 129/248 (52%), Gaps = 33/248 (13%)
Query: 69 ALTACSEKPQNVLTEEEKAAG--------------WQLLFDGQTLEGWRDYNGQALTGPW 114
ALTAC K +N +EE K A WQ LF+G+ L+GW+ +N +++ W
Sbjct: 13 ALTAC--KNENKESEEIKVAENTEMKSSEAKEDQEWQELFNGENLDGWKAFNKDSISDQW 70
Query: 115 EVVDGAI--QADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVP 172
+ +GAI + E + ++ +ENFEL +WKIS+ GNSG+++ V E +Y P
Sbjct: 71 KAENGAISFKPSAENRSKTENLITKEEFENFELSLEWKISEAGNSGIMWAVQEGEKYNEP 130
Query: 173 YVTGPEYQLIDDKGFAEPLEDWQR-CGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHV 231
Y+TGPE Q++D++ + R G Y M P +PAGEWN I H+
Sbjct: 131 YLTGPEIQVLDNQRHPDAKNGLNRTAGALYDMIPPSEDV--TKPAGEWNKEVI-----HI 183
Query: 232 EY-------YMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWF 284
Y +NG +EF ++W + K+ +G ++KG I LQDHG P W+
Sbjct: 184 NYKENKGWVKLNGTTIVEFPVHGEEWKNMVSKSKFSEWEGFGASQKGHIALQDHGDPVWY 243
Query: 285 RNIKIREL 292
RNIKI++L
Sbjct: 244 RNIKIKQL 251
>gi|149197739|ref|ZP_01874789.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
HTCC2155]
gi|149139309|gb|EDM27712.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
HTCC2155]
Length = 233
Score = 132 bits (333), Expect = 4e-29, Method: Composition-based stats.
Identities = 74/225 (32%), Positives = 124/225 (55%), Gaps = 9/225 (4%)
Query: 70 LTACSEKP--QNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEG 127
+T+C+ P N L++ E GWQLLF+GQ + WR++ Q + W V G ++ G G
Sbjct: 15 ITSCTSTPIPDNSLSKAEAKEGWQLLFNGQDMSQWRNFKKQDINPKWVVEGGTMKLSGGG 74
Query: 128 SDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF 187
G I+ + YENF+ + +WKIS+ GNSG+ ++ + K Y PE Q++D++
Sbjct: 75 G---GDIMTKKQYENFDFRMEWKISEAGNSGIF--ILADEKGKRIYSHAPEIQILDNEKH 129
Query: 188 AEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWS 247
+ + R G Y M + + AGEWN +I+ + H++ + NG +T++ S
Sbjct: 130 NDRKKPNHRSGSLYDMITSPAESH--KKAGEWNQVRILLNKSHLQVWQNGIQTVDIVMHS 187
Query: 248 DDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
D+W + K++N +G+ +KG + LQDH WF+N+K+ EL
Sbjct: 188 DEWKELVGKSKFKNWKGFGMNKKGHLGLQDHNDVVWFKNLKVLEL 232
>gi|149278985|ref|ZP_01885119.1| hypothetical protein PBAL39_03869 [Pedobacter sp. BAL39]
gi|149230264|gb|EDM35649.1| hypothetical protein PBAL39_03869 [Pedobacter sp. BAL39]
Length = 230
Score = 132 bits (331), Expect = 7e-29, Method: Composition-based stats.
Identities = 75/216 (34%), Positives = 115/216 (53%), Gaps = 5/216 (2%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTG-PWEVVDGAIQADGEGSDENGYIVYD 137
N + + G++ L DG+T GW Y GQA G W+V DGA D + G ++ D
Sbjct: 18 NTIQAQSVKKGFKALSDGKTTAGWHTY-GQATAGEKWKVEDGAFHLDPSVQNGGGDLITD 76
Query: 138 RIYENFELQWDWKISKGGNSGLLYHVVE-RPQYKVPYVTGPEYQLIDDKGFAEPLEDWQR 196
+ Y NF L +DWK++ NSG++++V E + +Y Y TG E Q+ID+ G + R
Sbjct: 77 KEYTNFHLIYDWKVAPNANSGVIFYVKEDKEKYHATYSTGLEMQVIDNDGHPDAKNVKHR 136
Query: 197 CGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNS 256
Y + ++ V+P GEWNT +I+ NG +E +NG ++ W+DD+
Sbjct: 137 AADLYDIIAS--SSEPVKPVGEWNTGEIISKNGKLELKLNGVTVVKTTLWNDDFKALLAK 194
Query: 257 GKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
K+ ++ + G I LQDHG W+RNI I+EL
Sbjct: 195 SKFATWKDFAAFKTGKIALQDHGDEVWYRNIMIKEL 230
>gi|86143589|ref|ZP_01061974.1| hypothetical protein MED217_13359 [Flavobacterium sp. MED217]
gi|85830036|gb|EAQ48497.1| hypothetical protein MED217_13359 [Leeuwenhoekiella blandensis
MED217]
Length = 458
Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats.
Identities = 88/224 (39%), Positives = 125/224 (55%), Gaps = 11/224 (4%)
Query: 78 QNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP---WEVVDG---AIQADGEGSDEN 131
+N LT+ E A+GWQLL+DG+T EGWR G+ P W + DG + GE S
Sbjct: 237 KNNLTQAEVASGWQLLWDGKTTEGWR--GGKLDHFPEKGWVIEDGELIVLSTGGEESAAG 294
Query: 132 GYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEP- 190
G IV Y++FEL+ D+KI++G NSG+ Y+V G EYQ++DD +
Sbjct: 295 GDIVTTEQYQDFELKIDFKITEGANSGIKYYVDTELNKGEGSSIGLEYQILDDAHHPDAK 354
Query: 191 LEDWQRCGVDYAMYLPDFATMN--VRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
L + + ++Y A N V P GEWNT++IV N HVE+Y+N K +E++ SD
Sbjct: 355 LGNHEGSRTLASLYDLIQADPNKPVNPIGEWNTARIVSKNKHVEHYLNDVKVLEYDRGSD 414
Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+ Q K+++ P +G KG I LQDHG F+NIKI+ L
Sbjct: 415 AFLQLVEESKYKDWPGFGTFEKGNILLQDHGDRVAFKNIKIKVL 458
Score = 92.8 bits (229), Expect = 4e-17, Method: Composition-based stats.
Identities = 58/188 (30%), Positives = 99/188 (52%), Gaps = 14/188 (7%)
Query: 299 EELFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
+ LFNG L GW + G + ++ +V + + +L T + Y DF + D+K +
Sbjct: 34 QSLFNGTSLDGWKQLNGKAAYRIEGDEIVGTTVANTPNSFLTTTQDYGDFIMELDYKVDP 93
Query: 358 DGNSGIFIRSFVE---EGAKVNGWQVEVAPKGFD-TGGIYESYGRGWLIQIPDD--RENF 411
NSGI IRS +V+G+Q+E+ P + GIY+ RGWL + ++ +
Sbjct: 94 SMNSGIQIRSLSTPKFRNGRVHGYQIEIDPSERAWSAGIYDEARRGWLYSLENNPKAQQA 153
Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH-----DGGGIKVL 466
K+ EWN R+ +G+ + TW+NG + + D++ + G IALQ+H D G ++
Sbjct: 154 FKQNEWNHYRVEALGDTLKTWINGVEAAHLVDDQTAS--GFIALQVHAIGAEDEPGKEIR 211
Query: 467 WRNIRVKT 474
W+NI++ T
Sbjct: 212 WKNIKIIT 219
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 82/353 (23%), Positives = 146/353 (41%), Gaps = 74/353 (20%)
Query: 51 IVFMMKCMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL 110
IVF ++ L ++V T C + + T +E WQ LF+G +L+GW+ NG+A
Sbjct: 4 IVFKYLVLSALGLLVF----TGCKNESE---TADEP---WQSLFNGTSLDGWKQLNGKA- 52
Query: 111 TGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYK 170
+ + I + N ++ + Y +F ++ D+K+ NSG+ + P+++
Sbjct: 53 --AYRIEGDEIVGTTVANTPNSFLTTTQDYGDFIMELDYKVDPSMNSGIQIRSLSTPKFR 110
Query: 171 VPYVTGPEYQL-IDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNG 229
V G + ++ ++ ++ + D R G Y++ A + EWN ++
Sbjct: 111 NGRVHGYQIEIDPSERAWSAGIYDEARRGWLYSLENNPKAQQAFK-QNEWNHYRVEALGD 169
Query: 230 HVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDH--------GYP 281
++ ++NG +E DD G I LQ H G
Sbjct: 170 TLKTWING---VEAAHLVDDQ-----------------TASGFIALQVHAIGAEDEPGKE 209
Query: 282 AWFRNIK-IRELPRKTEEEE----------------------LFNGKDLTGW-----DVY 313
++NIK I E P++ ++ L++GK GW D +
Sbjct: 210 IRWKNIKIITENPQQYSKQMSLPAFNTKNNLTQAEVASGWQLLWDGKTTEGWRGGKLDHF 269
Query: 314 GTEQWYVQD-SLLVCESGPDKQY--GYLATCKYYNDFELTADFKQEADGNSGI 363
+ W ++D L+V +G ++ G + T + Y DFEL DFK NSGI
Sbjct: 270 PEKGWVIEDGELIVLSTGGEESAAGGDIVTTEQYQDFELKIDFKITEGANSGI 322
>gi|86143701|ref|ZP_01062077.1| probable secreted glycosyl hydrolase [Flavobacterium sp. MED217]
gi|85829744|gb|EAQ48206.1| probable secreted glycosyl hydrolase [Leeuwenhoekiella blandensis
MED217]
Length = 248
Score = 129 bits (323), Expect = 6e-28, Method: Composition-based stats.
Identities = 88/253 (34%), Positives = 128/253 (50%), Gaps = 23/253 (9%)
Query: 58 MNCLCVMVLFGA--LTACSEKPQNVLTEEEKAAG----------WQLLFDGQTLEGWRDY 105
M L V V F T+C EK + TEE A W++LFDG ++ W Y
Sbjct: 1 MKQLIVSVAFAMALFTSCKEKAEEANTEEVAVATTETTQPEDNEWEVLFDGTNIDKWHAY 60
Query: 106 NGQALTGPWEVVDGAI---QADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYH 162
NG T WE+VD + A+ EN +V + Y +F L+ DW IS+GGNSG+++
Sbjct: 61 NGGDPT-QWEIVDDVLVFTPAENRNGSEN--LVTNEAYTSFVLKMDWMISEGGNSGIMWA 117
Query: 163 VVERPQYKVPYVTGPEYQLIDDKGFAEP-LEDWQRCGVDYAMYLPDFATMNVRPAGEWNT 221
V E P+Y PY TGPE Q++DD+ + R G Y M +N PAGEWN+
Sbjct: 118 VEEDPKYHEPYATGPEIQILDDERHPDTNAGPSHRSGALYDMIGAPEGVVN--PAGEWNS 175
Query: 222 SKIVFDNGHVE--YYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHG 279
+I D + +NG++ + F + W ++ K+ + + GLI LQDHG
Sbjct: 176 YEITIDYNTNQGIIVLNGEEVVTFPVKGEAWEDLVSNSKFASWEAFAKTDTGLIALQDHG 235
Query: 280 YPAWFRNIKIREL 292
+ F+NIKI++L
Sbjct: 236 HQVSFKNIKIKKL 248
>gi|154493991|ref|ZP_02033311.1| hypothetical protein PARMER_03336 [Parabacteroides merdae ATCC
43184]
gi|154086251|gb|EDN85296.1| hypothetical protein PARMER_03336 [Parabacteroides merdae ATCC
43184]
Length = 284
Score = 127 bits (320), Expect = 1e-27, Method: Composition-based stats.
Identities = 77/228 (33%), Positives = 119/228 (52%), Gaps = 19/228 (8%)
Query: 82 TEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY-----IVY 136
T + A G LFDG++ GWR Y + WEVVDG I G G+ E G +V+
Sbjct: 58 TFPKDADGKVTLFDGKSFNGWRGYGRTDVPAAWEVVDGTIHIKGSGAGEAGAKDGGDLVF 117
Query: 137 DRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF--AEPLEDW 194
++N+E +++WK+ KG NSG+LY +++ + + Y++ PEYQ++D+ A+ +D
Sbjct: 118 AHKFKNYEFEFEWKVGKGSNSGVLY-MIQEVEGQPSYISAPEYQVLDNANHPDAKLGKDG 176
Query: 195 QRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQ-- 252
R +P N +P GEWN KI+ G V +Y N + +E+ W+ W +
Sbjct: 177 NRQSASLYDMIPA-KPQNSKPFGEWNKGKIMCYKGTVVHYQNDEPVVEYHLWTQQWKEML 235
Query: 253 ---RKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
+ + KW A E G ++G I QDHG W+RNI I+EL
Sbjct: 236 DNSKFSKDKWPLAYELLLNCGGENKEGFIGFQDHGDDVWYRNITIKEL 283
>gi|150009463|ref|YP_001304206.1| hypothetical protein BDI_2876 [Parabacteroides distasonis ATCC
8503]
gi|149937887|gb|ABR44584.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 270
Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats.
Identities = 81/225 (36%), Positives = 116/225 (51%), Gaps = 28/225 (12%)
Query: 90 WQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDEN---GYIVYDRIYENFELQ 146
W +FDG+TL GWR Y Q + W V DG+I G + + G ++YD+ ++NF +
Sbjct: 52 WITMFDGKTLNGWRGYCRQDVPLGWVVEDGSITYKGSDNKADTGFGDLIYDKKFKNFVFE 111
Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRC------GVD 200
+WKI K GNSG+ Y E + Y + PEYQL+D++ + W+ C G
Sbjct: 112 IEWKIDKAGNSGIFYTAQEIEGTPI-YYSSPEYQLLDNENMPDA---WEGCDGNRQAGAV 167
Query: 201 YAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDW---FQRKNSG 257
Y M +PD V+P G WN ++IV N V +YMN K +EF+ + W
Sbjct: 168 YDMIMPD--PQPVKPYGNWNKTRIVVYNQRVIHYMNDVKILEFQFGTPVWRALVDHSKFS 225
Query: 258 KWENAPE-----YGL-----ARKGLICLQDHGYPAWFRNIKIREL 292
K+ +PE Y L + G I +QDHGY FRNI+I+EL
Sbjct: 226 KFSTSPEKCPEAYDLMLQCGKQPGYIGMQDHGYGVCFRNIRIKEL 270
>gi|150007837|ref|YP_001302580.1| hypothetical protein BDI_1195 [Parabacteroides distasonis ATCC
8503]
gi|149936261|gb|ABR42958.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 269
Score = 125 bits (313), Expect = 8e-27, Method: Composition-based stats.
Identities = 88/228 (38%), Positives = 124/228 (54%), Gaps = 16/228 (7%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTG-PWEVVDGAI---QADGEGSDENGYI 134
N LT++EKA GW LLFDG+T +GWR + A W V DG + ++DG S G I
Sbjct: 43 NQLTDQEKAEGWALLFDGKTTKGWRGAHKDAFPDHGWMVKDGELIVQKSDGSESTNGGDI 102
Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF--AEPLE 192
V + Y FE D+KI++G NSG+ Y V E+ + K G E+QL+DD A+
Sbjct: 103 VTEGEYSAFEFSVDFKITEGANSGIKYFVTEQEKQK-GSAYGLEFQLLDDAKHPDAKLYT 161
Query: 193 DWQRCGVDYAMY-LPDFATMNVRPAGEWNTSKI-VFDNGHVEYYMNGQKTIEFEAWSDDW 250
+ ++Y L ++ GEWNT+ + VF N HVE+++NG K +E+E S ++
Sbjct: 162 TFPGSRTLGSLYDLKKSENIHFNGVGEWNTAVVKVFPNNHVEHWLNGVKVLEYERGSKEF 221
Query: 251 FQRKNSGKWENAPEY------GLARKGLICLQDHGYPAWFRNIKIREL 292
K+ + P Y G A KG I LQDHG FRNIK++EL
Sbjct: 222 RDLVKGSKYAD-PSYNAGGAFGEAPKGHILLQDHGDEVAFRNIKVKEL 268
>gi|149196977|ref|ZP_01874030.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
HTCC2155]
gi|149140087|gb|EDM28487.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
HTCC2155]
Length = 223
Score = 124 bits (312), Expect = 1e-26, Method: Composition-based stats.
Identities = 79/236 (33%), Positives = 120/236 (50%), Gaps = 14/236 (5%)
Query: 58 MNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVV 117
M ++ LF +E N L++E+K+ GWQLLFDG+T GW +Y L W
Sbjct: 1 MRSFLILSLFSFSIFAAE--MNTLSDEQKSEGWQLLFDGKTTNGWVNYQSDKLNPLWVAE 58
Query: 118 DGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGP 177
DG ++ +G G++ +++FEL+ WKIS GGNSG+ V + P
Sbjct: 59 DGCLKLVKKGG---GHMHSTSSFKDFELKLQWKISAGGNSGVFLRVTPKSASG-----SP 110
Query: 178 EYQLIDDKGFAEPLEDWQRCGVDYAMYL-PDFATMNVRPAGEWNTSKIVFDNGHVEYYMN 236
E Q++D++ + G Y + P A V+ GEWN I+ H ++++N
Sbjct: 111 EMQVLDNEKNGNGKDPKTSAGALYGIIAAPKGA---VKAQGEWNQVHIIAKGKHYQFFLN 167
Query: 237 GQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
G KT +F+ S+++ + K+ GK YG +G I LQDHG FRNI I+EL
Sbjct: 168 GVKTADFDIDSEEFQKLKSQGKMAKKKTYGSNTEGHIGLQDHGKEVCFRNIMIKEL 223
>gi|88803973|ref|ZP_01119493.1| hypothetical protein RB2501_03965 [Robiginitalea biformata
HTCC2501]
gi|88784852|gb|EAR16021.1| hypothetical protein RB2501_03965 [Robiginitalea biformata
HTCC2501]
Length = 454
Score = 123 bits (308), Expect = 3e-26, Method: Composition-based stats.
Identities = 83/223 (37%), Positives = 125/223 (56%), Gaps = 9/223 (4%)
Query: 78 QNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTG-PWEVVDGAIQA---DGEGSDENGY 133
+N LT+ EK GWQLL+DG++ EGW + WE+ DG + G S+ G
Sbjct: 232 KNNLTQAEKEDGWQLLWDGESTEGWHGARLEDFPDYGWEIEDGVLTVLASGGGESEAGGD 291
Query: 134 IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPL-- 191
IV D +Y +F+L+ D++I++G NSG+ Y+V G EYQ++DD+ +
Sbjct: 292 IVTDSLYGDFDLRVDFRITEGANSGIKYYVDTELNKGEGSAIGLEYQILDDERHPDAKLG 351
Query: 192 --EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDD 249
E + Y + D A V P G+WNT++I+ +GHVE+++NG K +E+E SD
Sbjct: 352 NHEGSRTMASLYDLIRADPAK-PVNPIGQWNTARILSRDGHVEHWLNGVKVLEYERGSDA 410
Query: 250 WFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+ Q + K++ P +G A +G I LQDHG FRNIKI+ L
Sbjct: 411 YRQLVSESKYKIWPGFGEAGRGHILLQDHGNRVSFRNIKIKTL 453
Score = 109 bits (272), Expect = 5e-22, Method: Composition-based stats.
Identities = 68/192 (35%), Positives = 112/192 (58%), Gaps = 16/192 (8%)
Query: 296 TEEEELFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDKQYGYLATCKYYNDFELTADFK 354
T +ELFNG+DL+GW G E Y V+D +V + D ++AT + Y DF L ++
Sbjct: 25 TPWQELFNGEDLSGWTQLGGEAEYAVRDGAIVGTTVHDTPNSFMATEQLYEDFILELEYL 84
Query: 355 QEADGNSGIFIRSFVEE---GAKVNGWQVEVAP--KGFDTGGIYESYGRGWLIQIPD--D 407
++ NSGI +RS ++ +V+G+Q+E+ P +G+ + GIY+ RGWL+ + D D
Sbjct: 85 VDSTMNSGIQVRSNSQDYYMDGRVHGYQIEIDPSDRGW-SAGIYDEARRGWLVPVTDNPD 143
Query: 408 RENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-----G 462
+ ++ +WN RI +G+ + TW+NG + D+K +G IALQ+H G G
Sbjct: 144 AQAAFRQGDWNHYRIEAIGDTLKTWINGVPAAHLIDDK--TSEGFIALQVHSIGDDAQAG 201
Query: 463 IKVLWRNIRVKT 474
+++WR+IR+ T
Sbjct: 202 TEIIWRDIRILT 213
Score = 60.5 bits (145), Expect = 3e-07, Method: Composition-based stats.
Identities = 115/461 (24%), Positives = 177/461 (38%), Gaps = 99/461 (21%)
Query: 85 EKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFE 144
E WQ LF+G+ L GW G+A + V DGAI N ++ +++YE+F
Sbjct: 22 EDDTPWQELFNGEDLSGWTQLGGEA---EYAVRDGAIVGTTVHDTPNSFMATEQLYEDFI 78
Query: 145 LQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQL-IDDKGFAEPLEDWQRCGVDYAM 203
L+ ++ + NSG+ + Y V G + ++ D+G++ + D R G +
Sbjct: 79 LELEYLVDSTMNSGIQVRSNSQDYYMDGRVHGYQIEIDPSDRGWSAGIYDEARRGWLVPV 138
Query: 204 YLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAP 263
A R G+WN +I ++ ++NG DD K S
Sbjct: 139 TDNPDAQAAFR-QGDWNHYRIEAIGDTLKTWINGVPAAHL---IDD----KTS------- 183
Query: 264 EYGLARKGLICLQDH--------GYPAWFRNIKI------RELPRKTEEEELFNGKDLTG 309
+G I LQ H G +R+I+I R R T E + +LT
Sbjct: 184 ------EGFIALQVHSIGDDAQAGTEIIWRDIRILTDSLARAYSRDTPLEPVVTKNNLTQ 237
Query: 310 ----------WDVYGTEQWY-------------VQDSLLVC---ESGPDKQYGYLATCKY 343
WD TE W+ ++D +L G + G + T
Sbjct: 238 AEKEDGWQLLWDGESTEGWHGARLEDFPDYGWEIEDGVLTVLASGGGESEAGGDIVTDSL 297
Query: 344 YNDFELTADFKQEADGNSGI--FIRSFVEEG-AKVNGWQVEV--------APKGFDTGGI 392
Y DF+L DF+ NSGI ++ + + +G G + ++ A G G
Sbjct: 298 YGDFDLRVDFRITEGANSGIKYYVDTELNKGEGSAIGLEYQILDDERHPDAKLGNHEGSR 357
Query: 393 YESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNG-------------EQMV 439
+ + P N + +WNT RI V WLNG Q+V
Sbjct: 358 TMASLYDLIRADPAKPVNPIG--QWNTARILSRDGHVEHWLNGVKVLEYERGSDAYRQLV 415
Query: 440 DIQDEKI-----GAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
KI AG+G I LQ H G +V +RNI++KTL
Sbjct: 416 SESKYKIWPGFGEAGRGHILLQDH---GNRVSFRNIKIKTL 453
>gi|88804319|ref|ZP_01119839.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
HTCC2501]
gi|88785198|gb|EAR16367.1| probable secreted glycosyl hydrolase [Robiginitalea biformata
HTCC2501]
Length = 250
Score = 121 bits (304), Expect = 8e-26, Method: Composition-based stats.
Identities = 84/246 (34%), Positives = 122/246 (49%), Gaps = 26/246 (10%)
Query: 67 FGALTACSEKPQN----------VLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEV 116
F A+ AC E + T E +A+ W +LFDG + +GW+ YN + + W +
Sbjct: 11 FFAVIACKENKETPSEEVAESAETATPENEASDWIVLFDGTSFDGWKGYNQEGVPDTWSI 70
Query: 117 VDGAI-----QADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKV 171
+GA+ EG+ N +V + +ENF L +W+IS+GGNSG+ + V E Q+
Sbjct: 71 EEGAMVFTPPAERPEGASYN--LVTESKFENFVLSLEWQISEGGNSGVFWGVEELEQFGQ 128
Query: 172 PYVTGPEYQLIDDKGFAEPLE-DWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFD--- 227
PY TGPE Q++D++ + + G Y M P RP GEWNT +I D
Sbjct: 129 PYQTGPEIQVLDNEKHPDAKAGTTHQAGALYDMIAPSEDV--TRPVGEWNTMEITIDYAG 186
Query: 228 -NGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRN 286
G V MNG + + F +D W K++ +G +G I LQDHG FRN
Sbjct: 187 ETGKV--VMNGTELLTFPLGNDAWDAMVADSKFDGWEGFGQYHEGKIGLQDHGDRVAFRN 244
Query: 287 IKIREL 292
IKI+ L
Sbjct: 245 IKIKPL 250
>gi|150009244|ref|YP_001303987.1| hypothetical protein BDI_2646 [Parabacteroides distasonis ATCC
8503]
gi|149937668|gb|ABR44365.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 284
Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats.
Identities = 73/232 (31%), Positives = 123/232 (53%), Gaps = 20/232 (8%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY----- 133
++ T + G ++FDG+T GWR Y+ + G W + DGAI+ +G G+ E G
Sbjct: 54 DITTFPKDKEGRYVIFDGKTFNGWRGYDRADVPGAWTIEDGAIKINGSGAGEAGASNGGD 113
Query: 134 IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF--AEPL 191
+++ NFEL+++WK+ KG NSG ++ +++ + + Y++ PEYQ++D++ A+
Sbjct: 114 LIFAHKLGNFELEFEWKVGKGSNSG-VFIMIQEVEGQPSYISAPEYQVLDNENHPDAKLG 172
Query: 192 EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNG-QKTIEFEAWSDDW 250
+D R +P N +P GEWN KI+ G V +Y+N + +E+ W+ W
Sbjct: 173 KDGNRKSSSLYDMIPA-KPQNAKPFGEWNKGKIMCYKGTVVHYLNSDEPVVEYHLWTPQW 231
Query: 251 FQ-----RKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
+ + + KW A E G ++G I QDHG WFRNI ++ L
Sbjct: 232 KEMLDNSKFSKDKWPLAYELLLNCGGANKEGFIGFQDHGDDVWFRNITVKVL 283
>gi|154492730|ref|ZP_02032356.1| hypothetical protein PARMER_02367 [Parabacteroides merdae ATCC
43184]
gi|154087035|gb|EDN86080.1| hypothetical protein PARMER_02367 [Parabacteroides merdae ATCC
43184]
Length = 459
Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats.
Identities = 88/228 (38%), Positives = 117/228 (51%), Gaps = 16/228 (7%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL-TGPWEVVDGAIQADGEGSDE---NGYI 134
N LTE EKAAGW+LLFDG+T GWR + W++ +G + G E G I
Sbjct: 233 NTLTEAEKAAGWKLLFDGKTSNGWRGAGQETFPENGWKIENGELTVMKNGGPEGKRGGDI 292
Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDW 194
+ + FEL +++K+++G NSG+ Y + E + K +V GPEYQ++DDK +
Sbjct: 293 LTVDEFGAFELSFEFKLTEGANSGMKYLIQESKKNK-GFVIGPEYQVLDDKQHPDAKLYT 351
Query: 195 QRCGVDYAMYLPDF---ATMNVRPAGEWNTSKI-VFDNGHVEYYMNGQKTIEFEAWSDDW 250
G L D G+WN I VF N HVE++MNG KT+E++ W+ D
Sbjct: 352 TYPGSRTVSSLYDIIPAKNKRFNGVGQWNKGVIKVFPNKHVEHWMNGFKTVEYD-WASDA 410
Query: 251 FQRKNSGKWENAPEY------GLARKGLICLQDHGYPAWFRNIKIREL 292
F G EY G A KG I LQDH FRNIKIREL
Sbjct: 411 FLEVVKGSKFAKKEYAEFGPFGTAEKGHILLQDHWDEVSFRNIKIREL 458
Score = 92.8 bits (229), Expect = 4e-17, Method: Composition-based stats.
Identities = 61/189 (32%), Positives = 101/189 (53%), Gaps = 15/189 (7%)
Query: 299 EELFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
++LFNGKD TG+ + G + V++ +V ++ + ++AT + Y DF L + K
Sbjct: 26 QQLFNGKDFTGFKQLNGKAPYRVENGCMVGQTVDKEPNSFMATEQTYGDFILEFEVKCHP 85
Query: 358 DGNSGIFIRSFVE---EGAKVNGWQVEVAPKGFD-TGGIYESYGRGWLIQIPDDREN--F 411
D NSG+ RS + +V+G+Q E+ P +GG+Y+ RGWL + ++
Sbjct: 86 DLNSGVQFRSESKPDYNNGRVHGYQCEIDPSDRAWSGGLYDEARRGWLAPLTNNEAGRAA 145
Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH------DGGGIKV 465
K+ +WN RI +GN + WLNG ++ D+ +G IA Q+H + G ++
Sbjct: 146 YKKDDWNKYRIEAIGNSIRIWLNGVNTSNVVDDM--TPEGFIAFQVHGIFGKTENVGKEI 203
Query: 466 LWRNIRVKT 474
WRNIR+KT
Sbjct: 204 WWRNIRIKT 212
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 95/406 (23%), Positives = 160/406 (39%), Gaps = 80/406 (19%)
Query: 89 GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWD 148
GWQ LF+G+ G++ NG+A P+ V +G + + N ++ ++ Y +F L+++
Sbjct: 24 GWQQLFNGKDFTGFKQLNGKA---PYRVENGCMVGQTVDKEPNSFMATEQTYGDFILEFE 80
Query: 149 WKISKGGNSGLLYHVVERPQYKVPYVTGPEYQL-IDDKGFAEPLEDWQRCGVDYAMYLPD 207
K NSG+ + +P Y V G + ++ D+ ++ L D R G A +
Sbjct: 81 VKCHPDLNSGVQFRSESKPDYNNGRVHGYQCEIDPSDRAWSGGLYDEARRGW-LAPLTNN 139
Query: 208 FATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGL 267
A +WN +I + ++NG T DD +
Sbjct: 140 EAGRAAYKKDDWNKYRIEAIGNSIRIWLNGVNTSNV---VDD-----------------M 179
Query: 268 ARKGLICLQDHGY---------PAWFRNIKIRE------------------LPRKTEEEE 300
+G I Q HG W+RNI+I+ +P E E
Sbjct: 180 TPEGFIAFQVHGIFGKTENVGKEIWWRNIRIKTENLEAERMQGPLAPEVNCIPNTLTEAE 239
Query: 301 -------LFNGKDLTGWDVYGTEQ-----WYVQDSLLVC--ESGPD-KQYGYLATCKYYN 345
LF+GK GW G E W +++ L GP+ K+ G + T +
Sbjct: 240 KAAGWKLLFDGKTSNGWRGAGQETFPENGWKIENGELTVMKNGGPEGKRGGDILTVDEFG 299
Query: 346 DFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFD-----TGGIYESYGRGW 400
FEL+ +FK NSG ++ ++E K G+ + + D +Y +Y
Sbjct: 300 AFELSFEFKLTEGANSG--MKYLIQESKKNKGFVIGPEYQVLDDKQHPDAKLYTTYPGSR 357
Query: 401 LIQ-----IPDDRENFLKEREWNTMRIRVVGNQ-VTTWLNGEQMVD 440
+ IP + F +WN I+V N+ V W+NG + V+
Sbjct: 358 TVSSLYDIIPAKNKRFNGVGQWNKGVIKVFPNKHVEHWMNGFKTVE 403
>gi|149276529|ref|ZP_01882673.1| hypothetical protein PBAL39_02377 [Pedobacter sp. BAL39]
gi|149233049|gb|EDM38424.1| hypothetical protein PBAL39_02377 [Pedobacter sp. BAL39]
Length = 452
Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats.
Identities = 80/224 (35%), Positives = 121/224 (54%), Gaps = 13/224 (5%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRD-YNGQALTGPWEVVDG---AIQADGEGSDENGYI 134
N L+ +EKA G+ LL+DG+T EGWR Y W + DG +++DG S G I
Sbjct: 231 NDLSAQEKAEGYSLLWDGRTTEGWRGAYKSTFPESGWLIKDGELSVVKSDGSESTHGGDI 290
Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDW 194
V ++ Y FEL++D+K++ G NSG+ Y V K + GPEYQ++DD+ P
Sbjct: 291 VTEKQYGAFELKFDFKLTPGANSGVKYFVTLTEGNKGSAI-GPEYQVLDDE--RHPDAKL 347
Query: 195 QRCGVDYAMYLPDFATMN-----VRPAGEWNTSKI-VFDNGHVEYYMNGQKTIEFEAWSD 248
+ G L D T R GEWN I VF + +EY++NG K +E+E +
Sbjct: 348 GKNGNRTLGSLYDLMTSKKIPNAQRKIGEWNRGLIRVFPDNKIEYWLNGYKILEYERGTP 407
Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
++ K+++ +G+A KG + LQDHG +FR++KI+ L
Sbjct: 408 EFTALVAGSKYKDWNNFGMAEKGHVLLQDHGDQVFFRSLKIKTL 451
Score = 101 bits (252), Expect = 9e-20, Method: Composition-based stats.
Identities = 70/196 (35%), Positives = 105/196 (53%), Gaps = 16/196 (8%)
Query: 292 LPRKTEEEELFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELT 350
L + + ++LFNGKDL+GW + G ++ V++ ++ + + +L T Y DF L
Sbjct: 20 LLKAQQWQQLFNGKDLSGWKQLNGKAKYEVRNGEIIGTTVSAEPNSFLCTDVDYGDFILE 79
Query: 351 ADFKQEADGNSGIFIRSFVE---EGAKVNGWQVEVAPKGFD-TGGIYESYGRGWLIQI-- 404
+ + NSGI IRS + E +V+G+QVEV P +GGIY+ RGWL +
Sbjct: 80 VELMADPSMNSGIQIRSESKSDYENGRVHGYQVEVDPSDRQFSGGIYDEARRGWLYPMDI 139
Query: 405 -PDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-- 461
P + F K WN RI +GN + TW+NG ++ D AG IALQ+H G
Sbjct: 140 NPKGKLAF-KNGSWNKYRIECIGNSIRTWVNGVPAANVVDNMTPAG--FIALQVHSIGKD 196
Query: 462 ---GIKVLWRNIRVKT 474
G ++ WRNIR++T
Sbjct: 197 EIAGKQIRWRNIRIQT 212
>gi|29347567|ref|NP_811070.1| hypothetical protein BT_2157 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29339468|gb|AAO77264.1| probable secreted glycosyl hydrolase [Bacteroides thetaiotaomicron
VPI-5482]
Length = 290
Score = 118 bits (296), Expect = 8e-25, Method: Composition-based stats.
Identities = 74/231 (32%), Positives = 114/231 (49%), Gaps = 26/231 (11%)
Query: 87 AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDE-----NGYIVYDRIYE 141
A G+ +FDG+T GWR Y + W + DG I+ +G G E G +++ ++
Sbjct: 60 ADGYITIFDGKTFNGWRGYGKDRVPSKWTIEDGCIKFNGSGGGEAQDGDGGDLIFAHKFK 119
Query: 142 NFELQWDWKISKGGNSGLLYHVVERPQ--------YKVPYVTGPEYQLIDDKGF--AEPL 191
NFEL+ +WK+SKGGNSG+ Y E + Y++ PEYQ++D+ A+
Sbjct: 120 NFELEMEWKVSKGGNSGIFYLAQEVTSKDKDGNDVLEPIYISAPEYQVLDNDNHPDAKLG 179
Query: 192 EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
+D R +P N +P GEWN +KI+ G V + N + +E+ W+ W
Sbjct: 180 KDNNRQSASLYDMIPA-VPQNAKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWT 238
Query: 252 -----QRKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
+ + KW A E G +G I +QDHG WFRNI+++ L
Sbjct: 239 DLLQASKFSQDKWPLAFELLNNCGGENHEGFIGMQDHGDDVWFRNIRVKVL 289
>gi|156110542|gb|EDO12287.1| hypothetical protein BACOVA_02177 [Bacteroides ovatus ATCC 8483]
Length = 451
Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats.
Identities = 75/222 (33%), Positives = 117/222 (52%), Gaps = 11/222 (4%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP-WEVVDG---AIQADGEGSDENGYI 134
N ++ E GW LL+DG+T +GWR W++ DG +++ G S G I
Sbjct: 232 NTISPNEAKEGWTLLWDGKTTDGWRGAKLSTFPAKGWKIEDGILKVMKSGGAESANGGDI 291
Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEP---L 191
V R Y+NF L+ D+KI++G NSG+ Y V G E+Q++DD + +
Sbjct: 292 VTTRKYKNFILKVDFKITEGANSGIKYFVNPDMNKGAGSAIGCEFQILDDDKHPDAKLGV 351
Query: 192 EDWQRCGVDYAMY-LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDW 250
+ ++ G Y + P N + E+NT+ I+ HVE+++NG K IE++ +D W
Sbjct: 352 KGNRKLGSLYDLIPAPKNKPFNKK---EFNTATIIVKGNHVEHWLNGVKLIEYDRNNDMW 408
Query: 251 FQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
K++N P +G +G I LQDHG WF+N+KI+EL
Sbjct: 409 NALVAYSKYKNWPNFGNPEEGNILLQDHGDEVWFKNVKIKEL 450
Score = 97.1 bits (240), Expect = 2e-18, Method: Composition-based stats.
Identities = 72/189 (38%), Positives = 99/189 (52%), Gaps = 16/189 (8%)
Query: 299 EELFNGKDLTGWD-VYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
E LFNGK+L GW + G ++ + D +V S +LAT K Y DF L DFK +
Sbjct: 25 EPLFNGKNLKGWKKLNGKAEYKIVDGAIVGVSKMGTPNTFLATTKNYGDFILEFDFKVDD 84
Query: 358 DGNSGIFIRSFVEEGAK---VNGWQVEVAP-KGFDTGGIYESYGRGWLIQI---PDDREN 410
NSG+ +RS ++ K V+G+Q E+ P K +GGIY+ R WL + P +
Sbjct: 85 GLNSGVQLRSESKKDYKKGRVHGYQFEIDPSKRAWSGGIYDEARRNWLYPLTLNPSAKTA 144
Query: 411 FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-----GIKV 465
F K WN RI VGN + TW+NG +I D+ G IALQ+H G G V
Sbjct: 145 F-KNNAWNKARIEAVGNSIRTWINGVPCANIWDDMTPV--GFIALQVHAIGNAADEGKTV 201
Query: 466 LWRNIRVKT 474
W++IR+ T
Sbjct: 202 SWKDIRICT 210
Score = 79.7 bits (195), Expect = 4e-13, Method: Composition-based stats.
Identities = 110/457 (24%), Positives = 186/457 (40%), Gaps = 95/457 (20%)
Query: 87 AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQ 146
A W+ LF+G+ L+GW+ NG+A +++VDGAI + N ++ + Y +F L+
Sbjct: 21 AQNWEPLFNGKNLKGWKKLNGKA---EYKIVDGAIVGVSKMGTPNTFLATTKNYGDFILE 77
Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDK-GFAEPLEDWQRCGVDYAMYL 205
+D+K+ G NSG+ + YK V G ++++ K ++ + D R Y + L
Sbjct: 78 FDFKVDDGLNSGVQLRSESKKDYKKGRVHGYQFEIDPSKRAWSGGIYDEARRNWLYPLTL 137
Query: 206 PDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEY 265
A + WN ++I + ++NG W D
Sbjct: 138 NPSAKTAFK-NNAWNKARIEAVGNSIRTWINGVPCANI--WDD----------------- 177
Query: 266 GLARKGLICLQ--------DHGYPAWFRNIKI-------RELPRKTEEEE---------- 300
+ G I LQ D G +++I+I + P E
Sbjct: 178 -MTPVGFIALQVHAIGNAADEGKTVSWKDIRICTTDVERYQTPEAQAAPEVNLIANTISP 236
Query: 301 ---------LFNGKDLTGW-----DVYGTEQWYVQDSLL-VCESGPDKQY--GYLATCKY 343
L++GK GW + + W ++D +L V +SG + G + T +
Sbjct: 237 NEAKEGWTLLWDGKTTDGWRGAKLSTFPAKGWKIEDGILKVMKSGGAESANGGDIVTTRK 296
Query: 344 YNDFELTADFKQEADGNSGI--FIRSFVEEGAKVN---GWQVEVAPKGFDTG-GIYESYG 397
Y +F L DFK NSGI F+ + +GA +Q+ K D G+ +
Sbjct: 297 YKNFILKVDFKITEGANSGIKYFVNPDMNKGAGSAIGCEFQILDDDKHPDAKLGVKGNRK 356
Query: 398 RGWLIQ-IPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMV------DIQDEKIGAG- 449
G L IP + ++E+NT I V GN V WLNG +++ D+ + +
Sbjct: 357 LGSLYDLIPAPKNKPFNKKEFNTATIIVKGNHVEHWLNGVKLIEYDRNNDMWNALVAYSK 416
Query: 450 -----------QGRIALQIHDGGGIKVLWRNIRVKTL 475
+G I LQ H G +V ++N+++K L
Sbjct: 417 YKNWPNFGNPEEGNILLQDH---GDEVWFKNVKIKEL 450
>gi|86131181|ref|ZP_01049780.1| hypothetical protein MED134_09676 [Cellulophaga sp. MED134]
gi|85818592|gb|EAQ39752.1| hypothetical protein MED134_09676 [Dokdonia donghaensis MED134]
Length = 244
Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats.
Identities = 76/224 (33%), Positives = 119/224 (53%), Gaps = 9/224 (4%)
Query: 74 SEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY 133
+++ + V T + + +LFDG + + W+ Y + W + DGA+ S+E G
Sbjct: 25 TQEVEEVETVTQASTTEIVLFDGSSFDAWKGYGTDGMHENWTIEDGAMAF--TPSEEGGK 82
Query: 134 -IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF--AEP 190
I+ Y+NFEL +WK+S+GGNSG+ + V E P++K Y TGPE Q++DD+ A+
Sbjct: 83 NIITKNTYKNFELNLEWKVSEGGNSGIFWGVKESPEFKEAYETGPEIQVLDDERHPDAKV 142
Query: 191 LEDWQRCGVDYAMYLPDFATMNVRPAGEWN--TSKIVFDNGHVEYYMNGQKTIEFEAWSD 248
+ G Y M P +N PAGEWN T I D+ + +NG++ F +
Sbjct: 143 ANGTHKAGSLYDMIKPADGMIN--PAGEWNKVTLYINHDSNLGKVSLNGKEAYTFPVNGE 200
Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+W K+ + +G ++G I LQDHG W+RNI I+EL
Sbjct: 201 EWDAMVAKTKFADWKGFGKYQEGHIGLQDHGDKVWYRNITIKEL 244
>gi|149177844|ref|ZP_01856443.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
gi|148843334|gb|EDL57698.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
Length = 432
Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats.
Identities = 111/412 (26%), Positives = 175/412 (42%), Gaps = 56/412 (13%)
Query: 77 PQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVY 136
P+ LTEEE AAGW LFDG +L GW+ N W V +G I+AD E G ++
Sbjct: 59 PETGLTEEEIAAGWIALFDGHSLSGWKPNNDVN----WHVDEGVIKAD---KGEPGLLLT 111
Query: 137 DRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQR 196
+ ++EL++++K++ NSG+ P P E L D K R
Sbjct: 112 TSPFADYELKFEFKLTPETNSGIFLRTTFNPTD--PSKDCYELNLCDQKTEFPTGSLVAR 169
Query: 197 CGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNS 256
+D + + + EW T ++ + +++ +NG++ + F
Sbjct: 170 SKIDKPLPV----------SSEWQTCEVNLEGSNIKAIINGKEVLNF------------- 206
Query: 257 GKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEELFNGKDLTGWDVYGTE 316
N L + G I LQ + FR I ++ L T +FNG DL GW+V
Sbjct: 207 ----NDTSKNLRKTGFIGLQKNEGAIEFRKIYLKPLRMST----IFNGVDLAGWNVVPGS 258
Query: 317 QWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG-NSGIFIRSFV-EEGAK 374
Q + +KQ GYL T + + DF A K D NSG F R+ E
Sbjct: 259 QSTFEVVDGTIHVTAEKQ-GYLETEEIWGDFLFQATAKSNGDSLNSGYFFRAIKGSEKGM 317
Query: 375 VNGWQVEVAPKGFDTGGIY--ESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTW 432
NG++V++ G G E+ G G + + + R + EW T + G + W
Sbjct: 318 ANGYEVQIH-NGIKEGDRTKPENAGTGAIFRRTEARRVVANDHEWFTTTLSASGPHIAVW 376
Query: 433 LNGEQMVDIQDEK---------IGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
+NG Q+ D D + + G I+LQ HD + +++++V TL
Sbjct: 377 INGYQVTDWTDTRKPDENPRKGLRLEAGHISLQGHD-PTTDLNFKDLKVSTL 427
>gi|29349855|ref|NP_813358.1| hypothetical protein BT_4447 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341766|gb|AAO79552.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 240
Score = 117 bits (294), Expect = 1e-24, Method: Composition-based stats.
Identities = 75/231 (32%), Positives = 115/231 (49%), Gaps = 26/231 (11%)
Query: 87 AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDE-----NGYIVYDRIYE 141
A G+ +FDG+T GWR Y + W + DG I+ +G GS E G +++ ++
Sbjct: 10 ADGYITIFDGKTFNGWRGYGKDRVPSKWTIEDGCIKFNGSGSGEAQNGDGGDLIFAHKFK 69
Query: 142 NFELQWDWKISKGGNSGLLYHVVERPQ--------YKVPYVTGPEYQLIDDKGF--AEPL 191
NFEL+ +WK+SKGGNSG+ Y E + Y++ PEYQ++D+ A+
Sbjct: 70 NFELEMEWKVSKGGNSGIFYLAQEVTSKDKDGNDVLEPIYISAPEYQVLDNDNHPDAKLG 129
Query: 192 EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
+D R +P N +P GEWN +KI+ G V + N + +E+ W+ W
Sbjct: 130 KDNNRQSASLYDMIPA-VPQNAKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWT 188
Query: 252 -----QRKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
+ + KW A E G +G I +QDHG WFRNI+++ L
Sbjct: 189 DLLQASKFSQDKWPLAFELLNNCGGENHEGFIGMQDHGDDVWFRNIRVKVL 239
>gi|126645050|ref|ZP_01717594.1| hypothetical protein ALPR1_10430 [Algoriphagus sp. PR1]
gi|126578461|gb|EAZ82625.1| hypothetical protein ALPR1_10430 [Algoriphagus sp. PR1]
Length = 461
Score = 117 bits (293), Expect = 2e-24, Method: Composition-based stats.
Identities = 78/224 (34%), Positives = 121/224 (54%), Gaps = 13/224 (5%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRD-YNGQALTGPWEVVDGAI---QADGEGSDENGYI 134
N LT+ EK GW+LLF+GQ EGW+ Y + W V DG + ++DG S G I
Sbjct: 240 NELTDYEKNTGWKLLFNGQNSEGWKGAYKDEFPDFGWSVNDGILTIAESDGGESTNAGDI 299
Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDW 194
V + F+L +++++++G NSGL Y V K + G EYQ++DD+ P
Sbjct: 300 VTKEEFSAFDLGFEFRLTEGANSGLKYFVTLSEGNKGSAI-GLEYQILDDE--KHPDAKM 356
Query: 195 QRCGVDYAMYLPDFATMN-----VRPAGEWNTSKIVFD-NGHVEYYMNGQKTIEFEAWSD 248
+ G L D T + P GEWN ++V + N HV +Y+NG K +E++ S+
Sbjct: 357 GKEGNRTLSSLYDLITAQKQGRFINPIGEWNKGRVVVEPNNHVTHYLNGLKVLEYDRGSE 416
Query: 249 DWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
++ + + K++ P +G A +G I LQDHG F+NIK++ L
Sbjct: 417 EFRELVANSKYKIWPNFGEAEQGHILLQDHGNRVSFKNIKLKSL 460
Score = 88.6 bits (218), Expect = 9e-16, Method: Composition-based stats.
Identities = 66/191 (34%), Positives = 101/191 (52%), Gaps = 19/191 (9%)
Query: 300 ELFNGKDLTGWD-VYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQE-A 357
+LFNGKDL+GW V GT + V D ++V + +L T + Y D+ L D K E
Sbjct: 33 DLFNGKDLSGWKAVAGTANFEVVDGVIVGSAVAGSPNTFLITEETYGDYILELDLKVENL 92
Query: 358 DGNSGIFIRSFVEEGAK-----VNGWQVEVAPKGFD-TGGIYESYGRGWLIQI---PDDR 408
NSGI R + A+ V G+QVE P + GIY+ RGWL + P +
Sbjct: 93 TSNSGIMARGQFDPAARDGNGLVYGYQVEADPSERAWSAGIYDEARRGWLYPLDLNPAAK 152
Query: 409 ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH-----DGGGI 463
F K E+N RI V+G+++ TWLNG+++ + D+ +G + LQ+H + G
Sbjct: 153 TAF-KMGEFNHYRIEVIGDEIKTWLNGQEVAYVVDDM--DSKGFVGLQVHSIRNPEDEGN 209
Query: 464 KVLWRNIRVKT 474
K ++N+++KT
Sbjct: 210 KTYFKNVKIKT 220
Score = 65.1 bits (157), Expect = 9e-09, Method: Composition-based stats.
Identities = 108/450 (24%), Positives = 178/450 (39%), Gaps = 82/450 (18%)
Query: 89 GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWD 148
GW LF+G+ L GW+ G T +EVVDG I N +++ + Y ++ L+ D
Sbjct: 30 GWVDLFNGKDLSGWKAVAG---TANFEVVDGVIVGSAVAGSPNTFLITEETYGDYILELD 86
Query: 149 WKISK-GGNSGLLYHVVERPQYKVPYVTGPEYQLIDD---KGFAEPLEDWQRCGVDYAMY 204
K+ NSG++ P + YQ+ D + ++ + D R G Y +
Sbjct: 87 LKVENLTSNSGIMARGQFDPAARDGNGLVYGYQVEADPSERAWSAGIYDEARRGWLYPLD 146
Query: 205 LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPE 264
L A + GE+N +I ++ ++NGQ+ A+ D K
Sbjct: 147 LNPAAKTAFK-MGEFNHYRIEVIGDEIKTWLNGQEV----AYVVDDMDSKGF-------- 193
Query: 265 YGLARKGLICLQDHGYPAWFRNIKIR------------------------ELPRKTEEEE 300
GL + +D G +F+N+KI+ + + T +
Sbjct: 194 VGLQVHSIRNPEDEGNKTYFKNVKIKTTNLDPKPFSSSIYVVNNRLNELTDYEKNTGWKL 253
Query: 301 LFNGKDLTGW-----DVYGTEQWYVQDSLLV---CESGPDKQYGYLATCKYYNDFELTAD 352
LFNG++ GW D + W V D +L + G G + T + ++ F+L +
Sbjct: 254 LFNGQNSEGWKGAYKDEFPDFGWSVNDGILTIAESDGGESTNAGDIVTKEEFSAFDLGFE 313
Query: 353 FKQEADGNSGIFIRSFVEEGAKVN--GWQVEVAPKGFDTGGIYESYGRGWLIQIPD---- 406
F+ NSG+ + EG K + G + ++ G L + D
Sbjct: 314 FRLTEGANSGLKYFVTLSEGNKGSAIGLEYQILDDEKHPDAKMGKEGNRTLSSLYDLITA 373
Query: 407 -DRENFLKE-REWNTMRIRV-VGNQVTTWLNG-------------EQMVDIQDEKI---- 446
+ F+ EWN R+ V N VT +LNG ++V KI
Sbjct: 374 QKQGRFINPIGEWNKGRVVVEPNNHVTHYLNGLKVLEYDRGSEEFRELVANSKYKIWPNF 433
Query: 447 -GAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
A QG I LQ H G +V ++NI++K+L
Sbjct: 434 GEAEQGHILLQDH---GNRVSFKNIKLKSL 460
>gi|153808737|ref|ZP_01961405.1| hypothetical protein BACCAC_03036 [Bacteroides caccae ATCC 43185]
gi|149128563|gb|EDM19781.1| hypothetical protein BACCAC_03036 [Bacteroides caccae ATCC 43185]
Length = 287
Score = 117 bits (292), Expect = 2e-24, Method: Composition-based stats.
Identities = 73/229 (31%), Positives = 114/229 (49%), Gaps = 26/229 (11%)
Query: 89 GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDE-----NGYIVYDRIYENF 143
G+ +FDG+T +GWR Y + W + DG I+ +G G E G +++ ++NF
Sbjct: 59 GYITIFDGKTFDGWRGYGKDKVPAKWTIEDGCIKFNGTGGGEAQDADGGDLIFAHKFKNF 118
Query: 144 ELQWDWKISKGGNSGLLYHVVE--------RPQYKVPYVTGPEYQLIDDKGF--AEPLED 193
EL+ +WK++KG NSG+LY E + Y++ PEYQ++D+ A+ +D
Sbjct: 119 ELELEWKVAKGSNSGILYLAQEITSKDKDGNDVLEPIYISAPEYQILDNANHPDAKLGKD 178
Query: 194 WQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF-- 251
R +P N +P GEWN +KI+ G V + N + +E+ W+ W
Sbjct: 179 NNRQSASLYDMIPA-VPQNSKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWTDM 237
Query: 252 ---QRKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
+ + KW A E G +G I LQDHG WFRNI+++ L
Sbjct: 238 LQASKFSEEKWPLAFELLNNCGGDNHEGFIGLQDHGDDVWFRNIRVKVL 286
>gi|88712851|ref|ZP_01106936.1| probable secreted glycosyl hydrolase [Flavobacteriales bacterium
HTCC2170]
gi|88708749|gb|EAR00984.1| probable secreted glycosyl hydrolase [Flavobacteriales bacterium
HTCC2170]
Length = 232
Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats.
Identities = 75/214 (35%), Positives = 112/214 (52%), Gaps = 14/214 (6%)
Query: 89 GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQAD--GEGSDENGYIVYDRIYENFELQ 146
G+ LF+G++L+GW Y + W +G + D + S EN +V D+ Y N+EL
Sbjct: 24 GFTDLFNGKSLDGWHSYGKDEINDGWYADNGELIFDFQRDKSGENSNLVTDKQYTNYELS 83
Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDD--KGFAEPLEDWQRCGVDYAMY 204
+WKI GNSG+ + V+E +++ PY+TGPE Q++DD + + E D R G Y +
Sbjct: 84 IEWKIYPHGNSGIFWGVIESEEFEQPYMTGPEIQILDDGWEAYIEERGDINRAGSLYGLI 143
Query: 205 LPDFATMNVRPAGEWNTSKIVFDNGHVEYYM--NGQKTIEFEAWSDDW---FQRKNSGKW 259
P N PA EWN I D+ E ++ NG + + F +W + KW
Sbjct: 144 PPSSIVSN--PAEEWNHYLIHIDHKENEGFVVFNGTEVVRFPVHGPEWKAMIAKSGFAKW 201
Query: 260 ENAPEYGLARKGLICLQDHGYPAWFRNIKIRELP 293
+ +G A+ G I LQ+ G FRNIKI+ELP
Sbjct: 202 SS---FGTAKTGHISLQEWGGKVAFRNIKIKELP 232
>gi|156112295|gb|EDO14040.1| hypothetical protein BACOVA_00235 [Bacteroides ovatus ATCC 8483]
Length = 244
Score = 116 bits (290), Expect = 3e-24, Method: Composition-based stats.
Identities = 76/221 (34%), Positives = 116/221 (52%), Gaps = 9/221 (4%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL-TGPWEVVDGAI---QADGEGSDENGYI 134
N LT +EK GW+LL+DG+T GWR T W+++ + ++ GE S G I
Sbjct: 25 NTLTNQEKNEGWKLLWDGKTTNGWRGARISTFPTKGWKIIGNDLVVEKSKGEESGNGGDI 84
Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDW 194
V + Y++FEL D+KI++G NSG+ Y V G E+Q++DD+ P
Sbjct: 85 VTIKTYKSFELVADFKITEGANSGIKYFVDPDLNKGKGSAIGCEFQILDDE--KHPDAKA 142
Query: 195 QRCGVDYAMYLPDF---ATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
R G L D + + E+NT++I+ HVE+++NG K +E+E + W
Sbjct: 143 GRKGNRTVGSLYDLIPAGSNKLFKKNEFNTARIIVKGNHVEHWLNGIKVVEYERNNQMWK 202
Query: 252 QRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
K+ + P +G ++G I LQDHG F+NIKI+EL
Sbjct: 203 ALVAGSKYADWPNFGEGKEGHILLQDHGDEVHFKNIKIKEL 243
Score = 55.8 bits (133), Expect = 7e-06, Method: Composition-based stats.
Identities = 63/216 (29%), Positives = 92/216 (42%), Gaps = 52/216 (24%)
Query: 301 LFNGKDLTGW-----DVYGTEQWYVQDSLLVCESGPDKQYGY---LATCKYYNDFELTAD 352
L++GK GW + T+ W + + LV E ++ G + T K Y FEL AD
Sbjct: 39 LWDGKTTNGWRGARISTFPTKGWKIIGNDLVVEKSKGEESGNGGDIVTIKTYKSFELVAD 98
Query: 353 FKQEADGNSGI--FIRSFVEEG-AKVNGWQVEV-----------APKGFDT-GGIYESYG 397
FK NSGI F+ + +G G + ++ KG T G +Y+
Sbjct: 99 FKITEGANSGIKYFVDPDLNKGKGSAIGCEFQILDDEKHPDAKAGRKGNRTVGSLYD--- 155
Query: 398 RGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDE------------- 444
IP K+ E+NT RI V GN V WLNG ++V+ +
Sbjct: 156 -----LIPAGSNKLFKKNEFNTARIIVKGNHVEHWLNGIKVVEYERNNQMWKALVAGSKY 210
Query: 445 ----KIGAG-QGRIALQIHDGGGIKVLWRNIRVKTL 475
G G +G I LQ H G +V ++NI++K L
Sbjct: 211 ADWPNFGEGKEGHILLQDH---GDEVHFKNIKIKEL 243
>gi|156109541|gb|EDO11286.1| hypothetical protein BACOVA_03189 [Bacteroides ovatus ATCC 8483]
Length = 288
Score = 116 bits (290), Expect = 3e-24, Method: Composition-based stats.
Identities = 73/229 (31%), Positives = 113/229 (49%), Gaps = 26/229 (11%)
Query: 89 GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDE-----NGYIVYDRIYENF 143
G+ +FDG+T GWR Y + W + DG I+ +G G E G +++ ++NF
Sbjct: 60 GYITIFDGETFNGWRGYGKDRVPTKWTIEDGCIKFNGSGGGEAQDGDGGDLIFAHKFKNF 119
Query: 144 ELQWDWKISKGGNSGLLYHVVERPQ--------YKVPYVTGPEYQLIDDKGF--AEPLED 193
EL+ +WK++KG NSG+LY E + Y++ PEYQ++D+ A+ +D
Sbjct: 120 ELELEWKVAKGSNSGILYLAQEVTSKDKDGNDVLEPIYISAPEYQILDNANHPDAKLGKD 179
Query: 194 WQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF-- 251
R +P N +P GEWN +KI+ G V + N + +E+ W+ W
Sbjct: 180 NNRQSASLYDMIPA-VPQNSKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWTDM 238
Query: 252 ---QRKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
+ + KW A E G +G I LQDHG WFRNI+++ L
Sbjct: 239 LQASKFSEDKWPLAFELLNNCGGENHEGFIGLQDHGDDVWFRNIRVKVL 287
>gi|29348878|ref|NP_812381.1| hypothetical protein BT_3469 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340784|gb|AAO78575.1| probable secreted glycosyl hydrolase [Bacteroides thetaiotaomicron
VPI-5482]
Length = 449
Score = 113 bits (283), Expect = 2e-23, Method: Composition-based stats.
Identities = 75/222 (33%), Positives = 118/222 (53%), Gaps = 11/222 (4%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP-WEVVDG---AIQADGEGSDENGYI 134
N ++ E GW LL+DG+T GWR A W++ DG +++ G S G I
Sbjct: 230 NTISPREAKEGWALLWDGKTNNGWRGAKLNAFPEKGWKMEDGILKVMKSGGAESANGGDI 289
Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEP---L 191
V R Y+NF L D+KI++G NSG+ Y V G E+Q++DD + +
Sbjct: 290 VTTRKYKNFILTVDFKITEGANSGVKYFVNPDLNKGEGSAIGCEFQILDDDKHPDAKLGV 349
Query: 192 EDWQRCGVDYAMY-LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDW 250
+ ++ G Y + P+ N + ++NT+ I+ + HVE+++NG K IE+ +D W
Sbjct: 350 KGNRKLGSLYDLIPAPEKKPFNKK---DFNTATIIVQDNHVEHWLNGVKLIEYTRNTDMW 406
Query: 251 FQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
K++N P +G + +G I LQDHG WF+N+KI+EL
Sbjct: 407 NALVAYSKYKNWPNFGNSAEGNILLQDHGDEVWFKNVKIKEL 448
Score = 95.9 bits (237), Expect = 5e-18, Method: Composition-based stats.
Identities = 70/189 (37%), Positives = 100/189 (52%), Gaps = 16/189 (8%)
Query: 299 EELFNGKDLTGWD-VYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
E LFNGK+L GW + G ++ + D +V S +LAT K Y DF L DFK +
Sbjct: 23 EPLFNGKNLKGWKKLNGKAEYKIVDGAIVGISKMGTPNTFLATTKNYGDFILEFDFKIDD 82
Query: 358 DGNSGIFIRSFVE---EGAKVNGWQVEVAP-KGFDTGGIYESYGRGWLIQI---PDDREN 410
NSG+ +RS + + +V+G+Q E+ P K +GGIY+ R WL + P +
Sbjct: 83 GLNSGVQLRSESKKDYQNGRVHGYQFEIDPSKRAWSGGIYDEARRNWLYPLTLNPAAKTA 142
Query: 411 FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-----GIKV 465
F K WN RI +GN + TW+NG +I D+ + G IALQ+H G G V
Sbjct: 143 F-KNNAWNKARIEAIGNSIRTWINGVPCANIWDDMTPS--GFIALQVHAIGNASEEGKTV 199
Query: 466 LWRNIRVKT 474
W++IR+ T
Sbjct: 200 SWKDIRICT 208
Score = 79.0 bits (193), Expect = 7e-13, Method: Composition-based stats.
Identities = 114/441 (25%), Positives = 195/441 (44%), Gaps = 63/441 (14%)
Query: 87 AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFELQ 146
A W+ LF+G+ L+GW+ NG+A +++VDGAI + N ++ + Y +F L+
Sbjct: 19 AQTWEPLFNGKNLKGWKKLNGKA---EYKIVDGAIVGISKMGTPNTFLATTKNYGDFILE 75
Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDK-GFAEPLEDWQRCGVDYAMYL 205
+D+KI G NSG+ + Y+ V G ++++ K ++ + D R Y + L
Sbjct: 76 FDFKIDDGLNSGVQLRSESKKDYQNGRVHGYQFEIDPSKRAWSGGIYDEARRNWLYPLTL 135
Query: 206 PDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSD---DWFQRKNSGKWENA 262
A + WN ++I + ++NG W D F NA
Sbjct: 136 NPAAKTAFK-NNAWNKARIEAIGNSIRTWINGVPCANI--WDDMTPSGFIALQVHAIGNA 192
Query: 263 PEYG--LARKGL-ICLQD-------HGYPAWFRNIKIREL-PRKTEE--EELFNGKDLTG 309
E G ++ K + IC D A RN+ + PR+ +E L++GK G
Sbjct: 193 SEEGKTVSWKDIRICTTDVERYQTPETEEAPERNMIANTISPREAKEGWALLWDGKTNNG 252
Query: 310 W-----DVYGTEQWYVQDSLL-VCESGPDKQY--GYLATCKYYNDFELTADFKQEADGNS 361
W + + + W ++D +L V +SG + G + T + Y +F LT DFK NS
Sbjct: 253 WRGAKLNAFPEKGWKMEDGILKVMKSGGAESANGGDIVTTRKYKNFILTVDFKITEGANS 312
Query: 362 GIFIRSFVE------EGAKVN-GWQVEVAPKGFDTG-GIYESYGRGWLIQ-IPDDRENFL 412
G ++ FV EG+ + +Q+ K D G+ + G L IP +
Sbjct: 313 G--VKYFVNPDLNKGEGSAIGCEFQILDDDKHPDAKLGVKGNRKLGSLYDLIPAPEKKPF 370
Query: 413 KEREWNTMRIRVVGNQVTTWLNGEQMV------DIQDEKIG------------AGQGRIA 454
++++NT I V N V WLNG +++ D+ + + + +G I
Sbjct: 371 NKKDFNTATIIVQDNHVEHWLNGVKLIEYTRNTDMWNALVAYSKYKNWPNFGNSAEGNIL 430
Query: 455 LQIHDGGGIKVLWRNIRVKTL 475
LQ H G +V ++N+++K L
Sbjct: 431 LQDH---GDEVWFKNVKIKEL 448
>gi|118073263|ref|ZP_01541446.1| protein of unknown function DUF1080 [Shewanella woodyi ATCC 51908]
gi|118022335|gb|EAV36156.1| protein of unknown function DUF1080 [Shewanella woodyi ATCC 51908]
Length = 239
Score = 112 bits (280), Expect = 5e-23, Method: Composition-based stats.
Identities = 73/228 (32%), Positives = 114/228 (50%), Gaps = 7/228 (3%)
Query: 65 VLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQAD 124
+L + N L+++EK AGWQLLF+G+ + WR++ Q + W + +I
Sbjct: 19 LLLSTVATAGVAADNQLSKKEKEAGWQLLFNGKDMSQWRNFKQQGVNPKWVIDQDSIHLS 78
Query: 125 GEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDD 184
G G + ++ + Y+NFEL DWKIS+ GNSG+ + + + K+ Y E Q++D+
Sbjct: 79 GGGGGD---LLTKQAYKNFELTLDWKISRAGNSGI-FVLADELGSKI-YSHAIEVQILDN 133
Query: 185 KGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFE 244
+ A+ D G Y + A+ V AGEWN +I N + + NG T +
Sbjct: 134 QRHADNKIDSHLSGSIYDIQASPPASHRV--AGEWNKVRIRMHNNSLSVWQNGILTADLI 191
Query: 245 AWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
S+ W K+ + +G I LQDH P WF+NIK+REL
Sbjct: 192 VGSEKWNSLVAESKFRTWTGFAQTSQGHIGLQDHSDPVWFKNIKLREL 239
>gi|149175127|ref|ZP_01853750.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
gi|148846105|gb|EDL60445.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
Length = 261
Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats.
Identities = 78/246 (31%), Positives = 125/246 (50%), Gaps = 46/246 (18%)
Query: 57 CMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEV 116
C+ CL ++ T C+ +P N L+E+E+ G++LLF+G+ L GW+ +G W+V
Sbjct: 9 CLGCLVLLQFHQ--TGCTSEP-NQLSEQEQLQGFKLLFNGKDLSGWQH------SGNWKV 59
Query: 117 VDGAIQADGEGSDENGYIVYD--RIYENFELQWDWKISKGGNSGLLYHVVERP-QYKVPY 173
DG I G+G G +VY+ + +NFEL+++WK+ +G NSG+ Y RP QY
Sbjct: 60 EDGIISRAGKG----GSLVYETEHVPDNFELKFEWKVGEGSNSGVYY----RPGQY---- 107
Query: 174 VTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEY 233
EYQ++D+ + Y P RP G+WNT +IV +++
Sbjct: 108 ----EYQILDNNKHVDGKNPRTSAASIYFCLPPSHDA--TRPVGDWNTGRIVCQGTVIQH 161
Query: 234 YMNGQKTIEFE------AWSDDWFQRKNSGKWENAPEYGLARKGL-ICLQDHGYPAWFRN 286
++NG+K I+ + AW + + LA +G + LQDHG P W+R
Sbjct: 162 WLNGEKVIDLDYTDPRYAWHVELLANRGG---------DLADRGAKLSLQDHGDPVWYRG 212
Query: 287 IKIREL 292
IK+R +
Sbjct: 213 IKMRSI 218
>gi|149174297|ref|ZP_01852924.1| hypothetical protein PM8797T_03084 [Planctomyces maris DSM 8797]
gi|148846842|gb|EDL61178.1| hypothetical protein PM8797T_03084 [Planctomyces maris DSM 8797]
Length = 1079
Score = 110 bits (276), Expect = 2e-22, Method: Composition-based stats.
Identities = 73/200 (36%), Positives = 104/200 (52%), Gaps = 32/200 (16%)
Query: 301 LFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
LFNGK+L GW +G + Y + D+ +V S P +L T K Y+DFEL DFK +
Sbjct: 30 LFNGKNLDGWVQHGGKAKYDIVDNTIVGTSVPKTPNSFLCTKKMYDDFELQVDFKVDPLL 89
Query: 360 NSGIFIRSFVEE--------------------GAKVNGWQVEVAPKGFD-TGGIYESYGR 398
NSGI IRS V + +V+G+QVE+ P +GGIY+ R
Sbjct: 90 NSGIQIRSNVYDEDKVLETKGADGKDKKIKIAAGRVHGYQVEIDPSDRAWSGGIYDEGRR 149
Query: 399 GWLIQIPDDR--ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQ 456
GWL + D++ + K+ EWN RI G+ + TW+NG D++D+ +G IALQ
Sbjct: 150 GWLNNLADNKAAQKAFKQNEWNHYRIVCRGDSIKTWINGVPAADLKDDL--TSKGFIALQ 207
Query: 457 IHDGG------GIKVLWRNI 470
+H G G +V WRN+
Sbjct: 208 VHGVGNHPEKVGKQVSWRNV 227
>gi|116624768|ref|YP_826924.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116227930|gb|ABJ86639.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 243
Score = 109 bits (273), Expect = 3e-22, Method: Composition-based stats.
Identities = 74/247 (29%), Positives = 128/247 (51%), Gaps = 39/247 (15%)
Query: 81 LTEEEKAAGWQLLFDGQTLEGWRDYNGQAL-TGPWEVVDGAIQADGEGSDENGYIVYDRI 139
+T +EKAAGW+LLFDG++ + W D ++ + + + DG I++ + + + R
Sbjct: 1 MTAQEKAAGWKLLFDGKSYKNWEDPTKKSPPSNAFTIEDGCIKSLPKANIDEDLFTKQR- 59
Query: 140 YENFELQWDWKISKGGNSGLLYHVV---------------------------ERPQYKVP 172
+++FEL++DWKIS GGNSG+ Y + +RP
Sbjct: 60 FQDFELEFDWKISPGGNSGIKYRIQDRVMLADEKKGQRFEDRVNASMKDRRKDRPAKGQE 119
Query: 173 YVTGPEYQLIDDKGFAEPLEDW-QRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHV 231
YV G EYQ++D++ + + G Y M P A +P GE+N S+++ HV
Sbjct: 120 YVIGFEYQVLDNEKNPDARRGTNHQAGALYDMISP--AKDATKPVGEFNHSRLLVKGDHV 177
Query: 232 EYYMNGQKTIEFEAWSDDWFQRKNSGKW-ENAPEYGL-----ARKGLICLQDHGYPAWFR 285
E+++NG+K ++ + D + ++ +W ++P Y L ++ I +Q+H AWF+
Sbjct: 178 EHWLNGEKVVD-GSLKDPGVAKGSAARWGTSSPVYDLLVNQPRKECQISVQNHNSDAWFK 236
Query: 286 NIKIREL 292
NIKIR+L
Sbjct: 237 NIKIRKL 243
>gi|156861848|gb|EDO55279.1| hypothetical protein BACUNI_00951 [Bacteroides uniformis ATCC 8492]
Length = 291
Score = 106 bits (265), Expect = 3e-21, Method: Composition-based stats.
Identities = 70/231 (30%), Positives = 115/231 (49%), Gaps = 26/231 (11%)
Query: 87 AAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQ-----ADGEGSDENGYIVYDRIYE 141
A G+ +FDG++L+GWR Y + W + DG ++ + E G +++ ++
Sbjct: 61 ADGYITIFDGKSLDGWRGYGKDKVPSRWIIEDGCLKFCGTGTGEGQTGEGGDLIFAHKFK 120
Query: 142 NFELQWDWKISKGGNSGLLYHVVERPQ--------YKVPYVTGPEYQLIDDKGF--AEPL 191
NFEL+ +WKISKGGNSG+ Y E + Y++ PE+Q++D+ A+
Sbjct: 121 NFELELEWKISKGGNSGIFYLAQEVTSKDKDGNEVLEPIYISAPEFQVLDNANHPDAKLG 180
Query: 192 EDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
+D R +P N +P GEWN +KI+ G V + N + +E+ W+ W
Sbjct: 181 KDNNRQAASLYDMIPA-VPQNAKPFGEWNKAKIMVYKGTVVHGQNDENVLEYHLWTKQWT 239
Query: 252 Q-----RKNSGKWENAPEY-----GLARKGLICLQDHGYPAWFRNIKIREL 292
+ + + KW A E G +G I +QDHG W+RNI+++ L
Sbjct: 240 EMLQASKFSEEKWPLAFELLNNCGGENHEGFIGVQDHGDDVWYRNIRVKVL 290
>gi|116625197|ref|YP_827353.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116228359|gb|ABJ87068.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 212
Score = 105 bits (262), Expect = 7e-21, Method: Composition-based stats.
Identities = 65/179 (36%), Positives = 98/179 (54%), Gaps = 5/179 (2%)
Query: 300 ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
+LFNGKDL+GW G E+W V+D + + G K+YGYL T K Y DF L+ FK E DG
Sbjct: 33 QLFNGKDLSGWVNVGHEKWTVEDGTIHGQ-GVTKEYGYLRTEKQYKDFWLSIRFKCEDDG 91
Query: 360 NSGIFIRSFVEEGAK--VNGWQVEVAPK-GFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
NSG++ + + G G Q E+ GG+Y GRGW+ + E ++ +
Sbjct: 92 NSGVYFHTDFKPGTVDVSKGMQFEIDRTLNHHNGGLYGD-GRGWIAWPSPEYEQVIRPTD 150
Query: 417 WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
WN ++V GN + LNG ++D D + G IALQ+H GG + +++I ++ +
Sbjct: 151 WNEFLLKVEGNHMVAILNGIAIIDFTDPTPKSFDGYIALQLHSGGEGNMRFKDIYLRDM 209
>gi|149177059|ref|ZP_01855667.1| hypothetical oxidoreductase [Planctomyces maris DSM 8797]
gi|148844124|gb|EDL58479.1| hypothetical oxidoreductase [Planctomyces maris DSM 8797]
Length = 542
Score = 105 bits (262), Expect = 7e-21, Method: Composition-based stats.
Identities = 74/225 (32%), Positives = 114/225 (50%), Gaps = 25/225 (11%)
Query: 74 SEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGY 133
S P N LT EK AGW+LLF+G+ GW+ NG+ + P E DGA+ G GY
Sbjct: 337 SSAPDNTLTSAEKEAGWKLLFNGKDYSGWKCNNGKPIAAPIE--DGALVPYKSG----GY 390
Query: 134 -IVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLE 192
IVYD+ + +F+ + D K+ + NSG+ + V + K P TG E Q++ G +
Sbjct: 391 LIVYDKPFADFKFKCDVKMPEECNSGIFFRVGD---LKNPVQTGFEAQVLTGDGTG--MH 445
Query: 193 DWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQ 252
D+ G Y + P + GEW +I HV +NG+ + A D+W +
Sbjct: 446 DF---GAIYDLVAP--SVNRASKPGEWTNLEITCQGPHVSVAVNGKVVAKLNA--DEWTE 498
Query: 253 -----RKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+S K+++A + RKG + QDHG+ W++N+K+ EL
Sbjct: 499 PGKRLDGSSHKFKDAVK-DFPRKGYLGFQDHGHKVWYKNVKLLEL 542
>gi|149178817|ref|ZP_01857398.1| hypothetical protein PM8797T_06527 [Planctomyces maris DSM 8797]
gi|148842358|gb|EDL56740.1| hypothetical protein PM8797T_06527 [Planctomyces maris DSM 8797]
Length = 1742
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 69/187 (36%), Positives = 109/187 (58%), Gaps = 10/187 (5%)
Query: 290 RELPRKTEEEEL---FNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYND 346
+++P K + L FNG+DL GW ++ W V++ +V S KQ +L + +D
Sbjct: 1555 QQVPMKATPDNLKLFFNGQDLAGW-TGNSQLWSVENGEIVGRSPGIKQNEFLVSDLQVSD 1613
Query: 347 FELTADFKQEAD-GNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIP 405
FEL K D GNSGI RS +E G+ V G+Q + A KG+ G +YE +GRG L +
Sbjct: 1614 FELKLKVKLTPDIGNSGIQFRSSLEPGSHVKGYQAD-AGKGW-WGKLYEEHGRGLLFK-- 1669
Query: 406 DDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKV 465
+ E ++++ EWN RI VG+Q+ T++NG ++ D + GA G IA QIH GG ++V
Sbjct: 1670 ESGEAYVRKGEWNEYRIVAVGSQIRTFINGNLCTNLNDPQ-GAKTGIIAFQIHSGGPMEV 1728
Query: 466 LWRNIRV 472
++++ +
Sbjct: 1729 RFKDLEL 1735
>gi|88713768|ref|ZP_01107849.1| hypothetical protein FB2170_00670 [Flavobacteriales bacterium
HTCC2170]
gi|88707895|gb|EAR00134.1| hypothetical protein FB2170_00670 [Flavobacteriales bacterium
HTCC2170]
Length = 461
Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats.
Identities = 67/208 (32%), Positives = 103/208 (49%), Gaps = 7/208 (3%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQAL-TGPWEVVDG---AIQADGEGSDENGYI 134
N + +E GWQ+L+DG+T GWR WE+ +G + + GE S G I
Sbjct: 238 NKVAVDEIKNGWQMLWDGKTTNGWRGARLDEFPENGWEITNGILTVLPSGGEESAAGGDI 297
Query: 135 VYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLE-- 192
V +Y +FEL+ D+KI++G NSG+ Y+V G EYQ++DD + +
Sbjct: 298 VTKEVYGDFELKVDFKITEGANSGIKYYVDTDLNKGPGSSIGLEYQILDDARHPDAKKGN 357
Query: 193 -DWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWF 251
+ R + V P GEWNT+ I+ + VE+ +NG +++E SD +
Sbjct: 358 HEGSRTVASLYDLIQAATDKPVNPIGEWNTAHIISKDNQVEHRLNGMTVLKYERKSDSYK 417
Query: 252 QRKNSGKWENAPEYGLARKGLICLQDHG 279
+ + K+ P +G KG I LQDHG
Sbjct: 418 KLVSESKYVKWPNFGEVEKGHILLQDHG 445
Score = 92.4 bits (228), Expect = 6e-17, Method: Composition-based stats.
Identities = 63/188 (33%), Positives = 102/188 (54%), Gaps = 16/188 (8%)
Query: 300 ELFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
E+F+G+ L GW G E Y V++ +V + D ++ + K Y DF L ++ ++
Sbjct: 35 EIFDGETLNGWTQKGGEANYTVREGSIVGSTIHDTPNSFMTSDKMYGDFILELEYLVDST 94
Query: 359 GNSGIFIRSFVEEG---AKVNGWQVEVAPKGFD-TGGIYESYGRGWL---IQIPDDRENF 411
NSGI IRS +V+G+Q+E+ P + GIY+ RGWL I PD ++ F
Sbjct: 95 MNSGIQIRSNSYPHYMHGRVHGYQIEIDPSDRAWSAGIYDEGRRGWLNNLIDNPDAQKGF 154
Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGG-----GIKVL 466
K+ +WN RI +G+ + TW+NG + D+K + G I LQ+H G G +++
Sbjct: 155 -KQNDWNHYRIEAIGDTLKTWINGIPAAHLIDDKTAS--GFIGLQVHSIGKDKKEGTEII 211
Query: 467 WRNIRVKT 474
W+NI++ T
Sbjct: 212 WKNIKILT 219
>gi|116621745|ref|YP_823901.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116224907|gb|ABJ83616.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 214
Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats.
Identities = 70/219 (31%), Positives = 109/219 (49%), Gaps = 14/219 (6%)
Query: 81 LTEEEKAAG-WQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRI 139
+T AAG W+ LFDG+T GW + G+ W V DG ++ + D +
Sbjct: 1 MTVAAGAAGEWRTLFDGKTSAGWLEITGKPFPATWTVEDGCLKTSPKPGGMQDIRTVD-V 59
Query: 140 YENFELQWDWKISKGGNSGLLYHVVERPQY-----KVPYVTGPEYQLIDDKGFAEPLEDW 194
+ NFEL++DWK+ GNSG+ Y V + ++ + G EYQL DD +
Sbjct: 60 FRNFELEFDWKMLADGNSGVKYLVQKVDEWTNKDGRQARARGLEYQLADDHNPDAASDPA 119
Query: 195 QRCGVDYAMYLPDFATMNVRPA-GEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQR 253
+ G Y++ P + P GE+N S++V + GHVE+++NG K +EF D Q+
Sbjct: 120 RVAGSLYSVIAP---VPKITPKIGEFNHSRLVVNGGHVEHWLNGTKVVEFST-GDAAVQK 175
Query: 254 KNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+ + + L +G I LQ+H WFR I++R L
Sbjct: 176 QL--RTLRGKDGELLEEGPISLQNHSSEVWFRGIRVRTL 212
>gi|32472633|ref|NP_865627.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
gi|32443870|emb|CAD73311.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
Length = 429
Score = 99.8 bits (247), Expect = 4e-19, Method: Composition-based stats.
Identities = 75/191 (39%), Positives = 103/191 (53%), Gaps = 18/191 (9%)
Query: 301 LFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYG-YLATCKYYNDFELTADFKQEAD 358
LFNG DL+GW G+ ++ V+D ++V E P+ +L + K + DF L +K
Sbjct: 33 LFNGSDLSGWVKRGGSAKYRVEDGVIVGECAPNTPGNTFLCSDKEFGDFVLKLRYKFLES 92
Query: 359 GNSGIFIRSFV-EEG--AKVNGWQVEVAPKGFDTGGIYESYGRG------WLIQI-PDDR 408
GNSG+ RS EEG +V G+Q E+ P G TG IY+ RG WL P +R
Sbjct: 93 GNSGVQFRSASREEGDRQRVFGYQAEMRPGGDMTGRIYDEGRRGHKHGIIWLDAFTPQER 152
Query: 409 ENFLKER----EWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIK 464
+ +E EWN + I+ VG + TWLNG +VDI D + +G I LQIH G
Sbjct: 153 LDAAQESCRPGEWNDLEIQCVGPSIKTWLNGNLVVDIFDSF--SMKGFIGLQIHSGETGS 210
Query: 465 VLWRNIRVKTL 475
V W++IRVK L
Sbjct: 211 VAWKDIRVKDL 221
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 106/447 (23%), Positives = 188/447 (42%), Gaps = 71/447 (15%)
Query: 61 LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGA 120
L + G L + + + TEE G+ LF+G L GW G A + V DG
Sbjct: 5 LAAAITIGLLASITHNVEAADTEE----GFVSLFNGSDLSGWVKRGGSA---KYRVEDGV 57
Query: 121 IQAD-GEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEY 179
I + + N ++ D+ + +F L+ +K + GNSG+ + R + V G +
Sbjct: 58 IVGECAPNTPGNTFLCSDKEFGDFVLKLRYKFLESGNSGVQFRSASREEGDRQRVFGYQA 117
Query: 180 QLIDDKGFAEPLEDWQRCGVDYAM-----YLP----DFATMNVRPAGEWNTSKIVFDNGH 230
++ + D R G + + + P D A + RP GEWN +I
Sbjct: 118 EMRPGGDMTGRIYDEGRRGHKHGIIWLDAFTPQERLDAAQESCRP-GEWNDLEIQCVGPS 176
Query: 231 VEYYMNGQKTIE-FEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDH----GYPAWFR 285
++ ++NG ++ F+++S KG I LQ H G AW +
Sbjct: 177 IKTWLNGNLVVDIFDSFS---------------------MKGFIGLQIHSGETGSVAW-K 214
Query: 286 NIKIRELPRKTEEEELFNGKD----LTGWDVYGTEQW-YVQDSLLV-CESGPDKQYGYLA 339
+I++++L + D L G E+W + +D +L S + G +
Sbjct: 215 DIRVKDLGESQWQSFFVKNDDGEYGLEGARFVLPEEWSFTKDGVLHGVHSKSQGKDGLVI 274
Query: 340 TCKYYNDFELTADFKQEADGNSGIFIRSFVEEGAKV-NGWQVEVAPKGFD-----TGGIY 393
+ +++F ++ GNS ++ R+ + V G+Q E+A G D T GI
Sbjct: 275 SDDNFDNFIARVTYRMRG-GNSALYFRAEETDAPWVLRGFQNEIANNGKDCALWHTAGII 333
Query: 394 ESY---GRGWLIQIPDDRENFL-----KEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEK 445
+ GRGW++ + F+ K+ +WNT G+++ LNG DI DE+
Sbjct: 334 DGNTIPGRGWIVT----NDEFVGKVRNKDDQWNTTCTAAYGDRLVQTLNGFCTSDIIDEE 389
Query: 446 IGAGQGRIALQIHDGGGIKVLWRNIRV 472
G++ LQ+H G ++ +++ V
Sbjct: 390 C-EKTGKLGLQMHGGTDCEMYFKDFEV 415
>gi|32475761|ref|NP_868755.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
gi|32446304|emb|CAD76132.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
Length = 446
Score = 99.0 bits (245), Expect = 6e-19, Method: Composition-based stats.
Identities = 119/452 (26%), Positives = 190/452 (42%), Gaps = 76/452 (16%)
Query: 61 LCVMVLFGALTACSEKPQNVLTEEEKAAG-WQLLFDGQTLEGWRDYNGQALTGPWEVVDG 119
CV + G++ + P+ TE + A Q LFDG++L GW + G EVVDG
Sbjct: 24 FCVSICCGSVATGDDMPKPAKTESQAPANEMQSLFDGKSLTGWTN---PYEWGKTEVVDG 80
Query: 120 AIQADGEGSDENGYIVYDRIYENFELQWDWKISKG-GNSGLLYHVVERPQYKVPYVTGPE 178
I +D+ ++V ++I++++E + + K+ +G NSG + P Y E
Sbjct: 81 EIHLT---ADKKFFLVTEKIFQDYEFEGEVKLPEGKSNSGFMARGQVSPNKVFGYQA--E 135
Query: 179 YQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQ 238
D + ++ +R ++ P+ R WN +I H+++++N
Sbjct: 136 ADPTDRRWSGGLYDEGRRQWLNPLWEQPEAQAAFDR--DRWNRYRIRCVGNHLQFFINDV 193
Query: 239 KTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAW---FRNIKIRELPRK 295
T D+F N G I LQ HG FRN+K+R L
Sbjct: 194 PTT-------DYFDPVN-------------LSGRIGLQHHGEKGQTYRFRNLKVRNLGSH 233
Query: 296 TEEEELFNGKDLTGWDVYGTEQWYVQDSLLV--CESGPDKQYGYLATCKYYND--FELTA 351
E + LF+GK L GW+ G W V D +L S ++ G L + D + +
Sbjct: 234 -EWKPLFDGKSLDGWETVGGGTWTVVDGILQGRASSEANEPNGMLYSKHPMTDGTYRIEY 292
Query: 352 DFKQEADGNSGIFIRSFVEEGAK-VNGWQVEVAPKGFDTGGIYESYGRGWLIQ------- 403
FK+ G+SG F+RS + E V G Q E+ + GG+Y++ G GWL++
Sbjct: 293 RFKK---GDSGFFVRSEITENKPFVKGVQCEIDNSD-EVGGLYQTGGAGWLVRPLHYLET 348
Query: 404 -IPDDRENFLK----------------------EREWNTMRIRVVGNQVTTWLNGEQMVD 440
P DR + E WN M + V G ++ LN VD
Sbjct: 349 GFPKDRHAVVNRHWKQAREGLQLDKKPVVSDDDETPWNMMTVSVHGKRIVVHLNDCLAVD 408
Query: 441 IQDEKIGAGQGRIALQIHDGGGIKVLWRNIRV 472
E + A G IALQ+H ++V +R + +
Sbjct: 409 HVVEDL-ADSGVIALQLHGNQDLEVDFRKVEM 439
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 63/185 (34%), Positives = 91/185 (49%), Gaps = 13/185 (7%)
Query: 297 EEEELFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFK- 354
E + LF+GK LTGW + Y + V D + + DK++ +L T K + D+E + K
Sbjct: 53 EMQSLFDGKSLTGWTNPYEWGKTEVVDGEIHLTA--DKKF-FLVTEKIFQDYEFEGEVKL 109
Query: 355 QEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFD-TGGIYESYGRGWLIQI---PDDREN 410
E NSG R V KV G+Q E P +GG+Y+ R WL + P+ +
Sbjct: 110 PEGKSNSGFMARGQVSPN-KVFGYQAEADPTDRRWSGGLYDEGRRQWLNPLWEQPEAQAA 168
Query: 411 FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNI 470
F ++R WN RIR VGN + ++N D D GRI LQ H G +RN+
Sbjct: 169 FDRDR-WNRYRIRCVGNHLQFFINDVPTTDYFDPV--NLSGRIGLQHHGEKGQTYRFRNL 225
Query: 471 RVKTL 475
+V+ L
Sbjct: 226 KVRNL 230
>gi|87311549|ref|ZP_01093668.1| hypothetical protein DSM3645_02096 [Blastopirellula marina DSM
3645]
gi|87285805|gb|EAQ77720.1| hypothetical protein DSM3645_02096 [Blastopirellula marina DSM
3645]
Length = 220
Score = 99.0 bits (245), Expect = 6e-19, Method: Composition-based stats.
Identities = 75/194 (38%), Positives = 103/194 (53%), Gaps = 24/194 (12%)
Query: 302 FNGKDLTGW-DVYGTEQWYVQDSLLV---CESGPDKQYGYLATCKYYNDFELTADFKQEA 357
FNGKDL+GW GT + V+D ++ E G + L T K Y DFEL + K +
Sbjct: 31 FNGKDLSGWTQKNGTATYVVEDGGVIRGKTEVGSPNSF--LCTDKDYGDFELEFEVKCDD 88
Query: 358 DGNSGIFIRSFVEEG------AKVNGWQVEVAPKGFDTGGIY-ESYGRGWLIQIPDDR-- 408
NSG+ IRS E +VNG QVE+ + G +Y E+ GRGWL P DR
Sbjct: 89 GLNSGVQIRSQTAEAKGDQKFGRVNGPQVEIEKSVGEAGYVYGEATGRGWLT--PADRLK 146
Query: 409 -ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQ--GRIALQIH----DGG 461
+ K EWN+ R+ G ++ T++NGE + D+ DE+I G I LQ+H D G
Sbjct: 147 PHDHFKNGEWNSYRVVAKGPRIQTFINGEPIEDLTDEEIYKTHPTGFIGLQVHGIGKDQG 206
Query: 462 GIKVLWRNIRVKTL 475
+V W+NIR+K L
Sbjct: 207 PYEVRWKNIRIKPL 220
>gi|87308076|ref|ZP_01090218.1| hypothetical protein DSM3645_20802 [Blastopirellula marina DSM
3645]
gi|87289158|gb|EAQ81050.1| hypothetical protein DSM3645_20802 [Blastopirellula marina DSM
3645]
Length = 229
Score = 97.8 bits (242), Expect = 1e-18, Method: Composition-based stats.
Identities = 72/201 (35%), Positives = 102/201 (50%), Gaps = 28/201 (13%)
Query: 301 LFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
LFNGK+L G+ +G + Y ++ +V S + +L T K Y DF L DFK +
Sbjct: 30 LFNGKNLEGFTQHGGKAVYTIEGDEIVGTSTLNTPNTFLCTNKEYGDFILEVDFKVDPKL 89
Query: 360 NSGIFIRSFVEEGA----------------KVNGWQVEVAPKGFD-TGGIYESYGRGWLI 402
NSGI IRS V A +V+G+QVE+ P +GGIY+ RGWL
Sbjct: 90 NSGIQIRSQVFPEATEVDFGDGKMKKMAKDRVHGYQVEIDPSARAWSGGIYDEARRGWLN 149
Query: 403 QIPDDRE--NFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDG 460
+ ++ E K+ +WN RI G+ + TW+NG D++D +G IALQ+H
Sbjct: 150 DLKNNPEAGKAFKQDDWNHYRIECRGDSIKTWINGVPAADLKDGL--TSKGLIALQVHGI 207
Query: 461 GGIK------VLWRNIRVKTL 475
GG K V W+NI +K L
Sbjct: 208 GGDKEKAGTQVRWKNIMIKEL 228
>gi|32470813|ref|NP_863806.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
gi|32442958|emb|CAD71479.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
Length = 272
Score = 97.8 bits (242), Expect = 2e-18, Method: Composition-based stats.
Identities = 70/222 (31%), Positives = 111/222 (50%), Gaps = 32/222 (14%)
Query: 83 EEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAI-QADGEGSDENGYIVYDR--I 139
+ + AA + LFDG++ +GW +G W + DGA +A G GS + Y R +
Sbjct: 47 QSDAAADFVELFDGKSFDGWEH------SGNWRIEDGAFFRAAGGGS-----LTYKRTLV 95
Query: 140 YENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGV 199
++FEL+++WK+S G NSG+ Y RP EYQ++D+ G P + R
Sbjct: 96 PDDFELRFEWKVSDGCNSGVYY----RPGQV-------EYQVLDNVG--SPYGENPRQSA 142
Query: 200 DYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKW 259
+ + RP GEWN+ ++V ++++ NGQK ++F+ W +
Sbjct: 143 ASLFFCMAPSKDATRPVGEWNSGRVVCKGTVIQHWFNGQKVLDFDYTDPKWAEMVRLLTI 202
Query: 260 ENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEEL 301
G R G + LQDHG P W+RN++ RE+P +EE L
Sbjct: 203 RGGDLTG--RGGELWLQDHGQPVWYRNLRWREIP---DEESL 239
>gi|149195996|ref|ZP_01873052.1| hypothetical protein LNTAR_22659 [Lentisphaera araneosa HTCC2155]
gi|149140843|gb|EDM29240.1| hypothetical protein LNTAR_22659 [Lentisphaera araneosa HTCC2155]
Length = 231
Score = 97.4 bits (241), Expect = 2e-18, Method: Composition-based stats.
Identities = 71/188 (37%), Positives = 104/188 (55%), Gaps = 13/188 (6%)
Query: 301 LFNGKDLTGWDVY-GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
LFNGKDL+GW V GT + V++ ++ ++ + +L + K ++DFELT + K D
Sbjct: 43 LFNGKDLSGWTVRNGTGTYRVENGAIIGKTTDGSKNTFLCSDKLFSDFELTLEVKLINDQ 102
Query: 360 -NSGIFIRSFVEEGAK-VNGWQVEVAP---KGFDTGGIYESYGRGWLIQIPDDR-ENFLK 413
NSGI IRS K VNG QVEV KG ++G IY GW+ + +K
Sbjct: 103 LNSGIQIRSNDNNAKKRVNGPQVEVEATKGKGAESGYIYGEACGGWMTPKAKLKPHTLMK 162
Query: 414 EREWNTMRIRVVGNQVTTWLNGEQMVDIQDE-KIGAG-QGRIALQIHD----GGGIKVLW 467
EWNT+ I G ++ TW+NG Q+ D+ D+ K+ + +G I LQ+H G +V W
Sbjct: 163 NGEWNTVLIIAKGAKIQTWINGTQVSDLTDDAKLKSHPEGFIGLQVHSIPKGKGPYEVTW 222
Query: 468 RNIRVKTL 475
+NI +K L
Sbjct: 223 KNIMIKDL 230
>gi|87308201|ref|ZP_01090343.1| probable secreted glycosyl hydrolase [Blastopirellula marina DSM
3645]
gi|87289283|gb|EAQ81175.1| probable secreted glycosyl hydrolase [Blastopirellula marina DSM
3645]
Length = 401
Score = 96.7 bits (239), Expect = 3e-18, Method: Composition-based stats.
Identities = 106/410 (25%), Positives = 174/410 (42%), Gaps = 64/410 (15%)
Query: 61 LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGA 120
L +++ A +E+ N L+E E A GW LLFDG+T GW+ G+ W V G
Sbjct: 11 LLALIVAPLTAANAEQAPNTLSEAEIADGWLLLFDGETTFGWKS-EGKI---DWTVDQGV 66
Query: 121 IQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQ 180
I+A + G + + N+EL D++ NS + +P+ P Y
Sbjct: 67 IRAT---KGDVGQLRTTTQFANYELHVDFRAPAETNSAIFLRTSPKPE-------SPTYG 116
Query: 181 LIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRP---AGEWNTSKIVFDNGHVEYYMNG 237
+ P + G M V P A +W++ + D G + ++G
Sbjct: 117 CYELN--IAPESNSYPTG-------SLVGRMKVTPPCVADQWHSFDVTADQGTIVVKLDG 167
Query: 238 QKTIEFEAWSDDWFQRKNSGKWENAPEY-GLARKGLICLQDHGYPAWFRNIKIRELPRKT 296
+ +++E P+Y GL G I LQ + FRN+K++ L ++
Sbjct: 168 AEVLKYED-----------------PQYVGL---GFIGLQHNQGEIEFRNVKLKPLGMQS 207
Query: 297 EEEELFNGKDLTGWDVY---GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADF 353
+FNGKDLTGW Y +E + + ++GP G L T K Y DF L D
Sbjct: 208 ----IFNGKDLTGWKTYPAMESEFTVTDEGTIHAKNGP----GQLETEKSYGDFVLRLDA 259
Query: 354 KQEADG-NSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGG--IYESYGRGWLIQIPDDREN 410
A NSG+F R G K+ G++ ++ G++ + +G G + + R
Sbjct: 260 ITHAKNLNSGVFFRCI--PGDKMMGYESQIH-NGYEAEDRTLPIDHGTGAIFRRMKARRV 316
Query: 411 FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDG 460
+++W T + G ++ W+NG Q+ D D++ R L+I G
Sbjct: 317 VSDDQKWFTKTLIAQGPHISVWVNGYQVTDWTDQRKPDENPRRGLRIEPG 366
>gi|149179157|ref|ZP_01857726.1| hypothetical protein PM8797T_17619 [Planctomyces maris DSM 8797]
gi|148842017|gb|EDL56411.1| hypothetical protein PM8797T_17619 [Planctomyces maris DSM 8797]
Length = 227
Score = 94.4 bits (233), Expect = 1e-17, Method: Composition-based stats.
Identities = 70/196 (35%), Positives = 98/196 (50%), Gaps = 24/196 (12%)
Query: 301 LFNGKDLTGWDVY-GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
LF+GK L W + GT + V+D + + +L T K Y +FEL + K +
Sbjct: 34 LFDGKTLNNWVQHNGTATYMVKDGTIEGTTSEGSPNSFLCTKKNYGNFELEFEVKVHNNL 93
Query: 360 NSGIFIRSFVEEG-AKVNGWQVEVAPKGFDTGG----------IYESYGRGWLIQIPDDR 408
NSG+ IRS E G +VNG QVE+ G D G Y G GW+ P+D+
Sbjct: 94 NSGVQIRSQQENGDGRVNGPQVEIEASG-DNGAEAGYIYGEAIKYAGKGIGWMT--PEDK 150
Query: 409 EN---FLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQ--GRIALQIH----D 459
LK+ EWN R+ G ++ TW+NGEQ+ D+ DE++ G I LQ+H
Sbjct: 151 RTPHKNLKDGEWNQFRVVANGPRIQTWVNGEQVSDLVDERVYKSHPTGFIGLQVHGIKKG 210
Query: 460 GGGIKVLWRNIRVKTL 475
G V W+NIR+K L
Sbjct: 211 TGPYSVAWKNIRIKEL 226
>gi|87308956|ref|ZP_01091094.1| hypothetical protein DSM3645_19403 [Blastopirellula marina DSM 3645]
gi|87288299|gb|EAQ80195.1| hypothetical protein DSM3645_19403 [Blastopirellula marina DSM 3645]
Length = 1348
Score = 90.5 bits (223), Expect = 2e-16, Method: Composition-based stats.
Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 10/179 (5%)
Query: 300 ELFNGKDLTGWDVYGTEQ-WYVQDSLLVCESGPD-KQYGYLATCKYYNDFELTADFKQ-E 356
ELFNG+DL GW+ G W V++ +V ++ +L + +F L + + +
Sbjct: 1173 ELFNGQDLNGWN--GDRNLWSVENGEIVGKTLTGIPANSFLISDLAAANFRLKLEIRLVK 1230
Query: 357 ADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
+GNSGI RS G V G+Q + + G +YE +GRG L+ + E+ LK +
Sbjct: 1231 NEGNSGIQFRSNALPGGSVQGYQADAGAGWW--GKLYEEHGRG-LLWVKSGEEH-LKPGD 1286
Query: 417 WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
WN I GNQV T+LNG+ VD+ D+K G QG ALQ+H GG +V +R+++++ L
Sbjct: 1287 WNQYEIVAQGNQVKTFLNGQPCVDLHDDK-GVKQGVFALQLHSGGPTEVRFRHLQLEIL 1344
>gi|32471625|ref|NP_864618.1| hypothetical protein RB1854 [Rhodopirellula baltica SH 1]
gi|32396996|emb|CAD72299.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 703
Score = 89.7 bits (221), Expect = 4e-16, Method: Composition-based stats.
Identities = 58/184 (31%), Positives = 103/184 (55%), Gaps = 9/184 (4%)
Query: 299 EELFNGKDLTGWDVY-GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA 357
+ LF+GK L GW+ GT ++ V++ +V + +L + + Y++FELT + +
Sbjct: 521 KSLFDGKTLDGWNRKNGTAKYRVENGTIVGTTSEGSPNSFLCSDENYDNFELTFEVNVDE 580
Query: 358 DGNSGIFIRSFV-EEGAKVNGWQVEVAPKGFDTGGIY-ESYGRGWLIQIPDDRENFLKER 415
NSG+ IRS E+G +V G QVE+ + G IY E+ GRGW+ + ++ + K
Sbjct: 581 GLNSGVQIRSQSREKGGRVYGPQVEIESAPGEAGYIYSEATGRGWITKEQPIKDAY-KNG 639
Query: 416 EWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH----DGGGIKVLWRNIR 471
++N +R GN++ W+ +++ DIQD + + G + LQ+H G +V WR+I+
Sbjct: 640 KFNRYLVRAHGNRIQVWIGDQKISDIQDPE-SSTDGFLGLQVHGIKAGTGPYEVSWRDIK 698
Query: 472 VKTL 475
++ L
Sbjct: 699 IRNL 702
Score = 60.1 bits (144), Expect = 4e-07, Method: Composition-based stats.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 86 KAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFEL 145
KA GW+ LFDG+TL+GW NG T + V +G I N ++ D Y+NFEL
Sbjct: 516 KADGWKSLFDGKTLDGWNRKNG---TAKYRVENGTIVGTTSEGSPNSFLCSDENYDNFEL 572
Query: 146 QWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFA 188
++ + +G NSG+ + + + K V GP+ ++ G A
Sbjct: 573 TFEVNVDEGLNSGV--QIRSQSREKGGRVYGPQVEIESAPGEA 613
>gi|149199908|ref|ZP_01876936.1| hypothetical protein LNTAR_09721 [Lentisphaera araneosa HTCC2155]
gi|149136977|gb|EDM25402.1| hypothetical protein LNTAR_09721 [Lentisphaera araneosa HTCC2155]
Length = 223
Score = 87.8 bits (216), Expect = 1e-15, Method: Composition-based stats.
Identities = 69/195 (35%), Positives = 97/195 (49%), Gaps = 19/195 (9%)
Query: 300 ELFNGKDLTGWDVYGTE-QWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQ-EA 357
ELFNGK+L GW E + V+D ++ + +L + Y DFEL + K +
Sbjct: 25 ELFNGKNLDGWTEKTKEGSFRVEDGAIIGTAKDGMGTTFLCSNNNYGDFELEFETKLIDN 84
Query: 358 DGNSGIFIRSFVEE------GAKVNGWQVEVAPKGFD---TGGIYESYGRGWLIQIPDDR 408
NSG+ IRS ++E V G QVEV + F+ +G IY + WL D +
Sbjct: 85 KLNSGVQIRSRLQEPDGKQKHPAVYGPQVEVTGRNFEKNQSGYIYGQAWKTWLTPKEDKK 144
Query: 409 -ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQ---DEKIGAGQGRIALQIH----DG 460
F K+ EWN R+ GNQ+TTWLNG ++V + G IALQ+H
Sbjct: 145 AHQFFKDGEWNHFRVLAKGNQITTWLNGNKIVTTTVPAKRQQSNPSGFIALQVHGIKKGT 204
Query: 461 GGIKVLWRNIRVKTL 475
G +V W+NI+VK L
Sbjct: 205 GPFQVAWKNIKVKEL 219
>gi|149197913|ref|ZP_01874962.1| hypothetical protein LNTAR_05481 [Lentisphaera araneosa HTCC2155]
gi|149139134|gb|EDM27538.1| hypothetical protein LNTAR_05481 [Lentisphaera araneosa HTCC2155]
Length = 442
Score = 87.8 bits (216), Expect = 1e-15, Method: Composition-based stats.
Identities = 96/364 (26%), Positives = 156/364 (42%), Gaps = 80/364 (21%)
Query: 132 GYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYK-----VPYVTGPEYQLIDDK- 185
G I + Y+++ L++++K++ N+G+ PQ K +P + E Q++D+
Sbjct: 64 GNIYTAKEYQDYVLRFEFKLAPHANNGIGLRA-PHPQDKSVKRHIPAYSTVEIQILDNTH 122
Query: 186 -GFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFE 244
+A+ L+D Q G Y + + +P GEWN +I H++ +NG+K ++ +
Sbjct: 123 PKYAK-LKDHQFHGSAYGIAAAKRGFL--KPLGEWNYQEIHLKASHLQVILNGEKILDCD 179
Query: 245 AWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEE------ 298
S D +GK+ + G I + HG FRNI + EL +
Sbjct: 180 IASGD------AGKYAKGRD---RTSGHIAIAGHGPGVTFRNISVAELDNALSQAQEDNV 230
Query: 299 -----EELFNGKDLTGW-----------------------------DVYGTEQWYVQDSL 324
+LFNGKDLT W D + W V D
Sbjct: 231 APEGFTQLFNGKDLTNWKGLLDRPFDRPHKRKTLKADKLKELQAKADESMKKHWSVTDKG 290
Query: 325 LVCESGPDKQYGY-LATCKYYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGW----- 378
+ G K+ G+ LAT K Y DFE +K +G+SGI++R +V W
Sbjct: 291 ELFFDG--KKGGHSLATLKQYKDFEFHVSWKINQNGDSGIYLRGL----PQVQIWDPSDQ 344
Query: 379 --QVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGE 436
Q A KG +G ++ + G + D+ EWN IR++G++V+ W NG+
Sbjct: 345 KVQKLGAHKG--SGALWNNPKEGKWPLVKADKPT----GEWNHFFIRMIGDRVSIWTNGK 398
Query: 437 QMVD 440
Q VD
Sbjct: 399 QTVD 402
>gi|32471039|ref|NP_864032.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
gi|32396741|emb|CAD71706.1| N-acetyl-galactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
Length = 889
Score = 86.7 bits (213), Expect = 3e-15, Method: Composition-based stats.
Identities = 68/191 (35%), Positives = 91/191 (47%), Gaps = 20/191 (10%)
Query: 301 LFNGKDLTGWDVYGTE-QWYVQDSLLVCESGPDKQYGYLATCKY-YNDFELTADFKQEAD 358
LFNGKDL+GW G + V+D +LV + P YL+T + ++DF T D K E
Sbjct: 65 LFNGKDLSGWTAKGGSCTFEVKDGILVGQVVPGSNSTYLSTERDDFDDFIFTCDMKWEES 124
Query: 359 GNSGIFIRSFVEEGAKVNGWQVEVAPK----GFD-----TGGIYESYGRG-----WLIQI 404
NSG+ R+ + G NG + P+ GF +GGIY G WL +
Sbjct: 125 CNSGVMFRAQSKPGK--NGTETVFGPQAEMEGFTQDRHWSGGIYGQSCGGYFYPLWLKEH 182
Query: 405 PDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIK 464
+ R E WN + I GN V TW+NG D+ +G LQ+H G
Sbjct: 183 KEARAA-TTEDIWNRVTISAQGNVVKTWINGVPAAHWIDDG-SYPKGFFGLQVHKGAKGT 240
Query: 465 VLWRNIRVKTL 475
VLW+NIRVK L
Sbjct: 241 VLWKNIRVKEL 251
>gi|87308660|ref|ZP_01090800.1| probable protein kinase yloP-putative serine/threonine protein
kinase [Blastopirellula marina DSM 3645]
gi|87288752|gb|EAQ80646.1| probable protein kinase yloP-putative serine/threonine protein
kinase [Blastopirellula marina DSM 3645]
Length = 1534
Score = 84.7 bits (208), Expect = 1e-14, Method: Composition-based stats.
Identities = 59/191 (30%), Positives = 92/191 (48%), Gaps = 29/191 (15%)
Query: 299 EELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
E +FNG DLTGW G W V++ +V E PD++ G L K Y+ +EL ++ EAD
Sbjct: 806 ESIFNGHDLTGWSERGAPGWRVENQQIVSEVSPDRERGSLLLDKLYDAYELEFEYALEAD 865
Query: 359 GNSGIFIRSFVEEGAKVNGW-QVEV----------APKGFDTGGIYESYGRGWLIQIPDD 407
+SG+F+ + E A W Q+++ P TG +Y L+
Sbjct: 866 ADSGLFLNAGFGESALEAQWLQIQLLDDQMGTYDDIPAERRTGSVYGVAAASALVDS--- 922
Query: 408 RENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQ---GRIALQIHDGGGIK 464
++ W MR+R GN VT ++N + + ++G GQ G I LQ + G G
Sbjct: 923 -----QKTPWRKMRVRFDGNSVTVYINN---IMVTQHELG-GQYPTGHIGLQRYKGSG-- 971
Query: 465 VLWRNIRVKTL 475
+RN+R++ L
Sbjct: 972 -KFRNLRIRNL 981
>gi|32472822|ref|NP_865816.1| hypothetical protein RB3944 [Rhodopirellula baltica SH 1]
gi|32444059|emb|CAD73501.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 513
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 100/427 (23%), Positives = 171/427 (40%), Gaps = 76/427 (17%)
Query: 61 LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP------- 113
LCV + A T + + +GW LF+G+ L GW +GQ P
Sbjct: 83 LCVALCLAAFTTSHAQDAT-----DTQSGWATLFNGKDLSGW---HGQPHFDPYKLAEMS 134
Query: 114 ------------------WEVVDGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGG 155
W V +G + DG G+ Y+ D Y ++EL+ +++
Sbjct: 135 DEERASKIEEWTADAKSHWSVENGELVNDGHGA----YLTTDEGYSDYELKLEYRTVAKA 190
Query: 156 NSGLLYHVVERPQYKVPYVTGP-EYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVR 214
+SG+ ++ PQ ++ T ++ L + G + + L D +
Sbjct: 191 DSGI--YLKGTPQVQIWDTTDEGKFNLGANLGSGALWNNSPGAAGKDPLVLAD------K 242
Query: 215 PAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLIC 274
P GEWNT ++ ++NG++T++ A +++++R L G I
Sbjct: 243 PFGEWNTVHVIQVGSRTSVWLNGKQTVD-HAIMENYWRRGEP----------LPASGPIQ 291
Query: 275 LQDHGYPAWFRNIKIRELPRKTEEEEL-----------FNGKDLTGWDVYGTEQWYVQDS 323
LQ HG +RNI++R+L + L F+GK L GW + +D
Sbjct: 292 LQTHGGEIRWRNIQVRQLDTDEANDILASKHNDNFTSVFDGKTLEGWIGAVADYEVTEDG 351
Query: 324 LLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVA 383
+ C+ G + G L T K Y DF + F+ GN+G+ IR+ +EG E+
Sbjct: 352 SIQCQKG---RGGNLLTEKEYGDFSVRLRFRLPERGNNGLAIRA-PKEGNPAYAAMTELQ 407
Query: 384 PKGFD---TGGIYESYGRGWLIQIPDDRENFLKE-REWNTMRIRVVGNQVTTWLNGEQMV 439
D + E G + + +L+ EWN ++ V+G + LNG ++
Sbjct: 408 VLDNDHPAYAKLDERQYHGSAYGMAAAKRGYLRPVGEWNFQQVTVIGPTIRVELNGNVIL 467
Query: 440 DIQDEKI 446
D KI
Sbjct: 468 DTDVSKI 474
>gi|149173679|ref|ZP_01852308.1| hypothetical protein PM8797T_04560 [Planctomyces maris DSM 8797]
gi|148847209|gb|EDL61543.1| hypothetical protein PM8797T_04560 [Planctomyces maris DSM 8797]
Length = 165
Score = 82.4 bits (202), Expect = 6e-14, Method: Composition-based stats.
Identities = 53/167 (31%), Positives = 88/167 (52%), Gaps = 12/167 (7%)
Query: 129 DENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKG-- 186
D G + D+ Y+NF L++D+K+ G N+G+ +HV P+ P G E Q++DD
Sbjct: 7 DSGGNLFTDKEYKNFVLRFDFKLEPGANNGIGFHVPLNPKTS-PAYAGKEIQILDDTADK 65
Query: 187 FAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAW 246
+A+ L+ +Q G Y +++P GEWNT +++ D V+ +NG ++F
Sbjct: 66 YAK-LQKYQYHGSLYGTAPAKRG--HLKPVGEWNTQELLVDGNKVKVTLNGTTIVDF--- 119
Query: 247 SDDWFQRKNSGKWENAPEYGLARK-GLICLQDHGYPAWFRNIKIREL 292
D K +G + GL R+ G +CL HG F+N++I+EL
Sbjct: 120 --DMTDAKKNGTIDGKDHPGLKRESGHLCLCGHGAKIEFKNLRIKEL 164
>gi|149174041|ref|ZP_01852669.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
gi|148847021|gb|EDL61356.1| probable secreted glycosyl hydrolase [Planctomyces maris DSM 8797]
Length = 219
Score = 80.1 bits (196), Expect = 3e-13, Method: Composition-based stats.
Identities = 66/193 (34%), Positives = 90/193 (46%), Gaps = 22/193 (11%)
Query: 294 RKTEEEE----LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFEL 349
+ TE EE LFNGKDLTGW V + V+D ++ + +L T K Y +FE
Sbjct: 36 KNTEAEEGWIELFNGKDLTGWTVAEGGPFEVKDGVIEVTG----KRSHLFTDKEYKNFEF 91
Query: 350 TADFKQEADGNSGIFIRS-FVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDR 408
AD K NSGIF + F EEG G++ +V D Y R L + P
Sbjct: 92 KADVKTTPGSNSGIFFHTKFQEEGWPTQGYESQVNVSHKDPVKTGSLYNRVKLFKTP--- 148
Query: 409 ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAG------QGRIALQIHDGGG 462
K+ EW T I V G V +N + ++D + + G +G ALQ HD
Sbjct: 149 ---AKDNEWWTQHIIVNGRHVIVKINDQTVIDYTEPEGATGSPSLGEKGSFALQAHDPKS 205
Query: 463 IKVLWRNIRVKTL 475
+ V ++NIRVK L
Sbjct: 206 V-VYYKNIRVKPL 217
Score = 61.2 bits (147), Expect = 2e-07, Method: Composition-based stats.
Identities = 68/251 (27%), Positives = 102/251 (40%), Gaps = 66/251 (26%)
Query: 58 MNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVV 117
M L V L G +A S + +N EE GW LF+G+ L GW G GP+EV
Sbjct: 17 MAALLVSSLIGNNSAYSGE-KNTEAEE----GWIELFNGKDLTGWTVAEG----GPFEVK 67
Query: 118 DGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQ--------- 168
DG I+ G+ S ++ D+ Y+NFE + D K + G NSG+ +H + +
Sbjct: 68 DGVIEVTGKRS----HLFTDKEYKNFEFKADVKTTPGSNSGIFFHTKFQEEGWPTQGYES 123
Query: 169 -----YKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSK 223
+K P TG Y + K F P +D EW T
Sbjct: 124 QVNVSHKDPVKTGSLYNRV--KLFKTPAKD-----------------------NEWWTQH 158
Query: 224 IVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPA- 282
I+ + HV +N Q I++ +P G KG LQ H +
Sbjct: 159 IIVNGRHVIVKINDQTVIDY----------TEPEGATGSPSLG--EKGSFALQAHDPKSV 206
Query: 283 -WFRNIKIREL 292
+++NI+++ L
Sbjct: 207 VYYKNIRVKPL 217
>gi|87311427|ref|ZP_01093547.1| hypothetical protein DSM3645_25327 [Blastopirellula marina DSM
3645]
gi|87285839|gb|EAQ77753.1| hypothetical protein DSM3645_25327 [Blastopirellula marina DSM
3645]
Length = 217
Score = 77.0 bits (188), Expect = 3e-12, Method: Composition-based stats.
Identities = 65/191 (34%), Positives = 96/191 (50%), Gaps = 23/191 (12%)
Query: 301 LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGN 360
LFNGKDL GW V + + V+D +VC K G L K +++F L +FK E N
Sbjct: 32 LFNGKDLDGW-VGAVKGYDVEDGAIVCNP---KVGGNLYYGKEFDNFVLRFEFKLEPGAN 87
Query: 361 SGIFIRSFVEEGAKVNGWQVEV---APKGFDTGGIYESYGRGWLIQIPDDRENFLKE-RE 416
+G+ IRS +E + NG ++++ + + T Y+++G + + +P R FLK E
Sbjct: 88 NGLAIRSPIEGNSAYNGIELQILDTEDERYKTIKPYQAHGSVYGV-VPAKR-GFLKPIGE 145
Query: 417 WNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHD------------GGGIK 464
WN + GNQV LNGE +VD D K + G + H G G K
Sbjct: 146 WNVQEVIADGNQVKVTLNGEVIVD-ADIKEASKDGTMDGNKHPGLLNEKGHIGFLGHGTK 204
Query: 465 VLWRNIRVKTL 475
V +RNIR+K +
Sbjct: 205 VSFRNIRIKPI 215
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 63/237 (26%), Positives = 111/237 (46%), Gaps = 29/237 (12%)
Query: 61 LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGA 120
L V+ L GAL + T + G+ LF+G+ L+GW A+ G ++V DGA
Sbjct: 7 LLVLTLIGALAGAA-------TLHAEDEGFTTLFNGKDLDGWVG----AVKG-YDVEDGA 54
Query: 121 IQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQ 180
I + + G + Y + ++NF L++++K+ G N+GL + P G E Q
Sbjct: 55 IVCNPK---VGGNLYYGKEFDNFVLRFEFKLEPGANNGL---AIRSPIEGNSAYNGIELQ 108
Query: 181 LID--DKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQ 238
++D D+ + + ++ +Q G Y + + +P GEWN +++ D V+ +NG+
Sbjct: 109 ILDTEDERY-KTIKPYQAHGSVYGVVPAKRGFL--KPIGEWNVQEVIADGNQVKVTLNGE 165
Query: 239 KTIEFEAWSDDWFQRKNSGKWENAPEYGLA-RKGLICLQDHGYPAWFRNIKIRELPR 294
++ D + G + GL KG I HG FRNI+I+ + +
Sbjct: 166 VIVD-----ADIKEASKDGTMDGNKHPGLLNEKGHIGFLGHGTKVSFRNIRIKPIAK 217
>gi|88712821|ref|ZP_01106906.1| hypothetical protein FB2170_09291 [Flavobacteriales bacterium
HTCC2170]
gi|88708719|gb|EAR00954.1| hypothetical protein FB2170_09291 [Flavobacteriales bacterium
HTCC2170]
Length = 222
Score = 75.5 bits (184), Expect = 7e-12, Method: Composition-based stats.
Identities = 70/237 (29%), Positives = 118/237 (49%), Gaps = 32/237 (13%)
Query: 61 LCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGA 120
L ++VLF T+C Q L ++ G+ LF+G+ L+GW N +E +DG
Sbjct: 13 LLIIVLF---TSCG--AQKGLDDD----GFVSLFNGENLDGWIGNNNS-----YEAIDGM 58
Query: 121 I--QADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPE 178
I +GEGS N Y V Y +F +++++++ G N+GL H + V Y+ G E
Sbjct: 59 IVVNPNGEGSGGNLYTVDQ--YSDFIFRFEFQLTPGANNGLGIH--SPLEGDVAYL-GKE 113
Query: 179 YQLIDDKG--FAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMN 236
Q++D+ +A+ L+ +Q G Y + +N P GEWNT +++ + +E +N
Sbjct: 114 LQILDNTADKYAD-LKPYQYHGSVYGIIPAKRGFLN--PVGEWNTQEVIVNGTKIEIRLN 170
Query: 237 GQKTIEFEAWSDDWFQRKNSGKWENAPEYGLAR-KGLICLQDHGYPAWFRNIKIREL 292
G ++ D+ + +G + GL R +G I HG FRNIKI+++
Sbjct: 171 GTTIVD-----GDFIEASKNGTMDKKEHPGLKRTEGHIGFLGHGDVVRFRNIKIKKI 222
>gi|149179308|ref|ZP_01857869.1| hypothetical protein PM8797T_11551 [Planctomyces maris DSM 8797]
gi|148841849|gb|EDL56251.1| hypothetical protein PM8797T_11551 [Planctomyces maris DSM 8797]
Length = 853
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 58/186 (31%), Positives = 104/186 (55%), Gaps = 14/186 (7%)
Query: 299 EELFNGKDLTGWDVYGTEQWY-VQDSLLVCESGPDK--QYGYLATCKYYNDFELTADFKQ 355
+ LFNGK GW +G ++ + ++D ++ S +K + +L + K Y+DFEL +FK
Sbjct: 673 QSLFNGKTFDGW--HGNKKIFRIEDGEIIAGSLTEKVERNEFLRSNKVYDDFELKLEFKL 730
Query: 356 EADG-NSGIFIRSF-VEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIP--DDRENF 411
D N+G+ IR+ + + +V+G+Q ++ G+ G +Y+ R ++ P + R+
Sbjct: 731 LGDKTNAGVQIRTAEIPDHHEVSGYQADLG-TGY-WGCLYDESRRKKILAGPPAELRDLP 788
Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQ--DEKIGAGQGRIALQIHDGGGIKVLWRN 469
++ +WN+ RIR G ++ W+N Q VD D +I +G IALQIH + +RN
Sbjct: 789 VRMNDWNSYRIRCEGPRIRIWINDVQTVDFTEADPQIPL-KGIIALQIHGNLVNEAHYRN 847
Query: 470 IRVKTL 475
+R++ L
Sbjct: 848 VRLREL 853
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 56/212 (26%), Positives = 99/212 (46%), Gaps = 29/212 (13%)
Query: 85 EKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFE 144
EK G+Q LF+G+T +GW E++ G++ E + N ++ +++Y++FE
Sbjct: 667 EKELGFQSLFNGKTFDGWHGNKKIFRIEDGEIIAGSLT---EKVERNEFLRSNKVYDDFE 723
Query: 145 LQWDWK-ISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAM 203
L+ ++K + N+G+ E P + V+G YQ G+ L D R A
Sbjct: 724 LKLEFKLLGDKTNAGVQIRTAEIPDHH--EVSG--YQADLGTGYWGCLYDESRRKKILAG 779
Query: 204 YLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAP 263
+ + VR +WN+ +I + + ++N +T++F E P
Sbjct: 780 PPAELRDLPVR-MNDWNSYRIRCEGPRIRIWINDVQTVDFT---------------EADP 823
Query: 264 EYGLARKGLICLQDHG---YPAWFRNIKIREL 292
+ L KG+I LQ HG A +RN+++REL
Sbjct: 824 QIPL--KGIIALQIHGNLVNEAHYRNVRLREL 853
>gi|32476109|ref|NP_869103.1| hypothetical protein RB9849 [Rhodopirellula baltica SH 1]
gi|32446653|emb|CAD76489.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 281
Score = 74.7 bits (182), Expect = 1e-11, Method: Composition-based stats.
Identities = 66/201 (32%), Positives = 92/201 (45%), Gaps = 31/201 (15%)
Query: 301 LFNGKDLTGWDVYGTEQ-WYVQDSLL---VCESGPDKQYGYLATCK-YYNDFELTADFKQ 355
LF+GK L GW G E W V D + + P KQ +L + FELT FK
Sbjct: 81 LFDGKTLDGWR--GREDLWSVDDGAIHGQTTDEAPIKQNTFLILDRPVKGSFELTLQFKM 138
Query: 356 EADGNSGIFIRSFV--EEGAKVNGWQVEVAPKGFDTGGIYESYGRGWL------------ 401
GNSGI RS V EE V G+Q ++ G +YE GRG L
Sbjct: 139 -IGGNSGIQYRSKVLDEEKFIVGGYQADIDATNRFAGILYEEKGRGILATRGQQTTIWAT 197
Query: 402 -------IQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIG--AGQGR 452
++ N + EWN RI V N + ++N M+ + D++ G A G
Sbjct: 198 GEKTTEQFATAEELANSIHLGEWNDYRILVRDNHLEQFINETLMIRLVDQQPGKKADSGV 257
Query: 453 IALQIHDGGGIKVLWRNIRVK 473
IALQ+H G +KV ++NI+++
Sbjct: 258 IALQLHQGPAMKVWFKNIQIR 278
>gi|32474668|ref|NP_867662.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
gi|32445207|emb|CAD75209.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
Length = 445
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 99/382 (25%), Positives = 157/382 (41%), Gaps = 54/382 (14%)
Query: 84 EEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENF 143
E GW LFDG TL GW G A + V D I AD EN + + ++
Sbjct: 83 ERTREGWIRLFDGHTLFGWV-IGGNA---NFRVEDETIVAD---QGENCLLTTSTQWSDY 135
Query: 144 ELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDYAM 203
EL+ ++ + NSG+ PQ VT Y++ A P + GV +
Sbjct: 136 ELELQFQCDEETNSGVFVRTTLDPQD----VTTDCYEV----NIAPPSNPFPTGGV---V 184
Query: 204 YLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAP 263
T + P +W+T I+ + + ++G T E +
Sbjct: 185 QRTKGQTFDTDPE-KWHTMNILCNGSTLRVTIDGTVTCELD------------------- 224
Query: 264 EYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEELFNGKDLTGWDVY-GTEQWYV-- 320
+ G I LQ F++I++R L K L +G L GW V G E Y
Sbjct: 225 DATRPTTGYIGLQHRDGRVAFKDIQLRPLGLKNL---LADG--LEGWTVREGMEGEYRID 279
Query: 321 QDSLLVCESGPDKQYGYLATCKYYNDFELTADFK-QEADGNSGIFIRSFVEEGAKVNGWQ 379
D LV + G + L T + DF + AD+K + NSG+F R+ G ++ G++
Sbjct: 280 DDGHLVVDGGKQQ----LETKAVFGDFVMLADYKMDDPKSNSGLFFRAI--PGDEMMGYE 333
Query: 380 VEVAPKGFDTGGIYES-YGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQM 438
+V+ + D + + G G + + D R + + WN++ + GN TW+NG Q+
Sbjct: 334 CQVSNELIDGNPLQPADCGAGGIFRRQDARVVAGEPKRWNSILLVAEGNHFATWVNGLQV 393
Query: 439 VDIQDEKIGAGQGRIALQIHDG 460
DI D + R L++ G
Sbjct: 394 TDIVDTRKADENPRRGLRLEPG 415
>gi|149200234|ref|ZP_01877256.1| hypothetical protein LNTAR_02824 [Lentisphaera araneosa HTCC2155]
gi|149136676|gb|EDM25107.1| hypothetical protein LNTAR_02824 [Lentisphaera araneosa HTCC2155]
Length = 229
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 57/209 (27%), Positives = 104/209 (49%), Gaps = 20/209 (9%)
Query: 285 RNIKIRELPRKTEEEE--LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDK--QYGYLAT 340
+N+ P+ +E L L W V + W + D++++ ++G +K + +L T
Sbjct: 23 KNVNEASEPKVQSKESINLIADNSLKAWKV-TSSLWSISDNVIIGDTGKEKPQKPEWLYT 81
Query: 341 CKYYNDFELTADFKQEAD--GNSGIF--IRSFVEEGAK-------VNGWQVEVAPKGFDT 389
+ + DF T++FK NSGI+ ++ F+ + K +G++ +++ F
Sbjct: 82 KQKFGDFLFTSEFKLTGSIAPNSGIYYRVKPFIFDRIKKKGAFEVASGYEYDLSYNKF-L 140
Query: 390 GGIYESYGRGWLIQIPDDR--ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIG 447
G + + Y R L PD++ +K+ +WN IR N++ WLNG +++D D
Sbjct: 141 GSLGDWYARPSLRIFPDNKITAQLIKKNDWNRATIRAKSNRLEYWLNGVKIMDFIDHDPK 200
Query: 448 AGQ-GRIALQIHDGGGIKVLWRNIRVKTL 475
A Q G I LQIHDG +K+ +R + + L
Sbjct: 201 ASQSGVIGLQIHDGALMKIEFRKMHILPL 229
>gi|149177017|ref|ZP_01855625.1| hypothetical protein PM8797T_11786 [Planctomyces maris DSM 8797]
gi|148844082|gb|EDL58437.1| hypothetical protein PM8797T_11786 [Planctomyces maris DSM 8797]
Length = 235
Score = 72.8 bits (177), Expect = 5e-11, Method: Composition-based stats.
Identities = 65/208 (31%), Positives = 93/208 (44%), Gaps = 48/208 (23%)
Query: 301 LFNGKDLTGW-------------------------DVYGTEQWYVQDSLLVCESGPDKQY 335
LFNGKDLTGW D E W V D ++V D +
Sbjct: 42 LFNGKDLTGWKGLVGSPKTRAKMSPEELAEAQKKADDEMKEHWNVVDGVIVF----DGKG 97
Query: 336 GYLATCKYYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEV-APKGFDTGGIYE 394
L T K Y DF++ D+K + DG+SGI++R +V W V A G +GG+Y
Sbjct: 98 KSLCTAKDYGDFDMLVDWKIKKDGDSGIYLRG----SPQVQIWDPAVKAANGVGSGGLYN 153
Query: 395 SYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD-------IQDEKIG 447
+ + D EWNT RI++VG +V+ WLNG+ + D + +K
Sbjct: 154 NKKNPDKPLVTADN----PVGEWNTFRIKMVGEKVSVWLNGKLVTDNVTLENYWERDKPI 209
Query: 448 AGQGRIALQIHDGGGIKVLWRNIRVKTL 475
G+I LQ H G + +RN+ +K L
Sbjct: 210 YETGQIELQNH---GNTLYFRNVFIKEL 234
>gi|126648469|ref|ZP_01720956.1| hypothetical protein ALPR1_08673 [Algoriphagus sp. PR1]
gi|126575353|gb|EAZ79685.1| hypothetical protein ALPR1_08673 [Algoriphagus sp. PR1]
Length = 222
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 60/210 (28%), Positives = 103/210 (49%), Gaps = 20/210 (9%)
Query: 86 KAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFEL 145
K G++ LF+G+ L+GW N ++ + +G I D +G G + ++ Y NF L
Sbjct: 29 KEDGFKRLFNGENLDGWVG-NKES----YRAENGMIVIDPQGGG-GGNLYTEKEYGNFIL 82
Query: 146 QWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDD--KGFAEPLEDWQRCGVDYAM 203
++++++ G N+GL H P G E Q++D+ K +AE LE +Q G Y +
Sbjct: 83 HFEFQLTPGANNGLGIHA---PLEGDAAYVGKELQILDNRAKKYAE-LEVYQYHGSVYGV 138
Query: 204 YLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAP 263
+N P GEWN ++ ++ ++ +NG+ ++ D+ + G ++
Sbjct: 139 IPARRGFLN--PVGEWNKQTVIVNHPKIQVILNGETILQ-----GDYLEASKEGTLDHKE 191
Query: 264 EYGLAR-KGLICLQDHGYPAWFRNIKIREL 292
GL R G I HG FRNI+I+EL
Sbjct: 192 HPGLERSSGHIGFLGHGDVVHFRNIRIKEL 221
>gi|150007795|ref|YP_001302538.1| hypothetical protein BDI_1153 [Parabacteroides distasonis ATCC 8503]
gi|149936219|gb|ABR42916.1| conserved hypothetical protein [Parabacteroides distasonis ATCC 8503]
Length = 1155
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 109/450 (24%), Positives = 184/450 (40%), Gaps = 102/450 (22%)
Query: 89 GWQLLFDGQTLEGWR----DYNGQALTGPWEVVDGAIQADG----EGSDENGYIVYD--- 137
G+ +F+G+ L GW+ + +A P ++ QAD + ENG +V+D
Sbjct: 744 GFVSIFNGKDLTGWKGLVENPIARAKMKPAQLAKAQEQADENMRRDWKVENGLLVFDGTG 803
Query: 138 -------RIYENFELQWDWKISKGG---NSGLLYHVVERPQYKVPYVTG-PEYQLIDDKG 186
+ Y +FE+ DW + G ++G+ Y+ G P+ Q+
Sbjct: 804 YDNLCTEKQYGDFEMYVDWMLDPKGPEADAGI-------------YLRGTPQVQI----- 845
Query: 187 FAEPLEDWQRCGVDYAMYLPDFATMN-----VRPA-------GEWNTSKIVFDNGHVEYY 234
W V+ + N +P+ GEWN+ I V
Sbjct: 846 -------WDTSRVNVGAQVGSGGLYNNQVNESKPSKVTDNKLGEWNSFYIKMVGDRVTVV 898
Query: 235 MNGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPR 294
+NG+K ++ ++++ RK P + + + I +Q HG ++RNI ++EL +
Sbjct: 899 LNGEKVVD-NVILENYWDRK-------LPIFPVEQ---IEMQAHGSKVYYRNIYVKELEK 947
Query: 295 K-----TEEEE------LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYG-YLATCK 342
+ + EEE LF+G ++ W T + ++D + P +G L T K
Sbjct: 948 QEPFKLSPEEEKEGFKVLFDGTNMHEW-TGNTVDYILEDGCI--SMVPSSSFGGNLYTKK 1004
Query: 343 YYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEV--APKGFDTGGIYESYGRGW 400
Y +F DF+ N+G+ IR+ +E A G +V+V G I G
Sbjct: 1005 EYGNFIYRFDFQLTPGANNGVGIRTPMEGDAAYVGMEVQVLDCEHPIYQGNITPLQHHGS 1064
Query: 401 LIQIPDDREN----FLKEREWNTMRIRVVGNQVTTWLNGEQMVD--IQDE-KIGAGQGRI 453
+ I RE+ F EWNT I G+ + +NG ++D I+D K G G+
Sbjct: 1065 VYGIIPAREDHPKAFKPVGEWNTEEIMADGDHIRVTVNGVVILDGNIRDAVKNGTPDGKE 1124
Query: 454 ALQIHD--------GGGIKVLWRNIRVKTL 475
+ + G G V +RNIR+K L
Sbjct: 1125 HPGLFNKKGHIGFLGHGSPVKFRNIRIKEL 1154
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 64/224 (28%), Positives = 98/224 (43%), Gaps = 57/224 (25%)
Query: 292 LPRKTEEEELFNGKDLTGW-------------------------DVYGTEQWYVQDSLLV 326
+P + +FNGKDLTGW D W V++ LLV
Sbjct: 739 MPDEVGFVSIFNGKDLTGWKGLVENPIARAKMKPAQLAKAQEQADENMRRDWKVENGLLV 798
Query: 327 CESGPDKQYGYLATCKYYNDFELTADFKQEADG---NSGIFIRSFVEEGAKVNGWQVEVA 383
+ Y L T K Y DFE+ D+ + G ++GI++R +V W
Sbjct: 799 FDG---TGYDNLCTEKQYGDFEMYVDWMLDPKGPEADAGIYLRG----TPQVQIWDTSRV 851
Query: 384 PKG--FDTGGIYESYGRGWLIQIPDDRENFLKER---EWNTMRIRVVGNQVTTWLNGEQM 438
G +GG+Y + Q+ + + + + + EWN+ I++VG++VT LNGE++
Sbjct: 852 NVGAQVGSGGLYNN-------QVNESKPSKVTDNKLGEWNSFYIKMVGDRVTVVLNGEKV 904
Query: 439 VD------IQDEKIGA-GQGRIALQIHDGGGIKVLWRNIRVKTL 475
VD D K+ +I +Q H G KV +RNI VK L
Sbjct: 905 VDNVILENYWDRKLPIFPVEQIEMQAH---GSKVYYRNIYVKEL 945
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 61/220 (27%), Positives = 99/220 (45%), Gaps = 26/220 (11%)
Query: 81 LTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIY 140
L+ EE+ G+++LFDG + W T + + DG I S G + + Y
Sbjct: 953 LSPEEEKEGFKVLFDGTNMHEW-----TGNTVDYILEDGCISMV-PSSSFGGNLYTKKEY 1006
Query: 141 ENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLID-----DKGFAEPLEDWQ 195
NF ++D++++ G N+G+ + P G E Q++D +G PL Q
Sbjct: 1007 GNFIYRFDFQLTPGANNGV---GIRTPMEGDAAYVGMEVQVLDCEHPIYQGNITPL---Q 1060
Query: 196 RCGVDYAMYLP--DFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQR 253
G Y + +P + +P GEWNT +I+ D H+ +NG ++ +
Sbjct: 1061 HHGSVYGI-IPAREDHPKAFKPVGEWNTEEIMADGDHIRVTVNGVVILD-----GNIRDA 1114
Query: 254 KNSGKWENAPEYGL-ARKGLICLQDHGYPAWFRNIKIREL 292
+G + GL +KG I HG P FRNI+I+EL
Sbjct: 1115 VKNGTPDGKEHPGLFNKKGHIGFLGHGSPVKFRNIRIKEL 1154
>gi|154490163|ref|ZP_02030424.1| hypothetical protein PARMER_00395 [Parabacteroides merdae ATCC
43184]
gi|154089055|gb|EDN88099.1| hypothetical protein PARMER_00395 [Parabacteroides merdae ATCC
43184]
Length = 1150
Score = 69.3 bits (168), Expect = 6e-10, Method: Composition-based stats.
Identities = 70/237 (29%), Positives = 101/237 (42%), Gaps = 54/237 (22%)
Query: 277 DHGYPAWFRNIKIRELPRKTEEEELFNGKDLTGW-------------------------D 311
D GY + E+P+ LFNGKDLTGW D
Sbjct: 721 DAGYQREAIKKHLAEMPQGEGFVSLFNGKDLTGWKGLVQNPIARAKMKPGQLAKEQAKAD 780
Query: 312 VYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG---NSGIFIRSF 368
+ W V+D +L+ D L T K Y DFE+ D+ + G ++GI++R
Sbjct: 781 EVMRKGWSVEDGMLIFNGKGDN----LCTEKQYGDFEMYVDWMLDPAGPEADAGIYLRG- 835
Query: 369 VEEGAKVNGWQVEVAPKG--FDTGGIYES-YGRGWLIQIPDDRENFLKEREWNTMRIRVV 425
+V W G +GG+Y + ++ D+ K EWN+ I++V
Sbjct: 836 ---TPQVQIWDTSRVNVGAQVGSGGLYNNQMNESKPTKVADN-----KLGEWNSFYIKMV 887
Query: 426 GNQVTTWLNGEQMVD------IQDEKIGA-GQGRIALQIHDGGGIKVLWRNIRVKTL 475
G++VT LNGE++VD D K+ +I LQ H G KV +RNI VK L
Sbjct: 888 GDRVTVVLNGEKVVDDVILENYWDRKLPIFPVEQIELQAH---GSKVYYRNIYVKEL 941
Score = 68.9 bits (167), Expect = 7e-10, Method: Composition-based stats.
Identities = 106/453 (23%), Positives = 182/453 (40%), Gaps = 94/453 (20%)
Query: 81 LTEEEKAAGWQLLFDGQTLEGWRDY---------------------NGQALTGPWEVVDG 119
L E + G+ LF+G+ L GW+ + + W V DG
Sbjct: 733 LAEMPQGEGFVSLFNGKDLTGWKGLVQNPIARAKMKPGQLAKEQAKADEVMRKGWSVEDG 792
Query: 120 AIQADGEGSDENGYIVYDRIYENFELQWDWKISKGG---NSGLLYHVVERPQYKVPYVTG 176
+ +G+G + + ++ Y +FE+ DW + G ++G+ Y+ G
Sbjct: 793 MLIFNGKGDN----LCTEKQYGDFEMYVDWMLDPAGPEADAGI-------------YLRG 835
Query: 177 -PEYQLIDDKGFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYM 235
P+ Q+ D + + M + GEWN+ I V +
Sbjct: 836 TPQVQIWDTSRVNVGAQVGSGGLYNNQMNESKPTKVADNKLGEWNSFYIKMVGDRVTVVL 895
Query: 236 NGQKTIEFEAWSDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRK 295
NG+K ++ + ++++ RK P + + + I LQ HG ++RNI ++EL RK
Sbjct: 896 NGEKVVD-DVILENYWDRK-------LPIFPVEQ---IELQAHGSKVYYRNIYVKELERK 944
Query: 296 ------TEEEE-----LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQY-GYLATCKY 343
EEE+ LF+G ++ W T + ++D + P K Y G L T
Sbjct: 945 EPFKLSAEEEKEGFKVLFDGTNMHEW-TGNTVDYTLEDGCI--SMIPSKSYGGNLYTKDE 1001
Query: 344 YNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVA----PKGFDTGGIYESYGRG 399
Y +F +F+ N+G+ IR+ +E A G ++++ P D + + +G
Sbjct: 1002 YGNFVYRFEFQLTPGANNGVGIRTPMEGDAAYVGMEIQILDCEHPIYKDITPL-QHHGSV 1060
Query: 400 WLIQIPDDREN---FLKEREWNTMRIRVVGNQVTTWLNGEQMVD----------IQDEKI 446
+ I IP E+ F EWN I G+ + +NG +++ D K
Sbjct: 1061 YGI-IPAKAEHHSAFKPAGEWNYEEIVANGDNIKVTVNGVVIMEGNIREATKNGTADHKE 1119
Query: 447 GAG----QGRIALQIHDGGGIKVLWRNIRVKTL 475
G +G I H G V +RNIR+K L
Sbjct: 1120 HPGLFNKKGHIGFLGH---GSPVKFRNIRIKEL 1149
>gi|88713612|ref|ZP_01107694.1| probable large multifunctional protein-putative glycosyl hydrolase
[Flavobacteriales bacterium HTCC2170]
gi|88708122|gb|EAR00360.1| probable large multifunctional protein-putative glycosyl hydrolase
[Flavobacteriales bacterium HTCC2170]
Length = 310
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 45/151 (29%), Positives = 69/151 (45%), Gaps = 15/151 (9%)
Query: 293 PRKTEEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTAD 352
P E + LFNGK+L GW G QW V+D +L E K L + + +NDF+L
Sbjct: 135 PVWGESKALFNGKNLDGWQAMGVNQWMVKDGILTSE----KSGANLVSSEKFNDFKLHIV 190
Query: 353 FKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFL 412
F+ NSG+++R E N + + P GGIY ++ P
Sbjct: 191 FRYPEGSNSGVYLRGRYEVQIADN---IGLEPSSILFGGIYGFLTPNEMVAKPAG----- 242
Query: 413 KEREWNTMRIRVVGNQVTTWLNGEQMVDIQD 443
EW I ++G +VT NG++++ Q+
Sbjct: 243 ---EWQEYDITLIGRRVTIIANGKEIITNQN 270
>gi|116626562|ref|YP_828718.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116229724|gb|ABJ88433.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 309
Score = 66.2 bits (160), Expect = 5e-09, Method: Composition-based stats.
Identities = 59/196 (30%), Positives = 92/196 (46%), Gaps = 29/196 (14%)
Query: 291 ELPRK-TEEEELFNGKDLTGW---DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYND 346
E P+ T E +FNGKDLTGW D T W V+ LV ES + L T + ++D
Sbjct: 126 EAPKAWTAPEPIFNGKDLTGWEPTDPAATNHWVVKGGELVNES----KGANLKTTRKFDD 181
Query: 347 FELTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPD 406
F+L ++ DGNSGI++R E +V +V+ K G +Y ++++P
Sbjct: 182 FKLHIEYNCPDDGNSGIYLRGRYE--VQVEYEKVDANDKFHSIGAVYSMLAP--VVELPR 237
Query: 407 DRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQD---------EKIGAGQGRIALQI 457
K W T I +VG ++T +G + +D Q+ + A G +Q
Sbjct: 238 ------KPGTWETFDITLVGRRLTVVRDGVKTIDNQEIAGTTGGALDSNEAEPGPFYIQG 291
Query: 458 HDGGGIKVLWRNIRVK 473
GG+K +RNI ++
Sbjct: 292 DHTGGMK--YRNITIQ 305
>gi|116619831|ref|YP_821987.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116222993|gb|ABJ81702.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 212
Score = 65.5 bits (158), Expect = 7e-09, Method: Composition-based stats.
Identities = 66/210 (31%), Positives = 100/210 (47%), Gaps = 22/210 (10%)
Query: 88 AGWQLLFDGQTLEGWRDYNGQALTGPWEVV-DGAIQADGEGSDENGYIVYDRIYENFELQ 146
AG+ LFDG+TL GW+ G GP VV +G I +G G + ++ + NF +
Sbjct: 21 AGFTPLFDGKTLNGWKLVGGH---GPGYVVQEGKIVCPADGG---GNLFTEKEFGNFAFR 74
Query: 147 WDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDD--KGFAEPLEDWQRCGVDYAMY 204
+++K++ G N+G+ + P G E Q++DD K + + Q G Y +
Sbjct: 75 FEFKLTPGANNGI---GIRAPYEGDAAYQGMEIQILDDGDKVYQGKIRPEQYHGSVYDV- 130
Query: 205 LPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENAPE 264
+P T +P GEWN +IV D ++ +NG I +A D K + P
Sbjct: 131 IPA-RTGYRKPVGEWNEEEIVADGRRIKVTLNG--VIILDA---DLSIVKEPQVLAHHP- 183
Query: 265 YGLARK-GLICLQDHGYPAWFRNIKIRELP 293
GLAR G I HG FRNI+++ LP
Sbjct: 184 -GLARTAGHIGFLGHGSLVEFRNIRVKPLP 212
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 58/189 (30%), Positives = 85/189 (44%), Gaps = 17/189 (8%)
Query: 301 LFNGKDLTGWDVYGTEQ--WYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
LF+GK L GW + G + VQ+ +VC P G L T K + +F +FK
Sbjct: 26 LFDGKTLNGWKLVGGHGPGYVVQEGKIVC---PADGGGNLFTEKEFGNFAFRFEFKLTPG 82
Query: 359 GNSGIFIRSFVEEGAKVNGWQVEVAPKGFDT--GGIYESYGRGWLIQIPDDRENFLKE-R 415
N+GI IR+ E A G ++++ G G I G + + R + K
Sbjct: 83 ANNGIGIRAPYEGDAAYQGMEIQILDDGDKVYQGKIRPEQYHGSVYDVIPARTGYRKPVG 142
Query: 416 EWNTMRIRVVGNQVTTWLNGEQMVD-----IQDEKIGA---GQGRIALQI-HDGGGIKVL 466
EWN I G ++ LNG ++D +++ ++ A G R A I G G V
Sbjct: 143 EWNEEEIVADGRRIKVTLNGVIILDADLSIVKEPQVLAHHPGLARTAGHIGFLGHGSLVE 202
Query: 467 WRNIRVKTL 475
+RNIRVK L
Sbjct: 203 FRNIRVKPL 211
>gi|87310208|ref|ZP_01092340.1| hypothetical protein DSM3645_14090 [Blastopirellula marina DSM
3645]
gi|87287198|gb|EAQ79100.1| hypothetical protein DSM3645_14090 [Blastopirellula marina DSM
3645]
Length = 505
Score = 65.5 bits (158), Expect = 9e-09, Method: Composition-based stats.
Identities = 62/210 (29%), Positives = 95/210 (45%), Gaps = 51/210 (24%)
Query: 299 EELFNGKDLTGW-------------------------DVYGTEQWYVQDSLLVCESGPDK 333
++LFNGKDL+GW D E W ++D +L D
Sbjct: 33 QKLFNGKDLSGWKGLVASPPKRAEMSAEALAAEQEKADASMREHWTIEDGVLTY----DG 88
Query: 334 QYGYLATCKYYNDFELTADFKQEADGNSGIFIRSFVE-EGAKVNGWQVEVAPKGFDTGGI 392
+ L T K Y DFE+ D+K + DG+SGI++R + + N W+V +GG+
Sbjct: 89 KGQSLCTDKDYADFEMYVDWKIKDDGDSGIYVRGSPQIQIWDPNHWKV-------GSGGL 141
Query: 393 YESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD-------IQDEK 445
Y + I D EWNTM +R++G++VT LNG+ + D + +K
Sbjct: 142 YNNKKNPSAPTIIADN----PIGEWNTMYVRMIGDRVTVKLNGKLVTDNVVLENYWERDK 197
Query: 446 IGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
G+I LQ H G + +RNI ++ L
Sbjct: 198 PIYPTGQIELQHH---GNTLWFRNIFIREL 224
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 63/235 (26%), Positives = 97/235 (41%), Gaps = 54/235 (22%)
Query: 89 GWQLLFDGQTLEGWR----------DYNGQALTGP-----------WEVVDGAIQADGEG 127
G+Q LF+G+ L GW+ + + +AL W + DG + DG+G
Sbjct: 31 GFQKLFNGKDLSGWKGLVASPPKRAEMSAEALAAEQEKADASMREHWTIEDGVLTYDGKG 90
Query: 128 SDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGF 187
+ D+ Y +FE+ DWKI G+SG+ +V PQ ++ P + + G
Sbjct: 91 QS----LCTDKDYADFEMYVDWKIKDDGDSGI--YVRGSPQIQI---WDPNHWKVGSGGL 141
Query: 188 AEPLEDWQRCGVDYAMYLPDFATMNV-RPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAW 246
++ P T+ P GEWNT + V +NG+ +
Sbjct: 142 YNNKKN------------PSAPTIIADNPIGEWNTMYVRMIGDRVTVKLNGKLVTDNVVL 189
Query: 247 SDDWFQRKNSGKWENAPEYGLARKGLICLQDHGYPAWFRNIKIRELPRKTEEEEL 301
+ W + K P Y G I LQ HG WFRNI IREL + E +++
Sbjct: 190 ENYWERDK--------PIY---PTGQIELQHHGNTLWFRNIFIRELAKPDELKKI 233
>gi|87309043|ref|ZP_01091181.1| hypothetical protein DSM3645_19838 [Blastopirellula marina DSM
3645]
gi|87288386|gb|EAQ80282.1| hypothetical protein DSM3645_19838 [Blastopirellula marina DSM
3645]
Length = 208
Score = 65.5 bits (158), Expect = 9e-09, Method: Composition-based stats.
Identities = 59/188 (31%), Positives = 99/188 (52%), Gaps = 18/188 (9%)
Query: 301 LFNGKDLTGWDVYGTEQWY-VQDSLLVC----ESGPDKQYGYLATCKYYNDFELTADFKQ 355
+F+G+ L GW+ G +W+ V D +V ++ P+ ++ L T + Y DFELT + K
Sbjct: 26 IFDGETLEGWE--GKSEWFHVADGAVVAGSLEKAIPNNEF--LCTKEEYGDFELTLEAKL 81
Query: 356 EADG-NSGIFIRS-FVEEGAKVNGWQVEVA--PKGFDTGGIY-ESYGRGWLIQIP-DDRE 409
G N+G+ RS + +V G+Q ++ P G +Y ES + ++ + P ++
Sbjct: 82 VGQGTNAGVQFRSQRIPNHHEVIGYQCDMGSTPVRLIWGSLYDESRRKIFVAEGPAEEVA 141
Query: 410 NFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQ--DEKIGAGQGRIALQIHDGGGIKVLW 467
+K EWN ++IR G ++ W+N Q VD D+ I A G I LQIH G + +
Sbjct: 142 KTVKRGEWNRLKIRCQGAKIQIWVNDLQTVDYTEADDAI-ARTGIIGLQIHSGPAAEASY 200
Query: 468 RNIRVKTL 475
R +++K L
Sbjct: 201 RKLQLKKL 208
>gi|150007144|ref|YP_001301887.1| putative secreted glycosylhydrolase [Parabacteroides distasonis
ATCC 8503]
gi|149935568|gb|ABR42265.1| putative secreted glycosylhydrolase [Parabacteroides distasonis
ATCC 8503]
Length = 222
Score = 65.5 bits (158), Expect = 9e-09, Method: Composition-based stats.
Identities = 76/250 (30%), Positives = 106/250 (42%), Gaps = 41/250 (16%)
Query: 53 FMMKCMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTG 112
F++ CV F + A + K W+ LF G+ LE +YN +
Sbjct: 4 FLLAMATLWCVASTFSSFAADNNK-------------WKPLF-GKNLEN-ANYNPEV--- 45
Query: 113 PWEVVDGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVP 172
W DG + G DE+ I YENFEL D+K G NSG++ + + + +P
Sbjct: 46 -WSETDGVL---GAVKDES--IWTKDEYENFELDLDFKTDVGTNSGVVVYCTDTKDW-IP 98
Query: 173 YVTGPEYQLIDDK----GFAEPLEDWQRCGVDYAMYLPDFATMNVRPAGEWNTSKIVFDN 228
E Q+ DD G +P +++CG Y +L V+ GEWN +I
Sbjct: 99 --NSVEIQIADDHCEKWGNGKP---YEKCGAIYG-HLGAVQDKVVKKPGEWNHMRIKCAG 152
Query: 229 GHVEYYMNGQKTIEFE--AWSDDWFQRKNSG--KWENAPEYGLARKGLICLQ-DHGYP-A 282
H+ +NG+K E + W+ S W P L KG I LQ HG
Sbjct: 153 QHIMVILNGKKVTEMDMSKWTSGTKNPDGSDIPSWLPKPFAELPTKGFIGLQGKHGDSLI 212
Query: 283 WFRNIKIREL 292
WFRNIKIR L
Sbjct: 213 WFRNIKIRSL 222
>gi|146280883|ref|YP_001171036.1| hypothetical protein PST_0488 [Pseudomonas stutzeri A1501]
gi|145569088|gb|ABP78194.1| conserved hypothetical protein [Pseudomonas stutzeri A1501]
Length = 194
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 45/155 (29%), Positives = 73/155 (47%), Gaps = 8/155 (5%)
Query: 140 YENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGV 199
Y NF L++++ + GGNSG+ Y V E + + + +GPE QL+DD + E R G
Sbjct: 45 YANFILRFEYALPVGGNSGVFYRVDEAAE--LAWHSGPEMQLLDDAVHPDGAEPTTRNGA 102
Query: 200 DYAMYLPDFATMNVRP--AGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSG 257
Y + AT P G + +V + VE++++ + + + D + +
Sbjct: 103 LYGLR----ATQQETPIEPGSFMEGALVVRDADVEHWLDNRMVLSYRLDDPDLRMQIRTS 158
Query: 258 KWENAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
K+ + P Y A G I LQ HG FR + I L
Sbjct: 159 KFADKPLYAQATAGHIVLQHHGEAVRFRRLSIEPL 193
>gi|32473471|ref|NP_866465.1| hypothetical protein-transmembrane region and signal peptide
prediction [Rhodopirellula baltica SH 1]
gi|32398151|emb|CAD78246.1| hypothetical protein-transmembrane region and signal peptide
prediction [Rhodopirellula baltica SH 1]
Length = 263
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 64/219 (29%), Positives = 107/219 (48%), Gaps = 38/219 (17%)
Query: 293 PRKTEEEELFNGKDLTGWDVYGTEQ-WYVQDSLLVCESGPDKQYGYLATCKYYN----DF 347
P ++ +FNGKDLTGW G ++ W V+D ++ E+ P+ + + + +F
Sbjct: 46 PAESGLTSIFNGKDLTGWS--GDDRLWSVRDGVIHGETTPENKANGNTFLIWEDGNTKNF 103
Query: 348 ELTADFKQEADGNSGIFIRS--FVEEGAK----VNGWQVEVAPK-GFD--TGGIYESYGR 398
E+ F+ A NSGI RS ++ A+ V G+Q E+ + F G IY+ GR
Sbjct: 104 EVRLSFRCNATNNSGIQYRSKHITDKSARNEWVVRGYQHELRNEMNFPNIAGFIYDEGGR 163
Query: 399 GWLIQIPDDR-----------ENFLKERE---------WNTMRIRVVGNQVTTWLNGEQM 438
I + ++ ENF+ E E WN + I GN + +LNG+ +
Sbjct: 164 RGRICMVGEKAAWKDGKKQVLENFMDEAEFQKLFQLDGWNEVVIIGKGNHIQHFLNGKLI 223
Query: 439 VDIQDEK--IGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
+D DE+ + G++ALQ+H G + +++IR K+L
Sbjct: 224 LDFTDEQPELKLLDGKLALQLHAGKPMWAEFKDIRFKSL 262
>gi|149177681|ref|ZP_01856282.1| sulfatase [Planctomyces maris DSM 8797]
gi|148843499|gb|EDL57861.1| sulfatase [Planctomyces maris DSM 8797]
Length = 664
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 60/189 (31%), Positives = 87/189 (46%), Gaps = 22/189 (11%)
Query: 301 LFNGKDLTGWDV--YGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
LFNG DL+GW + E ++V+ LVC P G+L T K Y+DF L DFK
Sbjct: 480 LFNGHDLSGWTLKRANREGYHVEAGKLVC---PADGGGFLFTEKEYSDFSLKFDFKLTKA 536
Query: 359 GNSGIFIR-SFVEEGAKVNGWQVEVAP-KGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
N+GI IR V++ G +++V KG+ Y IP R E
Sbjct: 537 ANNGIAIRCPLVDQKPAYEGMEIQVLDNKGYPKKLKPTQYHGSVYDVIPAKRGALKPVGE 596
Query: 417 WNTMRIRVVGNQVTTWLNG-----EQMVDIQDEKIGAG-------QGRIALQIHDGGGIK 464
WN I G+++T +N + +I+DE++ A +G I L H G
Sbjct: 597 WNHEEIICRGSKITVIVNDIPVLQTDLSEIKDEQVLAKHPGLKNQRGHIGLLGH---GSH 653
Query: 465 VLWRNIRVK 473
V ++NIR+K
Sbjct: 654 VEYQNIRIK 662
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 55/212 (25%), Positives = 100/212 (47%), Gaps = 23/212 (10%)
Query: 85 EKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYDRIYENFE 144
E+ A ++ LF+G L GW +A + V G + +G G++ ++ Y +F
Sbjct: 472 EQTADYKPLFNGHDLSGWT--LKRANREGYHVEAGKLVCPADGG---GFLFTEKEYSDFS 526
Query: 145 LQWDWKISKGGNSGLLYH---VVERPQYKVPYVTGPEYQLIDDKGFAEPLEDWQRCGVDY 201
L++D+K++K N+G+ V ++P Y+ G E Q++D+KG+ + L+ Q G Y
Sbjct: 527 LKFDFKLTKAANNGIAIRCPLVDQKPAYE-----GMEIQVLDNKGYPKKLKPTQYHGSVY 581
Query: 202 AMYLPDFATMNVRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWEN 261
+ + +P GEWN +I+ + +N + D + K+
Sbjct: 582 DVIPAKRGAL--KPVGEWNHEEIICRGSKITVIVN-----DIPVLQTDLSEIKDEQVLAK 634
Query: 262 APEYGLA-RKGLICLQDHGYPAWFRNIKIREL 292
P GL ++G I L HG ++NI+I+E
Sbjct: 635 HP--GLKNQRGHIGLLGHGSHVEYQNIRIKEF 664
>gi|148253179|ref|YP_001237764.1| putative exported protein of unknown function [Bradyrhizobium sp.
BTAi1]
gi|146405352|gb|ABQ33858.1| putative exported protein of unknown function [Bradyrhizobium sp.
BTAi1]
Length = 193
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 51/165 (30%), Positives = 78/165 (47%), Gaps = 15/165 (9%)
Query: 314 GTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGNSGIFIRSFVEEGA 373
G W +D L + K YL T Y DF++ A+F + NSGIFIR ++
Sbjct: 41 GKANWAAKDGALSADKLDGKDPSYLVTKTSYKDFQIKAEFWVDDAANSGIFIR--CDQSD 98
Query: 374 KVNG---WQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVT 430
K++ ++V + K D YG G ++ + +WNT I G ++T
Sbjct: 99 KIDSNICYEVNIFDKRPDP-----KYGTGAIVDVAKVDPMPKAGGKWNTYEITAKGTRLT 153
Query: 431 TWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
LNGE+ D+ D K AG IALQ G G+ V ++ +++K L
Sbjct: 154 VILNGEKTADVDDSKHAAGP--IALQY--GSGV-VKFKKVQIKPL 193
>gi|117164776|emb|CAJ88325.1| putative secreted glycosyl hydrolase [Streptomyces ambofaciens ATCC
23877]
Length = 592
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 57/182 (31%), Positives = 83/182 (45%), Gaps = 17/182 (9%)
Query: 301 LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGN 360
LF+G TGW G + + D L G + A ++ D+ L D+K D N
Sbjct: 284 LFDGSGTTGWQQAGPGGFTLADGTLTSHGGLGMLW--YAAEEFTGDYSLKLDWKAAGDDN 341
Query: 361 SGIFIRSFVEEG----AKVNGWQVEV-APKGFD--TGGIYESYGRGWLIQIPDDRENFLK 413
SG+F+ F G A NG+++++ A D TG +Y +P
Sbjct: 342 SGVFV-GFPASGDPWSAVNNGYEIQIDATDAADRTTGAVYGFRS----ADLPARDAALNP 396
Query: 414 EREWNTMRIRVVGNQVTTWLNGEQMVDI--QDEKIGAGQGRIALQIHDGGGIKVLWRNIR 471
EWNT +RV G ++ +LNG ++ D D QG I LQ H G G +V +R+IR
Sbjct: 397 PGEWNTYELRVTGERLEIFLNGSKINDFTNTDPARSLRQGHIGLQNH-GDGDEVAFRDIR 455
Query: 472 VK 473
VK
Sbjct: 456 VK 457
>gi|116625196|ref|YP_827352.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116228358|gb|ABJ87067.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 323
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 49/152 (32%), Positives = 72/152 (47%), Gaps = 19/152 (12%)
Query: 296 TEEEELFNGKDLTGWDVY---GTEQWYVQDSLLVCESGPDKQYG-YLATCKYYNDFELTA 351
T+ E LFNGKDLTGW+ + W QD +LV D +G L T + +NDF+L
Sbjct: 146 TDPEPLFNGKDLTGWEAFPAGAVNHWVAQDGVLV-----DTDHGASLKTTRTFNDFKLHI 200
Query: 352 DFKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENF 411
+F GNSGI++R + +V + V K D G +Y G+ + E
Sbjct: 201 EFNCPDGGNSGIYLRG--RDEIQVAYEKPGVEDKFHDMGAVY-----GF---VAPTAEVP 250
Query: 412 LKEREWNTMRIRVVGNQVTTWLNGEQMVDIQD 443
W + + ++G VT NG + VD Q+
Sbjct: 251 RTPGTWESFDVTLIGRYVTIVRNGVKTVDNQE 282
>gi|146342960|ref|YP_001208008.1| hypothetical protein BRADO6143 [Bradyrhizobium sp. ORS278]
gi|146195766|emb|CAL79793.1| hypothetical protein [Bradyrhizobium sp. ORS278]
Length = 205
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 53/178 (29%), Positives = 82/178 (46%), Gaps = 15/178 (8%)
Query: 301 LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADGN 360
L +G + G W +D L+ + K YL T Y DF++ A+F + N
Sbjct: 40 LVDGDKKVEFTEVGKANWEAKDGALMADKLDGKDPSYLVTKTSYKDFQIKAEFWVDDAAN 99
Query: 361 SGIFIRSFVEEGAKVNG---WQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREW 417
SGIFIR ++ K++ ++V + K D YG G ++ + +W
Sbjct: 100 SGIFIR--CDQADKIDSNICYEVNIFDKRPDP-----KYGTGAIVDVSKVDPMPKAGGKW 152
Query: 418 NTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
NT I G ++T NGE+ D+ D K AG IALQ G GI V ++ +++K L
Sbjct: 153 NTYEITAKGTRLTVIFNGEKTADVDDSKHAAGP--IALQY--GSGI-VKFKKVQIKPL 205
>gi|88713788|ref|ZP_01107869.1| hypothetical protein FB2170_00770 [Flavobacteriales bacterium
HTCC2170]
gi|88707915|gb|EAR00154.1| hypothetical protein FB2170_00770 [Flavobacteriales bacterium
HTCC2170]
Length = 199
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 55/196 (28%), Positives = 97/196 (49%), Gaps = 27/196 (13%)
Query: 283 WFRNIKIRELPRKTEEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCK 342
+F N K EL +K +E +G D T W+ ++ + ++ G G++ T K
Sbjct: 23 FFNNAKPAELFKKNSKECFISG-DAT-WN-------FIDNEIVGIAKGAS---GFIMTKK 70
Query: 343 YYNDFELTADFKQEADGNSGIFIRSFVEEGAKVNGWQ---VEVAPKGFDTGGIYESYGRG 399
Y +F L +FK ++ NSGIFIR +E + V+ ++ ++ P + G + +
Sbjct: 71 SYKNFILELEFKPDSTVNSGIFIRCKNKELSMVDCYENNIWDLHPNQENRTGAVVNRSKP 130
Query: 400 WLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIHD 459
+ D+ WNT +I++ N + TW+NGE + D+ D + +G IALQ +
Sbjct: 131 LIYVNTLDK--------WNTYKIKIEKNHLQTWVNGELITDLHDNDL--SEGMIALQAAE 180
Query: 460 GGGIKVLWRNIRVKTL 475
G I+ +RNI+ + L
Sbjct: 181 TGEIR--FRNIKFQNL 194
>gi|126648719|ref|ZP_01721203.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
gi|126575170|gb|EAZ79520.1| probable secreted glycosyl hydrolase [Algoriphagus sp. PR1]
Length = 207
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 56/191 (29%), Positives = 91/191 (47%), Gaps = 26/191 (13%)
Query: 300 ELFNGKDLTGWDVY-GTEQWYVQDSLLVCESGPDKQYGYLATC--KYYNDFELTADFKQE 356
ELFNG++ GW + + + ++D +L +GP Y +N+FEL K
Sbjct: 26 ELFNGQNFEGWKISENPDSFSIEDGMLKV-NGPRGHMFYEGEVGDHDFNNFELEVTLKTL 84
Query: 357 ADGNSGIFIRS-FVEEGAKVNGWQVEVAPKGFD---TGGIYESYGRGWLIQIPDDRENFL 412
+ NSGIFI + + E G G +++V D TG +Y D R+ F+
Sbjct: 85 PEANSGIFIHTKYQERGWPNIGHEIQVNQSHGDWRKTGSVY---------SFKDVRDTFV 135
Query: 413 KEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAG--------QGRIALQIHDGGGIK 464
++ EW I V G++VT +NGE + + + K G G IALQ HD +
Sbjct: 136 EDGEWYKETIIVQGDKVTVKVNGEVINEYDETKDREGDLGTKKLDHGTIALQAHDPNSV- 194
Query: 465 VLWRNIRVKTL 475
V ++++++K L
Sbjct: 195 VYYKSVKIKIL 205
>gi|32471313|ref|NP_864306.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
gi|32443154|emb|CAD71985.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
Length = 504
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 61/212 (28%), Positives = 92/212 (43%), Gaps = 34/212 (16%)
Query: 293 PRKTEE--EELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKY--YNDFE 348
PR T E L G W+ + W +++ +L+ + + T K DFE
Sbjct: 110 PRDTPAGAESLLEGNLADSWE-GSLDDWTLENGVLIGTTDGSVKVNRFITSKIAPVEDFE 168
Query: 349 LTADFKQEADGNSGIFIRSFVEEGAKVN---GWQVEV---APKGFDTGGIYESYGR---- 398
L D A GNSGI RS V E N G+Q +V PK G +YE GR
Sbjct: 169 LEVDVWVSARGNSGIQYRSEVREDLGPNVMVGYQCDVVAATPKY--NGMLYEERGRRILC 226
Query: 399 -------------GWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD--IQD 443
GW+++ + F E W+ ++RVVGN W++G++ + D
Sbjct: 227 HTAEKVVTDADGQGWVVE-SSEPPKFAPE-AWHRYKVRVVGNHHQHWIDGQKTAEHFDLD 284
Query: 444 EKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
+ GRI +Q+H G +++ +RN VK L
Sbjct: 285 PNGRSLSGRIGVQVHVGPPMEIRYRNFFVKRL 316
>gi|116626769|ref|YP_828925.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116229931|gb|ABJ88640.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 252
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 63/205 (30%), Positives = 91/205 (44%), Gaps = 30/205 (14%)
Query: 299 EELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYN----DFELTADFK 354
+ +F+GK L GWD W V+ LV ++ +KQ + DFEL FK
Sbjct: 50 QPIFDGKSLAGWD-GDPGFWRVEGGALVGQTSTEKQPAQNTFLIWRGGSPADFELKLQFK 108
Query: 355 QEADGNSGIFIRSFVEEGAK--VNGWQVEVAPKGFDTGGIYESYGRGWLIQ------IP- 405
NSGI RS K + G+Q ++ TG IYE GRG+L IP
Sbjct: 109 LTG-FNSGIQFRSIELPDIKWAMKGYQADMDGVQQYTGQIYEERGRGFLAMRGQFSYIPQ 167
Query: 406 -------------DDRENFLKEREWNTMRIRVVGNQVTTWLNGE-QMVDIQDEKIGAG-Q 450
++ + +K +WN + + GN + LNG + I D+K G
Sbjct: 168 GGKPGLVGSVGDSNELKALIKGEDWNDLHLIARGNTIVQLLNGRITSMLIDDDKEGRKMD 227
Query: 451 GRIALQIHDGGGIKVLWRNIRVKTL 475
G I +Q+H G +K+ RNIR+K L
Sbjct: 228 GLIGIQVHKGPPMKIEVRNIRLKKL 252
>gi|109897209|ref|YP_660464.1| protein of unknown function DUF1080 [Pseudoalteromonas atlantica
T6c]
gi|109699490|gb|ABG39410.1| protein of unknown function DUF1080 [Pseudoalteromonas atlantica
T6c]
Length = 259
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 65/272 (23%), Positives = 119/272 (43%), Gaps = 47/272 (17%)
Query: 55 MKCMNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGW-----RDYNGQA 109
M + + + L G L++C+ + +GW LF+G+ L+GW G
Sbjct: 1 MTLVKLILIGTLLGILSSCTH-------QRPVKSGWHTLFNGENLDGWTVKIHHHEVGDN 53
Query: 110 LTGPWEVVDGAIQADGEGSD----ENGYIVYDRIYENFELQWDWKISKG----------G 155
+ + V +G ++ + D + G++ +++ + NF L+ D+
Sbjct: 54 VDDTFRVENGLLRVSYDQYDTFDKQFGHLYFNQPFSNFHLKLDYHFYGDFLSDAPHYAER 113
Query: 156 NSGLLYHVVERPQYKVPYVTGP---EYQLID--DKGFAEPLEDWQRCGVDY----AMYLP 206
NSGL+Y+ + P + P E Q + D G A P + G D A+Y
Sbjct: 114 NSGLMYYS-QAPDTILKEQDWPISVEMQFLAGLDDGKARPTGNMCSPGTDIEYQGAVY-T 171
Query: 207 DFATMNVRPA---GEWNTSKIVFDNGHVEYYMNGQKTIEFE--AWSDDWFQRKNSGKWE- 260
D M+ P GEW +++++ +NG+V + +NG +++ + + + W
Sbjct: 172 DHCLMSSSPTIPVGEWVSAELIVNNGNVTHIINGDIVLQYTRPTMGGKFVKGYDPAIWTP 231
Query: 261 NAPEYGLARKGLICLQDHGYPAWFRNIKIREL 292
+AP + G I LQ G+P F+NIKI+ L
Sbjct: 232 SAPLH----SGYIALQSEGHPIEFKNIKIKAL 259
>gi|149196057|ref|ZP_01873113.1| hypothetical protein LNTAR_22964 [Lentisphaera araneosa HTCC2155]
gi|149140904|gb|EDM29301.1| hypothetical protein LNTAR_22964 [Lentisphaera araneosa HTCC2155]
Length = 1233
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 54/200 (27%), Positives = 98/200 (49%), Gaps = 25/200 (12%)
Query: 299 EELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYG--YLATCKYYNDFELTADFKQE 356
++LFNG +L GW W V+D ++ +S + G +L DFE+T + +
Sbjct: 24 QDLFNGNNLAGWK-GDLSYWSVKDGVIFGQSTKEHPTGGTFLVWDGEVADFEITLQARVK 82
Query: 357 ADGNSGIFIRSFVEEGAK--VNGWQVEVAPKGFDTGGIY-ESYGRGWLIQ----IPDDRE 409
+ NSG+ RS + + VNG+Q ++ G +Y + GRG + Q + D++
Sbjct: 83 GN-NSGLQYRSKIANAQRFTVNGYQADIIDANHLFGMMYHQGEGRGIVAQRFQQVAVDKQ 141
Query: 410 ---NFLKE----------REWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAG-QGRIAL 455
+KE +WN R+ VGN++ +NG VD+ D+ A +G +AL
Sbjct: 142 GKKTIVKEFGDKNQKWDASQWNEYRVIAVGNRLIHQVNGVTTVDVTDDHPNAARKGILAL 201
Query: 456 QIHDGGGIKVLWRNIRVKTL 475
Q+H G + +++I+++ +
Sbjct: 202 QLHGGAPMTAEFKDIKLRKV 221
>gi|87308845|ref|ZP_01090984.1| hypothetical protein DSM3645_11417 [Blastopirellula marina DSM
3645]
gi|87288556|gb|EAQ80451.1| hypothetical protein DSM3645_11417 [Blastopirellula marina DSM
3645]
Length = 1672
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 61/202 (30%), Positives = 92/202 (45%), Gaps = 30/202 (14%)
Query: 299 EELFNGKDLTGWDVYGTEQ-WYVQDSLLVCES---GPDKQYGYLATCK-YYNDFELTADF 353
+ +F+GK L GW G EQ W VQD + ++ P K +L + +DFEL +
Sbjct: 30 KSIFDGKTLNGWR--GKEQFWSVQDGAITGQTTSENPTKGNTFLIWDQGKVDDFELKLKY 87
Query: 354 KQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQ------IPDD 407
K GNSGI RS V G+Q ++ K +G YE GRG + + D
Sbjct: 88 KI-VGGNSGIQYRSTDLGDFVVKGYQADIDSKDTYSGINYEERGRGIIANRGVKATVYDG 146
Query: 408 RENFLKER--------------EWNTMRIRVVGNQVTTWLNGEQMVDIQDE--KIGAGQG 451
+ ER +WN I GN +T ++NG + ++ DE K G
Sbjct: 147 NQGNKDERFAESADIQAKINKEDWNEYHIIAKGNHLTHFINGVKTSEVIDEGKKDNRESG 206
Query: 452 RIALQIHDGGGIKVLWRNIRVK 473
+ALQ+H G + V +++I +K
Sbjct: 207 ILALQLHAGPPMNVQFKDIELK 228
>gi|87310061|ref|ZP_01092194.1| serine/threonine protein kinase [Blastopirellula marina DSM 3645]
gi|87287307|gb|EAQ79208.1| serine/threonine protein kinase [Blastopirellula marina DSM 3645]
Length = 806
Score = 61.2 bits (147), Expect = 1e-07, Method: Composition-based stats.
Identities = 39/126 (30%), Positives = 65/126 (51%), Gaps = 17/126 (13%)
Query: 79 NVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGP--WEVVDGAIQADGEGSDENGYIVY 136
NV + A W +F+G+ + GW ++ GP W V G++ G E G+I+
Sbjct: 605 NVPPTTDVAQDWVSVFNGRDMRGW------SIQGPTLWRVASGSLIGVGTSPQEAGWIML 658
Query: 137 DRIYENFELQWDWKISKGGNSGLLYHVVERPQYKVPYVTGPEYQLIDDKGFAEP----LE 192
++ Y+ +EL++++K+ +GGNSG+ + RP + E Q+IDD A P +
Sbjct: 659 EKKYDAYELEFEYKLEEGGNSGVFLN--SRPGEPLVGSKFLEIQIIDD---AAPKFRNIP 713
Query: 193 DWQRCG 198
D QR G
Sbjct: 714 DIQRTG 719
>gi|116619940|ref|YP_822096.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116223102|gb|ABJ81811.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 256
Score = 60.5 bits (145), Expect = 3e-07, Method: Composition-based stats.
Identities = 63/227 (27%), Positives = 94/227 (41%), Gaps = 46/227 (20%)
Query: 288 KIRELPRKTEEEE-----LFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCK 342
K + P E +E +F+GK L GW+ + W V++ LV E P G +
Sbjct: 33 KQSDRPEAIEGDEPGFKPIFDGKSLAGWE-GDPKYWRVENGALVGEITP----GTVIKSN 87
Query: 343 YY--------NDFELTADFKQEADGNSGIFIRSFV-------EEGAKVNGWQVEVAPKGF 387
+ DFEL AD++ GNSGI RS V + G+Q ++ +
Sbjct: 88 TFIIWRGGEPADFELKADYRITTAGNSGINYRSVVVPDKVTPSNQFAMRGYQHDIDGQNR 147
Query: 388 DTGGIYESYGR------GWLIQIPDDRENFLKER-------------EWNTMRIRVVGNQ 428
TG YE GR G + R+ + +WN+ I GN
Sbjct: 148 YTGQNYEEKGRLFLALRGQTTHVVGGRKPIVLSSTGDTKALAEFITSDWNSCHIIARGNV 207
Query: 429 VTTWLNGEQMVDIQDEKIG--AGQGRIALQIHDGGGIKVLWRNIRVK 473
+T LNG M + D+ +G I +Q+H G +KV +RN R+K
Sbjct: 208 LTHILNGHLMCCVIDDDPPNRMAKGLIGVQVHVGPPMKVEYRNFRLK 254
>gi|32474367|ref|NP_867361.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
gi|32444905|emb|CAD74907.1| probable secreted glycosyl hydrolase [Rhodopirellula baltica SH 1]
Length = 230
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 54/188 (28%), Positives = 83/188 (44%), Gaps = 16/188 (8%)
Query: 301 LFNGKDLTGW--DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEAD 358
LF+G+ L GW + W V+D +LVC+ G Y+ +F AD K
Sbjct: 44 LFDGESLDGWKKSTENPDSWQVEDGMLVCK-GERCHLFYVGELAPLKNFHFKADVKVMPG 102
Query: 359 GNSGIFIRS-FVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKEREW 417
N+GI+ + + E G G++ +V D YG I N +++ EW
Sbjct: 103 SNAGIYFHTKYQESGWPKYGYECQVNVSHKDPKKTSSLYGVE-NIDAETLAANGIRDNEW 161
Query: 418 NTMRIRVVGNQVTTWLNGEQMVDI---QDEKIGA-------GQGRIALQIHDGGGIKVLW 467
T I V G + +NG+ +VD D++ + G+G ALQ HD I +
Sbjct: 162 YTQEIIVRGKHIELKVNGKTLVDFTEPSDQEAFSDRFERRLGEGTFALQAHDPQSI-AYF 220
Query: 468 RNIRVKTL 475
+N+RVK L
Sbjct: 221 KNLRVKPL 228
>gi|149177640|ref|ZP_01856241.1| hypothetical protein PM8797T_27292 [Planctomyces maris DSM 8797]
gi|148843458|gb|EDL57820.1| hypothetical protein PM8797T_27292 [Planctomyces maris DSM 8797]
Length = 240
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 62/216 (28%), Positives = 102/216 (47%), Gaps = 45/216 (20%)
Query: 291 ELPRKTEEEELFNGKDLTGWDVYGTEQ--WYVQDSLLVCESGPDKQYGYLATCKYYNDFE 348
ELP + + LFNGKDLTGW T++ WYV+D LVC P G + + + Y +F
Sbjct: 24 ELP---QYKPLFNGKDLTGWVNVNTDKDTWYVKDGTLVCTGHP---IGVMRSDRQYENFL 77
Query: 349 LTADFKQ-EADGNSGIFIRS--FVEEGAKV-NGWQVE----------------VAPKGFD 388
L +++ EA GNSG+F S V EG ++ G +++ + P +
Sbjct: 78 LHIEWRHMEAGGNSGVFAWSEGTVPEGRRLPKGMEIQMLELDWVNQHKLKDGSLPPIAYV 137
Query: 389 TGGIYESYGRGWLIQIPDD----RENFLKER-----EWNTMRIRVVGNQVTTWLNGEQMV 439
G E +G LI PD+ R ++ R +WN + V V +NG+ +
Sbjct: 138 HG---ELFGANGLITTPDNPRGTRSKSIENRCKGKGQWNVYDVVCVDGVVKLSVNGKFVN 194
Query: 440 DIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
+++ I +G + L+ G ++ +RNI++ L
Sbjct: 195 GVRNASI--KKGYLCLE---SEGAEIQFRNIQIMEL 225
>gi|145589295|ref|YP_001155892.1| protein of unknown function DUF1080 [Polynucleobacter sp.
QLW-P1DMWA-1]
gi|145047701|gb|ABP34328.1| protein of unknown function DUF1080 [Polynucleobacter sp.
QLW-P1DMWA-1]
Length = 197
Score = 59.7 bits (143), Expect = 5e-07, Method: Composition-based stats.
Identities = 53/180 (29%), Positives = 90/180 (50%), Gaps = 20/180 (11%)
Query: 300 ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEADG 359
+L +G LT W++ GT W + + ++ +K G+L + K Y +F + +F E++
Sbjct: 34 DLIDGVSLTDWNIIGTANWVIGNGIV----EGNKPNGFLVSTKPYKNFIIRTEFWAESNT 89
Query: 360 NSGIFIRSFVEEGAKVNG---WQVEVAPKGFDTGGIYESYGRGWLIQIPDDRENFLKERE 416
NSGIFIR ++ KV +++ + +DT ++Y G ++ +
Sbjct: 90 NSGIFIR--CQDPKKVTQSTCYEINI----WDTRP-EQAYATGAIVDVAKVDPVPKAGGR 142
Query: 417 WNTMRIRVVGNQVTTWLNGEQMV-DIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
WNTM I G+ LNG V D QD + +G IALQ GGI + ++ +++KTL
Sbjct: 143 WNTMEITANGSHFKVVLNGVTTVADGQDSRY--VEGPIALQ--SAGGI-IKFKKVQIKTL 197
>gi|21224873|ref|NP_630652.1| glycosyl hydrolase (secreted protein) [Streptomyces coelicolor A3(2)]
gi|3218378|emb|CAA19630.1| putative glycosyl hydrolase (putative secreted protein) [Streptomyces
coelicolor A3(2)]
Length = 1238
Score = 59.3 bits (142), Expect = 5e-07, Method: Composition-based stats.
Identities = 59/213 (27%), Positives = 97/213 (45%), Gaps = 41/213 (19%)
Query: 89 GWQLLFDGQTLEGWRDYNGQALTGPWEVVDGAIQADGEGSDENGYIVYD-RIYENFELQW 147
G++ +F+GQTL+GW+ QA G + V +G ++++G G + Y + +++ L+
Sbjct: 1058 GYRDIFNGQTLDGWK----QAGPGKFNVKNGVLESEGG----MGLLWYQAKELKSYSLKL 1109
Query: 148 DWKISKGGNSGLLYHVVERPQYKVPYVT---GPEYQLIDDKGFAEPLEDWQRCGVDYAMY 204
DWK+ NSG+ V P P+ G E Q ID + + G Y
Sbjct: 1110 DWKMRGDDNSGVF---VGFPASDDPWSAVNKGYEIQ-IDATDAVD-----RTTGAIYTFK 1160
Query: 205 LPDFATMN--VRPAGEWNTSKIVFDNGHVEYYMNGQKTIEFEAWSDDWFQRKNSGKWENA 262
+ + +RP G+WN+ +I ++ ++NG K +F
Sbjct: 1161 AANIKARDQVLRPPGQWNSYEIKVQGERLQVFLNGVKINDFT---------------NKD 1205
Query: 263 PEYGLARKGLICLQDHGY--PAWFRNIKIRELP 293
PE L G I LQ+HG FRNI+++ELP
Sbjct: 1206 PERSLT-DGYIGLQNHGADDQVSFRNIQLKELP 1237
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 54/186 (29%), Positives = 88/186 (47%), Gaps = 20/186 (10%)
Query: 300 ELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYL-ATCKYYNDFELTADFKQEAD 358
++FNG+ L GW G ++ V++ +L E G G L K + L D+K D
Sbjct: 1061 DIFNGQTLDGWKQAGPGKFNVKNGVLESEGG----MGLLWYQAKELKSYSLKLDWKMRGD 1116
Query: 359 GNSGIFIRSFVEE---GAKVNGWQVEV-APKGFD--TGGIYESYGRGWLIQIPDDRENFL 412
NSG+F+ + A G+++++ A D TG IY R+ L
Sbjct: 1117 DNSGVFVGFPASDDPWSAVNKGYEIQIDATDAVDRTTGAIYTFKAANI-----KARDQVL 1171
Query: 413 KER-EWNTMRIRVVGNQVTTWLNGEQMVDI--QDEKIGAGQGRIALQIHDGGGIKVLWRN 469
+ +WN+ I+V G ++ +LNG ++ D +D + G I LQ H G +V +RN
Sbjct: 1172 RPPGQWNSYEIKVQGERLQVFLNGVKINDFTNKDPERSLTDGYIGLQNH-GADDQVSFRN 1230
Query: 470 IRVKTL 475
I++K L
Sbjct: 1231 IQLKEL 1236
>gi|116624007|ref|YP_826163.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116227169|gb|ABJ85878.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 229
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 65/215 (30%), Positives = 94/215 (43%), Gaps = 51/215 (23%)
Query: 301 LFNGKDLTGWD-------------------VYGTEQWYVQDSL------LVCESGPDKQY 335
LFNGKDL+GW +QW + L E D +
Sbjct: 26 LFNGKDLSGWRGRQGTYSPHAEALLSKEELAAKQQQWNAERDLHWSVDVAKGEIVSDGKS 85
Query: 336 GYLATCKYYNDFELTADFKQ-EADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYE 394
+LAT + Y DF+L D+ + +G+SGI++RS+ +V W V+ P+ G
Sbjct: 86 VHLATARDYRDFDLYVDWLMVKHNGDSGIYLRSY----PQVQIWDVD-NPREVKNGA--- 137
Query: 395 SYGRGWLIQIPDDREN---FLKERE----WNTMRIRVVGNQVTTWLNGEQMVDIQ--DEK 445
G G L DD +K WNT RI++ G++V+ WLNG+ VD Q D
Sbjct: 138 PRGSGALWNNNDDNPGKWPLVKADNPVGAWNTFRIKMAGSRVSVWLNGKLTVDNQVLDNF 197
Query: 446 IGAG-----QGRIALQIHDGGGIKVLWRNIRVKTL 475
+G I LQ H G ++ +RNI +K L
Sbjct: 198 YNRSLPLVPKGAIELQTH---GSEIRFRNIYIKEL 229
>gi|87310236|ref|ZP_01092367.1| hypothetical protein DSM3645_27448 [Blastopirellula marina DSM
3645]
gi|87286985|gb|EAQ78888.1| hypothetical protein DSM3645_27448 [Blastopirellula marina DSM
3645]
Length = 253
Score = 58.5 bits (140), Expect = 9e-07, Method: Composition-based stats.
Identities = 63/217 (29%), Positives = 93/217 (42%), Gaps = 59/217 (27%)
Query: 301 LFNGKDLTGWDVYG-------------------------TEQWYVQDSLLVCE-SGPDKQ 334
LFNG DL+GW YG ++ W V+D LV + GP
Sbjct: 40 LFNGHDLSGW--YGLNPHLAAKLDGEEKDKNLRTQRAEFSKYWRVEDGSLVNDGKGP--- 94
Query: 335 YGYLATCKYYNDFELTADFKQEADGNSGIFIR-----SFVEEGAKVNGWQVEVAPKGFDT 389
Y T + D EL ++K A +SGI++R + K N + P +
Sbjct: 95 --YATTVDEFGDMELQLEYKTVAGADSGIYLRGAPQVQIWDSNQKFNSKAPDRKPH-LGS 151
Query: 390 GGIYESY----GRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD--IQD 443
GG+Y + GR L ++ D EWN +RIR +G + WLNG+ +VD + +
Sbjct: 152 GGLYNNTPGAPGRDPL-ELADHPFG-----EWNQLRIRQIGARTWVWLNGKLVVDDAVME 205
Query: 444 EKIGAGQ-----GRIALQIHDGGGIKVLWRNIRVKTL 475
G + G I LQ H G ++ WRNI V+ +
Sbjct: 206 NFWGRSKPLPSTGPIMLQTHGG---EISWRNIFVREI 239
>gi|149198757|ref|ZP_01875800.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
HTCC2155]
gi|149138193|gb|EDM26603.1| probable secreted glycosyl hydrolase [Lentisphaera araneosa
HTCC2155]
Length = 234
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 51/174 (29%), Positives = 80/174 (45%), Gaps = 31/174 (17%)
Query: 311 DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQ---EADGNSGIFIRS 367
+VY + ++D ++ S K+ + T K Y DF L + K E NSG+ R+
Sbjct: 37 NVYNDGHYELKDGVVHMTS---KKNFFFPTKKRYADFILEYEVKMPDVEEYSNSGLIFRA 93
Query: 368 FVEEGAK---VNGWQVEVAPKGFD-TGGIYESYGRGWLI----------------QI--- 404
++EG K V G+Q EV P +GG+Y+ RGWL Q+
Sbjct: 94 QIKEGKKGKTVIGYQAEVDPSERAWSGGLYDQGRRGWLYPKHATRSKYDEDFKGSQLEPW 153
Query: 405 PDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIALQIH 458
++++ K EWN R+ G+ + +LNG M + D K +G IA+Q H
Sbjct: 154 TEEKKKVYKHLEWNKYRVECRGSDIKIFLNGTLMTHVIDTK--DAEGHIAIQHH 205
>gi|116621620|ref|YP_823776.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
gi|116224782|gb|ABJ83491.1| protein of unknown function DUF1080 [Solibacter usitatus Ellin6076]
Length = 228
Score = 57.8 bits (138), Expect = 1e-06, Method: Composition-based stats.
Identities = 54/211 (25%), Positives = 91/211 (43%), Gaps = 50/211 (23%)
Query: 301 LFNGKDLTGWDVYGTEQWYVQ-DSLLVCE-----------SGP-----------DKQYGY 337
LFNGK+L GW++ G QW V D +V + GP D Q
Sbjct: 24 LFNGKNLDGWEIIGDGQWTVMADGTVVGQRIGELRKMLVPGGPLTTPKDFKSWVDTQSWL 83
Query: 338 LATCKYYNDFELTADFKQEADGNSGIFIRS-------------FVEEGAKVNGWQVEVA- 383
T + +F+L ++ + GNSG+ IR + + +K+ G+++++
Sbjct: 84 YTTRNDFGEFDLHLEYWTKTSGNSGVSIRDTSRAKWGVTTPPDYTKTPSKI-GYEIQINN 142
Query: 384 --PKGFDTGGIYESYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDI 441
P +G IY P D + K+ EWN M I+ +++T +NG + +
Sbjct: 143 RFPDPHPSGSIYG------FQDAPKDSQ---KDDEWNAMDIKSRNDKITVSINGRVVAEH 193
Query: 442 QDEKIGAGQGRIALQIHDGGGIKVLWRNIRV 472
+ + G I LQ+HD I +RN+R+
Sbjct: 194 AGDPARSKTGPIGLQLHDQFSIS-QFRNVRI 223
>gi|94969351|ref|YP_591399.1| protein of unknown function DUF1080 [Acidobacteria bacterium
Ellin345]
gi|94551401|gb|ABF41325.1| protein of unknown function DUF1080 [Acidobacteria bacterium
Ellin345]
Length = 308
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 47/152 (30%), Positives = 72/152 (47%), Gaps = 20/152 (13%)
Query: 293 PRKTEEEELFNGKDLTGW---DVYGTEQWYVQDSLLVC-ESGPDKQYGYLATCKYYNDFE 348
P+ + +LFNGKDLTGW D T W V D L E GP+ + + + + DF+
Sbjct: 130 PKWGKPIDLFNGKDLTGWTMSDPKATNPWKVIDGALTSPEHGPE-----IISNQKFKDFK 184
Query: 349 LTADFKQEADGNSGIFIRSFVEEGAKVNGWQVEVAPKGFDTGGIYESYGRGWLIQIPDDR 408
+ +F NSG+++R E A++ P+ TG IY G+L+ P
Sbjct: 185 IHVEFNIHGTANSGVYLRGRYE--AQIETDSANEGPE-HHTGSIY-----GFLVGDPKPP 236
Query: 409 ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVD 440
+ W T I ++G VT LNG+ ++D
Sbjct: 237 R---QSDVWQTYDITLLGRWVTVVLNGKTIID 265
>gi|21225488|ref|NP_631267.1| secreted glycosyl hydrolase [Streptomyces coelicolor A3(2)]
gi|8546922|emb|CAB94634.1| putative secreted glycosyl hydrolase. [Streptomyces coelicolor
A3(2)]
Length = 579
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 54/188 (28%), Positives = 86/188 (45%), Gaps = 17/188 (9%)
Query: 295 KTEEEELFNGKDLTGWDVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFK 354
+T LF+G +GW G + + D L G Y Y A ++ D+ L D++
Sbjct: 265 ETGYRSLFDGTGTSGWKQAGPGGFTLADGTLTSHGGLG-MYWYQAE-EFTGDYSLKLDWR 322
Query: 355 QEADGNSGIFIRSFVEE---GAKVNGWQVEVAPKGF---DTGGIYESYGRGWLIQIPDDR 408
D NSG+F+ + A NG+++++ TG +Y G+ R
Sbjct: 323 ASGDDNSGVFVGFPASDDPWSAVDNGYEIQIDATDTPDRTTGSVY-----GFQSADVAAR 377
Query: 409 ENFLKER-EWNTMRIRVVGNQVTTWLNGEQMVDI--QDEKIGAGQGRIALQIHDGGGIKV 465
+ L EWNT +RV G ++ +LNG ++ D D QG I +Q H G G +V
Sbjct: 378 DAALNPPGEWNTYEVRVTGERLELFLNGRKINDFTNTDPARSLRQGHIGIQNH-GDGDEV 436
Query: 466 LWRNIRVK 473
+R++RVK
Sbjct: 437 SFRDVRVK 444
>gi|32471058|ref|NP_864051.1| conserved hypothetical protein-putative rhamnosidase
[Rhodopirellula baltica SH 1]
gi|32396760|emb|CAD71725.1| conserved hypothetical protein-putative rhamnosidase
[Rhodopirellula baltica SH 1]
Length = 824
Score = 56.6 bits (135), Expect = 3e-06, Method: Composition-based stats.
Identities = 57/201 (28%), Positives = 88/201 (43%), Gaps = 31/201 (15%)
Query: 301 LFNGKDLTGW-DVYGTEQWYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFK-QEAD 358
LFNGKDL+GW + Y + V D + ++ D ++ +L T + Y++F L+ D E
Sbjct: 53 LFNGKDLSGWRNPYSHGEASVVDGEIHLKA--DNKF-FLVTEQKYSNFRLSVDIHLPEGP 109
Query: 359 GNSGIFIRSFVEEGA--KVNGWQVEV-APKGFDTGGIYESYGRGWLIQIPDDR------- 408
NSG+ R V+E A KV G+Q E + +GG+++ R W+ R
Sbjct: 110 SNSGVMFRCHVDEDAEKKVYGYQAECDGSERRWSGGLFDEARRRWIWPSTKGRSTTQFRA 169
Query: 409 --------------ENFLKEREWNTMRIRVVGNQVTTWLNGEQMVDIQDEKIGAGQGRIA 454
+ L WN I + + +T LNG Q V +D G I
Sbjct: 170 HEEESQKFFAEPRVRDALNRNGWNRYTITCIDDVITIELNGVQTVRFRDAM--DSSGFIG 227
Query: 455 LQIHDGGGIKVLWRNIRVKTL 475
+Q H G +RN+ +K L
Sbjct: 228 IQHHGEKGQTYRFRNLFIKEL 248
>gi|32470990|ref|NP_863983.1| conserved hypothetical protein-putative secreted protein
[Rhodopirellula baltica SH 1]
gi|32396692|emb|CAD71657.1| conserved hypothetical protein-putative secreted protein
[Rhodopirellula baltica SH 1]
Length = 264
Score = 55.8 bits (133), Expect = 7e-06, Method: Composition-based stats.
Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 8/112 (7%)
Query: 58 MNCLCVMVLFGALTACSEKPQNVLTEEEKAAGWQLLFDGQTLEGWRDYNGQALTGPWEVV 117
M V+ L A T+ S + + +++A W LFDG++LEGW WEVV
Sbjct: 57 MGLALVLCLSAAGTSVSAEDN---AKADESAKWTTLFDGESLEGWEKVGRD--DSKWEVV 111
Query: 118 DGAIQADGEGSDENGYIVYDRIYENFELQWDWKISKGGNSGLLYHVVERPQY 169
DG I+ G S + Y+NF + + KI+ GGNSG+ + +P +
Sbjct: 112 DGVIKGTGGVS---MLVNTSGPYKNFRYRAEVKINDGGNSGVYFRTTRKPGF 160
>gi|149196925|ref|ZP_01873978.1| hypothetical protein LNTAR_10981 [Lentisphaera araneosa HTCC2155]
gi|149140035|gb|EDM28435.1| hypothetical protein LNTAR_10981 [Lentisphaera araneosa HTCC2155]
Length = 185
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 52/164 (31%), Positives = 78/164 (47%), Gaps = 16/164 (9%)
Query: 318 WYVQDSLLVCESGPDKQYGYLATCKYYNDFELTADFKQEA-DGNSGIFIRSFVEEGAKVN 376
W +D+L++ E+ DK+ L T K + D+EL FK ++ D +SG+F+R N
Sbjct: 32 WTTEDNLIIGEN-VDKKNSVLWTEKKFKDYELVVKFKTDSKDYDSGVFLRG--------N 82
Query: 377 GWQVEV----APKGFDTGGIYE-SYGRGWLIQIPDDRENFLKEREWNTMRIRVVGNQVTT 431
QV++ + K T IY S +G D K +WN M++ V G +
Sbjct: 83 SHQVQIGVSRSLKKDLTACIYAPSDKKGKYPASSDKVAELHKLGQWNEMKMIVQGKNMKV 142
Query: 432 WLNGEQMVDIQDEKIGAGQGRIALQIHDGGGIKVLWRNIRVKTL 475
+LNG Q VD KI G I LQ+H G K+ + + K L
Sbjct: 143 YLNGTQTVDYDGVKIKE-SGPIGLQLHAGVHQKMEFEIVSFKEL 185
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.320 0.139 0.451
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,052,168,933
Number of Sequences: 5470121
Number of extensions: 96143281
Number of successful extensions: 171161
Number of sequences better than 1.0e-05: 122
Number of HSP's better than 0.0 without gapping: 35
Number of HSP's successfully gapped in prelim test: 87
Number of HSP's that attempted gapping in prelim test: 170471
Number of HSP's gapped (non-prelim): 373
length of query: 475
length of database: 1,894,087,724
effective HSP length: 137
effective length of query: 338
effective length of database: 1,144,681,147
effective search space: 386902227686
effective search space used: 386902227686
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 132 (55.5 bits)