BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= TF2877
(536 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|29348500|ref|NP_812003.1| putative regulatory protein [B... 474 e-132
gi|156109833|gb|EDO11578.1| hypothetical protein BACOVA_027... 470 e-131
gi|29349113|ref|NP_812616.1| regulatory protein SusR [Bacte... 432 e-119
gi|150003950|ref|YP_001298694.1| regulatory protein SusR [B... 431 e-119
gi|156109143|gb|EDO10888.1| hypothetical protein BACOVA_035... 430 e-118
gi|156861033|gb|EDO54464.1| hypothetical protein BACUNI_019... 411 e-113
gi|156861186|gb|EDO54617.1| hypothetical protein BACUNI_016... 324 1e-86
gi|1561765|gb|AAB39215.1| regulatory protein [Bacteroides t... 323 3e-86
gi|156111629|gb|EDO13374.1| hypothetical protein BACOVA_009... 320 1e-85
gi|154492745|ref|ZP_02032371.1| hypothetical protein PARMER... 315 6e-84
gi|156861943|gb|EDO55374.1| hypothetical protein BACUNI_010... 306 2e-81
gi|29348718|ref|NP_812221.1| transcriptional regulator [Bac... 297 1e-78
gi|146301132|ref|YP_001195723.1| hypothetical protein Fjoh_... 296 2e-78
gi|156111662|gb|EDO13407.1| hypothetical protein BACOVA_009... 290 2e-76
gi|150008321|ref|YP_001303064.1| putative regulatory protei... 287 1e-75
gi|156862782|gb|EDO56213.1| hypothetical protein BACUNI_001... 270 2e-70
gi|29349477|ref|NP_812980.1| conserved hypothetical protein... 258 9e-67
gi|29347570|ref|NP_811073.1| putative regulatory protein [B... 254 2e-65
gi|156107261|gb|EDO09006.1| hypothetical protein BACOVA_048... 252 4e-65
gi|156111653|gb|EDO13398.1| hypothetical protein BACOVA_009... 249 3e-64
gi|156109538|gb|EDO11283.1| hypothetical protein BACOVA_031... 240 2e-61
gi|156858966|gb|EDO52397.1| hypothetical protein BACUNI_040... 230 2e-58
gi|153808734|ref|ZP_01961402.1| hypothetical protein BACCAC... 229 4e-58
gi|67940337|ref|ZP_00532774.1| transcriptional regulator [C... 229 4e-58
gi|146302068|ref|YP_001196659.1| hypothetical protein Fjoh_... 222 5e-56
gi|86140905|ref|ZP_01059464.1| transcriptional regulator [F... 207 1e-51
gi|146302167|ref|YP_001196758.1| hypothetical protein Fjoh_... 205 5e-51
gi|149279983|ref|ZP_01886109.1| hypothetical protein PBAL39... 196 3e-48
gi|4572635|emb|CAB40102.1| hypothetical protein [Prevotella... 58 2e-06
>gi|29348500|ref|NP_812003.1| putative regulatory protein [Bacteroides thetaiotaomicron VPI-5482]
gi|29340405|gb|AAO78197.1| putative regulatory protein [Bacteroides thetaiotaomicron VPI-5482]
Length = 550
Score = 474 bits (1219), Expect = e-132, Method: Composition-based stats.
Identities = 272/551 (49%), Positives = 371/551 (67%), Gaps = 32/551 (5%)
Query: 6 FLQIILP-LCCLTHLVQAAGSLAEHSGRLKHLDRMISDRGRYHIEKEKGIDSLRSKLRQA 64
F+ I+LP L C + ++ L L+ +D +I +R Y EKE I L+ L +A
Sbjct: 8 FVTIVLPCLLCARNDDKSTDML------LREIDGIIRNRQTYGAEKEARISDLKKLLSEA 61
Query: 65 VSDQERFEWCSRLYETYIVYQTDSALRYVLESERLLRQLPDDAFRYRAQLNRVGVMTVTG 124
SD++R+ +C RL++ Y Y DS+ Y + L L + A +N VM TG
Sbjct: 62 TSDEQRYGFCGRLFDEYRAYNLDSSYVYAQRKQELASHLNKQDYLDDAAMNMAEVMGTTG 121
Query: 125 MYKEALDALNQIPRHAFSGDLLQSYFHHSRTLYGHMADYALTNDEKNAYQQSADRYRDSL 184
MYKEAL+ L QI + S L Y+H RT+YG M DYA+T EK Y + D YRDSL
Sbjct: 122 MYKEALEQLGQIDKKTLSDYLYPYYYHLYRTIYGLMGDYAVTEKEKKEYYRMTDLYRDSL 181
Query: 185 LFIMPHGEINTLIVEADRFNTHGQFDATIAML-----KPVTDTCRNTERMRFLAYTLSEA 239
L + ++V AD+ H Q+D I ML KP D + YTLSEA
Sbjct: 182 LQTNASDSLGHVLVMADKCTVHAQYDQAITMLTDFYRKPSLDD----HSKAMITYTLSEA 237
Query: 240 YALKGDRENQKYYLTLSAIADLKTSVREYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQ 299
Y LKGD++ QK++L LSAIADLK++V+EY+SLRKLA L+YEDGDIDRAY Y+KCSLEDA
Sbjct: 238 YRLKGDKKGQKHFLALSAIADLKSAVKEYVSLRKLASLVYEDGDIDRAYNYLKCSLEDAT 297
Query: 300 SCNARSRTIEVAEIFPVIDKAYRLRSERKQRTITLLLASVSLLALCLVAAIIYVYRQMKK 359
CNAR RT+E++++FP+ID+AY+L+++R+Q+ + + L +SLL++ L+ AI +VY+QMKK
Sbjct: 298 LCNARLRTLEISQVFPIIDQAYQLKTKRQQQEMKISLICISLLSVFLLVAIFFVYKQMKK 357
Query: 360 LAVARQ--------------ALADANRQLQTINNTLKETNLIKEEYIAQYINRCSAYIDK 405
+A AR+ L D+N QL+ +N+TL E N IKEEYI +Y+++CS Y+DK
Sbjct: 358 VAAARREVIDTNTLLQELNGELHDSNSQLKEMNHTLSEANYIKEEYIGRYMDQCSTYLDK 417
Query: 406 LDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLL 465
+D YRR L K+A+ ++EEL++AIKS +F+E E K+FY FD TFL LFPNFV +FN LL
Sbjct: 418 MDLYRRSLNKIAAAGRVEELYKAIKSSQFLEEELKDFYANFDMTFLQLFPNFVEEFNALL 477
Query: 466 NEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDK 525
E + PK G+LLNTELRIFALIRLGITDST+IA+FL+YS+TTIYNYR++VRNKA+G++
Sbjct: 478 VEP--MQPKQGELLNTELRIFALIRLGITDSTKIAQFLRYSVTTIYNYRTRVRNKALGER 535
Query: 526 NEFEAAVMRIG 536
+EFEA VM+IG
Sbjct: 536 DEFEAKVMKIG 546
>gi|156109833|gb|EDO11578.1| hypothetical protein BACOVA_02788 [Bacteroides ovatus ATCC 8483]
Length = 549
Score = 470 bits (1210), Expect = e-131, Method: Composition-based stats.
Identities = 262/523 (50%), Positives = 358/523 (68%), Gaps = 25/523 (4%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQERFEWCSRLYETYIVYQTDSALRY 92
L+ +D +I +R Y EKE I L+ L +A SD++R+ +C RL++ Y Y DS+ Y
Sbjct: 29 LREIDGIIKNRQTYGAEKEARIADLKKLLAEATSDEQRYGFCGRLFDEYRAYNLDSSFVY 88
Query: 93 VLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHH 152
E L ++ + A +N VM TGMYKEAL+ L QI + L Y+H
Sbjct: 89 AQRKEELAHRMDKLDYLDDAAMNMAEVMGTTGMYKEALELLGQIDKKTLPDYLYGYYYHL 148
Query: 153 SRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDAT 212
RT+YG M DYA+T K Y + D YRDSLL + + ++V AD+ H Q+D
Sbjct: 149 YRTIYGLMGDYAVTEKVKKEYYRMTDLYRDSLLQVNASDSLGHVLVMADKCIVHAQYDEA 208
Query: 213 IAML-----KPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVRE 267
I ML KP D L YTLSE Y LKGD++ QK+YL LSAIADLK++V+E
Sbjct: 209 IRMLMEYYNKPSLDD----HSKAMLTYTLSEGYRLKGDKQGQKHYLALSAIADLKSAVKE 264
Query: 268 YISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSER 327
Y+SLRKLA L+Y++GDIDRAY Y+KCSLEDA CNAR RT+E++++FP+ID+AY+L+++R
Sbjct: 265 YVSLRKLASLVYDEGDIDRAYNYLKCSLEDATLCNARLRTLEISQVFPIIDQAYQLKTKR 324
Query: 328 KQRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVAR--------------QALADANRQ 373
+Q+ + + L +SLL++ L+ AI +VY+QMKK+A AR + L D+N Q
Sbjct: 325 QQQEMKVSLICISLLSVFLLVAIFFVYKQMKKVAAARREVVDTNTLLQELNEELHDSNSQ 384
Query: 374 LQTINNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDE 433
L+ +N+TL E N IKEEYI +Y+++CS Y+DK+D YRR L K+A+ ++EEL++AIKS +
Sbjct: 385 LKEMNHTLSEANYIKEEYIGRYMDQCSTYLDKMDLYRRSLNKIAAAGRVEELYKAIKSSQ 444
Query: 434 FIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGI 493
F++ E KEFY FD TFL LFPNFV +FN LL E + PKPG+LLNTELRIFALIRLGI
Sbjct: 445 FLDEELKEFYANFDMTFLQLFPNFVEEFNALLTEP--MQPKPGELLNTELRIFALIRLGI 502
Query: 494 TDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRIG 536
TDST+IA+FL+YS+TTIYNYR++VRNKA+G+++EFE VM+IG
Sbjct: 503 TDSTKIAQFLRYSVTTIYNYRTRVRNKALGERDEFETKVMQIG 545
>gi|29349113|ref|NP_812616.1| regulatory protein SusR [Bacteroides thetaiotaomicron VPI-5482]
gi|29341020|gb|AAO78810.1| regulatory protein SusR [Bacteroides thetaiotaomicron VPI-5482]
Length = 582
Score = 432 bits (1111), Expect = e-119, Method: Composition-based stats.
Identities = 251/503 (49%), Positives = 335/503 (66%), Gaps = 1/503 (0%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQERFEWCSRLYETYIVYQTDSALRY 92
LK LD +IS + Y I +EK I L+ +L + +E + L+ Y+ YQ DSAL Y
Sbjct: 81 LKKLDDIISKKETYQIRREKDITDLKVQLAHSTDPARNYELYASLFGAYLHYQADSALHY 140
Query: 93 VLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHH 152
+ +L QL Y +NR VM V GMY EA++ L +I + L SY+
Sbjct: 141 INRQMEILPQLNRPDLEYEIVINRATVMGVMGMYIEAMEQLEKIDPKKLNEWTLLSYYQT 200
Query: 153 SRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDAT 212
R YG +ADY EK Y + D YRDS++ MP E N IV A+R G+ D
Sbjct: 201 YRACYGWLADYTTNKTEKEKYLKKTDLYRDSIIAAMPPEE-NKTIVMAERCIVTGKADTA 259
Query: 213 IAMLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISLR 272
I ML + + ++ YTLSEAY++K D E + YYL L+AIADL++SVREY SL+
Sbjct: 260 IGMLNDALKDMEDERQKVYIYYTLSEAYSMKKDVEKEVYYLILTAIADLESSVREYASLQ 319
Query: 273 KLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRTI 332
KLA L+YE GDIDRAY+Y+ CS+EDA +CNAR R +EV E FP+IDKAY+L+ ER++
Sbjct: 320 KLAHLMYELGDIDRAYKYLSCSMEDAVACNARLRFMEVTEFFPIIDKAYKLKEERERAVS 379
Query: 333 TLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLKETNLIKEEYI 392
+L SVSLL+L L+ AI Y+YR MKK++V R+ L+ AN+Q+ +N L++T IKE YI
Sbjct: 380 RAMLISVSLLSLFLLIAIFYLYRWMKKISVMRRNLSLANKQMSAVNKELEQTGKIKEVYI 439
Query: 393 AQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFDHTFLS 452
A+Y++RC Y+DKL+ YRR L KLA +S++++LF+AIKS++FI ER EFYNEFD +FL
Sbjct: 440 ARYLDRCVNYLDKLETYRRSLAKLAMSSRIDDLFKAIKSEQFIRDERNEFYNEFDKSFLK 499
Query: 453 LFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSLTTIYN 512
LFP+F+T FN LL E+ R+ PK +LL TELRIFALIRLG+ DS +IA FL YSL TIYN
Sbjct: 500 LFPHFITSFNNLLVEEARVYPKSDELLTTELRIFALIRLGVVDSNKIAHFLGYSLATIYN 559
Query: 513 YRSKVRNKAVGDKNEFEAAVMRI 535
YRS++RNKA GDK+ FE VM +
Sbjct: 560 YRSRMRNKAAGDKDRFEQDVMNL 582
>gi|150003950|ref|YP_001298694.1| regulatory protein SusR [Bacteroides vulgatus ATCC 8482]
gi|149932374|gb|ABR39072.1| regulatory protein SusR [Bacteroides vulgatus ATCC 8482]
Length = 531
Score = 431 bits (1107), Expect = e-119, Method: Composition-based stats.
Identities = 250/503 (49%), Positives = 347/503 (68%), Gaps = 1/503 (0%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQERFEWCSRLYETYIVYQTDSALRY 92
LK LDR+I ++ H++KEK I L+ +L ++ ++E++E C L+ Y+ YQ DSAL Y
Sbjct: 30 LKRLDRVIDNKTACHVQKEKEIVDLKQRLHRSKDNREKYELCGSLFNAYLHYQADSALYY 89
Query: 93 VLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHH 152
+ LL L + +NR VM V GMY EAL+ L ++ + L Y+
Sbjct: 90 INGKMNLLPLLNHPELKNEIVINRAEVMGVMGMYNEALEQLERVDPLELERETLAYYYRT 149
Query: 153 SRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDAT 212
R YG +ADY + EK Y + D YRDS+L I ++ IV A++ +G+ D+
Sbjct: 150 YRAYYGWVADYTTNDAEKVKYLKKTDAYRDSIL-IATDPCVDRSIVWAEKKIINGKVDSA 208
Query: 213 IAMLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISLR 272
+ +L + + + ++ YTLSEAY ++GD + + YYL L+AI DLK+S+REY SL+
Sbjct: 209 LVILSDLLKETPDERQKGYIYYTLSEAYDMRGDIQKEIYYLALTAITDLKSSIREYASLQ 268
Query: 273 KLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRTI 332
KLA L+YE GD+DRAY+Y+ CS+EDA +CNAR R IEV + FP+IDKAY+L+ E++++
Sbjct: 269 KLAQLMYEVGDLDRAYKYLNCSMEDAVACNARLRFIEVTQFFPIIDKAYKLKEEKERQIS 328
Query: 333 TLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLKETNLIKEEYI 392
LL SVSLL+L L+AAI Y+YR MKKL+V R+ L+ AN+Q+Q +N L +T IKE YI
Sbjct: 329 RTLLISVSLLSLFLLAAIFYLYRWMKKLSVMRRNLSLANQQMQEVNAELAQTGKIKEVYI 388
Query: 393 AQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFDHTFLS 452
A+Y++RC Y+DKL+ YRR L KLA S++++LF+AIKS++FI ERK+FYNEFD +FL
Sbjct: 389 ARYLDRCVIYLDKLEFYRRSLAKLAMASRIDDLFKAIKSEQFIRDERKDFYNEFDKSFLE 448
Query: 453 LFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSLTTIYN 512
LFP+F+T FN+LL E+ RI PK G+LL TELRIFALIRLG+TDS RIA FL YSL TIYN
Sbjct: 449 LFPHFITSFNELLVEEGRIYPKSGELLTTELRIFALIRLGVTDSNRIAHFLGYSLATIYN 508
Query: 513 YRSKVRNKAVGDKNEFEAAVMRI 535
YRSK+RNKA+G+K FE VM +
Sbjct: 509 YRSKMRNKAIGNKETFEQEVMNL 531
>gi|156109143|gb|EDO10888.1| hypothetical protein BACOVA_03521 [Bacteroides ovatus ATCC 8483]
Length = 557
Score = 430 bits (1106), Expect = e-118, Method: Composition-based stats.
Identities = 248/508 (48%), Positives = 340/508 (66%), Gaps = 1/508 (0%)
Query: 28 EHSGRLKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQERFEWCSRLYETYIVYQTD 87
+++ LK LD +IS + + I KEK I++L+ +L + ++E + L+ Y+ YQ D
Sbjct: 51 DNAAALKKLDEVISKKETFQIRKEKEINNLKLELAHSTDPVRKYELYASLFGAYLHYQAD 110
Query: 88 SALRYVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQ 147
S+L Y+ +L L Y +NR VM V GMY EA++ L +I +
Sbjct: 111 SSLYYINREMEILPLLNRPELEYEIIINRATVMGVMGMYIEAIEQLERIDPKKLNEWTRL 170
Query: 148 SYFHHSRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHG 207
SY+ R YG +ADY +EK Y + D YRDS++ MP E N IV A++ +G
Sbjct: 171 SYYQTYRACYGWLADYTTNKNEKEKYLKKTDLYRDSIIAAMP-PEANKTIVLAEKCIMNG 229
Query: 208 QFDATIAMLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVRE 267
+ D + ML ++ + ++ YTLSEAY++K D E + YYL L+AIADL+T VRE
Sbjct: 230 KADVAVDMLNNALKEIQDERQKVYIYYTLSEAYSMKKDIEKEVYYLILTAIADLETPVRE 289
Query: 268 YISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSER 327
Y SL+KLA L+YE GDIDRAY+Y+ CS+EDA +CNAR R IEV E FP+IDKAY+L+ E+
Sbjct: 290 YASLQKLAHLMYESGDIDRAYKYLSCSMEDAVACNARLRFIEVTEFFPIIDKAYKLKEEK 349
Query: 328 KQRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLKETNLI 387
++ +L SVSLL+L L+ AI Y+YR MKKL+V R+ L+ AN+Q+ +N L++T I
Sbjct: 350 ERAVSRAMLISVSLLSLFLLIAIFYLYRWMKKLSVMRRNLSLANKQMSAVNAELEQTGKI 409
Query: 388 KEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFD 447
KE YIA+Y++RC Y+DKL+ YRR L KLA S++E+LF+AIKS++FI ER EFYNEFD
Sbjct: 410 KEVYIARYLDRCVNYLDKLETYRRSLAKLAMASRIEDLFKAIKSEQFIRDERDEFYNEFD 469
Query: 448 HTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSL 507
+FL LFPNF++ FN LL E+ R+ PK +LL TELRIFALIRLG+ DS +IA FL YSL
Sbjct: 470 RSFLKLFPNFISAFNNLLVEEGRVYPKSDELLTTELRIFALIRLGVVDSNKIAHFLGYSL 529
Query: 508 TTIYNYRSKVRNKAVGDKNEFEAAVMRI 535
TIYNYRS++RNKA GDK+ FE VM +
Sbjct: 530 ATIYNYRSRMRNKAAGDKDMFEQNVMNL 557
>gi|156861033|gb|EDO54464.1| hypothetical protein BACUNI_01940 [Bacteroides uniformis ATCC 8492]
Length = 538
Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats.
Identities = 244/519 (47%), Positives = 352/519 (67%), Gaps = 15/519 (2%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQERFEWCSRLYETYIVYQTDSALRY 92
L LD+ I +R Y +KE + L+ +L + + D+ERF L + Y + TDSAL
Sbjct: 16 LLKLDQAIKERPIYMEQKELKLVELKRQLHRQIPDEERFAILGTLLDEYRSFNTDSALHM 75
Query: 93 VLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHH 152
E E++ +L + + A++N+ V+ +TGMYKE +D + I D+ Y+H
Sbjct: 76 AEEREQIAIRLGNREYIDNARMNKADVLGMTGMYKEVMDLMRNIHIDRLPVDIHPYYYHI 135
Query: 153 SRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDAT 212
RT+YG MADYA+T EK Y + D+YRDSLL + + ++++D++N ++D
Sbjct: 136 YRTVYGLMADYAVTAYEKKLYTELTDKYRDSLLLVNKDNLLIHTLIQSDQYNVRNEYDKA 195
Query: 213 IAMLKPVTDTCRNTER-MRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISL 271
I +L ++ E + AYTLSE+Y LKGD+E +K YL +SA+AD+KT+VREYISL
Sbjct: 196 IRLLTDYLALQKDYEHDVAICAYTLSESYRLKGDKEKEKEYLIVSAMADMKTAVREYISL 255
Query: 272 RKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRT 331
RKLA LLY++GDI+RAY Y+K +EDA +CNAR R +E+ EIFP+I+ AY+ ++E++Q
Sbjct: 256 RKLAVLLYQEGDIERAYSYVKICMEDAAACNARLRKLEILEIFPIINDAYQQKTEKQQEQ 315
Query: 332 ITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANR--------------QLQTI 377
+ L S+SLL+L L+ AI YVY+QMKK+A AR+ + DAN+ QL+
Sbjct: 316 MKWALVSISLLSLFLLLAIFYVYKQMKKVAAARREVIDANKRLKELNDELHLSNAQLKEA 375
Query: 378 NNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEA 437
N+++ E + +KEEYI +Y+++CS Y++K+D+YRR L K+A+T +EEL++ IKS +FIE
Sbjct: 376 NHSIAENSYLKEEYIGRYMDQCSVYLEKMDNYRRSLGKIAATGNVEELYKNIKSSKFIEG 435
Query: 438 ERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDST 497
E KEFY FD+TFL LFP FV DFN LL + ++I+ K G+ +NTELRIFALIRLGITDS
Sbjct: 436 ELKEFYTNFDNTFLQLFPTFVEDFNALLADDEQISLKAGERMNTELRIFALIRLGITDSV 495
Query: 498 RIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRIG 536
+IA+FL+YS+TTIYNYR+KVRNKA GD++ E VM IG
Sbjct: 496 KIAQFLRYSVTTIYNYRTKVRNKAAGDRDLLEQEVMTIG 534
>gi|156861186|gb|EDO54617.1| hypothetical protein BACUNI_01600 [Bacteroides uniformis ATCC 8492]
Length = 554
Score = 324 bits (830), Expect = 1e-86, Method: Composition-based stats.
Identities = 202/526 (38%), Positives = 319/526 (60%), Gaps = 24/526 (4%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLRSKLRQA-VSDQERFEWCSRLYETYIVYQTDSALR 91
L LD+ I R +Y EKE+ I ++ +L+Q ++D ER+ +RL+ Y Y +DSAL
Sbjct: 27 LSVLDKTIVMRRQYEEEKERYISLIKDELKQGRLTDMERYLIQNRLFAEYNSYISDSALH 86
Query: 92 YVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFH 151
Y+ E+ + +L + + + LN+V ++ +G++ EA++ L +PR+ G+ + Y+
Sbjct: 87 YINENILIATRLNNRQWINSSILNKVHILNTSGLFVEAMELLKSLPRNTLEGENIVDYYV 146
Query: 152 HSRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDA 211
LY + A+YA + N Y + A+ YRDS++ ++P ++V A + G+
Sbjct: 147 CFENLYLYQAEYATDRNYVNNYLRIANLYRDSIISLVPEDTYRYVVVHAPQLIDQGKSQE 206
Query: 212 TIAMLKPVTDTCRNTERMRFLAYT-LSEAYALKGDRENQKYYLTLSAIADLKTSVREYIS 270
I +LK ++ R +A + L+ AY + G+++ + SAIAD++ V+E S
Sbjct: 207 AICLLKNFLPRLKSNTREYAVATSILAFAYHVVGNKQKEMEARISSAIADIRAVVKENYS 266
Query: 271 LRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQR 330
L LA LLY GD++RA Y+K S+EDA R R+ + +++ P+ID+AY+ E +Q+
Sbjct: 267 LCALAELLYGMGDLERANHYIKISMEDANYYTTRLRSSQNSKMLPLIDRAYQQEKEIQQQ 326
Query: 331 TITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINN----------- 379
+ + + +L++ L+ ++ V QMKKL+ AR+ + AN QL +N+
Sbjct: 327 RQRMFITGICILSVFLLLTVLCVLWQMKKLSYARKKVVAANSQLSILNSELKKLNKSQHE 386
Query: 380 ----------TLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAI 429
TL E N IKEEY+ ++++ S+YI K++ YRR L K A+ KLEEL+R +
Sbjct: 387 ANERLLHTNQTLTEANHIKEEYLGRFLSLSSSYISKMEEYRRILNKQAAAGKLEELYRTL 446
Query: 430 KSDEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALI 489
KSD FI E KEFY+ FD +FL +FPNFV +FN+LL ++++ PK +LL TELRIFALI
Sbjct: 447 KSDRFINQELKEFYHNFDVSFLKIFPNFVEEFNRLLPVEEQLHPKNEELLVTELRIFALI 506
Query: 490 RLGITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRI 535
RLGITDS RIAEFL+YS+TTIY YRSK++NK++ K +FE VM+I
Sbjct: 507 RLGITDSARIAEFLRYSITTIYTYRSKLKNKSLC-KEDFEERVMKI 551
>gi|1561765|gb|AAB39215.1| regulatory protein [Bacteroides thetaiotaomicron]
Length = 433
Score = 323 bits (827), Expect = 3e-86, Method: Composition-based stats.
Identities = 199/429 (46%), Positives = 269/429 (62%), Gaps = 22/429 (5%)
Query: 99 LLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHHSRTLYG 158
+L QL Y +NR VM V GMY EA++ L +I + L SY+ R YG
Sbjct: 3 ILPQLNRPDLEYEIVINRATVMGVMGMYIEAMEQLEKIDPKKLNEWTLLSYYQTYRACYG 62
Query: 159 HMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDATIAMLKP 218
+ADY EK Y + D YRDS++ MP E N IV A+R G+ D I ML
Sbjct: 63 WLADYTTNKTEKEKYLKKTDLYRDSIIAAMPPEE-NKTIVMAERCIVTGKADTAIGMLND 121
Query: 219 VTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISLRKLAFLL 278
+ + ++ YTLSEAY++K D E + YYL L+AIADL++SVREY SL+KLA L+
Sbjct: 122 ALKDMEDERQKVYIYYTLSEAYSMKKDVEKEVYYLILTAIADLESSVREYASLQKLAHLM 181
Query: 279 YEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRTITLLLAS 338
YE GDIDRAY+Y+ CS+EDA +CNAR R +EV E FP+IDKAY+L+ ER++ +L S
Sbjct: 182 YELGDIDRAYKYLSCSMEDAVACNARLRFMEVTEFFPIIDKAYKLKEERERAVSRAMLIS 241
Query: 339 VSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLKETNLIKEEYIAQYINR 398
VSLL+L L+ AI Y+YR MKK++V R+ L+ AN+Q+ +N L++T IKE YIA+Y++R
Sbjct: 242 VSLLSLFLLIAIFYLYRWMKKISVMRRNLSLANKQMSAVNKELEQTGKIKEVYIARYLDR 301
Query: 399 CSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFDHTFLSLFPNFV 458
C Y+DKL+ YRR L KLA +S++++LF+AIKS++FI ER EFYNEFD +FL+ +
Sbjct: 302 CVNYLDKLETYRRSLAKLAMSSRIDDLFKAIKSEQFIRDERNEFYNEFDKSFLNCSHTLL 361
Query: 459 ----------TDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSLT 508
+K+ D TP LIRLG+ DS +IA FL YSL
Sbjct: 362 LLSITAGRRSKSLSKIRRTADNRTPD-----------LCLIRLGVVDSNKIAHFLGYSLA 410
Query: 509 TIYNYRSKV 517
TIYNYRS++
Sbjct: 411 TIYNYRSRI 419
>gi|156111629|gb|EDO13374.1| hypothetical protein BACOVA_00908 [Bacteroides ovatus ATCC 8483]
Length = 538
Score = 320 bits (820), Expect = 1e-85, Method: Composition-based stats.
Identities = 203/536 (37%), Positives = 302/536 (56%), Gaps = 5/536 (0%)
Query: 3 MRRFLQIILPLCCLTHLVQ-AAGSLAEHSGRLKHLDRMISDRGRYHIEKEKGIDSLRSKL 61
M+RF ILPL + + A + E LK LD+ + ++ Y +K++ IDSL+ L
Sbjct: 1 MKRF---ILPLLIIFIFISLPAKANDEVKTLLKVLDKSLQNKASYTQQKQRQIDSLKIIL 57
Query: 62 RQAVSDQERFEWCSRLYETYIVYQTDSALRYVLESERLLRQLPDDAFRYRAQLNRVGVMT 121
RQ+ +++ L Y +Q DSAL Y + R ++ D A+L+ +++
Sbjct: 58 RQSRDIRKKVGTAQSLCFEYSSFQKDSALAYAIHMNRFAQESNDKELLIEAKLDYSRILS 117
Query: 122 VTGMYKEALDALNQIPRHAFSGDLLQSYFHHSRTLYGHMADYALTNDEKNAYQQSADRYR 181
G +KEAL N + + S L YF T+Y H +A D+ A YR
Sbjct: 118 SMGFFKEALAITNSMQQKQLSPKLKAEYFLGQVTIYNHQKAFASNEDDSQENDLIAQIYR 177
Query: 182 DSLLFIMPHGEINTLIVEADRFNTHGQFDATIAMLKPVTDTCRNTER-MRFLAYTLSEAY 240
DSLL + A H ++D I +L + R +AY+L+ AY
Sbjct: 178 DSLLQCKEVPSNVRAFITAPTLLFHKKYDDAIHILDSTYQSYTPYSRNAGIIAYSLASAY 237
Query: 241 ALKGDRENQKYYLTLSAIADLKTSVREYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQS 300
K D EN Y +SAI+D+ RE +SL+ LA L++E GDIDRA +YMK ++EDA
Sbjct: 238 QGKNDHENTIKYFAISAISDVLNGARENLSLKILAKLIFESGDIDRASKYMKNAMEDAIL 297
Query: 301 CNARSRTIEVAEIFPVIDKAYRLRSERKQRTITLLLASVSLLALCLVAAIIYVYRQMKKL 360
CNAR TIE ++++ IDKA++ + + K IT LL ++ ++ + L + + +Q K+
Sbjct: 298 CNARINTIEASDMYLFIDKAFQEKEKHKFIIITALLIALCIVCILLFILSVQLKKQKGKV 357
Query: 361 AVARQALADANRQLQTINNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTS 420
A ++L+ ++Q +N+ L + N IKEEY+ Y+ + ++YI K+ ++++R K+A +
Sbjct: 358 EQANESLSYHLYEIQQMNSILADNNKIKEEYVGLYMEQYTSYISKIANFKKRALKIAKSE 417
Query: 421 KLEELFRAIKSDEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLN 480
++++ + S E + EFYN FD L+LFPNFV DFN LL ++ I P PG LL
Sbjct: 418 DIKKVTSFLHSSLNTEEDLAEFYNNFDKAILNLFPNFVEDFNALLLPENAIIPGPGKLLT 477
Query: 481 TELRIFALIRLGITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRIG 536
ELRIFALIRLGITDS +IA FLQYSL+TIYNYRSK+R KA GD+NEFE V RIG
Sbjct: 478 PELRIFALIRLGITDSVKIAHFLQYSLSTIYNYRSKMRIKANGDRNEFEEKVARIG 533
>gi|154492745|ref|ZP_02032371.1| hypothetical protein PARMER_02382 [Parabacteroides merdae ATCC
43184]
gi|154087050|gb|EDN86095.1| hypothetical protein PARMER_02382 [Parabacteroides merdae ATCC
43184]
Length = 547
Score = 315 bits (806), Expect = 6e-84, Method: Composition-based stats.
Identities = 203/549 (36%), Positives = 309/549 (56%), Gaps = 19/549 (3%)
Query: 3 MRRFLQIILPLCCLTHLVQAAGSLAEHSGRLKHLDRMISDRGRYHIEKEKGIDSLRSK-L 61
M + +IIL L ++V A +L L +LD+ I Y +E I L+ K
Sbjct: 1 MGKRYRIILFLFFTANMVFAESNL---DSLLVNLDQTILQHEIYKDRREARIRELKGKAT 57
Query: 62 RQAVSDQERFEWCSRLYETYIVYQTDSALRYVLESERLLRQLPDDAFRYRAQLNRVGVMT 121
+ A + ++ +Y Y Y DSA+ Y+ ++ R+ R L D Y+++L +
Sbjct: 58 KTAPNSIAAYQLNDSIYREYKSYMCDSAVLYLTKNIRIARNLRDQEREYKSKLLLASLHA 117
Query: 122 VTGMYKEALDALNQIPRHAFSGDLLQSYFHHSRTLYGHMADYALTNDEKNAYQQSADRYR 181
TGMY+EA+D L ++ R L + Y+ +Y ++ + Y+ + YR
Sbjct: 118 ATGMYQEAIDVLEEVRREDLPASLTRDYYACKEQVYREISGNSRDPQSIRRYEDKSFVYR 177
Query: 182 DSLLFIMPHGEINTLIVEADRFNTHGQFDATIAMLKP-VTDTCRNTERMRFLAYTLSEAY 240
DSL ++P G + ++ G D + + + T +Y + Y
Sbjct: 178 DSLAMMLPEGAGKRVELQELALRADGHTDEALRINDTRLAKIPFGTPEYALTSYQRAMIY 237
Query: 241 ALKGDRENQKYYLTLSAIADLKTSVREYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQS 300
K DRE +KYYL LS+++D+++++ ++ SL LA LL +DGDI+RAY Y++ S ++
Sbjct: 238 RQKKDREKEKYYLALSSLSDIQSAITDHASLWMLADLLLKDGDIERAYHYIRFSWDETNR 297
Query: 301 CNARSRTIEVAEIFPVIDKAYRLRSERKQRTITLLLASVSLLALCLVAAIIYVYRQMKKL 360
ARSR+ + A+I +IDK Y+ E K R + L +S+L L L++AI+Y+YRQMK+L
Sbjct: 298 FRARSRSWQSADILSLIDKNYQATIEGKNRILVAYLTLISVLTLLLISAIVYIYRQMKRL 357
Query: 361 AVARQALADANRQLQTINNTL--------------KETNLIKEEYIAQYINRCSAYIDKL 406
A AR L + N QL+ +N L E+N IKEEYI ++++ CS+YIDKL
Sbjct: 358 AEARNHLQETNEQLKVLNGELYQMNDRLQSANLELSESNRIKEEYIGRFMSLCSSYIDKL 417
Query: 407 DHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLN 466
D YRR + K+ S+ ++ EL + +S + +EAE Y FD FL LFPNFVT FN LL
Sbjct: 418 DGYRRMVYKMVSSGQIGELVKVTRSSKGLEAELNALYKNFDTAFLHLFPNFVTQFNSLLL 477
Query: 467 EKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKN 526
E +++ K +LLNTELRIFALIRLGI DS++IAEFL+YS+ TIYNYR+KV+NKA ++
Sbjct: 478 EDEQVVLKRDELLNTELRIFALIRLGINDSSQIAEFLRYSVNTIYNYRAKVKNKACVSRD 537
Query: 527 EFEAAVMRI 535
+FE V I
Sbjct: 538 DFENLVREI 546
>gi|156861943|gb|EDO55374.1| hypothetical protein BACUNI_01046 [Bacteroides uniformis ATCC 8492]
Length = 585
Score = 306 bits (785), Expect = 2e-81, Method: Composition-based stats.
Identities = 205/541 (37%), Positives = 297/541 (54%), Gaps = 18/541 (3%)
Query: 1 MHMRRFLQIILPLCCLTHLVQAAGSLAEHSGRLKHLDRMISDRGRYHIEKEKGIDSL-RS 59
M R F L + +V AAG+ + LD I Y +E I L R
Sbjct: 56 MERRLFTFCFLAVVVWQSVVFAAGT-PSFTALTSSLDEAIEAHRHYVAVREGRIARLKRQ 114
Query: 60 KLRQAVSDQERFEWCSRLYETYIVYQTDSALRYVLE----SERLLRQLPDDAFRYRAQLN 115
L ++ F W +Y+ Y Y DSA+ Y+ +ER RQ D A R +L
Sbjct: 115 VLDTDTANISFFRWNGEIYKEYKAYICDSAIHYLRVNLDWAERYGRQ--DAALETRLELA 172
Query: 116 RVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHHSRTLYGHMADYALTNDEKNAYQQ 175
+ M GMY+EA + L Q + + LL Y++ LY ++ Y L + K YQ
Sbjct: 173 HL--MASAGMYEEAAELLRQTDKASLPSHLLPDYYNACHKLYTELSFYTLDDSFKKHYQA 230
Query: 176 SADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDATIAMLKPVTDTCR-NTERMRFLAY 234
A Y DSL+ ++ L R G D +++ + NT + Y
Sbjct: 231 LATHYDDSLMQVLLPSSPLYLERRETREAAAGHPDEALSINDTRLAHAKPNTPEYALVTY 290
Query: 235 TLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISLRKLAFLLYEDGDIDRAYEYMKCS 294
S Y G+RE +K YL LSA+ D++ S+ ++ SL LA LLYE+GD++ AY Y++ S
Sbjct: 291 QRSLLYRRLGNREEEKRYLALSALTDIRLSITDHASLWNLAELLYEEGDMEHAYRYIRFS 350
Query: 295 LEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRTITLLLASVSLLALCLVAAIIYVY 354
++ NARSR+++ A I +ID Y+ E++ + L L +S L + LV A+ ++Y
Sbjct: 351 WDETNRYNARSRSLQTAGILSLIDLTYQAMREKQNDRLRLYLWLISALIVLLVVAVGFIY 410
Query: 355 RQMKKLAVARQALADANRQLQTINNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLT 414
RQM++L+ AR L AN QLQ NN IKEEY+ +++N CS YI++LD YRR +
Sbjct: 411 RQMQRLSAARNKLEHANEQLQLSNN-------IKEEYVGRFMNLCSVYINRLDTYRRMVN 463
Query: 415 KLASTSKLEELFRAIKSDEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPK 474
K S ++EEL + ++S E ++ KE Y+ FD FL LFP+FV FN LL ++RI +
Sbjct: 464 KKISAGQMEELLKMVRSREVLDTGLKELYDNFDTAFLHLFPDFVDKFNDLLQPEERIVLR 523
Query: 475 PGDLLNTELRIFALIRLGITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMR 534
G+LLNTELRIFALIRLGI DS++IAEFL+YS+ TIYNYR+KV+NKA + +FE +M+
Sbjct: 524 KGELLNTELRIFALIRLGIDDSSQIAEFLRYSVNTIYNYRAKVKNKARISREDFEIRLMQ 583
Query: 535 I 535
I
Sbjct: 584 I 584
>gi|29348718|ref|NP_812221.1| transcriptional regulator [Bacteroides thetaiotaomicron VPI-5482]
gi|29340624|gb|AAO78415.1| transcriptional regulator [Bacteroides thetaiotaomicron VPI-5482]
Length = 547
Score = 297 bits (761), Expect = 1e-78, Method: Composition-based stats.
Identities = 187/516 (36%), Positives = 294/516 (56%), Gaps = 16/516 (3%)
Query: 36 LDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQ-ERFEWCSRLYETYIVYQTDSALRYVL 94
LD+ I Y +++E I L+ + ER+ +++Y+ Y + DSA+ Y+
Sbjct: 31 LDQAILAHDTYVVQRESRIRHLKELAGDVAPNSIERYNLNNQIYKEYKAFICDSAIYYLN 90
Query: 95 ESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHHSR 154
E+ R+ L D ++L +++ TGMY E++D L + R + L+ Y+
Sbjct: 91 ENVRIAGNLGDTDREIESKLQLSLLLSSTGMYTESIDVLKSVDRQKVTSHLILDYYTCFD 150
Query: 155 TLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDATIA 214
+YG M Y Y++ + Y+DSL I+ +++ F ++D +
Sbjct: 151 HVYGEMGFYTQDQTLSAYYREISSAYKDSLYAILSPQSEEFMVMRETLFRDRHKYDEALE 210
Query: 215 ML-KPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISLRK 273
+ + + +T + + Y S Y GD+ +K L LSAI+D++++++++ SL
Sbjct: 211 INDRRLMAAEPDTPQYALVTYHRSLIYKYLGDKIREKQNLCLSAISDIRSAIKDHASLWM 270
Query: 274 LAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRTIT 333
LA LLYE+GD++RAY+YM+ S + NAR R+ + A++ +IDK Y+ E++ +
Sbjct: 271 LAQLLYENGDMERAYQYMRFSWNATKFYNARLRSWQSADVLSLIDKTYQAMIEKQNDRLQ 330
Query: 334 LLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLK----------- 382
L ++ L + L+ A+ Y+YRQMKKLAVAR L AN QL +N L+
Sbjct: 331 QYLVLITALLVLLIGALGYIYRQMKKLAVARNHLQTANHQLNQLNEELQQMNACLTSTNA 390
Query: 383 ---ETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAER 439
E+N IKEEYIA++I CS YI++LD YRR + K S ++ EL + +S + ++ E
Sbjct: 391 ELSESNQIKEEYIARFIKLCSTYINRLDAYRRMVNKKVSAGQIAELLKITRSQDALDEEL 450
Query: 440 KEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRI 499
+E Y FD FL LFP+FV FN LL + ++I K +LLNTELRIFALIRLGI DS++I
Sbjct: 451 EELYANFDTAFLHLFPDFVKKFNALLQDNEQIILKKDELLNTELRIFALIRLGIEDSSQI 510
Query: 500 AEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRI 535
AEFL+YS+ TIYNYR+KV+NKA G + +FE V +I
Sbjct: 511 AEFLRYSVNTIYNYRAKVKNKARGSREDFEDLVRKI 546
>gi|146301132|ref|YP_001195723.1| hypothetical protein Fjoh_3390 [Flavobacterium johnsoniae UW101]
gi|146155550|gb|ABQ06404.1| hypothetical protein Fjoh_3390 [Flavobacterium johnsoniae UW101]
Length = 556
Score = 296 bits (758), Expect = 2e-78, Method: Composition-based stats.
Identities = 190/525 (36%), Positives = 302/525 (57%), Gaps = 31/525 (5%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQERFEWCSRLYETYIVYQTDSALRY 92
L L+ + ++ Y KE+ I + + ++ ++ + + LY Y + +DSA+ Y
Sbjct: 36 LTKLNFALKNKEHYVRLKEERILNFKKIKSDNLTQEQEYNYNKTLYTEYQKFNSDSAILY 95
Query: 93 VLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHH 152
V ++ ++ QL + A L V + + +G Y+E+ L I + S LL +Y+
Sbjct: 96 VKKNLKIAVQLQNKELEDLANLQLVTLYSSSGKYRESEAILKSINKKTLSAHLLPTYYIS 155
Query: 153 SRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLI--VEADRF-----NT 205
R + H A N Y+ +YRDSLL ++ ++ I ++ + F NT
Sbjct: 156 YREFFEHYA----ANSYNEEYRLQIIKYRDSLLKVLDPKTLDYKINRIQQNIFLRKFGNT 211
Query: 206 HGQFDATIAMLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSV 265
Q +A+LK +T + + Y L + K + E +K Y LSA +DLK +
Sbjct: 212 QKQL---LALLK---NTPEENPQYAMITYLLGKINEAKNNLELRKKYYALSAASDLKNAN 265
Query: 266 REYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRS 325
++ SL++LA + YE GD+D AY+ + ++EDA CN + RT+ ++E++ +I+ Y +
Sbjct: 266 KDNASLQELALVFYEIGDVDMAYKLTQSAIEDALYCNVQFRTLLMSEVYSIINTVYLEKE 325
Query: 326 ERKQRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQAL--------------ADAN 371
+++ + L L +SLL++ LVAA+IYVY+QMKK++ R L + N
Sbjct: 326 AKRKTELQLYLICISLLSVFLVAAVIYVYKQMKKVSKIRAELYETTQKLAKLNKEITETN 385
Query: 372 RQLQTINNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKS 431
QLQ N+ L E+N +KEEYIA + + CS YI+KL++YR L K A+ + +E+++ +KS
Sbjct: 386 EQLQETNSQLSESNHVKEEYIAHFFSLCSTYINKLENYRIILNKKATAKQFDEIYKMLKS 445
Query: 432 DEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRL 491
++ E +E Y FD FL+L+P FV DFN LL +++I K G+LLNTELRIFALIRL
Sbjct: 446 TTLVDNELEELYKNFDIIFLNLYPTFVKDFNTLLIPEEQIVLKQGELLNTELRIFALIRL 505
Query: 492 GITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRIG 536
GITDS +IA FL+YSL+TIYNYR++ RNKA +N+FE VM+IG
Sbjct: 506 GITDSVKIAAFLRYSLSTIYNYRTRGRNKAAVSRNDFEEMVMKIG 550
>gi|156111662|gb|EDO13407.1| hypothetical protein BACOVA_00941 [Bacteroides ovatus ATCC 8483]
Length = 547
Score = 290 bits (741), Expect = 2e-76, Method: Composition-based stats.
Identities = 197/519 (37%), Positives = 293/519 (56%), Gaps = 16/519 (3%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLRSKLRQ-AVSDQERFEWCSRLYETYIVYQTDSALR 91
L LD I + Y ++E I L+ + S E + S++Y+ Y + DSA+
Sbjct: 28 LNVLDLTIQEHETYVAQRESRIRHLKELTHEIEASSAEHYNLNSQIYKEYKAFICDSAIH 87
Query: 92 YVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFH 151
Y+ E+ R+ +L D + +QL +++ TGMY E++D L + R L+ Y+
Sbjct: 88 YLNENIRIAERLHDTDRKIESQLQLSLLLSSTGMYTESIDVLESVDRQKVVSRLIADYYT 147
Query: 152 HSRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMP-HGEINTLIVEADRFNTHGQFD 210
+YG ++ Y Y + YRDSL I+P E ++ EA + H +
Sbjct: 148 CFDHVYGELSVYTQDKTLSGRYWSISQAYRDSLYAILPPESEEYLMMREASLRDQHQYEE 207
Query: 211 ATIAMLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYIS 270
A + + NT + + Y S Y D +K L LSAI+D++++++++ S
Sbjct: 208 ALKVNDLRLAEIEVNTPQYALVTYHRSLIYKYSNDSLGEKRNLCLSAISDIRSAIKDHAS 267
Query: 271 LRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQR 330
L LA LLYEDGD++RAY+YM+ S + NAR R+ + A++ +IDK Y+ E++
Sbjct: 268 LWMLAQLLYEDGDMERAYQYMRFSWNATKFYNARLRSWQSADVLSLIDKTYQAMIEKQND 327
Query: 331 TITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLK-------- 382
+ L ++ L + L+ A+ Y+YRQMKKLA AR L AN QL +N L+
Sbjct: 328 RLQQNLLLITALLVLLIVALGYIYRQMKKLADARNHLQVANGQLNGLNEELRQMNSCLSS 387
Query: 383 ------ETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIE 436
E+N IKEEYIA++I CS YI++LD YRR + K S ++ EL + +S + ++
Sbjct: 388 TNIELSESNQIKEEYIARFIKLCSTYINRLDAYRRMVNKKVSAGQIAELLKITRSQDALD 447
Query: 437 AERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDS 496
E +E Y FD FL LFPNFV FN LL + + I PK +LLNTELRIFALIRLGI DS
Sbjct: 448 EELEELYANFDTAFLHLFPNFVGKFNDLLQDNEHILPKKDELLNTELRIFALIRLGIEDS 507
Query: 497 TRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRI 535
++IAEFL+YS+ TIYNYR+KVRNKA G +++FE V +I
Sbjct: 508 SQIAEFLRYSVNTIYNYRAKVRNKARGSRDDFEILVRKI 546
>gi|150008321|ref|YP_001303064.1| putative regulatory protein [Parabacteroides distasonis ATCC 8503]
gi|149936745|gb|ABR43442.1| putative regulatory protein [Parabacteroides distasonis ATCC 8503]
Length = 551
Score = 287 bits (735), Expect = 1e-75, Method: Composition-based stats.
Identities = 189/519 (36%), Positives = 292/519 (56%), Gaps = 20/519 (3%)
Query: 34 KHLDRMISDRGRYHIEKEKGIDSLRSK-LRQAVSDQERFEWCSRLYETYIVYQTDSALRY 92
K LD + +R Y ++E+ I L+ L +S + +E +LYE + Q DSA+ Y
Sbjct: 31 KELDEAVLNRSFYLQQREQRITQLKDMFLLSKISLWQEYEINHQLYEEFKKIQQDSAIYY 90
Query: 93 VLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHH 152
+ + + + D A Y ++L + +GMY+E+ L I R S + Q ++
Sbjct: 91 IKRNMEIASFMKDTARIYTSRLRLATLYAFSGMYRESESLLRSIDRELLSKEQKQDFYEA 150
Query: 153 SRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDAT 212
+ + Y TN Y++ D Y+DSLL ++ I A ++ HGQ +
Sbjct: 151 YYSFF----SYYSTNLGSFEYRKQLDLYKDSLLSVLDTVSYRYKINLAQKYLAHGQARSA 206
Query: 213 IAMLKPVTDTC-RNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISL 271
+L P+ + R + Y L + Y + G + + Y TLS IAD+K + + S+
Sbjct: 207 EKVLIPLLEKEERYNPNFAMITYLLGDVYDMDGRTDLARDYYTLSVIADIKRAFLDSGSI 266
Query: 272 RKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRT 331
+KLA YE GD+ A +Y + ++E A +CN + R E+++ +P+I+ +Y+ R + +R
Sbjct: 267 QKLALNYYESGDLSNALKYAQLAIEGAVTCNIQFRMNEISKFYPIINASYQTREAQGKRQ 326
Query: 332 ITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLKETN------ 385
+ S+SLL L L+ ++ YVYRQM+K++ R+ L + N +L +N + ETN
Sbjct: 327 LMTYFLSISLLTLFLILSLAYVYRQMRKISAIREELVNTNARLVKLNREISETNDLLQER 386
Query: 386 --------LIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEA 437
IKEEYIA +++ CS YI+KL+ Y++ L K A +L+ELF+ ++S +E
Sbjct: 387 NIQLSESNHIKEEYIAHFLDLCSTYINKLEDYQKSLQKKAMNKQLDELFKMLRSTRMVEN 446
Query: 438 ERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDST 497
E + Y FD FL L+P FV DFN LL ++RI K DLLN ELRIFAL+RLG+TDS
Sbjct: 447 EVEALYVNFDRIFLGLYPTFVKDFNALLQPEERIVLKSEDLLNKELRIFALMRLGVTDSV 506
Query: 498 RIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRIG 536
RIA FL+ SL+TIYNYR+KVRNKA+ ++EFE VMRIG
Sbjct: 507 RIAAFLRCSLSTIYNYRTKVRNKALVPRDEFEGWVMRIG 545
>gi|156862782|gb|EDO56213.1| hypothetical protein BACUNI_00153 [Bacteroides uniformis ATCC 8492]
Length = 571
Score = 270 bits (690), Expect = 2e-70, Method: Composition-based stats.
Identities = 184/538 (34%), Positives = 285/538 (52%), Gaps = 37/538 (6%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQERFEWCSR-LYETYIVYQTDSALR 91
L LD +I+ + + + KE I L KLR+ V E W ++ LY+ Y VY DSA+
Sbjct: 31 LVRLDSLIAQKNTFAMLKEAKIAQLH-KLRKDVRTLEERYWLNKNLYDEYCVYNADSAMN 89
Query: 92 YVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFH 151
YV + ++ + D + ++ + ++ TG+ KEA D L+ + + +LL Y+
Sbjct: 90 YVAGNLDIVYKQNDKYRQMEWKIKKSFLLAATGLLKEAQDELDGVSGGSLPKELLVDYYG 149
Query: 152 HSRTLYGHMADYALTNDEKNA----YQQSADRYRDSL-LFIMPHGEINTLIVEADRFNTH 206
LY H Y T E Y Q Y+DSL + + P + T
Sbjct: 150 QMLYLYSHFNQY--TGSEMGTLHEHYAQLERVYKDSLNMVLTPEDPLFLWYKGQVVQGTD 207
Query: 207 GQFDATIAMLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVR 266
+ + K + ++ +T R AY L+ Y ++EN YL SA+AD++ S +
Sbjct: 208 SMYVFKERLQKGILNSAFDTRRDAMNAYVLACFYRESDEQENYLTYLIYSAMADVRISNK 267
Query: 267 EYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSE 326
+ SL +LA +L+ GDID AY YM L++A + R R + ++ + I + Y+ R++
Sbjct: 268 DIASLEELAGVLFSLGDIDHAYVYMSYCLQNALAYRNRVRVVGISAVQDTIHQIYQERNQ 327
Query: 327 RKQRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADAN--------------- 371
R++ + + L VS+L+L + A +Y+Y+QMK+L +RQ L +AN
Sbjct: 328 RQEARLRMYLVLVSVLSLISLFAFLYIYKQMKRLKQSRQQLNEANNRLNKHVEELSKMHG 387
Query: 372 -------------RQLQTINNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLAS 418
QL+ NN L+E+N +KEEYI + CS YI KLD YR+ + +
Sbjct: 388 QVAETNVQLTSLNEQLRDTNNQLRESNYVKEEYIGYVFSICSNYISKLDEYRKNINRKLK 447
Query: 419 TSKLEELFRAIKSDEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDL 478
++LE++ + + E KEFY+ FD FL ++P+FV+DFN LL+ ++I K G+L
Sbjct: 448 ANQLEDVKALTDTHSMAQNELKEFYHNFDAIFLHIYPDFVSDFNALLHPDEQIVLKDGEL 507
Query: 479 LNTELRIFALIRLGITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRIG 536
LNTELRI+AL+RLGITDS +IAEFL YS T+YN R K RNKA+ + EF A V +G
Sbjct: 508 LNTELRIYALVRLGITDSVKIAEFLHYSPQTVYNNRLKTRNKAIIPREEFAAVVRSLG 565
>gi|29349477|ref|NP_812980.1| conserved hypothetical protein, putative regulatory protein
[Bacteroides thetaiotaomicron VPI-5482]
gi|29341386|gb|AAO79174.1| conserved hypothetical protein, putative regulatory protein
[Bacteroides thetaiotaomicron VPI-5482]
Length = 534
Score = 258 bits (658), Expect = 9e-67, Method: Composition-based stats.
Identities = 170/528 (32%), Positives = 283/528 (53%), Gaps = 7/528 (1%)
Query: 13 LCCLTHLVQAAGSL---AEHSGRLKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQE 69
LC +T L+ + +L E+ LK LD++IS+R Y +KE I L++K ++ + +
Sbjct: 5 LCFITILLGSLLNLYANNENDSLLKVLDKVISERLVYTEKKEATIKELKAKKKEQKTLDD 64
Query: 70 RFEWCSRLYETYIVYQTDSALRYVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEA 129
+ S + Y + DSA +Y+ E+ + ++L + + +L V +++G++ +A
Sbjct: 65 MYRLNSEILHQYETFVCDSAEQYINENIEIAKKLDNKTYLLEGRLQLAFVYSLSGLFIQA 124
Query: 130 LDALNQIPRHAFSGDLLQSYFHHSRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMP 189
D I L Y + Y ++ Y + Y + YRD+++ I+
Sbjct: 125 NDIFKSINCSDLPSHLQALYCWNRIRYYENLIKYTDDARFASEYLVEKEAYRDTVMSILY 184
Query: 190 HGEINTLIVEADRFNTHGQFDATIAMLKPVTDTCRN-TERMRFLAYTLSEAYALKGDREN 248
A + G + +L + + T ++ LS AY L G+ E
Sbjct: 185 DASEEYSKERAIKLQDQGNTKEALKILTKIYQKEKTGTHGFAMMSMGLSRAYRLVGEHEL 244
Query: 249 QKYYLTLSAIADLKTSVREYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTI 308
++ YL L+A+ D+K +V+E +L LA LY GDIDR+Y Y+K +L DA N+R +
Sbjct: 245 EEKYLILAAMTDIKLAVKENEALLTLAVNLYHKGDIDRSYNYIKVALSDAIFYNSRFKNT 304
Query: 309 EVAEIFPVIDKAYRLRSERKQRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALA 368
+A I P+I+ Y R E++++ + + SL + L + + Y+Q K ++ A++ L
Sbjct: 305 VIARIHPIIENTYLYRLEKQKQNLRFYILLTSLFVVALAITLYFTYKQTKIVSRAKKNLN 364
Query: 369 DANRQLQTINNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRA 428
N +L +N L E NLIKE Y+ ++N+C+ YI+KLD YR+ + + T ++++L+++
Sbjct: 365 VMNEELVALNKNLDEANLIKERYVGYFMNQCAVYINKLDEYRKNVNRKIKTGQVDDLYKS 424
Query: 429 IKSDEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFAL 488
S E E + Y FD FL L+PNFV +FN LL +D D LNTELRIFAL
Sbjct: 425 --SSRPFEKELEGLYTNFDKAFLKLYPNFVEEFNSLLKPEDYYKLDK-DQLNTELRIFAL 481
Query: 489 IRLGITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRIG 536
+R+GITD ++IA FL YS+ TIYNY+SKV+ ++ D N FE V ++G
Sbjct: 482 MRMGITDVSQIAVFLHYSVQTIYNYKSKVKRMSLLDGNIFEEEVKKLG 529
>gi|29347570|ref|NP_811073.1| putative regulatory protein [Bacteroides thetaiotaomicron VPI-5482]
gi|29339471|gb|AAO77267.1| putative regulatory protein [Bacteroides thetaiotaomicron VPI-5482]
Length = 561
Score = 254 bits (648), Expect = 2e-65, Method: Composition-based stats.
Identities = 168/520 (32%), Positives = 282/520 (54%), Gaps = 29/520 (5%)
Query: 36 LDRMISDRGRYHIEKEKGIDSLRSKLRQA-VSDQERFEWCSRLYETYIVYQTDSALRYVL 94
L +MI + + +KE+ I ++ L+ + ++ ++ +LY Y + DSA+ YV
Sbjct: 51 LHKMIDAKPLFVQKKEQRIARIKCLLKDSGLTPDREYKVNLQLYNEYKKFNIDSAIHYVD 110
Query: 95 ESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHHSR 154
+ + RQL + +Y++ L V ++ G Y++A L ++ F LL +Y+
Sbjct: 111 RNLEIARQLNRNYLKYQSSLQLSLVYSMCGRYRDAELLLEKMKPSEFPRSLLATYY---- 166
Query: 155 TLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDATIA 214
Y +Y + N Y + + Y+DSL +M H + + A + H D+T A
Sbjct: 167 DTYARFWEYYSISATNNQYGKKREAYQDSLYALMDHTSFDYKLSRAYSYAGH---DSTKA 223
Query: 215 MLKPVTDTCRNTERMRFLAYTL-SEAYALKGD----RENQKYYLTLSAIADLKTSVREYI 269
+ + D N E + Y + + +YA+ ++ K YL +SAIAD++ + RE
Sbjct: 224 I--KILDELLNAEEVGTPNYAMITHSYAMLSRYLKREDDAKKYLMMSAIADIQNATRETA 281
Query: 270 SLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQ 329
SL+ LA + YE+ ++ A+++ + +++D S R +E+ + + +I+ AY+ R +
Sbjct: 282 SLQALALIQYEENNLADAFKFTQSAIDDVVSSGIHFRAMEIYKFYSIINTAYQTEEARSK 341
Query: 330 RTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLK------- 382
+ L S S+ L+ ++++Y QMKK ++ALA +N +L +N+ L
Sbjct: 342 SNLITFLISTSVSLFLLIVLVVFIYIQMKKTLRMKRALAQSNEELLRLNDKLNSMNSELN 401
Query: 383 -------ETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFI 435
E N IKE YIAQ+ + C +YI K++ Y+ L K+A EEL + +KS I
Sbjct: 402 DKNDELCEINNIKEHYIAQFFDVCFSYIHKMEKYQNMLYKIAINKCYEELIKKLKSSALI 461
Query: 436 EAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITD 495
+ E Y FD FL+L+P FV+DFN LL + ++I K LLN ELRI+AL+RLGITD
Sbjct: 462 DDELDALYTRFDRVFLNLYPTFVSDFNALLKDDEKIILKQDALLNRELRIYALLRLGITD 521
Query: 496 STRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRI 535
S +IA FL+ S +T+YNYR+K+RNKA D++EFE +M+I
Sbjct: 522 SGKIANFLRCSTSTVYNYRTKMRNKAAVDRDEFENEIMKI 561
>gi|156107261|gb|EDO09006.1| hypothetical protein BACOVA_04864 [Bacteroides ovatus ATCC 8483]
Length = 553
Score = 252 bits (644), Expect = 4e-65, Method: Composition-based stats.
Identities = 171/512 (33%), Positives = 280/512 (54%), Gaps = 6/512 (1%)
Query: 27 AEHSGRLKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQERFEWCSRLYETYIVYQT 86
+E+ LK LD++IS+R Y +KE I L+ K S ++ + + Y +
Sbjct: 41 SENDSLLKVLDKVISERLIYTQKKEATIKELKKKKVGLNSLEDIYNLNKEIIHQYETFVC 100
Query: 87 DSALRYVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLL 146
DSA +Y+ E+ + + + + + QL V +++G++ +A D I R A D L
Sbjct: 101 DSAEQYIHENIDIAKIIGNKEYLLEEQLRLAFVYSLSGLFIQANDIFKSI-RCADLPDHL 159
Query: 147 QSYFHHSRT-LYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNT 205
++ + +R Y ++ Y N Y + YRD+++ I+ +A +
Sbjct: 160 KALYCWNRIRYYENLIKYTDDVRFSNEYIAEKEAYRDTVMSILFDQSDEYRKEKAVKLQD 219
Query: 206 HGQFDATIAMLKPVTDTCRNTER-MRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTS 264
G + +LK + + +A L+ AY L G+ ++ +L L+A+ D K +
Sbjct: 220 KGSTKEALLILKEIYNKQEPASHGYAMMAMGLARAYRLTGNYILEEKFLMLAAMTDTKLA 279
Query: 265 VREYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLR 324
V+E +L LA LY GDIDRAY Y+K +L+DA N+R + +A I P+I+ Y +R
Sbjct: 280 VKENEALLTLAVNLYHKGDIDRAYNYIKVALDDAIFYNSRFKNTVIARIHPIIENTYLIR 339
Query: 325 SERKQRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLKET 384
E++++ + + SL + L + + Y+Q K ++ A++ L N +L +N L E
Sbjct: 340 LEKQKQNLRFYIFLTSLFVVALAITLYFTYKQTKIVSRAKRHLKAMNEELVGLNKNLDEA 399
Query: 385 NLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYN 444
NLIKE+Y+ ++N+C+ YI+KLD YR+ + + T ++++L+++ S E E +E Y+
Sbjct: 400 NLIKEKYVGYFMNQCAVYINKLDEYRKNVNRKIKTGQIDDLYKS--SSRPFEKELEELYH 457
Query: 445 EFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQ 504
FD FL L+PNFV +FN LL ++R + D LNTELRIFALIRLGITD +IA FL
Sbjct: 458 NFDKAFLKLYPNFVVEFNSLLKSEERYRLEK-DQLNTELRIFALIRLGITDVGQIAVFLH 516
Query: 505 YSLTTIYNYRSKVRNKAVGDKNEFEAAVMRIG 536
YS+ TIYNY+SKV+ + D N FE V +G
Sbjct: 517 YSVQTIYNYKSKVKRMSTLDSNIFEEEVKMLG 548
>gi|156111653|gb|EDO13398.1| hypothetical protein BACOVA_00932 [Bacteroides ovatus ATCC 8483]
Length = 520
Score = 249 bits (636), Expect = 3e-64, Method: Composition-based stats.
Identities = 185/524 (35%), Positives = 278/524 (53%), Gaps = 30/524 (5%)
Query: 36 LDRMISDRGRYHIEKEKGIDSLRSKLRQAVS-DQERFEWCSRLYETYIVYQTDSALRYVL 94
LD+ I + Y KE + L+ + R+ ER+ + +Y Y Y +DSAL Y+
Sbjct: 2 LDKTIKEADTYVQIKENKLHELKKEARKTPPFSVERYNLNNDIYLEYKAYSSDSALHYLN 61
Query: 95 ESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHHSR 154
E+ L RQL D + QL +++ GMY EA D LN I R LL Y+
Sbjct: 62 ENMLLARQLNDKERELKIQLELSYLLSSIGMYMEAADILNSIDRQTLPSSLLGHYY---- 117
Query: 155 TLYGHMADYALTNDEKNAYQQSADRY-------RDSLLFIMPHGEINTLIVEADRFNTHG 207
T Y H+ Y + Y+ A RY RDS+ + L + + G
Sbjct: 118 TCYEHV--YFEAGAAQPRYKMFASRYVKLSHAYRDSMQITLDPSSATYLWLRETQLREAG 175
Query: 208 QFDATIAML-KPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVR 266
++D + + + ++ T + +AY + G ++ YYL LSAI+D++++++
Sbjct: 176 KYDEALEFSDRRLAESSFGTPQYALVAYQRFRLFESMGKKDEHLYYLVLSAISDVRSAIK 235
Query: 267 EYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSE 326
E SL LA L GD+ RAY+Y+ S E +Q R R+ +I+ Y+ +
Sbjct: 236 EQSSLMVLAQELNSKGDLKRAYDYINFSWEISQFYKTRLRSWMNITPLSMINGNYQDIIK 295
Query: 327 RKQRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADAN--------------R 372
++ R + + + V+LLAL LV A+IY+YRQMK L++A++ L + N R
Sbjct: 296 QQNRELLIYIVCVALLALLLVIALIYIYRQMKALSIAKKGLQEVNERLFSLNEELEEVNR 355
Query: 373 QLQTINNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIK-S 431
L++ N L E+NLIKE YIA++ CS Y+D+L YR+ + K ++ EL + S
Sbjct: 356 HLRSTNLELSESNLIKEAYIARFFKLCSVYVDRLQAYRKLVNKKLQRGQVAELLKMTHLS 415
Query: 432 DEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRL 491
++ + E +E Y FD FL LFPNFV N LL ++I KP +LLNTELRIFALIRL
Sbjct: 416 NDIVTVEVQELYANFDSAFLHLFPNFVESLNALLLPDEQIVLKPDELLNTELRIFALIRL 475
Query: 492 GITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRI 535
GI DS++IAE L YS+ TIYNYRS+V+ KA +++FE V +I
Sbjct: 476 GIKDSSQIAELLHYSVNTIYNYRSRVKTKARVSRDDFEDLVAKI 519
>gi|156109538|gb|EDO11283.1| hypothetical protein BACOVA_03186 [Bacteroides ovatus ATCC 8483]
Length = 554
Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats.
Identities = 167/515 (32%), Positives = 273/515 (53%), Gaps = 19/515 (3%)
Query: 36 LDRMISDRGRYHIEKEKGIDSLRSKLRQ-AVSDQERFEWCSRLYETYIVYQTDSALRYVL 94
L R+I ++ + EKE I+ ++ LR ++ + + RLY Y + DSA+ YV
Sbjct: 38 LRRVIDEKHVFVKEKEDRINRIKCMLRSPGLTLEGEYRINLRLYNEYKKFHIDSAIHYVD 97
Query: 95 ESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHHSR 154
+ + RQL F ++ L+ + ++ G ++EA L I DLL +Y+
Sbjct: 98 RNIEISRQLNRPYFTNQSSLHLSLLYSMCGRFREAEIILKSIKTSELPRDLLINYYQTYS 157
Query: 155 TLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDATIA 214
+ +GH + + N Y + Y+DSL ++ H + + +A + +
Sbjct: 158 SFWGHYS----ISVANNLYGKQQAAYQDSLFALIDHTSWDYRMSQASYYIWRDTLKSKEI 213
Query: 215 MLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISLRKL 274
+ + T + ++ S + + +K YL LSAIAD + + RE SL+ L
Sbjct: 214 FKELLEIEEVGTPNYAMITHSYSRLCHHQKKYDEEKKYLMLSAIADTRNATRENASLQSL 273
Query: 275 AFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRTITL 334
A + Y++ ++ A+++ + +++D S R IE+ + +I+ AY+ R + +T
Sbjct: 274 ALIAYDEQNLADAFKFTQSAIDDVISSGIHFRAIEIYKFNSIINTAYQAEQARSRSHLTT 333
Query: 335 LLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANR--------------QLQTINNT 380
L S S++ LV ++++Y QMKK +QALA +N QL NN
Sbjct: 334 FLISTSIILFLLVLLVLFIYIQMKKTLKIKQALAQSNEELLRLNNKLNSMNSQLNDTNNQ 393
Query: 381 LKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERK 440
L E N IKE YIA++ + C +YI K++ Y+ L K+A +EL + +KS I+ E
Sbjct: 394 LCEINSIKEYYIAEFFDVCFSYIHKMEKYQNMLYKIAINKYYDELIKKLKSSALIDDELS 453
Query: 441 EFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIA 500
Y FD FL L+P FV+DFN LL ++++I KP LLN ELRI+AL+RLGITDS +IA
Sbjct: 454 ALYTRFDKVFLGLYPTFVSDFNALLKDEEKIILKPDALLNRELRIYALLRLGITDSGKIA 513
Query: 501 EFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRI 535
FL+ S +T+YNYR+K+RNKA D++EFE +M+I
Sbjct: 514 NFLRCSTSTVYNYRTKMRNKAAVDRDEFENEIMKI 548
>gi|156858966|gb|EDO52397.1| hypothetical protein BACUNI_04011 [Bacteroides uniformis ATCC 8492]
Length = 590
Score = 230 bits (586), Expect = 2e-58, Method: Composition-based stats.
Identities = 169/565 (29%), Positives = 269/565 (47%), Gaps = 41/565 (7%)
Query: 9 IILPLCCLTHLVQAAGSLAEHSGRLKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQ 68
++L LC L +AA + + L LD +I + + +++ I+ + SK + Q
Sbjct: 29 LLLFLCSLQG--KAAFYSDKAANLLNTLDSLIENYPQVIEQRQSAINRIASK-PMPNTLQ 85
Query: 69 ERFEWCSRLYETYIVYQTDSALRYVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKE 128
R+E RL++ Y + DSAL YV + + +Q DD ++ R + + G E
Sbjct: 86 ARYEHNKRLFQEYQYFNIDSALVYVNANIVIAQQTGDDNMMNLCRIQRSYIYAIIGQLHE 145
Query: 129 ALDALNQIPRHAFSGDLLQSYFHHSRTLYGHMADYALT-NDEKNAYQQSADRYRDSLLFI 187
+++ + QI +G +L Y LY A+Y +D+++ Y + Y DS L +
Sbjct: 146 SVEEMRQIDHDKLTGGMLIEYLGQMSLLYSRFAEYTDGGSDKRDYYYEREHFYIDSTLNV 205
Query: 188 MPHGEINTLIVEADRFNTHGQFDATIAMLKPVTDTCRNTERMRF-LAYTLSEAYALKGDR 246
+PH ++ + + G + +I +K +T + L Y LS Y
Sbjct: 206 LPHENPYYILYKGLKDLRAGNYSGSIVAIKVYLETSKTISIANSKLLYMLSTLYRDNNQP 265
Query: 247 ENQKYYLTLSAIADLKTSVREYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSR 306
+ + L +AI D+ + + ++L LA LL E GD +RAY+Y+ LE A N R R
Sbjct: 266 DLRLECLAEAAIMDMTLANNDNVTLHDLATLLNEKGDNERAYKYISLCLEMAHIMNNRVR 325
Query: 307 TIEVAEIFPVIDKAYRLRSERKQRTITLLLASVSLLALCLVAAIIYVYRQMK-------- 358
+ F I K + K R ++ L + ++ L +IY YR +K
Sbjct: 326 MVSCLSTFDAIQKTSLQIKKAKNRQLSWSLWVIGFVSCLLTLTLIYTYRILKHRSRQRIE 385
Query: 359 ---------------------------KLAVARQALADANRQLQTINNTLKETNLIKEEY 391
++ V Q + + N L+ N L + N+IKEEY
Sbjct: 386 LKNLNGLLSASNTELESANERLKNSLHEITVLHQQITETNNLLKEANEKLYDQNIIKEEY 445
Query: 392 IAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFDHTFL 451
I + CS YI KLD YR+ + T + ++L + S IE+E KEFY FD FL
Sbjct: 446 IGFVLTLCSDYISKLDQYRKNINNKVKTQQYKDLLKFTDSPIMIESELKEFYRTFDAIFL 505
Query: 452 SLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSLTTIY 511
++P+FV+DFN LL + RI PK LNTELRIFAL+ LG+TDS++IA+F +S T+Y
Sbjct: 506 HVYPSFVSDFNSLLQPEYRIIPKEEGRLNTELRIFALMHLGVTDSSKIADFFHWSTQTVY 565
Query: 512 NYRSKVRNKAVGDKNEFEAAVMRIG 536
N R +R KA+ D+ F V ++G
Sbjct: 566 NKRVYIRQKAI-DRETFNDQVRKLG 589
>gi|153808734|ref|ZP_01961402.1| hypothetical protein BACCAC_03033 [Bacteroides caccae ATCC 43185]
gi|149128560|gb|EDM19778.1| hypothetical protein BACCAC_03033 [Bacteroides caccae ATCC 43185]
Length = 553
Score = 229 bits (584), Expect = 4e-58, Method: Composition-based stats.
Identities = 156/515 (30%), Positives = 279/515 (54%), Gaps = 20/515 (3%)
Query: 36 LDRMISDRGRYHIEKEKGIDSLRSKLR-QAVSDQERFEWCSRLYETYIVYQTDSALRYVL 94
L RMI ++ + +KE+ I+ ++ L+ ++ +++++ +L Y + DSA+ YV
Sbjct: 38 LRRMIDEKPLFIQQKEQRINRIKCLLKGSGLTLEQKYKINFQLCNEYKKFVVDSAIHYVD 97
Query: 95 ESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHHSR 154
++ + R+L + + ++ L + ++ G Y++A L +I S DLL Y+
Sbjct: 98 QNLEIARKLNNRDLKNQSSLQLSLLYSMCGRYRDAELILEKIKTSELSKDLLSVYYETYS 157
Query: 155 TLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFNTHGQFDATIA 214
+ + Y++T + + Q++ Y+DSLL ++ + + A + A
Sbjct: 158 RFWEY---YSITANSRYGKQRAV--YQDSLLSLLDQTSFDYKLSRAYYYGGRDSIKAKTV 212
Query: 215 MLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISLRKL 274
+ + + T + + + + + +K YL +SAIAD++ + RE SL+ L
Sbjct: 213 LQELLDTEEVGTPHYAMITHAYASFCWHQKKMDERKKYLMMSAIADIRNATRETASLQAL 272
Query: 275 AFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRTITL 334
A + YE+ ++ A+++ + +++D S R +E+ + + +I+ AY+ R + +
Sbjct: 273 ALIQYEEKNLSDAFKFTQSAIDDVVSSGIHFRAMEIYKFYSIINTAYQTEEARSKSNLIT 332
Query: 335 LLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLK------------ 382
L S S++ LV +I +Y QM+K+ ++AL +N +L +N L
Sbjct: 333 FLISTSIILFLLVLLVICIYIQMRKILKIKRALVQSNEKLLRLNEKLNTMNNQLNEQNNQ 392
Query: 383 --ETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERK 440
ETN IKE YIA++ + C +YI K++ Y+ L K A +EL + +KS I+ E
Sbjct: 393 LSETNNIKEHYIAEFFDVCFSYISKMEKYQNVLYKYAINKYYDELIKKLKSSALIDEELN 452
Query: 441 EFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIA 500
Y FD FL+L+P FV+DFN LL E++RI K LLN ELRI+AL+RLGI+DS +IA
Sbjct: 453 ALYARFDRVFLNLYPTFVSDFNALLKEEERIVLKTDTLLNRELRIYALLRLGISDSGKIA 512
Query: 501 EFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRI 535
FL+ S +T+YNYR+K+RNK+ D++EFE +M+I
Sbjct: 513 NFLRCSTSTVYNYRTKMRNKSAVDRDEFENEIMKI 547
>gi|67940337|ref|ZP_00532774.1| transcriptional regulator [Chlorobium phaeobacteroides BS1]
gi|67913452|gb|EAM62863.1| transcriptional regulator [Chlorobium phaeobacteroides BS1]
Length = 375
Score = 229 bits (583), Expect = 4e-58, Method: Composition-based stats.
Identities = 135/318 (42%), Positives = 197/318 (61%), Gaps = 14/318 (4%)
Query: 232 LAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISLRKLAFLLYEDGDIDRAYEYM 291
+ Y S Y L+ E + Y LSAI+D+ S ++ SL +LA LL+ GD+DRA +++
Sbjct: 53 ITYERSLLYRLEKKEEEEMKYFCLSAISDIMGSNKDNASLTELALLLHRKGDVDRANKFI 112
Query: 292 KCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERKQRTITLLLASVSLLALCLVAAII 351
K S +DA + R ++EI P+I AY+L+ E++Q + +LL L + L+ +
Sbjct: 113 KFSYDDAVFFKSELRFKLLSEILPIIQDAYQLKIEKQQNRLGVLLLISGFLLMALIVSFF 172
Query: 352 YVYRQM--------------KKLAVARQALADANRQLQTINNTLKETNLIKEEYIAQYIN 397
+ RQ+ K+L + LAD N +LQ N+ L E+N +KE YI ++
Sbjct: 173 LILRQVAEVRKKKNELFKVNKQLKELNKDLADTNNKLQFTNDELSESNHLKEHYIGNFLT 232
Query: 398 RCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFDHTFLSLFPNF 457
CS YI KLD + +++ K K+EELF KS +F++AE K FY FD FL +FPNF
Sbjct: 233 ICSDYIYKLDQFSKKVNKQLVNKKIEELFVETKSKKFMDAEIKAFYENFDKAFLHIFPNF 292
Query: 458 VTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSLTTIYNYRSKV 517
+TDFN+LL+++ +I K + LNTELRIFALIRLGI +S++IA+ L+YS+ TIYNYR KV
Sbjct: 293 LTDFNQLLSDEKQIILKTEERLNTELRIFALIRLGINESSQIAKLLRYSVNTIYNYRVKV 352
Query: 518 RNKAVGDKNEFEAAVMRI 535
+NKA G++ FE VMRI
Sbjct: 353 KNKAKGERENFENEVMRI 370
>gi|146302068|ref|YP_001196659.1| hypothetical protein Fjoh_4332 [Flavobacterium johnsoniae UW101]
gi|146156486|gb|ABQ07340.1| hypothetical protein Fjoh_4332 [Flavobacterium johnsoniae UW101]
Length = 532
Score = 222 bits (565), Expect = 5e-56, Method: Composition-based stats.
Identities = 158/507 (31%), Positives = 276/507 (54%), Gaps = 6/507 (1%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLR---SKLRQAVSDQERFEWCSRLYETYIVYQTDSA 89
L+ LD+++ + Y +K + I++L+ SK + +++E + L++ Y ++ DSA
Sbjct: 25 LEELDKVLLKKEVYLKQKYRKIETLKKNVSKYTVSQNNEELYNTYMSLFDEYKSFKYDSA 84
Query: 90 LRYVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSY 149
Y+ +S+ + L D + ++++ V+ +G++KEA+D LN I Y
Sbjct: 85 YYYLEQSKIKAKILKDPKYLAKSRIKEGFVLLSSGLFKEAIDTLNVIDDKKLDQKNKFEY 144
Query: 150 FHHSRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEA-DRFNTHGQ 208
++ Y +ADY Y Q + + L ++ E+ R
Sbjct: 145 YNIKARAYYDLADYNRDQRFNIHYVQQGNHFLKKALELIGTNTNEYWAAESLKRLKQQDW 204
Query: 209 FDATIAMLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREY 268
A A + + + +L Y+ +G + +YL L+AIAD+K + +E
Sbjct: 205 RGAEFAFSYWINNYHLPPDYYGIATSSLGYIYSERGYTKKAIHYLALAAIADVKNATKET 264
Query: 269 ISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERK 328
++LR LA L++ G +D+A EY+ +++DA NAR R IE++ I P+I+KA + K
Sbjct: 265 VALRNLANELFKMGYLDKANEYINIAMDDATFYNARHRKIEISSILPIIEKAQLNNVKDK 324
Query: 329 QRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLKETNLIK 388
+ ++ +++L L ++ I +++Q+K+ AR+ +A + QLQ +N +L E N IK
Sbjct: 325 NDKLEKIIILLTILTLIIILFSIIIFKQLKEKNKARKIMASSYAQLQEMNVSLSEANAIK 384
Query: 389 EEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFDH 448
EEYI +I SA+I+K+DH ++ T K +E+ ++K ++ ER+ +++FD
Sbjct: 385 EEYITYFIKATSAFINKIDHIQKSTLHKIITKKTDEVIASLKRYN-VKEERENLFHQFDE 443
Query: 449 TFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSLT 508
FL LFP FVTDFN L + K G+LLNTELRIFAL RLGI DS+++AEFL+ S+
Sbjct: 444 IFLKLFPTFVTDFNNLFPNDHKCIVKKGELLNTELRIFALYRLGIQDSSQMAEFLELSVA 503
Query: 509 TIYNYRSKVRNKAVGDKNEFEAAVMRI 535
TIY Y++++++K+ K+ FE +M I
Sbjct: 504 TIYTYKTRIKSKS-DFKDTFEEKIMAI 529
>gi|86140905|ref|ZP_01059464.1| transcriptional regulator [Flavobacterium sp. MED217]
gi|85832847|gb|EAQ51296.1| transcriptional regulator [Leeuwenhoekiella blandensis MED217]
Length = 561
Score = 207 bits (528), Expect = 1e-51, Method: Composition-based stats.
Identities = 159/530 (30%), Positives = 266/530 (50%), Gaps = 24/530 (4%)
Query: 27 AEHSGRLKHLDRMISDRGRYHIEKEKGIDSLRSKL--RQAVSDQERFEWCSRLYETYIVY 84
A+H+ +L L I Y K + IDSL ++L ++ S Q RFE L+ Y ++
Sbjct: 26 AQHAEQLTQLVETIERSAAYDARKIERIDSLYNELTLQRDSSLQRRFELNRSLFFEYRIF 85
Query: 85 QTDSALRYVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGD 144
+ DSA +Y L S+ L QL D L+ V GM+ EAL LN +
Sbjct: 86 KQDSAFKYSLASKALAEQLGDARLIAETTLDLANVCVSAGMFSEALAYLNGADLERIPEE 145
Query: 145 LLQSYFHHSRTLYGHMADYALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLIVEADRFN 204
+ S + Y MADY+ + + Y++ A Y L + G + +++
Sbjct: 146 IRFSLYGLLGRCYSDMADYSTISYYSDLYRKKAREYHQKSLELAEPGTWDYIMLRGYLNY 205
Query: 205 THGQFDATIAMLKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTS 264
+G + + LKP + ++ + L + Y GD E +L S+IAD+KTS
Sbjct: 206 KYGNLEEALEDLKPSLEMRQDLRSQAVVNSVLGDIYVQMGDEEQAIKHLAQSSIADIKTS 265
Query: 265 VREYISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLR 324
+E +S+ +LA L++ GD+ A ++K + DA++ A+ R I V I P I++ +
Sbjct: 266 AKENLSMIQLAEFLFQKGDVKLASVFIKKANADAEAYGAQQRKIRVGAILPNIEEQIINQ 325
Query: 325 SERKQRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADANRQLQT-------- 376
E ++ ++ +S+L L+ + Y Q+++L AR+AL A++ LQ
Sbjct: 326 VETQRARLSKQNIMLSVLLAVLLVLAVITYTQLRRLKRARKALLVAHKDLQAKNERIVEV 385
Query: 377 -------------INNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTKLASTSKLE 423
+N+ L E N IKEEYI + + + +K ++ + + T+ LE
Sbjct: 386 NETIHSQNEELNRVNDLLLEANTIKEEYIGFFFTQDADIFEKFKEFKTGIERNLKTNNLE 445
Query: 424 ELFRAIKSDEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTEL 483
+ + + + ++ E+K+ FD F+ LFPNF+ +FN LL +++I K G LLN EL
Sbjct: 446 RI-KYLTTVYDLKKEKKKLLESFDEAFIKLFPNFIAEFNALLKPEEQIQLKKGQLLNKEL 504
Query: 484 RIFALIRLGITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVM 533
RIFALIRLGI + IA+ L YS+ +IY Y++K+RNK+ DK +F+ ++
Sbjct: 505 RIFALIRLGIKHNEIIAQILGYSVNSIYAYKTKIRNKSFLDKKDFDQMLL 554
>gi|146302167|ref|YP_001196758.1| hypothetical protein Fjoh_4435 [Flavobacterium johnsoniae UW101]
gi|146156585|gb|ABQ07439.1| hypothetical protein Fjoh_4435 [Flavobacterium johnsoniae UW101]
Length = 566
Score = 205 bits (522), Expect = 5e-51, Method: Composition-based stats.
Identities = 148/529 (27%), Positives = 265/529 (50%), Gaps = 31/529 (5%)
Query: 33 LKHLDRMISDRGRYHIEKEKGIDSLRSKLRQAVSDQER-FEWCSRLYETYIVYQTDSALR 91
LK LD++I Y K + I ++++KLR Q R E +L++ Y+V++ DSA
Sbjct: 38 LKRLDQVIDSSSLYDSNKGRIIKNIKNKLRHNKDGQYREVELNQQLFDQYVVFKRDSAFN 97
Query: 92 YVLESERLLRQLPDDAFRYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFH 151
Y L+ + L + D AF +A +N + GMYKE LD L+ I + + Y+
Sbjct: 98 YALKIKALSESINDKAFLMKANINLANISVAAGMYKEGLDYLDLIKPEEINNENGSLYYG 157
Query: 152 HSRTLYGHMADYALTNDEKNAYQQSADRYR-DSLLFIMPHGEINTLIVEADRFNTHGQFD 210
YG MA+Y+ Y + A R +L P N+ + +N + +
Sbjct: 158 LLGRCYGDMAEYSSIPFFSRKYNRLAKECRIKALNLTTPGTFFNSFL---KFYNEYKDGN 214
Query: 211 ATIAM--LKPVTDTCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREY 268
IA + + ++ N+ L Y L E G + Y + +AIAD++ S +E
Sbjct: 215 LIIAEEGFQSMFNSNINSRDEALLNYMLGEICRESGKADRAIDYYSKAAIADIQISTKES 274
Query: 269 ISLRKLAFLLYEDGDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERK 328
++L +L+ LL+ ++ A +K + +DA A+ R ++V I P+I++ ER+
Sbjct: 275 LALIRLSELLFRKKNLQSASSLVKKAYDDAVFYGAQQRKLQVGAILPLIEEEIVQNIERE 334
Query: 329 QRTITLLLASVSLLALCLVAAIIYVYRQMKKLAVARQALADA------------------ 370
++ + + + + ++ +I + Q +K+ A++ +A A
Sbjct: 335 RKRLYIQYIGAIVFSTIVICFLILTFIQFRKIQKAKKIIASAHSDLQKVNKQLVIVNEEV 394
Query: 371 ---NRQLQTINNTLKETNLIKEEYIAQYINRCSAYIDKLDHYRRRLTK-LASTSKLEELF 426
N+Q+++INN L E N I EEYI + +K + + + K + S ++ +
Sbjct: 395 NARNKQIESINNRLFEANKINEEYIGFFFTEYDDIFEKFNDFISSIEKDIDSGDYVKAKY 454
Query: 427 RAIKSDEFIEAERKEFYNEFDHTFLSLFPNFVTDFNKLLNEKDRITPKPGDLLNTELRIF 486
R + D ++ E+++ N FD F++LFP+F+ +FN L+ ++I K +LN ELRIF
Sbjct: 455 RVSRYD--LKKEKEKLLNNFDTAFINLFPSFIEEFNSLMKPAEQIKLKDNQILNKELRIF 512
Query: 487 ALIRLGITDSTRIAEFLQYSLTTIYNYRSKVRNKAVGDKNEFEAAVMRI 535
ALIRLGI + +IA+ L YS+ +IY Y++KVRNK++ + F+ +M++
Sbjct: 513 ALIRLGIKHNEKIAQILGYSVNSIYAYKTKVRNKSIIENEGFDKKLMKM 561
>gi|149279983|ref|ZP_01886109.1| hypothetical protein PBAL39_14404 [Pedobacter sp. BAL39]
gi|149229363|gb|EDM34756.1| hypothetical protein PBAL39_14404 [Pedobacter sp. BAL39]
Length = 552
Score = 196 bits (499), Expect = 3e-48, Method: Composition-based stats.
Identities = 160/496 (32%), Positives = 262/496 (52%), Gaps = 14/496 (2%)
Query: 46 YHIEKEKGIDSL-RSKLRQAVSDQERFEWC-SRLYETYIVYQTDSALRYVLESERLLRQL 103
Y +KEK I+SL RS + S ER + + E Y YQ DSA YV + L +
Sbjct: 45 YDRQKEKRINSLKRSVSETSKSGYERLHLLYNTILEEYKFYQFDSAHTYVRKIIDLSEK- 103
Query: 104 PDDAFRYR-AQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHHSRTLYGHMAD 162
D FR + A+L + ++T +GM+KEA + ++++ +L+ +Y + Y +A
Sbjct: 104 HGDLFRVKEAKLQLLQILTPSGMFKEASELVSELDHETMPMELMGNYHNLKARFYESLAK 163
Query: 163 YALTNDEKNAYQQSADRYRDSLLFIMPHGEINTLI-VEADRFNTHGQFDATIAMLKPVTD 221
Y Y+ A + ++P + +I + AD ++ + L +T
Sbjct: 164 YNNDPFYAAQYRALAAAQFSKMNSVLPPNKFEAVIGLAADSSSSSSRRRPAEFYLHFITH 223
Query: 222 TCRNTERMRFLAYTLSEAYALKGDRENQKYYLTLSAIADLKTSVREYISLRKLAFLLYED 281
+ + ++ +A LS Y G +Q +L LSAI+D+++S ++ ++ KL +LY+
Sbjct: 224 SGLSDHKIAMVAMRLS--YCFSG--ADQIMFLALSAISDIRSSTKDTQAILKLGEVLYKA 279
Query: 282 GDIDRAYEYMKCSLEDAQSCNARSRTIEVAEIFPVIDKAYRLRSERK-QRTITLLLASVS 340
GD++ AY + ++ DA+ +AR I++ I P+I A + SE + QR +++ A +S
Sbjct: 280 GDVNMAYTCINKAISDAEFYSARGHKIDIQNILPLI--AGKTISEAELQRDKSIMYALIS 337
Query: 341 -LLALCLVAAIIYVYRQMKKLAVARQALADANRQLQTINNTLKETNLIKEEYIAQYINRC 399
++ L ++ A+ VY Q+KK+ V+ Q + N QL +IN L E I EEYI + R
Sbjct: 338 AVVVLVVLCALFIVYIQLKKIRVSEQLINQKNNQLASINMKLMEHTRINEEYIGFFFKRS 397
Query: 400 SAYIDKLDHYRRRLTKLASTSKLEELFRAIKSDEFIEAERKEFYNEFDHTFLSLFPNFVT 459
A I L+ +R++ T +++E + S + I ER+ Y+ D FL LFPNFV
Sbjct: 398 FANISSLEKLKRKIAHSIKTKRIDEALDTVTSIQ-ISKEREYLYHTLDQIFLKLFPNFVP 456
Query: 460 DFNKLLNEKDRITPKPGDLLNTELRIFALIRLGITDSTRIAEFLQYSLTTIYNYRSKVRN 519
FN+LL +D+I P+ L T LRIFALIRLGI D IA L YS++TIY Y+ +V+
Sbjct: 457 AFNELLKPEDQIWPQKDQSLTTVLRIFALIRLGIKDDETIAVILDYSVSTIYTYKIRVKA 516
Query: 520 KAVGDKNEFEAAVMRI 535
KA+ EF+ + I
Sbjct: 517 KALVKGEEFDKRIAGI 532
>gi|4572635|emb|CAB40102.1| hypothetical protein [Prevotella albensis]
Length = 202
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 38/138 (27%), Positives = 67/138 (48%)
Query: 49 EKEKGIDSLRSKLRQAVSDQERFEWCSRLYETYIVYQTDSALRYVLESERLLRQLPDDAF 108
EKEK ID L++ L++ S + +F ++LYE Y V+Q DSA Y+ L ++ D +
Sbjct: 11 EKEKRIDYLKTSLQKEQSTKAQFNIYNQLYEEYYVFQFDSAQVYINRGIELAKKQNDKYY 70
Query: 109 RYRAQLNRVGVMTVTGMYKEALDALNQIPRHAFSGDLLQSYFHHSRTLYGHMADYALTND 168
+ + +M + G+Y EA D + I +L Y+ +Y + +DY +
Sbjct: 71 YSLFVIRKAQLMAIGGLYHEAKDLIETIDVSNLDKELQFDYYLSLFRIYSYWSDYCNDKE 130
Query: 169 EKNAYQQSADRYRDSLLF 186
K Y+ A+ + +F
Sbjct: 131 YKPRYRTLANTFLSKAIF 148
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.323 0.136 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,737,294,895
Number of Sequences: 5470121
Number of extensions: 68803831
Number of successful extensions: 209686
Number of sequences better than 1.0e-05: 29
Number of HSP's better than 0.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 209585
Number of HSP's gapped (non-prelim): 30
length of query: 536
length of database: 1,894,087,724
effective HSP length: 138
effective length of query: 398
effective length of database: 1,139,211,026
effective search space: 453405988348
effective search space used: 453405988348
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 132 (55.5 bits)