BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI1050
(791 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|156861421|gb|EDO54852.1| hypothetical protein BACUNI_013... 475 e-132
gi|150004846|ref|YP_001299590.1| hypothetical protein BVU_2... 334 2e-89
gi|153808003|ref|ZP_01960671.1| hypothetical protein BACCAC... 316 4e-84
gi|156110788|gb|EDO12533.1| hypothetical protein BACOVA_016... 314 2e-83
gi|29345738|ref|NP_809241.1| hypothetical protein BT_0328 [... 303 3e-80
gi|60681146|ref|YP_211290.1| hypothetical protein BF1652 [B... 300 2e-79
gi|53712934|ref|YP_098926.1| hypothetical protein BF1644 [B... 300 3e-79
gi|154490235|ref|ZP_02030496.1| hypothetical protein PARMER... 229 5e-58
gi|150008031|ref|YP_001302774.1| hypothetical protein BDI_1... 226 5e-57
gi|34540194|ref|NP_904673.1| hypothetical protein PG0362 [P... 169 7e-40
gi|149371925|ref|ZP_01891244.1| hypothetical protein SCB49_... 98 2e-18
gi|88805045|ref|ZP_01120565.1| hypothetical protein RB2501_... 92 2e-16
gi|89890456|ref|ZP_01201966.1| conserved hypothetical prote... 89 1e-15
gi|88713481|ref|ZP_01107564.1| hypothetical protein FB2170_... 87 6e-15
gi|91218355|ref|ZP_01255299.1| hypothetical protein P700755... 86 9e-15
gi|146299686|ref|YP_001194277.1| hypothetical protein Fjoh_... 83 7e-14
gi|83856530|ref|ZP_00950059.1| hypothetical protein CA2559_... 83 7e-14
gi|126663335|ref|ZP_01734333.1| hypothetical protein FBBAL3... 79 9e-13
gi|149279624|ref|ZP_01885753.1| hypothetical protein PBAL39... 78 2e-12
gi|150025265|ref|YP_001296091.1| hypothetical protein FP119... 77 4e-12
gi|86132078|ref|ZP_01050674.1| hypothetical protein MED134_... 75 1e-11
gi|120437244|ref|YP_862930.1| hypothetical protein GFO_2916... 73 8e-11
gi|126648123|ref|ZP_01720617.1| hypothetical protein ALPR1_... 71 3e-10
gi|88801502|ref|ZP_01117030.1| hypothetical protein PI23P_0... 70 6e-10
gi|86142126|ref|ZP_01060650.1| hypothetical protein MED217_... 67 5e-09
gi|86134238|ref|ZP_01052820.1| hypothetical protein MED152_... 65 2e-08
gi|110638203|ref|YP_678412.1| hypothetical protein CHU_1803... 64 4e-08
>gi|156861421|gb|EDO54852.1| hypothetical protein BACUNI_01374 [Bacteroides uniformis ATCC 8492]
Length = 736
Score = 475 bits (1222), Expect = e-132, Method: Composition-based stats.
Identities = 283/753 (37%), Positives = 414/753 (54%), Gaps = 69/753 (9%)
Query: 45 VSDSIQNQHKEIPRGLKVWTIDERFGDRTAAQPDTVSELFMNTFFNTGLRGEYNSLGNLG 104
+ DS + + +P L +W + E G+RT DT S F +T G+ G YN LGNLG
Sbjct: 47 LEDSTNTEIQSLPPKLYMWKLSETLGNRTIIPADTASLNFQSTNLVEGMFGHYNYLGNLG 106
Query: 105 SPRQNRIFIDRAEPSEFIFLAPYDKFIVQPSKFHFTSTLSPITNLSYSTAGDRTDGEDQF 164
SPR +R+F +R E IF+AP+ F +P + FT++ P TNL+Y AG + +GE++F
Sbjct: 107 SPRLSRLFFEREESEPTIFMAPFSSFFTRPDQVLFTNSNVPYTNLTYYKAGSKVNGEERF 166
Query: 165 KALFAINAGKKWGFGFKFDYLYGRGYYSNQNASHINYSFWGTYTGEKYQANFLSSLNHQK 224
K+ F++N K+ FGF DYLYGRGYY NQ+ SH N +G+Y GEKYQ + + K
Sbjct: 167 KSYFSVNVNKQLAFGFNIDYLYGRGYYQNQSTSHFNAGLFGSYIGEKYQVQAVYNNFFLK 226
Query: 225 VSENGGIANDAYITHPEIFN---ESFNTNEIPTILSKNWNRNDNQHIFFNHRYSLGFYRK 281
++ENGGIA+D YIT PE + + + + IP L ++ NRN + +I+ RY LGF R+
Sbjct: 227 MNENGGIADDQYITRPENMSGGKKEYESTTIPVKLEQSSNRNKDFYIYLTQRYRLGFTRR 286
Query: 282 VPMTKEEIEAKKFAIKSMQSQEEDKARRKAMEEAKQNGEEFDEEEFNKRQKAKGRPDNAK 341
V T+E G+ + +A
Sbjct: 287 VRNTQE-------------------------------GKPTVSAPKASSSPSSEAEISAS 315
Query: 342 VIDKNPTSNALATNGEQRDSINIDRLEEKQIAASDAANEWIKDEYVPVTSFIHTARFDNF 401
D ++++A+ + D L A +D+ E +E+VPVTSFIHT + +
Sbjct: 316 NNDIALPNDSIASKSVSTLAAANDSLPSVSTADNDSLFE---EEFVPVTSFIHTLKVERS 372
Query: 402 RRIYQAYDTPNSFYANNYYYNNATASDSIYDQTKHWALKNTFAIALLEGFNKWAKAGLKA 461
R +++ P F+ +Y ++DS T +++KN F IALLEGFNK+AKAGL A
Sbjct: 373 RHQFRSGSEPEGFFPEDYKLYKNYSNDS----TTAFSVKNVFGIALLEGFNKYAKAGLTA 428
Query: 462 FVSHELRHYELPMLLTTTTTPAANPLFGGYEKINKNDISVGGQLLKANGKTLHYNINAEA 521
++SH+ Y+L + T T + + + +I +GG+L K GK LHYN+N E
Sbjct: 429 YISHKFSRYDL---MNTDTLTDMRRI-----RYTEQEIFLGGELAKREGKLLHYNVNGEV 480
Query: 522 WIAGDRAGQLHIDGNADLNFPLLGDTVQFAATAFLHRTAPTFYMNTFRSRHFWWGN-NLD 580
+ GQ ++ N DLNF L DTV F A ++ T P+FYM + S H+ W N N+D
Sbjct: 481 GLVDKAIGQFRVNANLDLNFRLWKDTVNFYARGYVSNTLPSFYMRHYHSNHYNWDNDNMD 540
Query: 581 QQIHSRLLGAFTLKKTRTKLRVGYDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSG 640
++ +R+ G + + T LR G + +KNYTYF N Q+ G
Sbjct: 541 KEFRTRVEGELNISRWGTNLRAGVENIKNYTYF----------------NQSALPEQNGG 584
Query: 641 AISLLTLQLQQDFKLGIMNWQNLITFQKSSNEAVLPTPTLNIYSNLFIRFKIA-KVLDCD 699
I +L+ L+QDF+LG+ + N +T+QK+SNE VLP P L++Y N +I K+A KVL
Sbjct: 585 NIQVLSATLKQDFRLGVFHLDNEVTWQKTSNETVLPLPQLSLYHNFYILAKLAKKVLTVQ 644
Query: 700 FGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSH 759
G D RYFTKY AP Y PG+ F +Q T+ E+G YPI+N YAN L+RTR F MM H
Sbjct: 645 LGADVRYFTKYNAPAYAPGVQQFHLQPTD-DLVEIGGYPIVNVYANLHLKRTRIFAMMYH 703
Query: 760 INSGDG-GNYFFTPHYPLNQRVLRLGISWDFFN 791
+N+G G N F PHYP+N R+ ++G+SW+F++
Sbjct: 704 VNAGMGSANSFLVPHYPINPRLFKIGVSWNFYD 736
>gi|150004846|ref|YP_001299590.1| hypothetical protein BVU_2309 [Bacteroides vulgatus ATCC 8482]
gi|149933270|gb|ABR39968.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 703
Score = 334 bits (856), Expect = 2e-89, Method: Composition-based stats.
Identities = 194/440 (44%), Positives = 271/440 (61%), Gaps = 31/440 (7%)
Query: 357 EQRDSINIDRLEEKQIAASDAANEWIKDEYVPVTSFIHTARFDNFRRIYQAYDTPNSFYA 416
E++DS++ D ++ Q+ +VPVTSFIHT + D R Y +YD +
Sbjct: 290 EKQDSVDTDSVKITQV-------------FVPVTSFIHTLQVDLNNRKYISYDDAQN--- 333
Query: 417 NNYYYNNATASDSIYDQTKHWALKNTFAIALLEGFNKWAKAGLKAFVSHELRHYELPMLL 476
Y+ +N +DSI D+TK ++KNT IAL EGFNKWAKAGL AF+S+E R++ L
Sbjct: 334 QKYFEHNYLGTDSI-DKTKRTSIKNTIGIALQEGFNKWAKAGLTAFLSYEYRNFAL---- 388
Query: 477 TTTTTPAANPLFGGYEKINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGN 536
T TT + Y++ + +S+GG+L K GK LHYNI E IAG+ AGQ ++G
Sbjct: 389 TDTTNIPGQRIINNYKE---SSLSIGGELSKKQGKLLHYNILGELAIAGEDAGQFSVEGR 445
Query: 537 ADLNFPLLGDTVQFAATAFLHRTAPTFYMNTFRSRHFWWGNN-LDQQIHSRLLGAFTLKK 595
DLN L GDTV+ AF+ P FY F+S+H+WW NN L + + +RL G +L +
Sbjct: 446 GDLNLRLFGDTVRLDVNAFIKNQNPVFYFRHFQSKHYWWDNNDLSKIMRTRLEGKLSLNR 505
Query: 596 TRTKLRVGYDVLKNYTYFGLQNERV--ANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDF 653
T+LR G + +KNYTY + V + GN KNN VRQHSG I + T LQQ
Sbjct: 506 WGTQLRAGVENIKNYTYLANASIPVKDSEGNVTGFKNNAA-VRQHSGNIQIFTAMLQQKL 564
Query: 654 KLGIMNWQNLITFQKSSNEAVLPTPTLNIYSNLFIRFKIA-KVLDCDFGVDGRYFTKYYA 712
K+GI + + +QKSS + +LP P L+ Y NL+++F +A KVL + G D RYF+KYYA
Sbjct: 565 KVGIFHLDGEVAYQKSSEQDILPLPELSAYGNLYMKFGLAKKVLQIEMGADVRYFSKYYA 624
Query: 713 PEYIPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSGDG-GNYFFT 771
P+Y P +G F Q + + E+G +PI+N YAN L+RTRFFIMM H+N+G G NYF
Sbjct: 625 PDYSPVIGQFYNQNPD-DKIEIGAFPIVNVYANLHLKRTRFFIMMYHVNAGSGKSNYFLA 683
Query: 772 PHYPLNQRVLRLGISWDFFN 791
PHYP+N R+++ G+SW+FF+
Sbjct: 684 PHYPINPRMIKFGLSWNFFD 703
Score = 218 bits (556), Expect = 1e-54, Method: Composition-based stats.
Identities = 124/284 (43%), Positives = 173/284 (60%), Gaps = 8/284 (2%)
Query: 5 IYLIINLFIISLLANAQSFNRVSRDGTSTSNGF---QGNRNLGVSDSIQN-QHKEIPRGL 60
I+L+ F SL+A ++ N ++ S++NG Q +RN D+ IP GL
Sbjct: 8 IFLLFLCFHTSLMAQ-RTMNTLNNQFGSSANGLDRNQYDRNGNPIDTTAVVDANTIPIGL 66
Query: 61 KVWTIDERFGDRTAAQPDTVSELFMNTFFNTGLRGEYNSLGNLGSPRQNRIFIDRAEPSE 120
W +D RFG+ T DT+ F N+ G G+YN LGNLGSPR +RIF DR +PS+
Sbjct: 67 YSWKVDSRFGNVTYIPVDTLQHAFQNSNDMGGYTGQYNYLGNLGSPRLSRIFFDRRDPSQ 126
Query: 121 FIFLAPYDKFIVQPSKFHFTSTLSPITNLSYSTAGDRTDGEDQFKALFAINAGKKWGFGF 180
+ F PYD + +P FT+T SP TNLSY G DGE++FK+ FAINA K+ GFGF
Sbjct: 127 YFFTDPYDFCVQRPEDVIFTNTFSPFTNLSYYKGGGSRDGEERFKSYFAINANKRLGFGF 186
Query: 181 KFDYLYGRGYYSNQNASHINYSFWGTYTGEKYQANFLSSLNHQKVSENGGIANDAYITHP 240
DYLYGRG Y +Q+ + N + +Y G+KY +F+ + ++ K++ENGGI +D YIT+P
Sbjct: 187 YIDYLYGRGLYQDQSTAFFNGGLFSSYRGDKYDMHFIFNNDNLKMAENGGITDDRYITNP 246
Query: 241 EIFNE---SFNTNEIPTILSKNWNRNDNQHIFFNHRYSLGFYRK 281
E ++ NEIPT LS+ WN N + H F HRY+LGFY++
Sbjct: 247 LDMAEGKKEYSANEIPTRLSQIWNHNTSYHAFLTHRYNLGFYKE 290
>gi|153808003|ref|ZP_01960671.1| hypothetical protein BACCAC_02289 [Bacteroides caccae ATCC 43185]
gi|149129612|gb|EDM20826.1| hypothetical protein BACCAC_02289 [Bacteroides caccae ATCC 43185]
Length = 683
Score = 316 bits (809), Expect = 4e-84, Method: Composition-based stats.
Identities = 175/412 (42%), Positives = 250/412 (60%), Gaps = 32/412 (7%)
Query: 383 KDEYVPVTSFIHTARFDNFRRIYQAYD-TPNSFYANNYYYNNATASDSIYDQTKHWALKN 441
K E+VPVTSFIHT + + R + +YD + +Y N Y+ + A + D T + +KN
Sbjct: 301 KQEFVPVTSFIHTIQVERARHGFNSYDEVADGYYQNTYFDKDNKAF--VRDSTTYVGIKN 358
Query: 442 TFAIALLEGFNKWAKAGLKAFVSHELRHYELPMLLTTTTTPAANPLFGGYEKINKNDISV 501
T IALLEGFNK+AKAGL AF S+++ Y T PL +K N+N+I V
Sbjct: 359 TIGIALLEGFNKYAKAGLTAFASYKISKY-------TLMNKDGGPL---PDKYNENEIFV 408
Query: 502 GGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPLLGDTVQFAATAFLHRTAP 561
GG+L K G LHY+ E +AG GQ ++ G+ DLNFPL DTV A +
Sbjct: 409 GGELSKREGNILHYHAIGEVGLAGKAIGQFNVKGDIDLNFPLWKDTVSLIARGEVSNQLA 468
Query: 562 TFYMNTFRSRHFWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLKNYTYFGLQNERVA 621
FYM + S+HF W N++D++ +R+ G ++ + +T+LR G + +KNYTYF
Sbjct: 469 PFYMRHYHSKHFMWDNDMDKEFRTRIEGELSIARWKTRLRAGVENIKNYTYF-------- 520
Query: 622 NGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQKSSNEAVLPTPTLN 681
N Q Q SG++ +L+ L QDFKLGI + N +T+QKSS++ VLP P L+
Sbjct: 521 --------NQQATPEQKSGSLQVLSASLNQDFKLGIFHLDNEVTWQKSSDQTVLPLPDLS 572
Query: 682 IYSNLFIRFKIA-KVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEVGNYPII 740
+Y N +++FK+A KVL G D RYFTKY AP Y+P + +F +Q E + E+G YPI+
Sbjct: 573 LYHNFYMQFKLAKKVLSVQLGADVRYFTKYNAPAYMPAIQNFYLQPEEG-KVEIGGYPIV 631
Query: 741 NAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQRVLRLGISWDFFN 791
N YAN L+RTRF++MM H+N G +YF +PHYP+N RVL+ G+SW+F++
Sbjct: 632 NVYANLHLKRTRFYVMMYHVNQGISRPDYFLSPHYPINPRVLKFGLSWNFYD 683
Score = 220 bits (561), Expect = 3e-55, Method: Composition-based stats.
Identities = 109/250 (43%), Positives = 156/250 (62%), Gaps = 3/250 (1%)
Query: 47 DSIQNQHKEIPRGLKVWTIDERFGDRTAAQPDTVSELFMNTFFNTGLRGEYNSLGNLGSP 106
DS + K +P L +W I + GDRT DT F NT G+ G YN L N+GSP
Sbjct: 52 DSANVEVKGLPPTLYMWRIKNQLGDRTIIPADTAYHHFQNTNLTEGITGHYNYLANMGSP 111
Query: 107 RQNRIFIDRAEPSEFIFLAPYDKFIVQPSKFHFTSTLSPITNLSYSTAGDRTDGEDQFKA 166
R +RIF DR P IF+ P+ F V+P++F+FT++ P TNL+Y AG++ +GE++FK+
Sbjct: 112 RMSRIFFDRRYPEPTIFMEPFSSFFVRPTEFNFTNSNVPYTNLTYHKAGNKVNGEERFKS 171
Query: 167 LFAINAGKKWGFGFKFDYLYGRGYYSNQNASHINYSFWGTYTGEKYQANFLSSLNHQKVS 226
F++N KK FGF DYLYGRGYY+NQN ++ N + +G+Y G++YQ + S N+ K +
Sbjct: 172 YFSVNVNKKLAFGFNIDYLYGRGYYNNQNTAYFNAAVFGSYIGDRYQVQGIYSNNYLKTN 231
Query: 227 ENGGIANDAYITHPEIFNE---SFNTNEIPTILSKNWNRNDNQHIFFNHRYSLGFYRKVP 283
ENGGI +D YIT PE E + + IPT+LS+ NRN + ++F RY+LGF R +P
Sbjct: 232 ENGGITDDRYITAPEEMAEGRREYESTNIPTVLSETTNRNHDFYVFLTQRYNLGFKRDIP 291
Query: 284 MTKEEIEAKK 293
+ + K
Sbjct: 292 QAENDTTPAK 301
>gi|156110788|gb|EDO12533.1| hypothetical protein BACOVA_01675 [Bacteroides ovatus ATCC 8483]
Length = 683
Score = 314 bits (804), Expect = 2e-83, Method: Composition-based stats.
Identities = 173/414 (41%), Positives = 255/414 (61%), Gaps = 34/414 (8%)
Query: 382 IKDEYVPVTSFIHTARFDNFRRIYQAYDTPNSFYANNYYYNNATASDS--IYDQTKHWAL 439
+K E+VPVTSFIHT + + R ++++ + NYY N +D+ + D T + +
Sbjct: 300 VKQEFVPVTSFIHTIQVERAR---HSFNSNDDMREKNYYQNTYFDTDNPNVRDSTTYVGI 356
Query: 440 KNTFAIALLEGFNKWAKAGLKAFVSHELRHYELPMLLTTTTTPAANPLFGGYEKINKNDI 499
KNT IALLEGFNK+AKAGL AF S+++ Y L + NPL +K N+N+I
Sbjct: 357 KNTIGIALLEGFNKYAKAGLTAFASYKISKYTLMNM-------EGNPL---PDKYNENEI 406
Query: 500 SVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPLLGDTVQFAATAFLHRT 559
VGG+L K G LHY+ E +AG GQ ++ G+ DLNFPL DTV A +
Sbjct: 407 FVGGELSKREGNVLHYHAIGEVGLAGKAIGQFNVKGDIDLNFPLWKDTVSLIARGEVSNK 466
Query: 560 APTFYMNTFRSRHFWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLKNYTYFGLQNER 619
FYM + S+HF W +++D++ +R+ G ++ + RT+L+ G + +KNYTYF
Sbjct: 467 LAPFYMRHYHSKHFMWDDDMDKEFRTRIEGELSIARWRTRLKAGVENIKNYTYF------ 520
Query: 620 VANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQKSSNEAVLPTPT 679
N + Q+SG+I +L+ L QDFKLGI + N +T+QKSS++ VLP P
Sbjct: 521 ----------NQEAKPEQNSGSIQVLSASLNQDFKLGIFHLDNEVTWQKSSDQIVLPLPD 570
Query: 680 LNIYSNLFIRFKIA-KVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEVGNYP 738
L++Y N +++FK+A KVL G D RYF+KY AP Y+P + +F +Q T+ + ++G YP
Sbjct: 571 LSLYHNFYMQFKLAKKVLSVQLGADVRYFSKYNAPAYMPAIQNFHLQPTD-DQVQIGGYP 629
Query: 739 IINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQRVLRLGISWDFFN 791
I+N YAN L+RTRF++MM H+N G NYF +PHYP+N RVL+ G+SW+F++
Sbjct: 630 IVNVYANLHLKRTRFYVMMYHVNQGMSSPNYFLSPHYPINPRVLKFGLSWNFYD 683
Score = 217 bits (552), Expect = 3e-54, Method: Composition-based stats.
Identities = 106/245 (43%), Positives = 155/245 (63%), Gaps = 3/245 (1%)
Query: 47 DSIQNQHKEIPRGLKVWTIDERFGDRTAAQPDTVSELFMNTFFNTGLRGEYNSLGNLGSP 106
DS + + +P L +W I + GDRT DT F N+ GL G YN L N+GSP
Sbjct: 52 DSANVEVQGLPPKLYMWRIKNQLGDRTIIPADTAFHHFQNSNLTEGLTGHYNYLANMGSP 111
Query: 107 RQNRIFIDRAEPSEFIFLAPYDKFIVQPSKFHFTSTLSPITNLSYSTAGDRTDGEDQFKA 166
R +RIF DR +P IF+ P+ F ++P++F+FT++ P TNL+Y AG++ +GE++FK+
Sbjct: 112 RMSRIFFDRRDPEPTIFMEPFSSFFIRPTEFNFTNSNVPYTNLTYHKAGNKVNGEERFKS 171
Query: 167 LFAINAGKKWGFGFKFDYLYGRGYYSNQNASHINYSFWGTYTGEKYQANFLSSLNHQKVS 226
F++N KK FGF DYLYGRGYY+NQN ++ N + +G+Y G+KYQ + S N+ K +
Sbjct: 172 YFSVNVNKKLAFGFNVDYLYGRGYYNNQNTAYFNAAVFGSYIGDKYQMQAIYSNNYLKTN 231
Query: 227 ENGGIANDAYITHPEIFNE---SFNTNEIPTILSKNWNRNDNQHIFFNHRYSLGFYRKVP 283
ENGGI +D YIT PE + + + IPT+LS NRN + ++F RY+LGF R +P
Sbjct: 232 ENGGIEDDRYITAPEEMAQGQREYESTNIPTVLSATTNRNHDFYVFLTQRYNLGFSRDIP 291
Query: 284 MTKEE 288
+ +
Sbjct: 292 QAEND 296
>gi|29345738|ref|NP_809241.1| hypothetical protein BT_0328 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337631|gb|AAO75435.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 680
Score = 303 bits (776), Expect = 3e-80, Method: Composition-based stats.
Identities = 169/411 (41%), Positives = 239/411 (58%), Gaps = 33/411 (8%)
Query: 383 KDEYVPVTSFIHTARFDNFRRIYQAYDTPNSFYANNYYYNNATASDSIYDQTKHWALKNT 442
K E+VPVTSFIHT + + R +++ D ++Y T + D T + +KNT
Sbjct: 301 KQEFVPVTSFIHTIQVERSRHNFRSDDNVENYYKQTLLDKENTF---VRDSTVYIGVKNT 357
Query: 443 FAIALLEGFNKWAKAGLKAFVSHELRHYELPMLLTTTTTPAANPLFGGYEKINKNDISVG 502
IALLEGFNK+AKAGL AF SH+L Y L L +PL +K N+ +I +G
Sbjct: 358 IGIALLEGFNKYAKAGLTAFASHKLSKYSLMSL---------DPL--KQDKYNETEIYIG 406
Query: 503 GQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPLLGDTVQFAATAFLHRTAPT 562
G+L K G LHY+ E +AG GQ + G+ DLN PL DTV A +
Sbjct: 407 GELTKKQGNFLHYHAIGEVGMAGKAIGQFDVKGDIDLNIPLWKDTVSVIARGEISNKLAP 466
Query: 563 FYMNTFRSRHFWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLKNYTYFGLQNERVAN 622
FYM + S+HFWW +++D++ +R+ G ++ T+LR G + +KNYTYF
Sbjct: 467 FYMRHYHSKHFWWDDDMDKEFRTRIEGELSIANWGTRLRAGVENIKNYTYF--------- 517
Query: 623 GNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQKSSNEAVLPTPTLNI 682
N Q Q G++ +++ QDFK+GI + N + +QKSS++AVLP P L++
Sbjct: 518 -------NQQALPEQKGGSLQVVSACFNQDFKVGIFHLDNEVIWQKSSDQAVLPLPELSL 570
Query: 683 YSNLFIRFKIA-KVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEVGNYPIIN 741
Y N +++FK+A KVL G D RYFTKY AP Y+P F +Q E + E+G YPI+N
Sbjct: 571 YHNFYMQFKLAKKVLSVQLGADVRYFTKYDAPAYMPATQQFYLQPEEG-KVEIGGYPIVN 629
Query: 742 AYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQRVLRLGISWDFFN 791
YAN L+RTRF++MM H+N G NYF PHYP+N RVL+ G+SW+F++
Sbjct: 630 VYANLHLKRTRFYVMMYHVNQGMSKPNYFLAPHYPINPRVLKFGLSWNFYD 680
Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats.
Identities = 106/250 (42%), Positives = 156/250 (62%), Gaps = 3/250 (1%)
Query: 47 DSIQNQHKEIPRGLKVWTIDERFGDRTAAQPDTVSELFMNTFFNTGLRGEYNSLGNLGSP 106
DS + + +P L +W I + GDRT DT F NT G+ G YN L NLGSP
Sbjct: 52 DSANVEVQGLPPTLYMWRIKNQLGDRTMIPADTTYHHFQNTNLTEGITGHYNYLANLGSP 111
Query: 107 RQNRIFIDRAEPSEFIFLAPYDKFIVQPSKFHFTSTLSPITNLSYSTAGDRTDGEDQFKA 166
R +RIF +R P IF+ + F ++P++F+FT++ P TNL+Y AG++ +GE++FK+
Sbjct: 112 RLSRIFFERRYPEPTIFMESFSSFFIRPTEFNFTNSNVPYTNLTYHKAGNKQNGEERFKS 171
Query: 167 LFAINAGKKWGFGFKFDYLYGRGYYSNQNASHINYSFWGTYTGEKYQANFLSSLNHQKVS 226
F++N KK FGF DYLYGRGYY+NQN S+ N + +G+Y G++YQ + S N+ K +
Sbjct: 172 YFSVNVNKKLAFGFNIDYLYGRGYYNNQNTSYFNAALFGSYIGDRYQVQGIYSNNYLKTN 231
Query: 227 ENGGIANDAYITHPEIFNE---SFNTNEIPTILSKNWNRNDNQHIFFNHRYSLGFYRKVP 283
ENGGI +D YIT PE E + + IPT+L+ + NRN + ++F RY+LGF+R +P
Sbjct: 232 ENGGITDDRYITAPEEMAEGRKEYESVNIPTVLNASANRNHDFYVFLTQRYNLGFHRDIP 291
Query: 284 MTKEEIEAKK 293
+ + K
Sbjct: 292 QAENDTMPAK 301
>gi|60681146|ref|YP_211290.1| hypothetical protein BF1652 [Bacteroides fragilis NCTC 9343]
gi|60492580|emb|CAH07352.1| conserved hypothetical exported protein [Bacteroides fragilis NCTC
9343]
Length = 684
Score = 300 bits (769), Expect = 2e-79, Method: Composition-based stats.
Identities = 176/437 (40%), Positives = 251/437 (57%), Gaps = 37/437 (8%)
Query: 358 QRDSINIDRLEEKQIAASDAANEWIKDEYVPVTSFIHTARFDNFRRIYQAYDTPNSFYAN 417
QR + R EK A D K E+VPVTSFIHT + + R + + D Y
Sbjct: 282 QRYKLGFTREVEK---AKDDTTNTQKTEFVPVTSFIHTMKVERSRHQFTSEDELYKIYPE 338
Query: 418 NYYYNNATASDSIYDQTKHWALKNTFAIALLEGFNKWAKAGLKAFVSHELRHYELPMLLT 477
Y + + D T + +KNT IALLEGFNK+AKAGL AF+SH+L +Y L +
Sbjct: 339 AYI---QPGNKLVNDSTSYIGVKNTLGIALLEGFNKYAKAGLTAFISHKLSNYRLMDRDS 395
Query: 478 TTTTPAANPLFGGYEKINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNA 537
+ +K +++++ VGG+L K GKTLHY E I GQ ++ +
Sbjct: 396 VSV-----------DKYSEHEVFVGGELAKRQGKTLHYRAMGEVGILDKAIGQFRVNADL 444
Query: 538 DLNFPLLGDTVQFAATAFLHRTAPTFYMNTFRSRHFWWGN-NLDQQIHSRLLGAFTLKKT 596
DLNF L DTV F A + T P FYM + S++F+W N N++++ +RL G ++
Sbjct: 445 DLNFRLWKDTVSFIARGSISNTLPAFYMRHYHSKYFYWDNDNMEKEFRTRLEGELNIEHW 504
Query: 597 RTKLRVGYDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLG 656
+T L+ G + +KNYTYF N + +Q+ G I +L+ L Q+F+LG
Sbjct: 505 QTNLKAGVENIKNYTYF----------------NQKALPQQNGGNIQVLSATLSQNFRLG 548
Query: 657 IMNWQNLITFQKSSNEAVLPTPTLNIYSNLFIRFKIA-KVLDCDFGVDGRYFTKYYAPEY 715
I++ N +T+QKSSN VLP P L++Y NL+I+ +A KVL G D RYFTKYYAP Y
Sbjct: 549 ILHLDNEVTWQKSSNNTVLPLPELSLYHNLYIQTTLAKKVLHVQLGADVRYFTKYYAPAY 608
Query: 716 IPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSGDG-GNYFFTPHY 774
P + F +Q E + ++G YPIIN YAN +L+RTR F MM H+N G G NYF +PHY
Sbjct: 609 TPAIQQFHLQ-PEDDQVKIGGYPIINVYANLQLKRTRLFAMMYHVNQGMGNSNYFLSPHY 667
Query: 775 PLNQRVLRLGISWDFFN 791
P+N R+ ++G+SW+F++
Sbjct: 668 PINPRLFKIGVSWNFYD 684
Score = 225 bits (574), Expect = 9e-57, Method: Composition-based stats.
Identities = 123/297 (41%), Positives = 178/297 (59%), Gaps = 15/297 (5%)
Query: 3 RHIYLIINLFIISLLANAQSFN-------RVSRDGTSTSNGFQGNRNLGVS-DSIQNQHK 54
R I L L ++ LL FN R RD +NG Q + + V DS + +
Sbjct: 6 RRILLTYILLVVGLLTAQAQFNPTQQVDPRTGRD----ANGNQIDPAMRVQEDSTDVEIQ 61
Query: 55 EIPRGLKVWTIDERFGDRTAAQPDTVSELFMNTFFNTGLRGEYNSLGNLGSPRQNRIFID 114
+P L +W + E G DT + +F NT GL G YN LGNLGSPR +R+F +
Sbjct: 62 GLPPTLYMWHVSENLGTIQRIPADTATHMFQNTNLVEGLTGHYNYLGNLGSPRLSRLFFE 121
Query: 115 RAEPSEFIFLAPYDKFIVQPSKFHFTSTLSPITNLSYSTAGDRTDGEDQFKALFAINAGK 174
R + IF+ P+ F ++P +F+FT++ P TNL+Y AG++ +GE++FK+ F++N K
Sbjct: 122 RRDAEPTIFMEPFSSFFIRPDEFNFTNSNVPFTNLTYYKAGNKINGEERFKSYFSVNVNK 181
Query: 175 KWGFGFKFDYLYGRGYYSNQNASHINYSFWGTYTGEKYQANFLSSLNHQKVSENGGIAND 234
+ FGF FDYLYGRGYY+NQN S+ N + +G+Y G++Y+A L S N+ K++ENGGI +D
Sbjct: 182 RLAFGFNFDYLYGRGYYNNQNTSYFNAALFGSYIGDRYEATLLYSNNYLKMNENGGITDD 241
Query: 235 AYITHPEIFNE---SFNTNEIPTILSKNWNRNDNQHIFFNHRYSLGFYRKVPMTKEE 288
YIT PE E + + IPT+LSK+ NRN + +IF RY LGF R+V K++
Sbjct: 242 RYITRPEEMAEGKKEYESQNIPTLLSKSANRNKDFYIFLTQRYKLGFTREVEKAKDD 298
>gi|53712934|ref|YP_098926.1| hypothetical protein BF1644 [Bacteroides fragilis YCH46]
gi|52215799|dbj|BAD48392.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 680
Score = 300 bits (768), Expect = 3e-79, Method: Composition-based stats.
Identities = 176/437 (40%), Positives = 251/437 (57%), Gaps = 37/437 (8%)
Query: 358 QRDSINIDRLEEKQIAASDAANEWIKDEYVPVTSFIHTARFDNFRRIYQAYDTPNSFYAN 417
QR + R EK A D K E+VPVTSFIHT + + R + + D Y
Sbjct: 278 QRYKLGFTREVEK---AKDDTTNTQKTEFVPVTSFIHTMKVERSRHQFTSEDELYKIYPE 334
Query: 418 NYYYNNATASDSIYDQTKHWALKNTFAIALLEGFNKWAKAGLKAFVSHELRHYELPMLLT 477
Y + + D T + +KNT IALLEGFNK+AKAGL AF+SH+L +Y L +
Sbjct: 335 AYI---QPGNKLVNDSTSYIGVKNTLGIALLEGFNKYAKAGLTAFISHKLSNYRLMDRDS 391
Query: 478 TTTTPAANPLFGGYEKINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNA 537
+ +K +++++ VGG+L K GKTLHY E I GQ ++ +
Sbjct: 392 VSV-----------DKYSEHEVFVGGELAKRQGKTLHYRAMGEVGILDKAIGQFRVNADL 440
Query: 538 DLNFPLLGDTVQFAATAFLHRTAPTFYMNTFRSRHFWWGN-NLDQQIHSRLLGAFTLKKT 596
DLNF L DTV F A + T P FYM + S++F+W N N++++ +RL G ++
Sbjct: 441 DLNFRLWKDTVSFIARGSISNTLPAFYMRHYHSKYFYWDNDNMEKEFRTRLEGELNIEHW 500
Query: 597 RTKLRVGYDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLG 656
+T L+ G + +KNYTYF N + +Q+ G I +L+ L Q+F+LG
Sbjct: 501 QTNLKAGVENIKNYTYF----------------NQKALPQQNGGNIQVLSATLSQNFRLG 544
Query: 657 IMNWQNLITFQKSSNEAVLPTPTLNIYSNLFIRFKIA-KVLDCDFGVDGRYFTKYYAPEY 715
I++ N +T+QKSSN VLP P L++Y NL+I+ +A KVL G D RYFTKYYAP Y
Sbjct: 545 ILHLDNEVTWQKSSNNTVLPLPELSLYHNLYIQTTLAKKVLHVQLGADVRYFTKYYAPAY 604
Query: 716 IPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSGDG-GNYFFTPHY 774
P + F +Q E + ++G YPIIN YAN +L+RTR F MM H+N G G NYF +PHY
Sbjct: 605 TPAIQQFHLQ-PEDDQVKIGGYPIINVYANLQLKRTRLFAMMYHVNQGMGNSNYFLSPHY 663
Query: 775 PLNQRVLRLGISWDFFN 791
P+N R+ ++G+SW+F++
Sbjct: 664 PINPRLFKIGVSWNFYD 680
Score = 224 bits (572), Expect = 1e-56, Method: Composition-based stats.
Identities = 123/297 (41%), Positives = 178/297 (59%), Gaps = 15/297 (5%)
Query: 3 RHIYLIINLFIISLLANAQSFN-------RVSRDGTSTSNGFQGNRNLGVS-DSIQNQHK 54
R I L L ++ LL FN R RD +NG Q + + V DS + +
Sbjct: 2 RRILLTYILLVVGLLTAQAQFNPTQQVDPRTGRD----ANGNQIDPAMRVQEDSTDVEIQ 57
Query: 55 EIPRGLKVWTIDERFGDRTAAQPDTVSELFMNTFFNTGLRGEYNSLGNLGSPRQNRIFID 114
+P L +W + E G DT + +F NT GL G YN LGNLGSPR +R+F +
Sbjct: 58 GLPPTLYMWHVSENLGTIQRIPADTATHMFQNTNLVEGLTGHYNYLGNLGSPRLSRLFFE 117
Query: 115 RAEPSEFIFLAPYDKFIVQPSKFHFTSTLSPITNLSYSTAGDRTDGEDQFKALFAINAGK 174
R + IF+ P+ F ++P +F+FT++ P TNL+Y AG++ +GE++FK+ F++N K
Sbjct: 118 RRDAEPTIFMEPFSSFFIRPDEFNFTNSNVPFTNLTYYKAGNKINGEERFKSYFSVNVNK 177
Query: 175 KWGFGFKFDYLYGRGYYSNQNASHINYSFWGTYTGEKYQANFLSSLNHQKVSENGGIAND 234
+ FGF FDYLYGRGYY+NQN S+ N + +G+Y G++Y+A L S N+ K++ENGGI +D
Sbjct: 178 RLAFGFNFDYLYGRGYYNNQNTSYFNAALFGSYIGDRYEATLLYSNNYLKMNENGGITDD 237
Query: 235 AYITHPEIFNE---SFNTNEIPTILSKNWNRNDNQHIFFNHRYSLGFYRKVPMTKEE 288
YIT PE E + + IPT+LSK+ NRN + +IF RY LGF R+V K++
Sbjct: 238 RYITRPEEMAEGKKEYESQNIPTLLSKSANRNKDFYIFLTQRYKLGFTREVEKAKDD 294
>gi|154490235|ref|ZP_02030496.1| hypothetical protein PARMER_00467 [Parabacteroides merdae ATCC
43184]
gi|154089127|gb|EDN88171.1| hypothetical protein PARMER_00467 [Parabacteroides merdae ATCC
43184]
Length = 703
Score = 229 bits (584), Expect = 5e-58, Method: Composition-based stats.
Identities = 140/432 (32%), Positives = 224/432 (51%), Gaps = 55/432 (12%)
Query: 386 YVPVTSFIHTARFDNFRRIYQA---------YDTPNSFYANNYYYNNATASDSIYDQTKH 436
++PV+S IHT + + RR +++ Y TP+ Y + N T D D+T +
Sbjct: 301 FIPVSSIIHTLEYQDNRRRFRSEADENLNECYLTPDG-YPRVFGLENGTGVD---DRTSY 356
Query: 437 WALKNTFAIALLEGFNKWAKAGLKAFVSHELRHYELPML-------------LTTTTTPA 483
W L+NTF ++L EGF WAK G+ AF + + R ++LP L + +
Sbjct: 357 WNLRNTFGLSLREGFQDWAKFGITAFATFDKRKFQLPAQIPGLSYDPEYGSGLNASPSTI 416
Query: 484 ANPLFGGYEKINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPL 543
P+ Y++ + +G ++ K G L YN E + GD G+ GN F L
Sbjct: 417 EFPITQVYDEFST---YIGAEISKRRGNILTYNARGELCVVGDDIGEFRATGNLQTKFKL 473
Query: 544 LGDTVQFAATAFLHRTAPTFYMNTFRSRHFWWGN---NLDQQIHSRLLGAFTLKKTRTKL 600
L +A ++ P+FYM F SR+FWW N N+ QQI++ + L+ TRT+L
Sbjct: 474 LKKDATISAEGYIKNVTPSFYMRHFHSRYFWWDNRDMNMIQQIYAGV--KINLESTRTQL 531
Query: 601 RVGYDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNW 660
G + ++NY +F N Q Q SG + ++ +++QD W
Sbjct: 532 SAGVESIQNYVFF----------------NKQGMPEQKSGNLQVINARIKQDVMYRAFGW 575
Query: 661 QNLITFQKSSNEAVLPTPTLNIYSNLFIRFKIAKVLDCDFGVDGRYFTKYYAPEYIPGMG 720
+N + +Q SS+++VLP P +++Y+N++++FK+AKVL G + Y T YYAP Y P
Sbjct: 576 ENEVAYQLSSDKSVLPLPQISLYTNMYLKFKVAKVLMVQLGANMYYNTSYYAPYYEPATQ 635
Query: 721 SFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQR 779
F +Q+ +VGN+P++NAY NF L++ RFF+M ++ S NYF HYPL+
Sbjct: 636 QFQVQD----EVKVGNFPLVNAYVNFHLKQARFFVMGYNLGSKFVNPNYFSLAHYPLDPF 691
Query: 780 VLRLGISWDFFN 791
VL++G++ F N
Sbjct: 692 VLKMGVAVTFNN 703
Score = 128 bits (322), Expect = 2e-27, Method: Composition-based stats.
Identities = 93/304 (30%), Positives = 140/304 (46%), Gaps = 11/304 (3%)
Query: 28 RDGTSTSNGFQGNRNLGVSDSIQNQHKEIPRGLKVWTIDERFGDRTAAQPDTVSELFMNT 87
R G S N + N+ + S I + R + + + GD A DT F N+
Sbjct: 28 RKGFSLGNLSKANQEVPDSLLIPDSAAINNRRITAYRLTPLLGDPYIAPLDTHKLNFANS 87
Query: 88 FFNTGLRGEYNSLGNLGSPRQNRIFIDRAEPSEFIFLAPYDKFIVQPSKFHFTSTLSPIT 147
L N GSP Q RIF +R EP +F+F YD +I P + T P T
Sbjct: 88 TLVESESLAVGYLANTGSPAQTRIFSERKEPRDFLFADAYDYYITDPQNAQYYDTKIPYT 147
Query: 148 NLSYSTAGDRTDGEDQFKALFAINAGKKWGFGFKFDYLYGRGYYSNQNASHINYSFWGTY 207
N+ Y+T G ++ K +N G K G DY+Y RGYY N ++Y +G+Y
Sbjct: 148 NVMYTTMGGSESKNERLKGTLTMNFGPKINVGGDLDYIYSRGYYKNNGNKLLSYRLFGSY 207
Query: 208 TGEKYQAN-FLSSLNHQKVSENGGIANDAYITHPEIFNESFNTNEIPTILS-----KNWN 261
++Y+A+ +LS+ N ENGG+AND+ IT+P+ + + P + K WN
Sbjct: 208 KSDRYEAHAYLSNFNFINY-ENGGLANDSVITNPDQYFAGERNQDDPKAFNTRYPVKAWN 266
Query: 262 RNDNQHIFFNHRYSLGFYRKVPMTKEEI--EAKKFAIKS--MQSQEEDKARRKAMEEAKQ 317
R + F +H Y+LGF R++ + + K F S + + E RR+ EA +
Sbjct: 267 RVRGKQYFLSHHYNLGFERELEGEVDTLGNPVKVFIPVSSIIHTLEYQDNRRRFRSEADE 326
Query: 318 NGEE 321
N E
Sbjct: 327 NLNE 330
>gi|150008031|ref|YP_001302774.1| hypothetical protein BDI_1394 [Parabacteroides distasonis ATCC
8503]
gi|149936455|gb|ABR43152.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 693
Score = 226 bits (576), Expect = 5e-57, Method: Composition-based stats.
Identities = 143/419 (34%), Positives = 214/419 (51%), Gaps = 30/419 (7%)
Query: 382 IKDEYVPVTSFIHTARFDNFRRIYQAYDTPNSFYANNYYYNNATASDSIYDQTKHWALKN 441
+K+ ++PV+S IHT +++ RR + + D + Y +A S+ DQT W LKN
Sbjct: 296 VKEVFIPVSSIIHTIDYEDNRRRFISNDAGIDTCYTHLYGMDA----SLNDQTSSWNLKN 351
Query: 442 TFAIALLEGFNKWAKAGLKAFVSHELRHYELPMLLT--------TTTTPAANPLFGGYEK 493
TFA+AL EGF WAK GL AFV+ E R + L + ++ + F E
Sbjct: 352 TFALALREGFQDWAKFGLTAFVTFEKRRFRLASQVPGLDYGPDGRGSSEPSTLNFPTSEV 411
Query: 494 INKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPLLGDTVQFAAT 553
++ +GG+L K G L YN+ E +AG AG++ + G+ F L A
Sbjct: 412 YDEFTTYIGGELSKRRGSLLTYNVRGELGLAGSDAGEVRVSGDLQTKFKLFKKDATIKAE 471
Query: 554 AFLHRTAPTFYMNTFRSRHFWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLKNYTYF 613
++ P FY R++WW +L L+ TRT+L G + ++NY +F
Sbjct: 472 GYIKNITPAFYQRHHHGRYYWWDLSLKNVQRIYAGAKINLESTRTQLSGGVESIQNYVFF 531
Query: 614 GLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQKSSNEA 673
N + Q S I +++ +++QD W+N + +Q SS ++
Sbjct: 532 ----------------NGKGLPEQSSKNIQVVSARIKQDIMYRAFGWENEVAYQLSSEKS 575
Query: 674 VLPTPTLNIYSNLFIRFKIAKVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTE 733
LP P ++ YSN+++ FK+AKVL G + Y T YYAP Y P F +QE E + +
Sbjct: 576 QLPLPDISAYSNIYLAFKLAKVLTVQIGANVYYNTAYYAPYYEPATQQFQVQE-EDKKVK 634
Query: 734 VGNYPIINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQRVLRLGISWDFFN 791
VGNYP+INAYANF L++ RFFIM ++ S NYF HYPL+ VL++GIS F N
Sbjct: 635 VGNYPLINAYANFHLKQARFFIMAYNLGSKFVDPNYFSLAHYPLDPMVLKMGISVTFNN 693
Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats.
Identities = 81/232 (34%), Positives = 122/232 (52%), Gaps = 5/232 (2%)
Query: 69 FGDRTAAQPDTVSELFMNTFFNTGLRGEYNSLGNLGSPRQNRIFIDRAEPSEFIFLAPYD 128
G+R A DT F N+ L N+GSP Q RIF +R E +FIF YD
Sbjct: 71 LGERYIAPMDTNRLNFGNSTLVEANSLAVGYLANVGSPAQTRIFNERKEERDFIFADAYD 130
Query: 129 KFIVQPSKFHFTSTLSPITNLSYSTAGDRTDGEDQFKALFAINAGKKWGFGFKFDYLYGR 188
+I P+ +F T P T ++Y+T G + D+ K + +N GKK G + DY+Y R
Sbjct: 131 YYITTPTNAYFYDTKVPYTQVTYTTGGASQNKMDRLKGVLTMNFGKKINVGGEMDYIYSR 190
Query: 189 GYYSNQNASHINYSFWGTYTGEKYQAN-FLSSLNHQKVSENGGIANDAYITHPEIFNE-- 245
GYY++ ++Y F+G+Y ++Y+ N +LS+ N ENGG+ ND YIT+P+ E
Sbjct: 191 GYYNSNGNKLLSYRFFGSYITDRYELNAYLSNFNFVNY-ENGGLTNDQYITNPDDLAENQ 249
Query: 246 -SFNTNEIPTILSKNWNRNDNQHIFFNHRYSLGFYRKVPMTKEEIEAKKFAI 296
+ ++ P + WNR + F RY+LGF +++ T EE K+ I
Sbjct: 250 RNIDSKSYPVRYTDTWNRVRGKQYFLTQRYNLGFTKELEETDEEGNVKEVFI 301
>gi|34540194|ref|NP_904673.1| hypothetical protein PG0362 [Porphyromonas gingivalis W83]
gi|34396506|gb|AAQ65572.1| hypothetical protein PG_0362 [Porphyromonas gingivalis W83]
Length = 722
Score = 169 bits (428), Expect = 7e-40, Method: Composition-based stats.
Identities = 128/412 (31%), Positives = 187/412 (45%), Gaps = 38/412 (9%)
Query: 386 YVPVTSFIHTARFDNFRRIYQAYDT-PNSFYANNYYYNNATASDSIY---DQTKHWALKN 441
+VPV S HT + RR ++ + NS Y N + A + I D T+ N
Sbjct: 339 FVPVGSISHTFNYTKSRRRFKVKNPLDNSIYPNLFIKRLNDAGEMITLPNDTTRMEEYHN 398
Query: 442 TFAIALLEGFNKWAKAGLKAFVSHELRHYELPMLLTTTTTPAANPLFGGYEKINKNDISV 501
T A++L EGF++WAK GL A+V E R Y L + P + F Y V
Sbjct: 399 TLALSLREGFHRWAKFGLTAYVRLENRFYTLQD--SVVGVPPTDREFSTY---------V 447
Query: 502 GGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPLLGDTVQFAATAFLHRTAP 561
GG++ + GK L++ + E + G AG + G F LL + A L T P
Sbjct: 448 GGEISRRGGKYLNFAADGELSVVGSDAGAFKLRGRLSTAFDLLRRKTEMEAWGQLLNTRP 507
Query: 562 TFYMNTFRSRHFWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLKNYTYFGLQNERVA 621
+++ WW + D RL G+ LK T L + LKNY YF
Sbjct: 508 GYFLRHHHGTVHWWDESFDFIQQLRLGGSLRLKDWGTTLTLQSATLKNYIYF-------- 559
Query: 622 NGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQKSSNEAVLPTPTLN 681
D+ QV S I +L ++ ++ G + W+ +Q SSN LP P L
Sbjct: 560 ---DHTAFPQQV-----SSPIQVLEGRIAHAYRWGALGWEVEAAYQTSSNRTALPLPKLA 611
Query: 682 IYSNLFIRFKI---AKVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEVGNYP 738
Y NL++ F++ KV+ GVD R + YYAP Y P + F Q+ G++P
Sbjct: 612 AYGNLYLDFRLPSSTKVMRIQTGVDARIHSSYYAPYYEPAVQQFTTQQEIKVG---GSFP 668
Query: 739 IINAYANFKLQRTRFFIMMSHINSGDGGNYFFT-PHYPLNQRVLRLGISWDF 789
++NAY N L+R+RFF M ++ + F+ H P N R LR+GI+ DF
Sbjct: 669 LMNAYVNIHLKRSRFFFEMYNLAEAFMDSKRFSLVHTPYNPRGLRMGIAIDF 720
Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats.
Identities = 76/238 (31%), Positives = 119/238 (50%), Gaps = 8/238 (3%)
Query: 52 QHKEIPRG--LKVWTIDERFGDRTAAQPDTVSELFMNTFFNTGLRGEYNSLGNLGSPRQN 109
+ ++I RG LKV+T++ GD + DT F N G LGNL SP ++
Sbjct: 80 EEEQIARGDKLKVYTLNPNTGDLLRSHIDTAMLGFYNRVKAEGRSLAIGYLGNLNSPWES 139
Query: 110 RIFIDRA-EPSEFIFLAPYDKFIVQPSKFHFTSTLSPITNLSYSTAGDRTDGEDQFKALF 168
++F DR +F+++ ++ P F T SP T + Y GD E++ ++
Sbjct: 140 KMFFDRPLYHPQFMYMYGLHGMLIDPFTARFYDTKSPTTQVLYLRHGDNQTREEELRSTL 199
Query: 169 AINAGKKWGFGFKFDYLYGRGYYSNQNASHINYSFWGTYTGEKYQANFLSSLNHQKVSEN 228
A+N GK+ G DY+Y G+Y++ I+Y F+G+Y ++Y+A N+ ++EN
Sbjct: 200 ALNIGKRVNIGNDVDYIYSNGFYNSNKTKAISYRFFGSYRSDRYEAYAYVGNNYYLMTEN 259
Query: 229 GGIANDAYITHPEIF---NESFNTNEIPTILSKN--WNRNDNQHIFFNHRYSLGFYRK 281
GGI ND Y+ HP F F + +IP N +N HRY+LGFYR+
Sbjct: 260 GGITNDDYVIHPGNFANGTNEFVSTDIPVKFPGNNMFNSLRQGTARLTHRYNLGFYRE 317
>gi|149371925|ref|ZP_01891244.1| hypothetical protein SCB49_08548 [unidentified eubacterium SCB49]
gi|149355065|gb|EDM43626.1| hypothetical protein SCB49_08548 [unidentified eubacterium SCB49]
Length = 641
Score = 97.8 bits (242), Expect = 2e-18, Method: Composition-based stats.
Identities = 76/253 (30%), Positives = 121/253 (47%), Gaps = 16/253 (6%)
Query: 546 DTVQFAATAFLHRTAPTFYMNTFRSRH--FWWGNNLDQQIHSRLLGAFTLKKTRTKLRVG 603
D ++ A +H AP F ++S + + W NN D ++++ L T +
Sbjct: 396 DDIKAVAGLKIHSVAPNFNTLLYQSDYVSYNWQNNFDN-VNTQELSFGIESPTFGNINAS 454
Query: 604 YDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNL 663
Y + NYTYFG A V + Q Q + + L ++L+++FK N
Sbjct: 455 YTGIDNYTYFGTTILEAATDETAEVIS-QPTPFQATERVDYLKIKLEKEFKYQNFRLANT 513
Query: 664 ITFQK-SSNEAVLPTPTLNIYSNLFIR---FKIAKVLDCDFGVDGRYFTKYYAPEYIPGM 719
I +Q +S EAV PT + L+ + FK K L GV +YF KY Y P +
Sbjct: 514 ILYQNVASGEAVFKVPTFITRNTLYYQDHWFK--KALFLQTGVTFKYFPKYEMDAYSPVL 571
Query: 720 GSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINS--GDGGNYFFTPHYPLN 777
G F +Q+TE E+G +P+I+ + N K+++TR F+ H+N+ ++F P YP
Sbjct: 572 GEFYVQDTE----EIGGFPMIDVFFNAKVRQTRIFVKYEHVNALFKSTNDHFAAPGYPYR 627
Query: 778 QRVLRLGISWDFF 790
+R G+ WDFF
Sbjct: 628 DAFIRFGLVWDFF 640
>gi|88805045|ref|ZP_01120565.1| hypothetical protein RB2501_09325 [Robiginitalea biformata
HTCC2501]
gi|88785924|gb|EAR17093.1| hypothetical protein RB2501_09325 [Robiginitalea biformata
HTCC2501]
Length = 688
Score = 92.0 bits (227), Expect = 2e-16, Method: Composition-based stats.
Identities = 60/197 (30%), Positives = 100/197 (50%), Gaps = 13/197 (6%)
Query: 600 LRVGYDVLKNYTYFGLQNERVANGNDYLVKNNQVNVR--QHSGAISLLTLQLQQDFKLGI 657
LR NY++F + A+ + Q VR Q +G I+ L ++ Q++F+LG
Sbjct: 500 LRAQLSSTDNYSFFRSE----ADADQLAAGEEQAFVRPFQEAGRITHLRVKYQKEFQLGK 555
Query: 658 MNWQNLITFQK-SSNEAVLPTPTLNIYSNLFIRFKI-AKVLDCDFGVDGRYFTKYYAPEY 715
N + +Q+ + AVL P L + L+ + K + G+ RYFT YY Y
Sbjct: 556 WALNNTLLYQEVDQDAAVLNVPRLVTRNTLYFSSDVFKKAMFLQTGITFRYFTSYYMDAY 615
Query: 716 IPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHY 774
P +G F +Q+ R E G +P+++ + N ++++TR ++ H NS G NY+ P Y
Sbjct: 616 NPLLGDFYVQD----REEFGGFPMLDFFINARIRQTRIYLKAEHFNSSFSGNNYYSAPDY 671
Query: 775 PLNQRVLRLGISWDFFN 791
P V+R G+ W+FF+
Sbjct: 672 PYRDFVIRFGLVWNFFS 688
>gi|89890456|ref|ZP_01201966.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
gi|89517371|gb|EAS20028.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
Length = 658
Score = 89.0 bits (219), Expect = 1e-15, Method: Composition-based stats.
Identities = 76/276 (27%), Positives = 130/276 (47%), Gaps = 22/276 (7%)
Query: 523 IAGDRAGQLHIDGNADLNFPLLGDTVQFAATAFLHRTAPTFYMNTFRSRH--FWWGNNLD 580
IAGD GQ +++G+A F VQ A + AP F F+S + + W NN
Sbjct: 396 IAGDLDGQ-YLNGSAGYAF----KDVQIDAGLAISSRAPDFNYQLFQSDYVNYNWQNNFS 450
Query: 581 QQIHSRLLGAFTLKKTR-TKLRVGYDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHS 639
+L F L+ + L V + L++Y YF Q +V N ++ + N Q +
Sbjct: 451 NVEKQQL--TFKLQSEKYLDLDVSFTTLQDYVYF--QENQVLNSDNEVTGYNATPA-QAT 505
Query: 640 GAISLLTLQLQQDFKL-GIMNWQNLITFQK-SSNEAVLPTPTLNIYSNLFIRFKIAK-VL 696
I+ ++ +D K N I FQ S ++ P + ++L+ + ++ K L
Sbjct: 506 EDITYFKIKAHKDIKFFKYFGIDNTIMFQSVSQGSDIINVPQITTRNSLYYKDRLFKNAL 565
Query: 697 DCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIM 756
G+ +YF++YY Y P +G F +Q + E+G +P+++ + N K+++TR F
Sbjct: 566 LLQTGISLKYFSEYYMDSYDPVLGEFTVQTDQ----ELGAFPMVDFFVNAKVRQTRIFFK 621
Query: 757 MSHINS--GDGGNYFFTPHYPLNQRVLRLGISWDFF 790
+ H+N G +YF P P +R G+ W+FF
Sbjct: 622 LEHLNQLFASGDDYFSAPRTPYRDFTVRFGVVWNFF 657
>gi|88713481|ref|ZP_01107564.1| hypothetical protein FB2170_08984 [Flavobacteriales bacterium
HTCC2170]
gi|88708391|gb|EAR00628.1| hypothetical protein FB2170_08984 [Flavobacteriales bacterium
HTCC2170]
Length = 662
Score = 86.7 bits (213), Expect = 6e-15, Method: Composition-based stats.
Identities = 65/198 (32%), Positives = 100/198 (50%), Gaps = 16/198 (8%)
Query: 600 LRVGYDVLKNYTYFGLQ-NERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIM 658
L Y L NYTYFG+ N+ + G + + Q S +IS L ++ ++ KLG
Sbjct: 475 LNAKYSTLGNYTYFGIDPNQALIAGQE----QATIKPFQESSSISHLKVKYNKEIKLGGF 530
Query: 659 NWQNLITFQK-SSNEAVLPTPTLNIYSNLFIR---FKIAKVLDCDFGVDGRYFTKYYAPE 714
N I +Q S + VL P L + L+ FK A L GV +YFT Y
Sbjct: 531 ALNNTIMYQSVSQSNDVLNVPQLVTRNTLYFSTDAFKKAMYLQT--GVTFKYFTAYNMDA 588
Query: 715 YIPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPH 773
Y P +G F IQ E ++G +P+++ + N ++Q+TR ++ H NS G N++ P+
Sbjct: 589 YNPVLGEFYIQNDE----KLGGFPMLDFFINARIQQTRVYLKAEHFNSSFTGYNFYSAPN 644
Query: 774 YPLNQRVLRLGISWDFFN 791
YP V+R G+ W+FF+
Sbjct: 645 YPYRDFVIRFGLVWNFFS 662
>gi|91218355|ref|ZP_01255299.1| hypothetical protein P700755_04422 [Psychroflexus torquis ATCC
700755]
gi|91183493|gb|EAS69892.1| hypothetical protein P700755_04422 [Psychroflexus torquis ATCC
700755]
Length = 655
Score = 85.9 bits (211), Expect = 9e-15, Method: Composition-based stats.
Identities = 79/303 (26%), Positives = 137/303 (45%), Gaps = 19/303 (6%)
Query: 493 KINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPLLGDTVQFAA 552
++ + +S+GG LK K A I GD G + G A + L D++QFAA
Sbjct: 366 RLKTDIVSLGGTYLKEY-KAFKLEGQAAYNITGDYDG-YEVHGKASMT---LSDSLQFAA 420
Query: 553 TAFLHRTAPTF--YMNTFRSRHFWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLKNY 610
+ +AP F +N +++ W N ++++ L K ++ + N+
Sbjct: 421 KLDISSSAPNFNWILNQSDYKNYNWYNPDFDNVNTQRLEVKAESKVYGQVSASLTQIDNH 480
Query: 611 TYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQKS- 669
YFG ER A G V ++ V Q I ++ Q+DFK G +N N I +Q
Sbjct: 481 AYFGF--ERTAEG---AVTDSLVRPLQTDQQIRYFKIKAQKDFKYGNLNLANTIMYQNVL 535
Query: 670 SNEAVLPTPTLNIYSNLFI-RFKIAKVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETE 728
E+V P P + ++LF + + L G+ YF+ + + Y P + F IQ+ +
Sbjct: 536 DGESVFPVPEIVTRNSLFYDNYLFDRALYLQTGLTFNYFSSFMSKAYDPILSEFAIQDFQ 595
Query: 729 ASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSGDGGNYFFT-PHYPLNQRVLRLGISW 787
E+G + + + N K++ R ++ + + + GN F+ P YP +R G+ W
Sbjct: 596 ----ELGGFYTADFFINAKIREARIYLKLENFTTLFQGNSNFSAPRYPYRDFAIRFGLVW 651
Query: 788 DFF 790
+FF
Sbjct: 652 NFF 654
>gi|146299686|ref|YP_001194277.1| hypothetical protein Fjoh_1926 [Flavobacterium johnsoniae UW101]
gi|146154104|gb|ABQ04958.1| hypothetical protein Fjoh_1926 [Flavobacterium johnsoniae UW101]
Length = 657
Score = 83.2 bits (204), Expect = 7e-14, Method: Composition-based stats.
Identities = 76/257 (29%), Positives = 122/257 (47%), Gaps = 25/257 (9%)
Query: 544 LGDTVQFAATAFLHRTAPTFYMNTFRSRH--FWWGNNL-DQQIHSRLLGAFTLKKTRTKL 600
L D +QF P N ++S + + W NN +++I+S LGA ++
Sbjct: 415 LNDKIQFDFRYRNINKLPNNNYNLYQSSYVEYNWSNNFKNEKINS--LGA-SISTPWVNA 471
Query: 601 RVGYDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNW 660
Y VLK++ YF +A ++ V Q+ I+ L ++ ++FK G
Sbjct: 472 EAQYTVLKDHLYF----VDMATPAQAALRTQIVKPAQYGNVINYLEIKASREFKFGKFAL 527
Query: 661 QNLITFQK-SSNEAVLPTP---TLNI--YSNLFIRFKIAKVLDCDFGVDGRYFTKYYAPE 714
N + +QK +E +L P T N Y+N F + K L GV YFTKYY
Sbjct: 528 DNTLLYQKVDQSELILNVPDFVTRNTFYYTNYFFK----KALYAQGGVVFNYFTKYYGNS 583
Query: 715 YIPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINS-GDGGNYFFTPH 773
Y P +G F +Q T+ E+GNY + + N ++++TRF++ H+N+ NY+ P
Sbjct: 584 YNPVIGEFFVQNTK----EIGNYATFDVFINARIRQTRFYLKAEHLNALFSSSNYYSAPD 639
Query: 774 YPLNQRVLRLGISWDFF 790
P V+R G+ W+FF
Sbjct: 640 NPYRDFVIRFGLVWNFF 656
>gi|83856530|ref|ZP_00950059.1| hypothetical protein CA2559_05545 [Croceibacter atlanticus
HTCC2559]
gi|83850330|gb|EAP88198.1| hypothetical protein CA2559_05545 [Croceibacter atlanticus
HTCC2559]
Length = 646
Score = 83.2 bits (204), Expect = 7e-14, Method: Composition-based stats.
Identities = 84/320 (26%), Positives = 138/320 (43%), Gaps = 50/320 (15%)
Query: 484 ANPLFGGYEKINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPL 543
AN L GG + VG + GK N ++GD G + AD ++ +
Sbjct: 363 ANRLLGGIPQ-------VGAEYFNTIGK-FKVKANLMTSVSGDTNGNYFL---ADASYSI 411
Query: 544 LGDTVQFAATAFLHRTAPTFYMNTFRSRH--FWWGN--------NLDQQIHSRLLGAFTL 593
+ D + AT + +P + ++S + + W N N + ++ S G T
Sbjct: 412 I-DELNIGATISSNDRSPNYTFQLYQSDYINYNWQNDFSNESIQNFEVRLQSDKYGKLTA 470
Query: 594 KKTRTKLRVGYDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDF 653
T ++ NYTYFGL E +QV Q+ +S L ++ Q +F
Sbjct: 471 SNT---------IIDNYTYFGLNEE------------DQVKPIQYDETVSYLRIKFQNNF 509
Query: 654 KLGIMNWQNLITFQKSSN-EAVLPTPTLNIYSNLFIR-FKIAKVLDCDFGVDGRYFTKYY 711
G N I +Q S+ + V P + + +L+ + + K L G G+YF+ +
Sbjct: 510 NFGYFGLANTIMYQNVSDGDTVFNVPEVVLRHSLYYQDYWFKKALYLQAGFTGKYFSGFN 569
Query: 712 APEYIPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSGDGGNYFFT 771
A Y P + F +Q E + +P ++ + N K+++ R F + + NS GN FT
Sbjct: 570 ANGYDPVLSDFFVQNDE----RIEGFPAVDFFFNAKIRQARVFFKLENANSILLGNNNFT 625
Query: 772 -PHYPLNQRVLRLGISWDFF 790
P+YP V+R GI WDFF
Sbjct: 626 APNYPYRDFVVRFGIVWDFF 645
>gi|126663335|ref|ZP_01734333.1| hypothetical protein FBBAL38_08275 [Flavobacteria bacterium BAL38]
gi|126624993|gb|EAZ95683.1| hypothetical protein FBBAL38_08275 [Flavobacteria bacterium BAL38]
Length = 649
Score = 79.3 bits (194), Expect = 9e-13, Method: Composition-based stats.
Identities = 62/235 (26%), Positives = 111/235 (47%), Gaps = 19/235 (8%)
Query: 561 PTFYMNTFRSRH--FWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLKNYTYFGLQNE 618
P N ++S + + W NN + ++ F+++ L Y VL ++ YF +
Sbjct: 428 PNLNFNLYQSDYINYNWYNNFKNEKLNQF--EFSVQTKWINLSTTYKVLNDHLYF----D 481
Query: 619 RVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQK--SSNEAVLP 676
+ N + N +V Q+ I+ L + ++ K N I +QK SNE V
Sbjct: 482 NITND----ITNLEVKPMQYDNTINYLAVTASKEIKFWKFALDNTILYQKVDQSNEIVNV 537
Query: 677 TPTLNIYSNLFIRFKIAKVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEVGN 736
+ + F F K + GV +YF KYYA +Y P +G F +Q + T++G
Sbjct: 538 PQIVTRNTLYFSDFVFKKAMLLQTGVTFQYFNKYYANDYNPLIGEFYVQ----NETKIGG 593
Query: 737 YPIINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQRVLRLGISWDFF 790
+P+ + + N ++++ R F+ H NS G +++ P+YP ++R G+ W+FF
Sbjct: 594 FPMFDFFINARVKQARIFLKAEHFNSAWTGYDFYSAPNYPYRDFIVRFGLVWNFF 648
>gi|149279624|ref|ZP_01885753.1| hypothetical protein PBAL39_08225 [Pedobacter sp. BAL39]
gi|149229660|gb|EDM35050.1| hypothetical protein PBAL39_08225 [Pedobacter sp. BAL39]
Length = 669
Score = 78.2 bits (191), Expect = 2e-12, Method: Composition-based stats.
Identities = 62/239 (25%), Positives = 105/239 (43%), Gaps = 14/239 (5%)
Query: 555 FLHRTAPTFYMNTFRSRHFWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLKNYTYFG 614
F +++ Y+ F H+ W N D+ L + K + Y ++ Y YF
Sbjct: 443 FQNKSPEEIYIRYF-GNHYRWTENFDRTKTINLSFRYLNDKLGLEAGAEYFLIDKYLYF- 500
Query: 615 LQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQKSSNEAV 674
N+ D L SGA+++L + L + F G + +QK+ N +
Sbjct: 501 --NQVGGTNADSLAI-----APAQSGAMNMLKITLGKRFNFGKFSLSTFGVYQKTDNPNI 553
Query: 675 LPTPTLNIYSNLFIRFKIAKVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEV 734
L TP L ++++++ KVL + G D RY T+Y A Y PG G F A
Sbjct: 554 LRTPELYAFASIYLDQTFFKVLKTNIGFDLRYNTEYTAYSYSPGAGQF---YNGAKDVTF 610
Query: 735 GNYPIINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQRVL-RLGISWDFFN 791
+ PI++ + L++ F+ + N G Y+ YP+ RVL + G+SW+F++
Sbjct: 611 ESTPILDFFVKASLRKANIFVKTDYANQGLLSKGYYTVDRYPMANRVLIKFGVSWNFYD 669
>gi|150025265|ref|YP_001296091.1| hypothetical protein FP1197 [Flavobacterium psychrophilum JIP02/86]
gi|149771806|emb|CAL43280.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 640
Score = 77.4 bits (189), Expect = 4e-12, Method: Composition-based stats.
Identities = 48/158 (30%), Positives = 85/158 (53%), Gaps = 7/158 (4%)
Query: 636 RQHSGAISLLTLQLQQDFKLGIMNWQNLITFQKSSNEA-VLPTPTLNIYSNL-FIRFKIA 693
+Q++ I+ L+L++ ++ N I FQ+ + E +L P L + + F +
Sbjct: 486 KQYNKTINYLSLKVSKELTFRKFALDNTILFQEVTQENNILNVPKLVTRNTIYFSDYVFK 545
Query: 694 KVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRF 753
K + G + FTKYYA Y P +G F +QE T++GN+P+++ + N ++++ R
Sbjct: 546 KAMYLQTGFTFQTFTKYYANHYNPLIGEFYVQEN----TKIGNFPLVDFFINARVRQARI 601
Query: 754 FIMMSHINSG-DGGNYFFTPHYPLNQRVLRLGISWDFF 790
F+ H+NS G NY+ P+YP ++R G+ W FF
Sbjct: 602 FLKAEHLNSSFTGRNYYSAPNYPYKDFIIRFGLVWTFF 639
>gi|86132078|ref|ZP_01050674.1| hypothetical protein MED134_11866 [Cellulophaga sp. MED134]
gi|85817412|gb|EAQ38592.1| hypothetical protein MED134_11866 [Dokdonia donghaensis MED134]
Length = 668
Score = 75.5 bits (184), Expect = 1e-11, Method: Composition-based stats.
Identities = 87/323 (26%), Positives = 135/323 (41%), Gaps = 40/323 (12%)
Query: 472 LPMLLTTTTTPAANPLFGGYE-KINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQ 530
+P + TTT GGY+ KI DI +G + +G YNI EA A
Sbjct: 381 VPNRIQTTTIAVG----GGYKNKIGGFDI-LGDAQVNVSGDFDGYNIYGEAGYA------ 429
Query: 531 LHIDGNADLNFPLLGDTVQFAATAFLHRTAPTFYMNTFRSRHFWWGNNLDQQIHSRLLGA 590
ID + + +L + F H + Y+N + W + + L A
Sbjct: 430 --IDKDKRFSASILSNA---RPAGFNHLLFQSNYIN-----YNWSNQEAYATVKTNTLTA 479
Query: 591 FTLKKTRTKLRVGYDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQ 650
K L+ +K+Y YF L ++ VN Q S +I+ L +
Sbjct: 480 QLDSKKWVNLKASAATIKDYAYFALDP-----------VDSLVNSFQTSESINHLQVTAS 528
Query: 651 QDFKLGIMNWQNLITFQK-SSNEAVLPTPTLNIYSNLFIRFKI-AKVLDCDFGVDGRYFT 708
++ K N T+Q S + VL P +L+ ++ K L G YFT
Sbjct: 529 KNIKYKKFNLDLTATYQNVSGADGVLNVPDFVGRGSLYFTDRLFKKALFLQTGFTANYFT 588
Query: 709 KYYAPEYIPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSGDGGNY 768
KY Y P + F +Q ++ E GN+P+++ + N K+++TR F+ H NS GN
Sbjct: 589 KYNLNAYDPILAEFYVQNSQ----EFGNFPLLDFFINMKVRQTRIFLKAEHFNSSFTGND 644
Query: 769 FFT-PHYPLNQRVLRLGISWDFF 790
F++ P YP +R GI W+FF
Sbjct: 645 FYSAPGYPYRDFNIRFGIVWNFF 667
>gi|120437244|ref|YP_862930.1| hypothetical protein GFO_2916 [Gramella forsetii KT0803]
gi|117579394|emb|CAL67863.1| conserved hypothetical protein [Gramella forsetii KT0803]
Length = 634
Score = 72.8 bits (177), Expect = 8e-11, Method: Composition-based stats.
Identities = 59/237 (24%), Positives = 103/237 (43%), Gaps = 22/237 (9%)
Query: 559 TAPTFYMNTFRSRH--FWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLKNYTYFGLQ 616
TAP F ++S + + W N+ D +I +L F KK + ++NYTYF L
Sbjct: 414 TAPNFNFLLYQSDYLNYNWQNDFDNEISQKLFADFKSKKL-FDVSGSLSQIENYTYFSLD 472
Query: 617 NERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQKS-SNEAVL 675
+ V Q + L+ +++F G+ N I +Q VL
Sbjct: 473 ------------EMGNVKPFQAGSQVRYFKLKAEKEFDFGLFGSYNTIMYQNVLEGLDVL 520
Query: 676 PTPTLNIYSNLFIR-FKIAKVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQETEASRTEV 734
P ++++ + F L G +YF++Y Y P + F +Q ++ ++
Sbjct: 521 NLPDFVTRNSIYYKDFWFDNALYLQTGFTFKYFSEYEMNAYDPVLAEFYVQNSQ----KL 576
Query: 735 GNYPIINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQRVLRLGISWDFF 790
G YP+++ + N K+ + R F + ++N D N F P YP ++R G+ WD F
Sbjct: 577 GGYPVVDYFFNAKVDKARIFFKLQNLNDLIDSNNNFTAPGYPYTDFLIRFGLVWDLF 633
>gi|126648123|ref|ZP_01720617.1| hypothetical protein ALPR1_15394 [Algoriphagus sp. PR1]
gi|126575716|gb|EAZ80026.1| hypothetical protein ALPR1_15394 [Algoriphagus sp. PR1]
Length = 650
Score = 71.2 bits (173), Expect = 3e-10, Method: Composition-based stats.
Identities = 77/315 (24%), Positives = 126/315 (40%), Gaps = 42/315 (13%)
Query: 485 NPLFGGYEKINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPLL 544
NP F +K N ++ +GGQL+ + N G+ + G LN +
Sbjct: 370 NPYFEEDDKFN--EVYIGGQLVGELSENWSLTAN----------GEYLLPGAYRLNGVFV 417
Query: 545 GDTVQFAATAFLHRTAPTFYMNTFRSRHFWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGY 604
++ T L++ PT + + W N+ + ++ G L + +LR
Sbjct: 418 SPWLELEYTKALYK--PTSVQQNYMGNFYQWDNDFENTGVDQIKGKVILNFDKIRLRPSL 475
Query: 605 DV--LKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGI-MNWQ 661
+ + NY YF +R+A ND G +L L +F++G W+
Sbjct: 476 TINRVNNYIYF--NEDRIATQND--------------GEAFMLMPGLIANFQIGKKFRWE 519
Query: 662 NLITFQKSSNEAVLPTPTLNIYSNLFIRFKIAKVLD---CDFGVDGRYFTKYYAPEYIPG 718
N + F + S Y+N F D G+D R+ + Y+A Y P
Sbjct: 520 NELIFTQISGVGSDAFRVPKFYANTKFYFDSPAFNDNVYIQLGIDIRFKSDYFADAYSPA 579
Query: 719 MGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSG--DGGNYFFTPHYPL 776
F IQ + V YP+ + + +F++ RTR + +H NSG YF TP Y
Sbjct: 580 TQQFFIQNS----FNVFAYPVGDVFLDFRINRTRVLLKYNHFNSGLMKEEGYFVTPGYTG 635
Query: 777 NQRVLRLGISWDFFN 791
+ + LGISW F+
Sbjct: 636 YKSFIDLGISWYLFD 650
>gi|88801502|ref|ZP_01117030.1| hypothetical protein PI23P_02547 [Polaribacter irgensii 23-P]
gi|88782160|gb|EAR13337.1| hypothetical protein PI23P_02547 [Polaribacter irgensii 23-P]
Length = 656
Score = 70.1 bits (170), Expect = 6e-10, Method: Composition-based stats.
Identities = 82/312 (26%), Positives = 133/312 (42%), Gaps = 49/312 (15%)
Query: 493 KINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPLLGDTV---- 548
KI N IS G N K + +NA+A I G+ + GN L G+ V
Sbjct: 379 KIEGNAISFGADW---NAKIKKFQLNADASIT---PGKGRLSGNY-----LKGEAVYKRD 427
Query: 549 ---QFAATAFLHRTAPTFYMNTFRSRH--FWWGNNLDQQIHSRLLGAFTLKKTRTKLRVG 603
+ T L+ AP F +S + + W +N +++R LG F L +
Sbjct: 428 SIFKIKGTLLLNSKAPNFNTLLHQSEYDDYNWQHNF-SNVNTRDLG-FDLTSKWINASLN 485
Query: 604 YDVLKNYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNL 663
+ L NYTYF Q++ +Q IS L ++ ++F LG N
Sbjct: 486 FTNLDNYTYFDAQSQP----------------QQFDAQISYLKVKANKEFTLGKFALDNT 529
Query: 664 ITFQK-SSNEAVLPTPTLNIYSNLFIR---FKIAKVLDCDFGVDGRYFTKYYAPEYIPGM 719
+ +Q SS V P + L+ FK K + + G+ +YF+ Y A Y P +
Sbjct: 530 LMYQNVSSGSTVFRVPEFVSRNTLYYTDYWFK-GKPMLVNIGITFKYFSAYAANAYNPLL 588
Query: 720 GSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQ 778
F +Q E ++G +P + + N +++RTR + + + S NYF P+YP
Sbjct: 589 AEFRLQNEE----QIG-FPTFDVFFNAQVRRTRLYFKVDNATSKFTSKNYFSAPNYPYRD 643
Query: 779 RVLRLGISWDFF 790
+R G+ W++F
Sbjct: 644 LTIRFGLVWNWF 655
>gi|86142126|ref|ZP_01060650.1| hypothetical protein MED217_03305 [Flavobacterium sp. MED217]
gi|85831689|gb|EAQ50145.1| hypothetical protein MED217_03305 [Leeuwenhoekiella blandensis
MED217]
Length = 676
Score = 67.0 bits (162), Expect = 5e-09, Method: Composition-based stats.
Identities = 55/203 (27%), Positives = 94/203 (46%), Gaps = 19/203 (9%)
Query: 603 GYDVLKNYT-YFGLQNERVANGNDYLVK--------NNQVNVRQHSGA---ISLLTLQLQ 650
YD +K + YF +Q+ N + LV N + + Q A ++ L L+L
Sbjct: 477 AYDNVKTSSLYFNMQSNNWLNASAELVNIADYAYFAKNDLGLTQSFQAGENVTYLKLKLN 536
Query: 651 QDFKLGIMNWQNLITFQK-SSNEAVLPTPTLNIYSNLFIRFK-IAKVLDCDFGVDGRYFT 708
++F+ G N + +Q SS + L P + + L+ + K L G YFT
Sbjct: 537 KEFRFGKFALDNTVAYQNVSSGMSYLNVPEVITRNTLYYTDQWFQKALFVQTGFRFNYFT 596
Query: 709 KYYAPEYIPGMGSFGIQETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINS-GDGGN 767
+Y Y P + F +Q + ++G +P+++ + N K+++TR F + HINS +
Sbjct: 597 EYNMNAYDPVLAEFYVQNDQ----QLGGFPLLDLFFNMKVRQTRIFFIAEHINSLVSPSD 652
Query: 768 YFFTPHYPLNQRVLRLGISWDFF 790
Y+ P YP R G+ W+FF
Sbjct: 653 YYSAPGYPYRDFTFRFGLVWNFF 675
>gi|86134238|ref|ZP_01052820.1| hypothetical protein MED152_06000 [Tenacibaculum sp. MED152]
gi|85821101|gb|EAQ42248.1| hypothetical protein MED152_06000 [Polaribacter dokdonensis MED152]
Length = 639
Score = 65.1 bits (157), Expect = 2e-08, Method: Composition-based stats.
Identities = 76/307 (24%), Positives = 131/307 (42%), Gaps = 39/307 (12%)
Query: 493 KINKNDISVGGQLLKANGKTLHYNINAEAWIAGDRAGQLHIDGNADLNFPLLGDTVQFA- 551
++ N IS G N + ++N+NA+A I G + GN + FA
Sbjct: 362 RLESNAISFGADW---NARIKNFNLNADASIT---PGSGRLSGNYLRGTAVYEKDSLFAI 415
Query: 552 -ATAFLHRTAPTFYMNTFRSRH--FWWGNNLDQQIHSRLLGAFTLKKTRTKLRVGYDVLK 608
+ + +P F +S + + W N +++R LG F V + +
Sbjct: 416 KGSLLISSKSPNFNTLLHQSSYDDYNWQNEF-SNVNTRDLG-FAFSSKWLNGSVNFTNID 473
Query: 609 NYTYFGLQNERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDFKLGIMNWQNLITFQK 668
NYTYF + +Q + I+ L ++ ++FK + N + +Q
Sbjct: 474 NYTYF----------------DEDSKPQQFADQITYLKVKANREFKFWKLALDNTVMYQN 517
Query: 669 SSN-EAVLPTP---TLNIYSNLFIRFKIAKVLDCDFGVDGRYFTKYYAPEYIPGMGSFGI 724
SN +V P T N + FK + + GV +YFT Y Y P + F +
Sbjct: 518 VSNGNSVFRVPEFVTRNTFYYADNWFK-GDAMFVNIGVTFKYFTAYNVNAYNPLLAEFRL 576
Query: 725 QETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSG-DGGNYFFTPHYPLNQRVLRL 783
Q E E+G +P I+ + N +++RTR ++ + ++ SG NYF P+YP +R
Sbjct: 577 QNDE----EIG-FPTIDVFFNARVRRTRLYLKVDNVTSGFTKKNYFSAPNYPYRDFTVRF 631
Query: 784 GISWDFF 790
G+ W++F
Sbjct: 632 GLVWNWF 638
>gi|110638203|ref|YP_678412.1| hypothetical protein CHU_1803 [Cytophaga hutchinsonii ATCC 33406]
gi|110280884|gb|ABG59070.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 648
Score = 63.9 bits (154), Expect = 4e-08, Method: Composition-based stats.
Identities = 62/250 (24%), Positives = 107/250 (42%), Gaps = 43/250 (17%)
Query: 560 APTFYMNTFRSRHFWWGNNLDQQIHSRLLGAFTLKKTRTKL--RVGYDVLKNYTYFGLQN 617
+PT +RS W ++L+ Q +R +T +K + + + Y++ +NY YF
Sbjct: 424 SPTMLQRAYRSNLVSWDSSLNLQHCNRFFAGYTFRKKKIDIIPSLTYNLYENYIYF---- 479
Query: 618 ERVANGNDYLVKNNQVNVRQHSGAISLLTLQLQQDF--KLGI------MNWQNLITFQKS 669
ND +G +S T + Q F KLG+ ++ N +
Sbjct: 480 ------ND-------------AGTVSQTTDKTQTQFQPKLGLRFNLWKIHIYNEFQYNVV 520
Query: 670 SNEAVLPTPTLNIYSNLFIR---FKIAKVLDCDFGVDGRYFTKYYAPEYIPGMGSFGIQ- 725
+ E+ L P + + LF+ FK A +L F D Y Y A Y P + + +
Sbjct: 521 NTESKLQMPPIVNATQLFVEAWLFKRATLLQVGF--DVLYRGAYQANGYNPLIQQYYVTT 578
Query: 726 ----ETEASRTEVGNYPIINAYANFKLQRTRFFIMMSHINSGDGGNYFFTPHYPLNQRVL 781
+ + Y +++ + N +++ R FI MS++ +G G Y+ TP Y R +
Sbjct: 579 PTGGDRALDYNYMNQYFVVDFFVNMQIRTARLFIKMSNLTTGKGNGYYSTPTYTAIPRTI 638
Query: 782 RLGISWDFFN 791
GISW FF+
Sbjct: 639 DFGISWRFFD 648
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.319 0.135 0.405
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,105,474,425
Number of Sequences: 5470121
Number of extensions: 141092504
Number of successful extensions: 456725
Number of sequences better than 1.0e-05: 28
Number of HSP's better than 0.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 456567
Number of HSP's gapped (non-prelim): 56
length of query: 791
length of database: 1,894,087,724
effective HSP length: 141
effective length of query: 650
effective length of database: 1,122,800,663
effective search space: 729820430950
effective search space used: 729820430950
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 134 (56.2 bits)