BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI0936
(352 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|150004587|ref|YP_001299331.1| hypothetical protein BVU_2... 249 2e-64
gi|53711891|ref|YP_097883.1| hypothetical protein BF0600 [B... 233 2e-59
gi|156109362|gb|EDO11107.1| hypothetical protein BACOVA_030... 224 8e-57
gi|29347821|ref|NP_811324.1| hypothetical protein BT_2411 [... 220 9e-56
gi|153808353|ref|ZP_01961021.1| hypothetical protein BACCAC... 219 3e-55
gi|156860346|gb|EDO53777.1| hypothetical protein BACUNI_023... 210 1e-52
>gi|150004587|ref|YP_001299331.1| hypothetical protein BVU_2044 [Bacteroides vulgatus ATCC 8482]
gi|149933011|gb|ABR39709.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 320
Score = 249 bits (637), Expect = 2e-64, Method: Composition-based stats.
Identities = 126/293 (43%), Positives = 179/293 (61%), Gaps = 8/293 (2%)
Query: 60 KLDSFAFRVKHHYSEGFNFVVSKDSLMLLRQQPEEVVNNLETDSFAVAKGKSLVVAGIRI 119
K DS + ++ HY+ NF V+ DSLML +Q P D V KG+ LVVA I
Sbjct: 36 KRDSVNYLIERHYTLNSNFEVTSDSLML-QQLPL-------VDVLPVFKGERLVVAEFMI 87
Query: 120 LPNDSIDSVWVRIATEDYVFGWARERQLLKKVDPDTPISQFISLFSNTHLLIFLVVISLI 179
DS+DSVWV++A + GW E++LL+KV P +SQFI FSN+H + F V++
Sbjct: 88 QAADSVDSVWVKVARDQETMGWIHEKELLEKVVPVDSVSQFIHFFSNSHTIAFFVILGFF 147
Query: 180 GVSYLMRKIMKRNANIVHFNDINSFYPTLLAIIVATSATFYASIQLFAPETWREFYFHPS 239
G+ Y+ R I ++ ++ NDI+S +PT+L+ +VAT+AT YASIQ F P TW +FY++PS
Sbjct: 148 GIWYIHRAIRRQKLQLIWLNDIDSVFPTVLSWLVATAATLYASIQHFVPGTWEQFYYNPS 207
Query: 240 LNPFSQPILLNIFLLLVWAMLLIGIAAFDDVRRLLRSGDAFMYLCGLAAICAVNYIVFSV 299
LNPFS P +L++F+ +W ++L+G+A DD+ A YL GL + C Y+ F+
Sbjct: 208 LNPFSLPFILSLFMFNIWGIILVGLATLDDLFHQTHIEAACFYLLGLMSCCIFLYLFFTF 267
Query: 300 LTLYYIGYLLLLVYIYYGIRVWFKKSSDAYECGNCGSRLHRKGRCPHCGVVNE 352
T YY+GY LLVY + S + CGNCG+++ KG CPHCG VNE
Sbjct: 268 TTYYYLGYPCLLVYAVWSFNRIKHASRYKFSCGNCGAKIKNKGVCPHCGAVNE 320
>gi|53711891|ref|YP_097883.1| hypothetical protein BF0600 [Bacteroides fragilis YCH46]
gi|60680119|ref|YP_210263.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
gi|52214756|dbj|BAD47349.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60491553|emb|CAH06305.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
Length = 324
Score = 233 bits (593), Expect = 2e-59, Method: Composition-based stats.
Identities = 117/291 (40%), Positives = 170/291 (58%), Gaps = 7/291 (2%)
Query: 62 DSFAFRVKHHYSEGFNFVVSKDSLMLLRQQPEEVVNNLETDSFAVAKGKSLVVAGIRILP 121
DS + K+HY+ N V DS+ L ++ NL KG +VVA + P
Sbjct: 41 DSLTYLSKYHYTWNTNLEVLDDSVRLEYLPLKDAYVNL-------YKGDRVVVAEFSVHP 93
Query: 122 NDSIDSVWVRIATEDYVFGWARERQLLKKVDPDTPISQFISLFSNTHLLIFLVVISLIGV 181
DS+DS+WV++A V GW R ++L++ P ISQFI LFS+TH F+ + +L
Sbjct: 94 QDSVDSIWVKVAHSQEVQGWVRNKELVRSFVPTDSISQFIHLFSDTHASYFVFIFALFVG 153
Query: 182 SYLMRKIMKRNANIVHFNDINSFYPTLLAIIVATSATFYASIQLFAPETWREFYFHPSLN 241
YL+R MK+ +V+FNDI+S YP L +++A SAT Y ++Q+F P+TW FYF+P+L+
Sbjct: 154 VYLLRAFMKKRLQMVYFNDIDSVYPLFLCLLMAFSATVYETMQVFVPDTWEHFYFNPTLS 213
Query: 242 PFSQPILLNIFLLLVWAMLLIGIAAFDDVRRLLRSGDAFMYLCGLAAICAVNYIVFSVLT 301
PF P +L++FL +W +++ +A DD+ R L A YL GL + C Y F + T
Sbjct: 214 PFKVPFILSVFLTGIWLFIIVTLAVLDDLFRQLTPAAAVFYLLGLMSCCIFCYFFFILTT 273
Query: 302 LYYIGYLLLLVYIYYGIRVWFKKSSDAYECGNCGSRLHRKGRCPHCGVVNE 352
YIGY L +I+ ++ + S Y CGNCG +L KG+CPHCG VNE
Sbjct: 274 HIYIGYFFLACFIWLFVKKVRRGSGYKYRCGNCGQKLREKGQCPHCGAVNE 324
>gi|156109362|gb|EDO11107.1| hypothetical protein BACOVA_03009 [Bacteroides ovatus ATCC 8483]
Length = 381
Score = 224 bits (570), Expect = 8e-57, Method: Composition-based stats.
Identities = 114/291 (39%), Positives = 166/291 (57%), Gaps = 7/291 (2%)
Query: 62 DSFAFRVKHHYSEGFNFVVSKDSLMLLRQQPEEVVNNLETDSFAVAKGKSLVVAGIRILP 121
DS + + HY+ N V DS+ L ++ L +G +VVA + P
Sbjct: 98 DSLTYLYERHYTWNTNLEVVDDSISLECLPIKDTFIQLN-------RGDRVVVAEFAVHP 150
Query: 122 NDSIDSVWVRIATEDYVFGWARERQLLKKVDPDTPISQFISLFSNTHLLIFLVVISLIGV 181
DS+DS+WV++A GW RE++L K P ISQ I LFS+TH F+V+ +L
Sbjct: 151 ADSVDSIWVKLAHTQDEQGWIREKELKKSFAPTDSISQAIHLFSDTHASYFVVIFALFVG 210
Query: 182 SYLMRKIMKRNANIVHFNDINSFYPTLLAIIVATSATFYASIQLFAPETWREFYFHPSLN 241
+YL R ++ +V+FNDI+S YP L +++A SAT Y S+Q+F PETW FYF+P+L+
Sbjct: 211 AYLFRAFRRKQLQMVYFNDIDSVYPLFLCLLMAFSATIYESMQVFVPETWEHFYFNPTLS 270
Query: 242 PFSQPILLNIFLLLVWAMLLIGIAAFDDVRRLLRSGDAFMYLCGLAAICAVNYIVFSVLT 301
PF P +L++FLL +W L++ +A DD+ + L A YL GL + C Y F ++T
Sbjct: 271 PFKVPFILSVFLLSIWLFLIVALAVLDDLFQQLTPAAAIFYLLGLMSCCIFCYFFFILMT 330
Query: 302 LYYIGYLLLLVYIYYGIRVWFKKSSDAYECGNCGSRLHRKGRCPHCGVVNE 352
YIGYL L +I + ++ Y CG+CG +L KG CPHCG +NE
Sbjct: 331 HIYIGYLFLTFFILVFAKKVYRNIGYKYRCGHCGEKLKEKGICPHCGAINE 381
>gi|29347821|ref|NP_811324.1| hypothetical protein BT_2411 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29339722|gb|AAO77518.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 344
Score = 220 bits (561), Expect = 9e-56, Method: Composition-based stats.
Identities = 116/291 (39%), Positives = 167/291 (57%), Gaps = 7/291 (2%)
Query: 62 DSFAFRVKHHYSEGFNFVVSKDSLMLLRQQPEEVVNNLETDSFAVAKGKSLVVAGIRILP 121
DS + HY+ N V+ DS+ + E + +T A+ KG +VVA I P
Sbjct: 61 DSLNYLYDRHYTWNTNLEVTTDSVTM------ECLPIKDT-YIALYKGDRVVVAEFAIHP 113
Query: 122 NDSIDSVWVRIATEDYVFGWARERQLLKKVDPDTPISQFISLFSNTHLLIFLVVISLIGV 181
DS+DSVWV++A GW RE +L + P ISQ I FS+TH F+++ +L
Sbjct: 114 ADSVDSVWVKLAHTQEEQGWIRETELKESFVPTDSISQAIHFFSDTHASYFMIIFALFVG 173
Query: 182 SYLMRKIMKRNANIVHFNDINSFYPTLLAIIVATSATFYASIQLFAPETWREFYFHPSLN 241
YL R ++ +V+FNDI+S YP L +++A SAT Y S+Q+F P+TW FYF+P+L+
Sbjct: 174 VYLFRAFRRKQLQLVYFNDIDSVYPLFLCLLMAFSATVYESMQVFVPDTWEHFYFNPTLS 233
Query: 242 PFSQPILLNIFLLLVWAMLLIGIAAFDDVRRLLRSGDAFMYLCGLAAICAVNYIVFSVLT 301
PF P +L++FLL +W L++ +A DD+ R L A YL GL + C Y F ++T
Sbjct: 234 PFKVPFILSVFLLSIWLFLIVALAVLDDLFRQLTPAAAVFYLLGLMSSCIFCYFFFILMT 293
Query: 302 LYYIGYLLLLVYIYYGIRVWFKKSSDAYECGNCGSRLHRKGRCPHCGVVNE 352
YIGYL L +I+ + + S Y CG+CG +L KG CPHCG +NE
Sbjct: 294 HIYIGYLFLAFFIWVFAKKVHRNISYKYRCGHCGEKLKEKGICPHCGAINE 344
>gi|153808353|ref|ZP_01961021.1| hypothetical protein BACCAC_02647 [Bacteroides caccae ATCC 43185]
gi|149129256|gb|EDM20472.1| hypothetical protein BACCAC_02647 [Bacteroides caccae ATCC 43185]
Length = 290
Score = 219 bits (557), Expect = 3e-55, Method: Composition-based stats.
Identities = 113/291 (38%), Positives = 170/291 (58%), Gaps = 7/291 (2%)
Query: 62 DSFAFRVKHHYSEGFNFVVSKDSLMLLRQQPEEVVNNLETDSFAVAKGKSLVVAGIRILP 121
DS + + HY+ N ++ DS++L ++ NL +G +VVA + P
Sbjct: 7 DSLNYLYERHYTWNTNLELTADSILLECLPIKDTYINL-------YRGDRVVVAEFAVHP 59
Query: 122 NDSIDSVWVRIATEDYVFGWARERQLLKKVDPDTPISQFISLFSNTHLLIFLVVISLIGV 181
DS+DSVWV++A GW RE +L + P ISQ I LFS+TH F+V+ +L
Sbjct: 60 ADSVDSVWVKLAHTQDEQGWIREVELKRSFVPTDSISQAIHLFSDTHASYFVVIFALFVG 119
Query: 182 SYLMRKIMKRNANIVHFNDINSFYPTLLAIIVATSATFYASIQLFAPETWREFYFHPSLN 241
YL+R ++ +V+FNDI+S YP L +++A SAT Y S+Q+F P+TW FYF+P+L+
Sbjct: 120 VYLLRAFRRKQLQMVYFNDIDSVYPLFLCLLMAFSATVYESMQVFVPDTWEHFYFNPTLS 179
Query: 242 PFSQPILLNIFLLLVWAMLLIGIAAFDDVRRLLRSGDAFMYLCGLAAICAVNYIVFSVLT 301
PF P +L++FLL +W +++ +A DD+ R L A YL GL + C Y F ++
Sbjct: 180 PFKVPFILSVFLLSIWLFVIVSLAVLDDLFRQLTPAAAIFYLLGLMSCCIFCYFFFILMA 239
Query: 302 LYYIGYLLLLVYIYYGIRVWFKKSSDAYECGNCGSRLHRKGRCPHCGVVNE 352
YIGYL L+++I ++ + S Y CG+CG +L KG CPHCG +NE
Sbjct: 240 HIYIGYLFLILFILVFVKKVYWNSRYKYCCGHCGEKLKEKGVCPHCGAINE 290
>gi|156860346|gb|EDO53777.1| hypothetical protein BACUNI_02392 [Bacteroides uniformis ATCC 8492]
Length = 351
Score = 210 bits (534), Expect = 1e-52, Method: Composition-based stats.
Identities = 116/292 (39%), Positives = 170/292 (58%), Gaps = 8/292 (2%)
Query: 62 DSFAFRVKHHYSEGFNFVVSKDSLMLLRQQPEEVVNNLETDSFAVAKGKSLVVAGIRILP 121
DS A+ + HY+ N V DS+ L ++ N L +G +VVA I P
Sbjct: 67 DSLAYLFERHYTWNTNLEVLADSVNLACLPVKDCYNML-------YRGDRVVVAEFAIHP 119
Query: 122 NDSIDSVWVRIATEDYVFGWARERQLLKKVDPDTPISQFISLFSNTHLLIFLVVISLIGV 181
DS+DSVWV++A V GW RE +++ P ISQ I LFS+TH F+++ +L
Sbjct: 120 ADSVDSVWVKLAHSQEVQGWLRESEMMHAFVPTDSISQAIYLFSDTHASYFIIIFALFVA 179
Query: 182 SYLMRKIMKRNANIVHFNDINSFYPTLLAIIVATSATFYASIQLFAPETWREFYFHPSLN 241
+L R ++ +V+FNDI+S YP LL +++A AT Y SIQ+FAPETW+ FYF+P+L+
Sbjct: 180 VWLFRAFRRKQLRMVYFNDIDSLYPLLLCLLMAFCATIYESIQVFAPETWQHFYFNPTLS 239
Query: 242 PFSQPILLNIFLLLVWAMLLIGIAAFDDVRRLLRSGDAFMYLCGLAAICAVNYIVFSVLT 301
PF P++L+ FL+ +W +++ +A DD+ R L A YL GLA+ C Y F + T
Sbjct: 240 PFKVPLVLSAFLMGIWLFIVVLLAVLDDLFRQLSPAAAVFYLLGLASCCIFCYFFFILTT 299
Query: 302 LYYIGYLLLLVYIYYGI-RVWFKKSSDAYECGNCGSRLHRKGRCPHCGVVNE 352
Y+GYL L +++ + R+ + Y CG CG +L KG CPHCG +NE
Sbjct: 300 SIYVGYLFLTAFVWVFLKRLRISLLASRYRCGRCGQKLREKGVCPHCGAINE 351
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.327 0.140 0.427
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,261,458,025
Number of Sequences: 5470121
Number of extensions: 50288163
Number of successful extensions: 197513
Number of sequences better than 1.0e-05: 6
Number of HSP's better than 0.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 197500
Number of HSP's gapped (non-prelim): 6
length of query: 352
length of database: 1,894,087,724
effective HSP length: 134
effective length of query: 218
effective length of database: 1,161,091,510
effective search space: 253117949180
effective search space used: 253117949180
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 130 (54.7 bits)