BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= pPI0464
(102 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34540616|ref|NP_905095.1| hypothetical protein PG0843 [P... 153 2e-36
gi|153807867|ref|ZP_01960535.1| hypothetical protein BACCAC... 131 1e-29
gi|53712573|ref|YP_098565.1| hypothetical protein BF1281 [B... 131 1e-29
gi|153807827|ref|ZP_01960495.1| hypothetical protein BACCAC... 69 1e-10
>gi|34540616|ref|NP_905095.1| hypothetical protein PG0843 [Porphyromonas gingivalis W83]
gi|34396929|gb|AAQ65994.1| hypothetical protein PG_0843 [Porphyromonas gingivalis W83]
Length = 103
Score = 153 bits (387), Expect = 2e-36, Method: Composition-based stats.
Identities = 77/102 (75%), Positives = 90/102 (88%)
Query: 1 MTINELLGKPVWQMTGEELLFLAQHGNMSTSGETAKASSSKEERRYVYGLAGIARLFGCS 60
MT++ELL KPVW+MTGEELLFLAQ G+ G+T + +KEER +VYGL+G+ARLFGCS
Sbjct: 1 MTLHELLEKPVWKMTGEELLFLAQQGSTQQEGDTQDKAPAKEERHFVYGLSGLARLFGCS 60
Query: 61 LPTANRIKQSGKINRAITQVGRKIIVDADLALELAGRKTGGR 102
LPTANRIKQSGKI+RAITQ+GRKII+DADLALELAGRK GGR
Sbjct: 61 LPTANRIKQSGKIDRAITQIGRKIIIDADLALELAGRKVGGR 102
>gi|153807867|ref|ZP_01960535.1| hypothetical protein BACCAC_02152 [Bacteroides caccae ATCC 43185]
gi|149129476|gb|EDM20690.1| hypothetical protein BACCAC_02152 [Bacteroides caccae ATCC 43185]
Length = 102
Score = 131 bits (330), Expect = 1e-29, Method: Composition-based stats.
Identities = 67/100 (67%), Positives = 81/100 (81%), Gaps = 2/100 (2%)
Query: 3 INELLGKPVWQMTGEELLFLAQHGNMSTSGETAKASSSKEERRYVYGLAGIARLFGCSLP 62
+ ELL KPVWQMTGEE +FL++H S E + ER+YVYG+ GIA+LFGCSLP
Sbjct: 4 LQELLLKPVWQMTGEEFIFLSKHA--SGQAEAQPRPITDTERKYVYGILGIAKLFGCSLP 61
Query: 63 TANRIKQSGKINRAITQVGRKIIVDADLALELAGRKTGGR 102
TANRIK+SGKI++AITQ+GRKIIVD +LALELAG+KTGGR
Sbjct: 62 TANRIKKSGKIDKAITQIGRKIIVDVELALELAGKKTGGR 101
>gi|53712573|ref|YP_098565.1| hypothetical protein BF1281 [Bacteroides fragilis YCH46]
gi|150005002|ref|YP_001299746.1| hypothetical protein BVU_2467 [Bacteroides vulgatus ATCC 8482]
gi|52215438|dbj|BAD48031.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|149933426|gb|ABR40124.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 102
Score = 131 bits (329), Expect = 1e-29, Method: Composition-based stats.
Identities = 67/102 (65%), Positives = 81/102 (79%), Gaps = 1/102 (0%)
Query: 1 MTINELLGKPVWQMTGEELLFLAQHGNMSTSGETAKASSSKEERRYVYGLAGIARLFGCS 60
M I ELL KPVWQMTGEE + L +H A+ ++ E ++YVYG+ GIARLFGCS
Sbjct: 1 MEIRELLSKPVWQMTGEEFILLNRHALQEREARAAQPAADTE-KKYVYGIGGIARLFGCS 59
Query: 61 LPTANRIKQSGKINRAITQVGRKIIVDADLALELAGRKTGGR 102
+PTANRIK+SGKI+RAITQ+GRKIIVDAD+ALELAG K+GGR
Sbjct: 60 MPTANRIKKSGKIDRAITQIGRKIIVDADMALELAGHKSGGR 101
>gi|153807827|ref|ZP_01960495.1| hypothetical protein BACCAC_02111 [Bacteroides caccae ATCC 43185]
gi|149129436|gb|EDM20650.1| hypothetical protein BACCAC_02111 [Bacteroides caccae ATCC 43185]
Length = 43
Score = 68.6 bits (166), Expect = 1e-10, Method: Composition-based stats.
Identities = 35/42 (83%), Positives = 41/42 (97%)
Query: 61 LPTANRIKQSGKINRAITQVGRKIIVDADLALELAGRKTGGR 102
+PTANRIK+SGKI+RAITQ+GRKIIVDAD+ALELAG K+GGR
Sbjct: 1 MPTANRIKKSGKIDRAITQIGRKIIVDADMALELAGHKSGGR 42
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.316 0.133 0.372
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 342,910,851
Number of Sequences: 5470121
Number of extensions: 10956391
Number of successful extensions: 27562
Number of sequences better than 1.0e-05: 4
Number of HSP's better than 0.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 27556
Number of HSP's gapped (non-prelim): 4
length of query: 102
length of database: 1,894,087,724
effective HSP length: 71
effective length of query: 31
effective length of database: 1,505,709,133
effective search space: 46676983123
effective search space used: 46676983123
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 124 (52.4 bits)