BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= pPI0464 
         (102 letters)

Database: nr 
           5,470,121 sequences; 1,894,087,724 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|34540616|ref|NP_905095.1|  hypothetical protein PG0843 [P...   153   2e-36
gi|153807867|ref|ZP_01960535.1|  hypothetical protein BACCAC...   131   1e-29
gi|53712573|ref|YP_098565.1|  hypothetical protein BF1281 [B...   131   1e-29
gi|153807827|ref|ZP_01960495.1|  hypothetical protein BACCAC...    69   1e-10
>gi|34540616|ref|NP_905095.1| hypothetical protein PG0843 [Porphyromonas gingivalis W83]
 gi|34396929|gb|AAQ65994.1| hypothetical protein PG_0843 [Porphyromonas gingivalis W83]
          Length = 103

 Score =  153 bits (387), Expect = 2e-36,   Method: Composition-based stats.
 Identities = 77/102 (75%), Positives = 90/102 (88%)

Query: 1   MTINELLGKPVWQMTGEELLFLAQHGNMSTSGETAKASSSKEERRYVYGLAGIARLFGCS 60
           MT++ELL KPVW+MTGEELLFLAQ G+    G+T   + +KEER +VYGL+G+ARLFGCS
Sbjct: 1   MTLHELLEKPVWKMTGEELLFLAQQGSTQQEGDTQDKAPAKEERHFVYGLSGLARLFGCS 60

Query: 61  LPTANRIKQSGKINRAITQVGRKIIVDADLALELAGRKTGGR 102
           LPTANRIKQSGKI+RAITQ+GRKII+DADLALELAGRK GGR
Sbjct: 61  LPTANRIKQSGKIDRAITQIGRKIIIDADLALELAGRKVGGR 102
>gi|153807867|ref|ZP_01960535.1| hypothetical protein BACCAC_02152 [Bacteroides caccae ATCC 43185]
 gi|149129476|gb|EDM20690.1| hypothetical protein BACCAC_02152 [Bacteroides caccae ATCC 43185]
          Length = 102

 Score =  131 bits (330), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 67/100 (67%), Positives = 81/100 (81%), Gaps = 2/100 (2%)

Query: 3   INELLGKPVWQMTGEELLFLAQHGNMSTSGETAKASSSKEERRYVYGLAGIARLFGCSLP 62
           + ELL KPVWQMTGEE +FL++H   S   E      +  ER+YVYG+ GIA+LFGCSLP
Sbjct: 4   LQELLLKPVWQMTGEEFIFLSKHA--SGQAEAQPRPITDTERKYVYGILGIAKLFGCSLP 61

Query: 63  TANRIKQSGKINRAITQVGRKIIVDADLALELAGRKTGGR 102
           TANRIK+SGKI++AITQ+GRKIIVD +LALELAG+KTGGR
Sbjct: 62  TANRIKKSGKIDKAITQIGRKIIVDVELALELAGKKTGGR 101
>gi|53712573|ref|YP_098565.1| hypothetical protein BF1281 [Bacteroides fragilis YCH46]
 gi|150005002|ref|YP_001299746.1| hypothetical protein BVU_2467 [Bacteroides vulgatus ATCC 8482]
 gi|52215438|dbj|BAD48031.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
 gi|149933426|gb|ABR40124.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
          Length = 102

 Score =  131 bits (329), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 67/102 (65%), Positives = 81/102 (79%), Gaps = 1/102 (0%)

Query: 1   MTINELLGKPVWQMTGEELLFLAQHGNMSTSGETAKASSSKEERRYVYGLAGIARLFGCS 60
           M I ELL KPVWQMTGEE + L +H         A+ ++  E ++YVYG+ GIARLFGCS
Sbjct: 1   MEIRELLSKPVWQMTGEEFILLNRHALQEREARAAQPAADTE-KKYVYGIGGIARLFGCS 59

Query: 61  LPTANRIKQSGKINRAITQVGRKIIVDADLALELAGRKTGGR 102
           +PTANRIK+SGKI+RAITQ+GRKIIVDAD+ALELAG K+GGR
Sbjct: 60  MPTANRIKKSGKIDRAITQIGRKIIVDADMALELAGHKSGGR 101
>gi|153807827|ref|ZP_01960495.1| hypothetical protein BACCAC_02111 [Bacteroides caccae ATCC 43185]
 gi|149129436|gb|EDM20650.1| hypothetical protein BACCAC_02111 [Bacteroides caccae ATCC 43185]
          Length = 43

 Score = 68.6 bits (166), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 35/42 (83%), Positives = 41/42 (97%)

Query: 61  LPTANRIKQSGKINRAITQVGRKIIVDADLALELAGRKTGGR 102
           +PTANRIK+SGKI+RAITQ+GRKIIVDAD+ALELAG K+GGR
Sbjct: 1   MPTANRIKKSGKIDRAITQIGRKIIVDADMALELAGHKSGGR 42
  Database: nr
    Posted date:  Sep 17, 2007 11:41 AM
  Number of letters in database: 999,999,834
  Number of sequences in database:  2,976,859
  
  Database: /nucleus1/users/jsaw/ncbi/db/nr.01
    Posted date:  Sep 17, 2007 11:48 AM
  Number of letters in database: 894,087,890
  Number of sequences in database:  2,493,262
  
Lambda     K      H
   0.316    0.133    0.372 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 342,910,851
Number of Sequences: 5470121
Number of extensions: 10956391
Number of successful extensions: 27562
Number of sequences better than 1.0e-05: 4
Number of HSP's better than  0.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 27556
Number of HSP's gapped (non-prelim): 4
length of query: 102
length of database: 1,894,087,724
effective HSP length: 71
effective length of query: 31
effective length of database: 1,505,709,133
effective search space: 46676983123
effective search space used: 46676983123
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 124 (52.4 bits)