BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= SSA_0143 
         (111 letters)

Database: nr 
           5,470,121 sequences; 1,894,087,724 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|125717022|ref|YP_001034155.1|  Conserved uncharacterized ...   172   6e-42
gi|157075388|gb|ABV10071.1|  lipoprotein, putative [Streptoc...   132   5e-30
gi|157074918|gb|ABV09601.1|  conserved hypothetical protein ...    59   8e-08
gi|146318510|ref|YP_001198222.1|  Translation initiation fac...    55   2e-06
gi|81096275|ref|ZP_00874622.1|  hypothetical protein SsuiDRA...    54   2e-06
gi|157076349|gb|ABV11032.1|  lipoprotein, putative [Streptoc...    54   3e-06
>gi|125717022|ref|YP_001034155.1| Conserved uncharacterized protein [Streptococcus sanguinis SK36]
 gi|125496939|gb|ABN43605.1| Conserved uncharacterized protein [Streptococcus sanguinis SK36]
          Length = 111

 Score =  172 bits (436), Expect = 6e-42,   Method: Composition-based stats.
 Identities = 111/111 (100%), Positives = 111/111 (100%)

Query: 1   MDISALANGNYASVKGTWQDASGNQLVFDDKGLVSSGYELYGASLTDYGTAAGGVYGGES 60
           MDISALANGNYASVKGTWQDASGNQLVFDDKGLVSSGYELYGASLTDYGTAAGGVYGGES
Sbjct: 1   MDISALANGNYASVKGTWQDASGNQLVFDDKGLVSSGYELYGASLTDYGTAAGGVYGGES 60

Query: 61  GGFLIEFLPKGVKVADKENFTDNSDAGQDRIWTGVGLNSFDEQGSFYYRVD 111
           GGFLIEFLPKGVKVADKENFTDNSDAGQDRIWTGVGLNSFDEQGSFYYRVD
Sbjct: 61  GGFLIEFLPKGVKVADKENFTDNSDAGQDRIWTGVGLNSFDEQGSFYYRVD 111
>gi|157075388|gb|ABV10071.1| lipoprotein, putative [Streptococcus gordonii str. Challis substr.
           CH1]
          Length = 181

 Score =  132 bits (333), Expect = 5e-30,   Method: Composition-based stats.
 Identities = 77/109 (70%), Positives = 94/109 (86%)

Query: 1   MDISALANGNYASVKGTWQDASGNQLVFDDKGLVSSGYELYGASLTDYGTAAGGVYGGES 60
           M+IS LA+GN+AS++GTWQ+  G QLVFD+ GLVS+ YE  GASLTDYGTAAGGVYGG++
Sbjct: 71  MNISELADGNFASIQGTWQNDKGEQLVFDENGLVSAEYEFGGASLTDYGTAAGGVYGGQT 130

Query: 61  GGFLIEFLPKGVKVADKENFTDNSDAGQDRIWTGVGLNSFDEQGSFYYR 109
           GGFL+EF+P GVK+AD ENF D+SD  +DR+WTGVG+ SF EQGSFYYR
Sbjct: 131 GGFLLEFIPSGVKLADTENFKDSSDISRDRLWTGVGIQSFAEQGSFYYR 179
>gi|157074918|gb|ABV09601.1| conserved hypothetical protein [Streptococcus gordonii str. Challis
           substr. CH1]
          Length = 203

 Score = 58.9 bits (141), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 39/110 (35%), Positives = 60/110 (54%), Gaps = 5/110 (4%)

Query: 2   DISALANGNYASVKGTWQDASGNQLVFDDKGLVSSGYELYGASLTDYGTAAGGVYGGESG 61
           D +A+ +GN+ SVKGTW++A G  + FD+KG+V+    +   +    G     V  G + 
Sbjct: 98  DFNAMTHGNFTSVKGTWKNAEGQTITFDEKGIVADETSISSFNYDATGNLIVNVQTGLT- 156

Query: 62  GFLIEFLPKGVKVADKENFTDNSDAGQDRIWTGVGLNSFDEQGSFYYRVD 111
           G+ I F+ KG  +  +    D SD  +DRI+ G G+   D   + YYRVD
Sbjct: 157 GYSIWFVMKGNAIVPE---GDTSDTNRDRIFAGQGMPE-DIASAAYYRVD 202
>gi|146318510|ref|YP_001198222.1| Translation initiation factor 1 (IF-1) [Streptococcus suis 05ZYH33]
 gi|145689316|gb|ABP89822.1| Translation initiation factor 1 (IF-1) [Streptococcus suis 05ZYH33]
          Length = 249

 Score = 54.7 bits (130), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 30/73 (41%), Positives = 38/73 (52%)

Query: 1   MDISALANGNYASVKGTWQDASGNQLVFDDKGLVSSGYELYGASLTDYGTAAGGVYGGES 60
           MDI AL  GNY S+ GTWQ++ GN LVF+ KGLV+    L G      G    G      
Sbjct: 59  MDIDALMQGNYDSILGTWQNSQGNSLVFNSKGLVADSQSLLGRGKITDGIFETGYVDATI 118

Query: 61  GGFLIEFLPKGVK 73
           G   +  +PKG +
Sbjct: 119 GDVTLLMIPKGTQ 131
>gi|81096275|ref|ZP_00874622.1| hypothetical protein SsuiDRAFT_1709 [Streptococcus suis 89/1591]
 gi|80977619|gb|EAP41155.1| hypothetical protein SsuiDRAFT_1709 [Streptococcus suis 89/1591]
          Length = 250

 Score = 54.3 bits (129), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 30/73 (41%), Positives = 38/73 (52%)

Query: 1   MDISALANGNYASVKGTWQDASGNQLVFDDKGLVSSGYELYGASLTDYGTAAGGVYGGES 60
           MDI AL  GNY S+ GTWQ++ GN LVF+ KGLV+    L G      G    G      
Sbjct: 59  MDIDALMQGNYDSILGTWQNSQGNSLVFNSKGLVADSQSLLGRGKITDGIFETGYVDATI 118

Query: 61  GGFLIEFLPKGVK 73
           G   +  +PKG +
Sbjct: 119 GDVTLLMIPKGTQ 131
>gi|157076349|gb|ABV11032.1| lipoprotein, putative [Streptococcus gordonii str. Challis substr.
           CH1]
          Length = 332

 Score = 53.9 bits (128), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/94 (36%), Positives = 53/94 (56%), Gaps = 4/94 (4%)

Query: 1   MDISALANGNYASVKGTWQDASGNQLVFDDKGLVSSGYELYGASLTDYGTAAGGVYGGES 60
           +D+ A+  G+Y+S++G W++  G++LVFD  GLV++  +L            GGV  G  
Sbjct: 222 LDLEAIQKGDYSSIEGRWRNGKGSELVFDKNGLVNNDLKLGNNFRMIDSYLQGGVSSGGP 281

Query: 61  GGFLIEFLPKGVKV---ADKENFTDNSDAGQDRI 91
           G  ++ F+P GV +   AD +   D SD  QDRI
Sbjct: 282 GAAIL-FIPAGVDMSVTADDQTIADASDKKQDRI 314
  Database: nr
    Posted date:  Sep 17, 2007 11:41 AM
  Number of letters in database: 999,999,834
  Number of sequences in database:  2,976,859
  
  Database: /nucleus1/users/jsaw/ncbi/db/nr.01
    Posted date:  Sep 17, 2007 11:48 AM
  Number of letters in database: 894,087,890
  Number of sequences in database:  2,493,262
  
Lambda     K      H
   0.313    0.136    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 469,708,247
Number of Sequences: 5470121
Number of extensions: 20416681
Number of successful extensions: 40930
Number of sequences better than 1.0e-05: 7
Number of HSP's better than  0.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 40924
Number of HSP's gapped (non-prelim): 7
length of query: 111
length of database: 1,894,087,724
effective HSP length: 79
effective length of query: 32
effective length of database: 1,461,948,165
effective search space: 46782341280
effective search space used: 46782341280
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 124 (52.4 bits)