BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= SSA_1758 
         (201 letters)

Database: nr 
           5,470,121 sequences; 1,894,087,724 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|125718557|ref|YP_001035690.1|  hypothetical protein SSA_1...   394   e-108
gi|125718556|ref|YP_001035689.1|  hypothetical protein SSA_1...   293   4e-78
gi|157075310|gb|ABV09993.1|  hypothetical protein SGO_1064 [...   131   3e-29
gi|157076257|gb|ABV10940.1|  hypothetical protein SGO_1065 [...   128   2e-28
gi|125718558|ref|YP_001035691.1|  hypothetical protein SSA_1...   126   8e-28
gi|157075074|gb|ABV09757.1|  hypothetical protein SGO_1063 [...   126   8e-28
gi|125717787|ref|YP_001034920.1|  hypothetical protein SSA_0...   125   1e-27
gi|125717788|ref|YP_001034921.1|  hypothetical protein SSA_0...   124   3e-27
gi|125717786|ref|YP_001034919.1|  hypothetical protein SSA_0...   123   5e-27
gi|157076021|gb|ABV10704.1|  hypothetical protein SGO_1061 [...   122   1e-26
gi|157075688|gb|ABV10371.1|  hypothetical protein SGO_1062 [...   119   8e-26
gi|125718559|ref|YP_001035692.1|  hypothetical protein SSA_1...    88   2e-16
>gi|125718557|ref|YP_001035690.1| hypothetical protein SSA_1758 [Streptococcus sanguinis SK36]
 gi|125498474|gb|ABN45140.1| Hypothetical protein SSA_1758 [Streptococcus sanguinis SK36]
          Length = 202

 Score =  394 bits (1011), Expect = e-108,   Method: Composition-based stats.
 Identities = 201/201 (100%), Positives = 201/201 (100%)

Query: 1   TIRVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIE 60
           TIRVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIE
Sbjct: 2   TIRVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIE 61

Query: 61  FSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDFNVND 120
           FSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDFNVND
Sbjct: 62  FSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDFNVND 121

Query: 121 GSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQGSPK 180
           GSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQGSPK
Sbjct: 122 GSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQGSPK 181

Query: 181 AEIVYNLEIRRIDERELDKWQ 201
           AEIVYNLEIRRIDERELDKWQ
Sbjct: 182 AEIVYNLEIRRIDERELDKWQ 202
>gi|125718556|ref|YP_001035689.1| hypothetical protein SSA_1757 [Streptococcus sanguinis SK36]
 gi|125498473|gb|ABN45139.1| Hypothetical protein SSA_1757 [Streptococcus sanguinis SK36]
          Length = 202

 Score =  293 bits (750), Expect = 4e-78,   Method: Composition-based stats.
 Identities = 143/199 (71%), Positives = 168/199 (84%)

Query: 3   RVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIEFS 62
           R KSILLSL C FVLVIGGKFY+DRMKVDNLYRHGFQLYEEQIATYLKEHYSG+SKIEFS
Sbjct: 4   RTKSILLSLLCFFVLVIGGKFYMDRMKVDNLYRHGFQLYEEQIATYLKEHYSGVSKIEFS 63

Query: 63  PIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDFNVNDGS 122
           PIF SGG GE F +ARI+PVVYD++GNKVYLRNDG +D  VP+Y   +G++L FNVNDGS
Sbjct: 64  PIFISGGGGESFVNARIVPVVYDSYGNKVYLRNDGVLDMAVPDYGTLAGLDLSFNVNDGS 123

Query: 123 EIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQGSPKAE 182
           EI+YL N + ES+    YQHLPE LKL+  + TD ++TA+S++G LKGVEKNSQGSP+AE
Sbjct: 124 EIVYLRNNERESVSSEVYQHLPEQLKLQKEEFTDEVMTAFSREGHLKGVEKNSQGSPQAE 183

Query: 183 IVYNLEIRRIDERELDKWQ 201
           I+YNLEIRR+  RE  +WQ
Sbjct: 184 IIYNLEIRRVQGREAFEWQ 202
>gi|157075310|gb|ABV09993.1| hypothetical protein SGO_1064 [Streptococcus gordonii str. Challis
           substr. CH1]
          Length = 189

 Score =  131 bits (329), Expect = 3e-29,   Method: Composition-based stats.
 Identities = 78/165 (47%), Positives = 99/165 (60%), Gaps = 4/165 (2%)

Query: 33  LYRHGFQLYEEQIATYLKEHYSGISKIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVY 92
           LY+ GF+L EEQ+ATY+KEHYSG+SKIEFSPIF  GG G+    A I+  VYD + NK Y
Sbjct: 25  LYKRGFRLLEEQLATYIKEHYSGVSKIEFSPIFVQGGDGQTMFSANIVFSVYDKNQNKAY 84

Query: 93  LRNDGTI-DTLVPNYVLTSGIELDFNVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKT 151
               GTI D   P Y     + +DFN  D  EII L     + I+V  +QHLP   KL  
Sbjct: 85  F--GGTIGDITYPTYNNVWDVRMDFNGWD-EEIIELRTSDEKLIDVSDFQHLPPEAKLTV 141

Query: 152 YKSTDAIITAYSQKGTLKGVEKNSQGSPKAEIVYNLEIRRIDERE 196
               D  I A  Q   ++GV K+S+GS   ++VYN EI++ DERE
Sbjct: 142 SNKVDENIEALVQDKQIEGVRKDSKGSADLDVVYNDEIKKGDERE 186
>gi|157076257|gb|ABV10940.1| hypothetical protein SGO_1065 [Streptococcus gordonii str. Challis
           substr. CH1]
          Length = 187

 Score =  128 bits (321), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 78/165 (47%), Positives = 101/165 (61%), Gaps = 5/165 (3%)

Query: 33  LYRHGFQLYEEQIATYLKEHYSGISKIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVY 92
           LY+ GF+L EEQ+ATY+KEHYSG+SKIEFSPIF  GG G+    A I+PV+YD HGNK Y
Sbjct: 25  LYKRGFRLLEEQLATYIKEHYSGVSKIEFSPIFVQGGDGQTMFDANIVPVIYDNHGNKAY 84

Query: 93  L-RNDGTIDTLVPNYVLTSGIELDFNVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKT 151
           L R  G       +Y L   + LDFN  D  E+I + +   + +++  Y+ LP   KL  
Sbjct: 85  LGRKVGKHG--YASYGLLGDLRLDFNGFD-EEVIEI-DVDGKFLDITNYKSLPPKAKLTI 140

Query: 152 YKSTDAIITAYSQKGTLKGVEKNSQGSPKAEIVYNLEIRRIDERE 196
             S D  I A    G LK V K+ +GS KAE+ YN EIR+ +E E
Sbjct: 141 NPSMDENIVALVNAGHLKDVVKSEKGSLKAEVAYNTEIRKGNEWE 185
>gi|125718558|ref|YP_001035691.1| hypothetical protein SSA_1759 [Streptococcus sanguinis SK36]
 gi|125498475|gb|ABN45141.1| Hypothetical protein SSA_1759 [Streptococcus sanguinis SK36]
          Length = 101

 Score =  126 bits (316), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 64/99 (64%), Positives = 80/99 (80%)

Query: 103 VPNYVLTSGIELDFNVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAY 162
           V +Y +T+GI+LDFNVNDGSEIIYL+N+KNES+ V  YQHLP+ LKL   + TD  + A+
Sbjct: 3   VADYEMTAGIDLDFNVNDGSEIIYLHNKKNESVSVEGYQHLPDGLKLNKDEITDEKMDAF 62

Query: 163 SQKGTLKGVEKNSQGSPKAEIVYNLEIRRIDERELDKWQ 201
           S++G LKGVEKNS GSPKAEIVYNLEIRR+  RE  +W+
Sbjct: 63  SREGYLKGVEKNSLGSPKAEIVYNLEIRRVQGREFYEWK 101
>gi|157075074|gb|ABV09757.1| hypothetical protein SGO_1063 [Streptococcus gordonii str. Challis
           substr. CH1]
          Length = 202

 Score =  126 bits (316), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 81/195 (41%), Positives = 108/195 (55%), Gaps = 11/195 (5%)

Query: 5   KSILLSLFCVFVLVIGGKFYIDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGIS 57
           K  L  L  + VLV GG F  +++       K   LY+HGFQL EEQIATY+KEHY GI 
Sbjct: 3   KKFLTILALILVLVSGGIFVYNKLTKPNFGPKTTRLYQHGFQLLEEQIATYIKEHYKGIK 62

Query: 58  KIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLV-PNYVLTSGIELDF 116
           KIEFSPI+ +G  G    +A I+P++YD HGNK   +  G       P Y     + L F
Sbjct: 63  KIEFSPIYVTGDDGSSMLNAEIVPIIYDYHGNKA--KFGGLYKNFQHPAYGTIGYLRLSF 120

Query: 117 NVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQ 176
           + + G   I L  +  E  +V   Q LPE +K K  K  D       ++  LKGVEK+  
Sbjct: 121 DYS-GKPYIELSTDTGEFKDVTYGQSLPEEIKGKKIKDIDFNFETLIKEERLKGVEKSDV 179

Query: 177 GSPKAEIVYNLEIRR 191
           GSP AE++YNLE+++
Sbjct: 180 GSPNAEVIYNLELKK 194
>gi|125717787|ref|YP_001034920.1| hypothetical protein SSA_0948 [Streptococcus sanguinis SK36]
 gi|125497704|gb|ABN44370.1| Hypothetical protein SSA_0948 [Streptococcus sanguinis SK36]
          Length = 197

 Score =  125 bits (315), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 75/195 (38%), Positives = 113/195 (57%), Gaps = 11/195 (5%)

Query: 5   KSILLSLFCVFVLVIGGKFYIDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGIS 57
           K I++ L  +F+++ GG +  +++       K   LY+ GF+L EEQ  TY KEHY GI 
Sbjct: 3   KKIIIVLTILFIVISGGIYIYNKLTKPNLGPKTTKLYQRGFRLLEEQYGTYFKEHYKGIE 62

Query: 58  KIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPN-YVLTSGIELDF 116
           KI+FSPI+  G  G    +A + P +YD +GNK  L    TI+   PN Y L + I LDF
Sbjct: 63  KIKFSPIYIEGDNGGSMLNAYVRPTIYDKYGNKATLGT--TIENYTPNSYGLVTHIFLDF 120

Query: 117 NVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQ 176
           +   G+++I L +     I+V   +HLP+  KL   K TD  I    + G LK V K+ +
Sbjct: 121 D-GAGNDVIELMDSHGNDIDVSNAKHLPDEAKLTRAKGTDLNIELLVEDGQLKDVVKDEK 179

Query: 177 GSPKAEIVYNLEIRR 191
           G+P+AEI+YN+++ +
Sbjct: 180 GTPEAEIIYNVKLSK 194
>gi|125717788|ref|YP_001034921.1| hypothetical protein SSA_0949 [Streptococcus sanguinis SK36]
 gi|125497705|gb|ABN44371.1| Hypothetical protein SSA_0949 [Streptococcus sanguinis SK36]
          Length = 202

 Score =  124 bits (312), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 75/195 (38%), Positives = 114/195 (58%), Gaps = 11/195 (5%)

Query: 5   KSILLSLFCVFVLVIGGKFYIDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGIS 57
           K I++ L  +F++  GG +  +++       K   LY+HGF+L EEQI TY+KE+YSGI 
Sbjct: 3   KKIIIGLTILFIVFSGGIYMYNKLTKPNFGSKTTKLYQHGFRLLEEQIGTYIKENYSGIE 62

Query: 58  KIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTL-VPNYVLTSGIELDF 116
           KIEFSPI+ +G  G    +A ++P+VYD++GNK   +  G       P Y     + + F
Sbjct: 63  KIEFSPIYITGDDGSSMLNAEVVPIVYDSYGNKA--KFGGLYKNFQQPAYGTIGYLRVSF 120

Query: 117 NVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQ 176
           + + G   I L  +  E  EV   Q LP+ +KL+  K  D       ++G LKG+EK+ +
Sbjct: 121 DYS-GKSYIELSTDSGEFKEVTYGQSLPKEIKLREMKDVDFNFETLIREGKLKGIEKSDK 179

Query: 177 GSPKAEIVYNLEIRR 191
           GSP AEIVYNL++++
Sbjct: 180 GSPDAEIVYNLQLKK 194
>gi|125717786|ref|YP_001034919.1| hypothetical protein SSA_0947 [Streptococcus sanguinis SK36]
 gi|125497703|gb|ABN44369.1| Hypothetical protein SSA_0947 [Streptococcus sanguinis SK36]
          Length = 197

 Score =  123 bits (309), Expect = 5e-27,   Method: Composition-based stats.
 Identities = 76/198 (38%), Positives = 113/198 (57%), Gaps = 17/198 (8%)

Query: 5   KSILLSLFCVFVLVIGGKFYIDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGIS 57
           K +  +L  + VLV GG +  +++       K   LY+ GF+  EEQI TY+KE YSGI 
Sbjct: 3   KKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSGIE 62

Query: 58  KIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTS-GIELDF 116
           KIEFSPI+ +G       +A + P +YD +GNK       T+ T +  YV  S GIE D 
Sbjct: 63  KIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNK------ATLGTQIKKYVPNSFGIEADL 116

Query: 117 NVN---DGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEK 173
            ++    G+E+I L + ++ SI+V   + LPE  KL   KS D  I    + G LK V K
Sbjct: 117 VLDFDWSGNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVK 176

Query: 174 NSQGSPKAEIVYNLEIRR 191
           + +GSP+A+I+YN+++ +
Sbjct: 177 DEKGSPEAQIIYNVKLSK 194
>gi|157076021|gb|ABV10704.1| hypothetical protein SGO_1061 [Streptococcus gordonii str. Challis
           substr. CH1]
          Length = 206

 Score =  122 bits (307), Expect = 1e-26,   Method: Composition-based stats.
 Identities = 79/172 (45%), Positives = 106/172 (61%), Gaps = 9/172 (5%)

Query: 25  IDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIEFSPIFKSGGRGEGFAHARIIPVVY 84
           +DR K + + +HGF+L EEQ+ TY+KE+YSGISKIEFSPIF  GG+      A ++PV+Y
Sbjct: 32  LDR-KEEKIIKHGFRLLEEQLGTYIKENYSGISKIEFSPIFIQGGKDNPPFTADVLPVIY 90

Query: 85  DTHGNKVYLRNDGTI-DTLVPNYVLTSGIELDFNVNDGSEIIYLYNEKNE---SIEVGQY 140
           D  GN+  L   G I  T  P+Y L + + LDF+ ND +EII L +        ++V   
Sbjct: 91  DEEGNRAVL--GGQIGKTGYPSYGLLTDLRLDFDHND-NEIIELEDSAKGLGIEVDVSNA 147

Query: 141 QHLPEHLKLKT-YKSTDAIITAYSQKGTLKGVEKNSQGSPKAEIVYNLEIRR 191
           + LPE  K+K     TD  I A      LK V KN +GSPKAE++YNLEI+R
Sbjct: 148 KTLPEKAKIKAELDGTDENIAALVDTKKLKNVRKNKEGSPKAELIYNLEIKR 199
>gi|157075688|gb|ABV10371.1| hypothetical protein SGO_1062 [Streptococcus gordonii str. Challis
           substr. CH1]
          Length = 206

 Score =  119 bits (299), Expect = 8e-26,   Method: Composition-based stats.
 Identities = 80/198 (40%), Positives = 107/198 (54%), Gaps = 13/198 (6%)

Query: 5   KSILLSLFCVFVLVIGGKFYIDRM--------KVDNLYRHGFQLYEEQIATYLKEHYSGI 56
           K ILL    + +L   G  Y+ +         K + + +HGF+L EEQI TY+KE+YSGI
Sbjct: 3   KKILLIFSALIILFSTGFVYMTQFHSSTGLDRKEEKIIKHGFRLLEEQIGTYIKENYSGI 62

Query: 57  SKIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDF 116
           SKIEFSPIF  GG+      A ++PV+YD  GN+  +          P Y  T  + LDF
Sbjct: 63  SKIEFSPIFIQGGKDNPPFTADVLPVIYDKEGNRAVM-GKRIGKKGYPTYGTTGDLTLDF 121

Query: 117 NVNDGSEIIYLYNEK--NESIEVGQYQHLPEHLKLK-TYKSTDAIITAYSQKGTLKGVEK 173
           +   G+E I+L      +E I+V    HLPE  KL    K TD  I A    G LK V +
Sbjct: 122 D-QMGNENIFLKGSSILDEKIDVSSANHLPEEAKLTPPIKGTDDNIDALVNDGQLKNVNR 180

Query: 174 NSQGSPKAEIVYNLEIRR 191
           +  GSP A++ YNLEI+R
Sbjct: 181 SENGSPNAKMDYNLEIKR 198
>gi|125718559|ref|YP_001035692.1| hypothetical protein SSA_1760 [Streptococcus sanguinis SK36]
 gi|125498476|gb|ABN45142.1| Hypothetical protein SSA_1760 [Streptococcus sanguinis SK36]
          Length = 52

 Score = 88.2 bits (217), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 41/45 (91%), Positives = 43/45 (95%)

Query: 3  RVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIAT 47
          RVKSILLSL C+ VLVIGGKFY+DRMKVDNLYRHGFQLYEEQIAT
Sbjct: 8  RVKSILLSLICLSVLVIGGKFYMDRMKVDNLYRHGFQLYEEQIAT 52
  Database: nr
    Posted date:  Sep 17, 2007 11:41 AM
  Number of letters in database: 999,999,834
  Number of sequences in database:  2,976,859
  
  Database: /nucleus1/users/jsaw/ncbi/db/nr.01
    Posted date:  Sep 17, 2007 11:48 AM
  Number of letters in database: 894,087,890
  Number of sequences in database:  2,493,262
  
Lambda     K      H
   0.319    0.139    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 774,769,377
Number of Sequences: 5470121
Number of extensions: 34155156
Number of successful extensions: 77919
Number of sequences better than 1.0e-05: 12
Number of HSP's better than  0.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 77895
Number of HSP's gapped (non-prelim): 12
length of query: 201
length of database: 1,894,087,724
effective HSP length: 126
effective length of query: 75
effective length of database: 1,204,852,478
effective search space: 90363935850
effective search space used: 90363935850
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 126 (53.1 bits)