BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= SMT2002 
         (83 letters)

Database: nr 
           5,470,121 sequences; 1,894,087,724 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|951054|gb|AAC98437.1|  unknown [Streptococcus pneumoniae]      114   2e-24
gi|46019887|emb|CAE52418.1|  hypothetical protein [Streptoco...    94   3e-18
gi|55822652|ref|YP_141093.1|  hypothetical protein str0683 [...    92   8e-18
gi|46019858|emb|CAE52381.1|  hypothetical protein [Streptoco...    92   9e-18
gi|77406940|ref|ZP_00783961.1|  conserved hypothetical prote...    86   5e-16
gi|22537421|ref|NP_688272.1|  hypothetical protein SAG1272 [...    84   3e-15
gi|22537409|ref|NP_688260.1|  hypothetical protein SAG1259 [...    81   2e-14
gi|50914510|ref|YP_060482.1|  hypothetical protein M6_Spy116...    80   3e-14
gi|14578842|gb|AAK69029.1|AF274302_3  Orf3 [Streptococcus pn...    80   3e-14
gi|46019865|emb|CAE52389.1|  hypothetical protein [Streptoco...    80   5e-14
gi|15485440|emb|CAC67534.1|  hypothetical protein [Streptoco...    79   8e-14
gi|52345270|emb|CAG30574.1|  hypothetical protein [Streptoco...    67   3e-10
gi|157075692|gb|ABV10375.1|  conserved hypothetical protein ...    65   2e-09
gi|125717476|ref|YP_001034609.1|  hypothetical protein SSA_0...    64   3e-09
>gi|951054|gb|AAC98437.1| unknown [Streptococcus pneumoniae]
          Length = 96

 Score =  114 bits (285), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 70/72 (97%), Positives = 71/72 (98%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          MNRKELYDDKLQLDYFSDSYL+FESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK
Sbjct: 1  MNRKELYDDKLQLDYFSDSYLQFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60

Query: 61 SKSLDNRDHYFV 72
          SKSLD RDHYFV
Sbjct: 61 SKSLDGRDHYFV 72
>gi|46019887|emb|CAE52418.1| hypothetical protein [Streptococcus thermophilus]
          Length = 100

 Score = 93.6 bits (231), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 51/71 (71%), Positives = 61/71 (85%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          MNRKELY DKLQ+DYFS+SY++FE DFY+YSA   PLTFI DDIL +MA+S ++YFKLNK
Sbjct: 2  MNRKELYKDKLQIDYFSESYIKFEEDFYRYSADGTPLTFIIDDILLSMALSHRNYFKLNK 61

Query: 61 SKSLDNRDHYF 71
           KS+D RDHYF
Sbjct: 62 EKSVDGRDHYF 72
>gi|55822652|ref|YP_141093.1| hypothetical protein str0683 [Streptococcus thermophilus
          CNRZ1066]
 gi|55738637|gb|AAV62278.1| conserved hypothetical protein [Streptococcus thermophilus
          CNRZ1066]
          Length = 108

 Score = 92.4 bits (228), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 48/71 (67%), Positives = 62/71 (87%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          M+R+ELY DKLQLDYFS+SY++FE DFY+YSA + PLTFI DDIL +MA+S K+YFKLN+
Sbjct: 1  MSRQELYKDKLQLDYFSESYIKFEEDFYRYSAANTPLTFIIDDILLSMALSHKNYFKLNR 60

Query: 61 SKSLDNRDHYF 71
           KS+D ++HYF
Sbjct: 61 EKSVDGKEHYF 71
>gi|46019858|emb|CAE52381.1| hypothetical protein [Streptococcus thermophilus]
          Length = 97

 Score = 92.0 bits (227), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 50/71 (70%), Positives = 63/71 (88%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          MNR+E+Y DKLQ+DYFS+SY++FE DFY+YSA +IPLTFI DDIL +MA+S K+YF+LNK
Sbjct: 1  MNRREVYKDKLQMDYFSESYIKFEEDFYRYSADNIPLTFIIDDILLSMALSHKNYFRLNK 60

Query: 61 SKSLDNRDHYF 71
           KS+D RDHYF
Sbjct: 61 QKSVDGRDHYF 71
>gi|77406940|ref|ZP_00783961.1| conserved hypothetical protein [Streptococcus agalactiae H36B]
 gi|77174460|gb|EAO77308.1| conserved hypothetical protein [Streptococcus agalactiae H36B]
          Length = 119

 Score = 86.3 bits (212), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 48/72 (66%), Positives = 57/72 (79%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          MN+ E   + LQ+DYFSDSY +FE DFY+YSALD PLTF+TDDI+  MA SQK YFKLNK
Sbjct: 18 MNQLEFNKNLLQMDYFSDSYRKFEEDFYRYSALDTPLTFLTDDIMLAMAKSQKPYFKLNK 77

Query: 61 SKSLDNRDHYFV 72
            S D+RDHYF+
Sbjct: 78 ENSKDHRDHYFM 89
>gi|22537421|ref|NP_688272.1| hypothetical protein SAG1272 [Streptococcus agalactiae 2603V/R]
 gi|22534297|gb|AAN00145.1|AE014250_8 conserved hypothetical protein [Streptococcus agalactiae 2603V/R]
          Length = 102

 Score = 84.0 bits (206), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 48/72 (66%), Positives = 57/72 (79%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          MN+ E   + LQ+DYFSDSY +FE DFY+YSALD PLTF+TDDI+  MA SQK YFKLNK
Sbjct: 1  MNQLEFNKNLLQMDYFSDSYRKFEEDFYRYSALDTPLTFLTDDIMLAMAKSQKPYFKLNK 60

Query: 61 SKSLDNRDHYFV 72
            S D+RDHYF+
Sbjct: 61 ENSKDHRDHYFM 72
>gi|22537409|ref|NP_688260.1| hypothetical protein SAG1259 [Streptococcus agalactiae 2603V/R]
 gi|22534284|gb|AAN00133.1|AE014249_14 conserved hypothetical protein [Streptococcus agalactiae 2603V/R]
          Length = 99

 Score = 80.9 bits (198), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 44/72 (61%), Positives = 58/72 (80%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          MN+ E   + LQ+DY+S+SY  FE DFY+YS ++IPL F+TDDIL+TMA S+K+YF LNK
Sbjct: 1  MNQLEFQRNHLQMDYYSESYQDFERDFYRYSNMNIPLIFLTDDILKTMATSRKNYFVLNK 60

Query: 61 SKSLDNRDHYFV 72
           KS DNRDH+F+
Sbjct: 61 EKSKDNRDHFFI 72
>gi|50914510|ref|YP_060482.1| hypothetical protein M6_Spy1164 [Streptococcus pyogenes
          MGAS10394]
 gi|10119929|gb|AAG13000.1|AF227520_6 unknown [Streptococcus pneumoniae]
 gi|18478332|gb|AAL73131.1|AF227521_6 unknown [Streptococcus pyogenes]
 gi|40218536|gb|AAR83190.1| conserved hypothetical protein [Streptococcus pyogenes]
 gi|50261581|gb|AAT72349.1| unknown [Streptococcus pyogenes]
 gi|50903584|gb|AAT87299.1| Hypothetical Protein M6_Spy1164 [Streptococcus pyogenes
          MGAS10394]
          Length = 99

 Score = 80.5 bits (197), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 45/72 (62%), Positives = 59/72 (81%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          MN+ E   + LQ+DY+S+SY  FE DFY+YS ++IPLTF+TDDIL+TMA S+K+YF LNK
Sbjct: 1  MNQLEFQRNHLQMDYYSESYQDFERDFYRYSNMNIPLTFLTDDILKTMATSRKNYFVLNK 60

Query: 61 SKSLDNRDHYFV 72
           KS DNRDH+F+
Sbjct: 61 EKSRDNRDHFFI 72
>gi|14578842|gb|AAK69029.1|AF274302_3 Orf3 [Streptococcus pneumoniae]
 gi|21322648|emb|CAC87434.1| hypothetical protein [Streptococcus salivarius]
 gi|38492182|gb|AAR22393.1| hypothetical protein [Streptococcus pneumoniae]
          Length = 99

 Score = 80.5 bits (197), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 45/72 (62%), Positives = 59/72 (81%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          MN+ E   + LQ+DY+S+SY  FE DFY+YS ++IPLTF+TDDIL+TMA S+K+YF LNK
Sbjct: 1  MNQLEFQRNHLQMDYYSESYQDFERDFYRYSNMNIPLTFLTDDILKTMATSRKNYFVLNK 60

Query: 61 SKSLDNRDHYFV 72
           KS DNRDH+F+
Sbjct: 61 EKSRDNRDHFFI 72
>gi|46019865|emb|CAE52389.1| hypothetical protein [Streptococcus thermophilus]
          Length = 99

 Score = 79.7 bits (195), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 41/72 (56%), Positives = 55/72 (76%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          M+R+ LY   LQ+DYFS+SY  FE DF +YS + +PLTF+TDDILRTMA+   +YF+LN+
Sbjct: 1  MDRRNLYAGYLQIDYFSESYSHFEEDFQRYSNMSVPLTFLTDDILRTMALCHTNYFRLNQ 60

Query: 61 SKSLDNRDHYFV 72
            + D R+HYFV
Sbjct: 61 ENAKDGRNHYFV 72
>gi|15485440|emb|CAC67534.1| hypothetical protein [Streptococcus thermophilus]
          Length = 99

 Score = 79.0 bits (193), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 41/72 (56%), Positives = 55/72 (76%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          M+R+ LY   LQ+DYFS+SY  FE DF +YS + +PLTF+TDDILRTMA+   +YFKLN+
Sbjct: 1  MDRRNLYAGDLQIDYFSESYSHFEEDFQRYSNMSVPLTFLTDDILRTMALCHTNYFKLNQ 60

Query: 61 SKSLDNRDHYFV 72
            + D R+HYF+
Sbjct: 61 ENAKDRRNHYFI 72
>gi|52345270|emb|CAG30574.1| hypothetical protein [Streptococcus pyogenes]
          Length = 100

 Score = 67.0 bits (162), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 40/73 (54%), Positives = 53/73 (72%), Gaps = 1/73 (1%)

Query: 1  MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
          MN+ E   + LQ+DY+S+SY  FE DFY+YS ++IPLTF+TDDIL+TMA S K+YF L  
Sbjct: 1  MNQLEFQRNHLQMDYYSESYQDFERDFYRYSNMNIPLTFLTDDILKTMATSHKNYFVLPL 60

Query: 61 SKSLD-NRDHYFV 72
           K L    DH+F+
Sbjct: 61 RKRLGITVDHFFI 73
>gi|157075692|gb|ABV10375.1| conserved hypothetical protein [Streptococcus gordonii str.
          Challis substr. CH1]
          Length = 97

 Score = 64.7 bits (156), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 36/63 (57%), Positives = 47/63 (74%)

Query: 9  DKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNKSKSLDNRD 68
          ++LQ DYFS +Y +F+ DFY++S L  PL  + DDILR MA  Q  YFKL+K+KSLD +D
Sbjct: 8  EELQFDYFSHNYYQFQEDFYQFSNLPQPLMAMEDDILRHMASRQVTYFKLSKTKSLDQKD 67

Query: 69 HYF 71
          HYF
Sbjct: 68 HYF 70
>gi|125717476|ref|YP_001034609.1| hypothetical protein SSA_0618 [Streptococcus sanguinis SK36]
 gi|125497393|gb|ABN44059.1| Conserved hypothetical protein [Streptococcus sanguinis SK36]
          Length = 97

 Score = 63.5 bits (153), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 35/63 (55%), Positives = 46/63 (73%)

Query: 9  DKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNKSKSLDNRD 68
          ++LQ DYFS +Y +F+ DFY++S L  PL  + DDIL  MA  Q  YFKL+K+KSLD +D
Sbjct: 8  EELQFDYFSQNYYQFQEDFYQFSNLPQPLMVMEDDILLHMASRQATYFKLSKTKSLDKKD 67

Query: 69 HYF 71
          HYF
Sbjct: 68 HYF 70
  Database: nr
    Posted date:  Sep 17, 2007 11:41 AM
  Number of letters in database: 999,999,834
  Number of sequences in database:  2,976,859
  
  Database: /nucleus1/users/jsaw/ncbi/db/nr.01
    Posted date:  Sep 17, 2007 11:48 AM
  Number of letters in database: 894,087,890
  Number of sequences in database:  2,493,262
  
Lambda     K      H
   0.324    0.137    0.394 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 286,595,403
Number of Sequences: 5470121
Number of extensions: 9327061
Number of successful extensions: 37274
Number of sequences better than 1.0e-05: 15
Number of HSP's better than  0.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 37259
Number of HSP's gapped (non-prelim): 15
length of query: 83
length of database: 1,894,087,724
effective HSP length: 54
effective length of query: 29
effective length of database: 1,598,701,190
effective search space: 46362334510
effective search space used: 46362334510
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 124 (52.4 bits)