BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SMT2002
(83 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|951054|gb|AAC98437.1| unknown [Streptococcus pneumoniae] 114 2e-24
gi|46019887|emb|CAE52418.1| hypothetical protein [Streptoco... 94 3e-18
gi|55822652|ref|YP_141093.1| hypothetical protein str0683 [... 92 8e-18
gi|46019858|emb|CAE52381.1| hypothetical protein [Streptoco... 92 9e-18
gi|77406940|ref|ZP_00783961.1| conserved hypothetical prote... 86 5e-16
gi|22537421|ref|NP_688272.1| hypothetical protein SAG1272 [... 84 3e-15
gi|22537409|ref|NP_688260.1| hypothetical protein SAG1259 [... 81 2e-14
gi|50914510|ref|YP_060482.1| hypothetical protein M6_Spy116... 80 3e-14
gi|14578842|gb|AAK69029.1|AF274302_3 Orf3 [Streptococcus pn... 80 3e-14
gi|46019865|emb|CAE52389.1| hypothetical protein [Streptoco... 80 5e-14
gi|15485440|emb|CAC67534.1| hypothetical protein [Streptoco... 79 8e-14
gi|52345270|emb|CAG30574.1| hypothetical protein [Streptoco... 67 3e-10
gi|157075692|gb|ABV10375.1| conserved hypothetical protein ... 65 2e-09
gi|125717476|ref|YP_001034609.1| hypothetical protein SSA_0... 64 3e-09
>gi|951054|gb|AAC98437.1| unknown [Streptococcus pneumoniae]
Length = 96
Score = 114 bits (285), Expect = 2e-24, Method: Composition-based stats.
Identities = 70/72 (97%), Positives = 71/72 (98%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
MNRKELYDDKLQLDYFSDSYL+FESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK
Sbjct: 1 MNRKELYDDKLQLDYFSDSYLQFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
Query: 61 SKSLDNRDHYFV 72
SKSLD RDHYFV
Sbjct: 61 SKSLDGRDHYFV 72
>gi|46019887|emb|CAE52418.1| hypothetical protein [Streptococcus thermophilus]
Length = 100
Score = 93.6 bits (231), Expect = 3e-18, Method: Composition-based stats.
Identities = 51/71 (71%), Positives = 61/71 (85%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
MNRKELY DKLQ+DYFS+SY++FE DFY+YSA PLTFI DDIL +MA+S ++YFKLNK
Sbjct: 2 MNRKELYKDKLQIDYFSESYIKFEEDFYRYSADGTPLTFIIDDILLSMALSHRNYFKLNK 61
Query: 61 SKSLDNRDHYF 71
KS+D RDHYF
Sbjct: 62 EKSVDGRDHYF 72
>gi|55822652|ref|YP_141093.1| hypothetical protein str0683 [Streptococcus thermophilus
CNRZ1066]
gi|55738637|gb|AAV62278.1| conserved hypothetical protein [Streptococcus thermophilus
CNRZ1066]
Length = 108
Score = 92.4 bits (228), Expect = 8e-18, Method: Composition-based stats.
Identities = 48/71 (67%), Positives = 62/71 (87%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
M+R+ELY DKLQLDYFS+SY++FE DFY+YSA + PLTFI DDIL +MA+S K+YFKLN+
Sbjct: 1 MSRQELYKDKLQLDYFSESYIKFEEDFYRYSAANTPLTFIIDDILLSMALSHKNYFKLNR 60
Query: 61 SKSLDNRDHYF 71
KS+D ++HYF
Sbjct: 61 EKSVDGKEHYF 71
>gi|46019858|emb|CAE52381.1| hypothetical protein [Streptococcus thermophilus]
Length = 97
Score = 92.0 bits (227), Expect = 9e-18, Method: Composition-based stats.
Identities = 50/71 (70%), Positives = 63/71 (88%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
MNR+E+Y DKLQ+DYFS+SY++FE DFY+YSA +IPLTFI DDIL +MA+S K+YF+LNK
Sbjct: 1 MNRREVYKDKLQMDYFSESYIKFEEDFYRYSADNIPLTFIIDDILLSMALSHKNYFRLNK 60
Query: 61 SKSLDNRDHYF 71
KS+D RDHYF
Sbjct: 61 QKSVDGRDHYF 71
>gi|77406940|ref|ZP_00783961.1| conserved hypothetical protein [Streptococcus agalactiae H36B]
gi|77174460|gb|EAO77308.1| conserved hypothetical protein [Streptococcus agalactiae H36B]
Length = 119
Score = 86.3 bits (212), Expect = 5e-16, Method: Composition-based stats.
Identities = 48/72 (66%), Positives = 57/72 (79%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
MN+ E + LQ+DYFSDSY +FE DFY+YSALD PLTF+TDDI+ MA SQK YFKLNK
Sbjct: 18 MNQLEFNKNLLQMDYFSDSYRKFEEDFYRYSALDTPLTFLTDDIMLAMAKSQKPYFKLNK 77
Query: 61 SKSLDNRDHYFV 72
S D+RDHYF+
Sbjct: 78 ENSKDHRDHYFM 89
>gi|22537421|ref|NP_688272.1| hypothetical protein SAG1272 [Streptococcus agalactiae 2603V/R]
gi|22534297|gb|AAN00145.1|AE014250_8 conserved hypothetical protein [Streptococcus agalactiae 2603V/R]
Length = 102
Score = 84.0 bits (206), Expect = 3e-15, Method: Composition-based stats.
Identities = 48/72 (66%), Positives = 57/72 (79%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
MN+ E + LQ+DYFSDSY +FE DFY+YSALD PLTF+TDDI+ MA SQK YFKLNK
Sbjct: 1 MNQLEFNKNLLQMDYFSDSYRKFEEDFYRYSALDTPLTFLTDDIMLAMAKSQKPYFKLNK 60
Query: 61 SKSLDNRDHYFV 72
S D+RDHYF+
Sbjct: 61 ENSKDHRDHYFM 72
>gi|22537409|ref|NP_688260.1| hypothetical protein SAG1259 [Streptococcus agalactiae 2603V/R]
gi|22534284|gb|AAN00133.1|AE014249_14 conserved hypothetical protein [Streptococcus agalactiae 2603V/R]
Length = 99
Score = 80.9 bits (198), Expect = 2e-14, Method: Composition-based stats.
Identities = 44/72 (61%), Positives = 58/72 (80%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
MN+ E + LQ+DY+S+SY FE DFY+YS ++IPL F+TDDIL+TMA S+K+YF LNK
Sbjct: 1 MNQLEFQRNHLQMDYYSESYQDFERDFYRYSNMNIPLIFLTDDILKTMATSRKNYFVLNK 60
Query: 61 SKSLDNRDHYFV 72
KS DNRDH+F+
Sbjct: 61 EKSKDNRDHFFI 72
>gi|50914510|ref|YP_060482.1| hypothetical protein M6_Spy1164 [Streptococcus pyogenes
MGAS10394]
gi|10119929|gb|AAG13000.1|AF227520_6 unknown [Streptococcus pneumoniae]
gi|18478332|gb|AAL73131.1|AF227521_6 unknown [Streptococcus pyogenes]
gi|40218536|gb|AAR83190.1| conserved hypothetical protein [Streptococcus pyogenes]
gi|50261581|gb|AAT72349.1| unknown [Streptococcus pyogenes]
gi|50903584|gb|AAT87299.1| Hypothetical Protein M6_Spy1164 [Streptococcus pyogenes
MGAS10394]
Length = 99
Score = 80.5 bits (197), Expect = 3e-14, Method: Composition-based stats.
Identities = 45/72 (62%), Positives = 59/72 (81%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
MN+ E + LQ+DY+S+SY FE DFY+YS ++IPLTF+TDDIL+TMA S+K+YF LNK
Sbjct: 1 MNQLEFQRNHLQMDYYSESYQDFERDFYRYSNMNIPLTFLTDDILKTMATSRKNYFVLNK 60
Query: 61 SKSLDNRDHYFV 72
KS DNRDH+F+
Sbjct: 61 EKSRDNRDHFFI 72
>gi|14578842|gb|AAK69029.1|AF274302_3 Orf3 [Streptococcus pneumoniae]
gi|21322648|emb|CAC87434.1| hypothetical protein [Streptococcus salivarius]
gi|38492182|gb|AAR22393.1| hypothetical protein [Streptococcus pneumoniae]
Length = 99
Score = 80.5 bits (197), Expect = 3e-14, Method: Composition-based stats.
Identities = 45/72 (62%), Positives = 59/72 (81%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
MN+ E + LQ+DY+S+SY FE DFY+YS ++IPLTF+TDDIL+TMA S+K+YF LNK
Sbjct: 1 MNQLEFQRNHLQMDYYSESYQDFERDFYRYSNMNIPLTFLTDDILKTMATSRKNYFVLNK 60
Query: 61 SKSLDNRDHYFV 72
KS DNRDH+F+
Sbjct: 61 EKSRDNRDHFFI 72
>gi|46019865|emb|CAE52389.1| hypothetical protein [Streptococcus thermophilus]
Length = 99
Score = 79.7 bits (195), Expect = 5e-14, Method: Composition-based stats.
Identities = 41/72 (56%), Positives = 55/72 (76%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
M+R+ LY LQ+DYFS+SY FE DF +YS + +PLTF+TDDILRTMA+ +YF+LN+
Sbjct: 1 MDRRNLYAGYLQIDYFSESYSHFEEDFQRYSNMSVPLTFLTDDILRTMALCHTNYFRLNQ 60
Query: 61 SKSLDNRDHYFV 72
+ D R+HYFV
Sbjct: 61 ENAKDGRNHYFV 72
>gi|15485440|emb|CAC67534.1| hypothetical protein [Streptococcus thermophilus]
Length = 99
Score = 79.0 bits (193), Expect = 8e-14, Method: Composition-based stats.
Identities = 41/72 (56%), Positives = 55/72 (76%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
M+R+ LY LQ+DYFS+SY FE DF +YS + +PLTF+TDDILRTMA+ +YFKLN+
Sbjct: 1 MDRRNLYAGDLQIDYFSESYSHFEEDFQRYSNMSVPLTFLTDDILRTMALCHTNYFKLNQ 60
Query: 61 SKSLDNRDHYFV 72
+ D R+HYF+
Sbjct: 61 ENAKDRRNHYFI 72
>gi|52345270|emb|CAG30574.1| hypothetical protein [Streptococcus pyogenes]
Length = 100
Score = 67.0 bits (162), Expect = 3e-10, Method: Composition-based stats.
Identities = 40/73 (54%), Positives = 53/73 (72%), Gaps = 1/73 (1%)
Query: 1 MNRKELYDDKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNK 60
MN+ E + LQ+DY+S+SY FE DFY+YS ++IPLTF+TDDIL+TMA S K+YF L
Sbjct: 1 MNQLEFQRNHLQMDYYSESYQDFERDFYRYSNMNIPLTFLTDDILKTMATSHKNYFVLPL 60
Query: 61 SKSLD-NRDHYFV 72
K L DH+F+
Sbjct: 61 RKRLGITVDHFFI 73
>gi|157075692|gb|ABV10375.1| conserved hypothetical protein [Streptococcus gordonii str.
Challis substr. CH1]
Length = 97
Score = 64.7 bits (156), Expect = 2e-09, Method: Composition-based stats.
Identities = 36/63 (57%), Positives = 47/63 (74%)
Query: 9 DKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNKSKSLDNRD 68
++LQ DYFS +Y +F+ DFY++S L PL + DDILR MA Q YFKL+K+KSLD +D
Sbjct: 8 EELQFDYFSHNYYQFQEDFYQFSNLPQPLMAMEDDILRHMASRQVTYFKLSKTKSLDQKD 67
Query: 69 HYF 71
HYF
Sbjct: 68 HYF 70
>gi|125717476|ref|YP_001034609.1| hypothetical protein SSA_0618 [Streptococcus sanguinis SK36]
gi|125497393|gb|ABN44059.1| Conserved hypothetical protein [Streptococcus sanguinis SK36]
Length = 97
Score = 63.5 bits (153), Expect = 3e-09, Method: Composition-based stats.
Identities = 35/63 (55%), Positives = 46/63 (73%)
Query: 9 DKLQLDYFSDSYLRFESDFYKYSALDIPLTFITDDILRTMAMSQKHYFKLNKSKSLDNRD 68
++LQ DYFS +Y +F+ DFY++S L PL + DDIL MA Q YFKL+K+KSLD +D
Sbjct: 8 EELQFDYFSQNYYQFQEDFYQFSNLPQPLMVMEDDILLHMASRQATYFKLSKTKSLDKKD 67
Query: 69 HYF 71
HYF
Sbjct: 68 HYF 70
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.324 0.137 0.394
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 286,595,403
Number of Sequences: 5470121
Number of extensions: 9327061
Number of successful extensions: 37274
Number of sequences better than 1.0e-05: 15
Number of HSP's better than 0.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 37259
Number of HSP's gapped (non-prelim): 15
length of query: 83
length of database: 1,894,087,724
effective HSP length: 54
effective length of query: 29
effective length of database: 1,598,701,190
effective search space: 46362334510
effective search space used: 46362334510
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 124 (52.4 bits)