BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SSA_1758
(201 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|125718557|ref|YP_001035690.1| hypothetical protein SSA_1... 394 e-108
gi|125718556|ref|YP_001035689.1| hypothetical protein SSA_1... 293 4e-78
gi|157075310|gb|ABV09993.1| hypothetical protein SGO_1064 [... 131 3e-29
gi|157076257|gb|ABV10940.1| hypothetical protein SGO_1065 [... 128 2e-28
gi|125718558|ref|YP_001035691.1| hypothetical protein SSA_1... 126 8e-28
gi|157075074|gb|ABV09757.1| hypothetical protein SGO_1063 [... 126 8e-28
gi|125717787|ref|YP_001034920.1| hypothetical protein SSA_0... 125 1e-27
gi|125717788|ref|YP_001034921.1| hypothetical protein SSA_0... 124 3e-27
gi|125717786|ref|YP_001034919.1| hypothetical protein SSA_0... 123 5e-27
gi|157076021|gb|ABV10704.1| hypothetical protein SGO_1061 [... 122 1e-26
gi|157075688|gb|ABV10371.1| hypothetical protein SGO_1062 [... 119 8e-26
gi|125718559|ref|YP_001035692.1| hypothetical protein SSA_1... 88 2e-16
>gi|125718557|ref|YP_001035690.1| hypothetical protein SSA_1758 [Streptococcus sanguinis SK36]
gi|125498474|gb|ABN45140.1| Hypothetical protein SSA_1758 [Streptococcus sanguinis SK36]
Length = 202
Score = 394 bits (1011), Expect = e-108, Method: Composition-based stats.
Identities = 201/201 (100%), Positives = 201/201 (100%)
Query: 1 TIRVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIE 60
TIRVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIE
Sbjct: 2 TIRVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIE 61
Query: 61 FSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDFNVND 120
FSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDFNVND
Sbjct: 62 FSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDFNVND 121
Query: 121 GSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQGSPK 180
GSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQGSPK
Sbjct: 122 GSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQGSPK 181
Query: 181 AEIVYNLEIRRIDERELDKWQ 201
AEIVYNLEIRRIDERELDKWQ
Sbjct: 182 AEIVYNLEIRRIDERELDKWQ 202
>gi|125718556|ref|YP_001035689.1| hypothetical protein SSA_1757 [Streptococcus sanguinis SK36]
gi|125498473|gb|ABN45139.1| Hypothetical protein SSA_1757 [Streptococcus sanguinis SK36]
Length = 202
Score = 293 bits (750), Expect = 4e-78, Method: Composition-based stats.
Identities = 143/199 (71%), Positives = 168/199 (84%)
Query: 3 RVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIEFS 62
R KSILLSL C FVLVIGGKFY+DRMKVDNLYRHGFQLYEEQIATYLKEHYSG+SKIEFS
Sbjct: 4 RTKSILLSLLCFFVLVIGGKFYMDRMKVDNLYRHGFQLYEEQIATYLKEHYSGVSKIEFS 63
Query: 63 PIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDFNVNDGS 122
PIF SGG GE F +ARI+PVVYD++GNKVYLRNDG +D VP+Y +G++L FNVNDGS
Sbjct: 64 PIFISGGGGESFVNARIVPVVYDSYGNKVYLRNDGVLDMAVPDYGTLAGLDLSFNVNDGS 123
Query: 123 EIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQGSPKAE 182
EI+YL N + ES+ YQHLPE LKL+ + TD ++TA+S++G LKGVEKNSQGSP+AE
Sbjct: 124 EIVYLRNNERESVSSEVYQHLPEQLKLQKEEFTDEVMTAFSREGHLKGVEKNSQGSPQAE 183
Query: 183 IVYNLEIRRIDERELDKWQ 201
I+YNLEIRR+ RE +WQ
Sbjct: 184 IIYNLEIRRVQGREAFEWQ 202
>gi|157075310|gb|ABV09993.1| hypothetical protein SGO_1064 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 189
Score = 131 bits (329), Expect = 3e-29, Method: Composition-based stats.
Identities = 78/165 (47%), Positives = 99/165 (60%), Gaps = 4/165 (2%)
Query: 33 LYRHGFQLYEEQIATYLKEHYSGISKIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVY 92
LY+ GF+L EEQ+ATY+KEHYSG+SKIEFSPIF GG G+ A I+ VYD + NK Y
Sbjct: 25 LYKRGFRLLEEQLATYIKEHYSGVSKIEFSPIFVQGGDGQTMFSANIVFSVYDKNQNKAY 84
Query: 93 LRNDGTI-DTLVPNYVLTSGIELDFNVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKT 151
GTI D P Y + +DFN D EII L + I+V +QHLP KL
Sbjct: 85 F--GGTIGDITYPTYNNVWDVRMDFNGWD-EEIIELRTSDEKLIDVSDFQHLPPEAKLTV 141
Query: 152 YKSTDAIITAYSQKGTLKGVEKNSQGSPKAEIVYNLEIRRIDERE 196
D I A Q ++GV K+S+GS ++VYN EI++ DERE
Sbjct: 142 SNKVDENIEALVQDKQIEGVRKDSKGSADLDVVYNDEIKKGDERE 186
>gi|157076257|gb|ABV10940.1| hypothetical protein SGO_1065 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 187
Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats.
Identities = 78/165 (47%), Positives = 101/165 (61%), Gaps = 5/165 (3%)
Query: 33 LYRHGFQLYEEQIATYLKEHYSGISKIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVY 92
LY+ GF+L EEQ+ATY+KEHYSG+SKIEFSPIF GG G+ A I+PV+YD HGNK Y
Sbjct: 25 LYKRGFRLLEEQLATYIKEHYSGVSKIEFSPIFVQGGDGQTMFDANIVPVIYDNHGNKAY 84
Query: 93 L-RNDGTIDTLVPNYVLTSGIELDFNVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKT 151
L R G +Y L + LDFN D E+I + + + +++ Y+ LP KL
Sbjct: 85 LGRKVGKHG--YASYGLLGDLRLDFNGFD-EEVIEI-DVDGKFLDITNYKSLPPKAKLTI 140
Query: 152 YKSTDAIITAYSQKGTLKGVEKNSQGSPKAEIVYNLEIRRIDERE 196
S D I A G LK V K+ +GS KAE+ YN EIR+ +E E
Sbjct: 141 NPSMDENIVALVNAGHLKDVVKSEKGSLKAEVAYNTEIRKGNEWE 185
>gi|125718558|ref|YP_001035691.1| hypothetical protein SSA_1759 [Streptococcus sanguinis SK36]
gi|125498475|gb|ABN45141.1| Hypothetical protein SSA_1759 [Streptococcus sanguinis SK36]
Length = 101
Score = 126 bits (316), Expect = 8e-28, Method: Composition-based stats.
Identities = 64/99 (64%), Positives = 80/99 (80%)
Query: 103 VPNYVLTSGIELDFNVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAY 162
V +Y +T+GI+LDFNVNDGSEIIYL+N+KNES+ V YQHLP+ LKL + TD + A+
Sbjct: 3 VADYEMTAGIDLDFNVNDGSEIIYLHNKKNESVSVEGYQHLPDGLKLNKDEITDEKMDAF 62
Query: 163 SQKGTLKGVEKNSQGSPKAEIVYNLEIRRIDERELDKWQ 201
S++G LKGVEKNS GSPKAEIVYNLEIRR+ RE +W+
Sbjct: 63 SREGYLKGVEKNSLGSPKAEIVYNLEIRRVQGREFYEWK 101
>gi|157075074|gb|ABV09757.1| hypothetical protein SGO_1063 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 202
Score = 126 bits (316), Expect = 8e-28, Method: Composition-based stats.
Identities = 81/195 (41%), Positives = 108/195 (55%), Gaps = 11/195 (5%)
Query: 5 KSILLSLFCVFVLVIGGKFYIDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGIS 57
K L L + VLV GG F +++ K LY+HGFQL EEQIATY+KEHY GI
Sbjct: 3 KKFLTILALILVLVSGGIFVYNKLTKPNFGPKTTRLYQHGFQLLEEQIATYIKEHYKGIK 62
Query: 58 KIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLV-PNYVLTSGIELDF 116
KIEFSPI+ +G G +A I+P++YD HGNK + G P Y + L F
Sbjct: 63 KIEFSPIYVTGDDGSSMLNAEIVPIIYDYHGNKA--KFGGLYKNFQHPAYGTIGYLRLSF 120
Query: 117 NVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQ 176
+ + G I L + E +V Q LPE +K K K D ++ LKGVEK+
Sbjct: 121 DYS-GKPYIELSTDTGEFKDVTYGQSLPEEIKGKKIKDIDFNFETLIKEERLKGVEKSDV 179
Query: 177 GSPKAEIVYNLEIRR 191
GSP AE++YNLE+++
Sbjct: 180 GSPNAEVIYNLELKK 194
>gi|125717787|ref|YP_001034920.1| hypothetical protein SSA_0948 [Streptococcus sanguinis SK36]
gi|125497704|gb|ABN44370.1| Hypothetical protein SSA_0948 [Streptococcus sanguinis SK36]
Length = 197
Score = 125 bits (315), Expect = 1e-27, Method: Composition-based stats.
Identities = 75/195 (38%), Positives = 113/195 (57%), Gaps = 11/195 (5%)
Query: 5 KSILLSLFCVFVLVIGGKFYIDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGIS 57
K I++ L +F+++ GG + +++ K LY+ GF+L EEQ TY KEHY GI
Sbjct: 3 KKIIIVLTILFIVISGGIYIYNKLTKPNLGPKTTKLYQRGFRLLEEQYGTYFKEHYKGIE 62
Query: 58 KIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPN-YVLTSGIELDF 116
KI+FSPI+ G G +A + P +YD +GNK L TI+ PN Y L + I LDF
Sbjct: 63 KIKFSPIYIEGDNGGSMLNAYVRPTIYDKYGNKATLGT--TIENYTPNSYGLVTHIFLDF 120
Query: 117 NVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQ 176
+ G+++I L + I+V +HLP+ KL K TD I + G LK V K+ +
Sbjct: 121 D-GAGNDVIELMDSHGNDIDVSNAKHLPDEAKLTRAKGTDLNIELLVEDGQLKDVVKDEK 179
Query: 177 GSPKAEIVYNLEIRR 191
G+P+AEI+YN+++ +
Sbjct: 180 GTPEAEIIYNVKLSK 194
>gi|125717788|ref|YP_001034921.1| hypothetical protein SSA_0949 [Streptococcus sanguinis SK36]
gi|125497705|gb|ABN44371.1| Hypothetical protein SSA_0949 [Streptococcus sanguinis SK36]
Length = 202
Score = 124 bits (312), Expect = 3e-27, Method: Composition-based stats.
Identities = 75/195 (38%), Positives = 114/195 (58%), Gaps = 11/195 (5%)
Query: 5 KSILLSLFCVFVLVIGGKFYIDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGIS 57
K I++ L +F++ GG + +++ K LY+HGF+L EEQI TY+KE+YSGI
Sbjct: 3 KKIIIGLTILFIVFSGGIYMYNKLTKPNFGSKTTKLYQHGFRLLEEQIGTYIKENYSGIE 62
Query: 58 KIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTL-VPNYVLTSGIELDF 116
KIEFSPI+ +G G +A ++P+VYD++GNK + G P Y + + F
Sbjct: 63 KIEFSPIYITGDDGSSMLNAEVVPIVYDSYGNKA--KFGGLYKNFQQPAYGTIGYLRVSF 120
Query: 117 NVNDGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEKNSQ 176
+ + G I L + E EV Q LP+ +KL+ K D ++G LKG+EK+ +
Sbjct: 121 DYS-GKSYIELSTDSGEFKEVTYGQSLPKEIKLREMKDVDFNFETLIREGKLKGIEKSDK 179
Query: 177 GSPKAEIVYNLEIRR 191
GSP AEIVYNL++++
Sbjct: 180 GSPDAEIVYNLQLKK 194
>gi|125717786|ref|YP_001034919.1| hypothetical protein SSA_0947 [Streptococcus sanguinis SK36]
gi|125497703|gb|ABN44369.1| Hypothetical protein SSA_0947 [Streptococcus sanguinis SK36]
Length = 197
Score = 123 bits (309), Expect = 5e-27, Method: Composition-based stats.
Identities = 76/198 (38%), Positives = 113/198 (57%), Gaps = 17/198 (8%)
Query: 5 KSILLSLFCVFVLVIGGKFYIDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGIS 57
K + +L + VLV GG + +++ K LY+ GF+ EEQI TY+KE YSGI
Sbjct: 3 KKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSGIE 62
Query: 58 KIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTS-GIELDF 116
KIEFSPI+ +G +A + P +YD +GNK T+ T + YV S GIE D
Sbjct: 63 KIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNK------ATLGTQIKKYVPNSFGIEADL 116
Query: 117 NVN---DGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEK 173
++ G+E+I L + ++ SI+V + LPE KL KS D I + G LK V K
Sbjct: 117 VLDFDWSGNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVK 176
Query: 174 NSQGSPKAEIVYNLEIRR 191
+ +GSP+A+I+YN+++ +
Sbjct: 177 DEKGSPEAQIIYNVKLSK 194
>gi|157076021|gb|ABV10704.1| hypothetical protein SGO_1061 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 206
Score = 122 bits (307), Expect = 1e-26, Method: Composition-based stats.
Identities = 79/172 (45%), Positives = 106/172 (61%), Gaps = 9/172 (5%)
Query: 25 IDRMKVDNLYRHGFQLYEEQIATYLKEHYSGISKIEFSPIFKSGGRGEGFAHARIIPVVY 84
+DR K + + +HGF+L EEQ+ TY+KE+YSGISKIEFSPIF GG+ A ++PV+Y
Sbjct: 32 LDR-KEEKIIKHGFRLLEEQLGTYIKENYSGISKIEFSPIFIQGGKDNPPFTADVLPVIY 90
Query: 85 DTHGNKVYLRNDGTI-DTLVPNYVLTSGIELDFNVNDGSEIIYLYNEKNE---SIEVGQY 140
D GN+ L G I T P+Y L + + LDF+ ND +EII L + ++V
Sbjct: 91 DEEGNRAVL--GGQIGKTGYPSYGLLTDLRLDFDHND-NEIIELEDSAKGLGIEVDVSNA 147
Query: 141 QHLPEHLKLKT-YKSTDAIITAYSQKGTLKGVEKNSQGSPKAEIVYNLEIRR 191
+ LPE K+K TD I A LK V KN +GSPKAE++YNLEI+R
Sbjct: 148 KTLPEKAKIKAELDGTDENIAALVDTKKLKNVRKNKEGSPKAELIYNLEIKR 199
>gi|157075688|gb|ABV10371.1| hypothetical protein SGO_1062 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 206
Score = 119 bits (299), Expect = 8e-26, Method: Composition-based stats.
Identities = 80/198 (40%), Positives = 107/198 (54%), Gaps = 13/198 (6%)
Query: 5 KSILLSLFCVFVLVIGGKFYIDRM--------KVDNLYRHGFQLYEEQIATYLKEHYSGI 56
K ILL + +L G Y+ + K + + +HGF+L EEQI TY+KE+YSGI
Sbjct: 3 KKILLIFSALIILFSTGFVYMTQFHSSTGLDRKEEKIIKHGFRLLEEQIGTYIKENYSGI 62
Query: 57 SKIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTSGIELDF 116
SKIEFSPIF GG+ A ++PV+YD GN+ + P Y T + LDF
Sbjct: 63 SKIEFSPIFIQGGKDNPPFTADVLPVIYDKEGNRAVM-GKRIGKKGYPTYGTTGDLTLDF 121
Query: 117 NVNDGSEIIYLYNEK--NESIEVGQYQHLPEHLKLK-TYKSTDAIITAYSQKGTLKGVEK 173
+ G+E I+L +E I+V HLPE KL K TD I A G LK V +
Sbjct: 122 D-QMGNENIFLKGSSILDEKIDVSSANHLPEEAKLTPPIKGTDDNIDALVNDGQLKNVNR 180
Query: 174 NSQGSPKAEIVYNLEIRR 191
+ GSP A++ YNLEI+R
Sbjct: 181 SENGSPNAKMDYNLEIKR 198
>gi|125718559|ref|YP_001035692.1| hypothetical protein SSA_1760 [Streptococcus sanguinis SK36]
gi|125498476|gb|ABN45142.1| Hypothetical protein SSA_1760 [Streptococcus sanguinis SK36]
Length = 52
Score = 88.2 bits (217), Expect = 2e-16, Method: Composition-based stats.
Identities = 41/45 (91%), Positives = 43/45 (95%)
Query: 3 RVKSILLSLFCVFVLVIGGKFYIDRMKVDNLYRHGFQLYEEQIAT 47
RVKSILLSL C+ VLVIGGKFY+DRMKVDNLYRHGFQLYEEQIAT
Sbjct: 8 RVKSILLSLICLSVLVIGGKFYMDRMKVDNLYRHGFQLYEEQIAT 52
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.319 0.139 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 774,769,377
Number of Sequences: 5470121
Number of extensions: 34155156
Number of successful extensions: 77919
Number of sequences better than 1.0e-05: 12
Number of HSP's better than 0.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 77895
Number of HSP's gapped (non-prelim): 12
length of query: 201
length of database: 1,894,087,724
effective HSP length: 126
effective length of query: 75
effective length of database: 1,204,852,478
effective search space: 90363935850
effective search space used: 90363935850
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 126 (53.1 bits)